Console
Self-hosted AI infrastructure with OpenAI-compatible API, provider routing & resiliency, vector orchestration, files pipeline, guardrails, tracing & observability, and project-level resource management.
What's in the box
Platform teams that need a multi-tenant AI gateway, control plane, and operational surface for inference, vectors, files, and tracing. Each capability is opt-in — use the parts that fit, leave the rest.
OpenAI Compatible API
Enables existing clients to operate as a seamless drop-in replacement, supporting chat, embeddings, streaming, and tool calling.
Provider Routing & Resiliency
Routes requests across model providers with built-in fallback handling and health-based routing strategies.
Vector Orchestration
Manages multiple vector databases through a unified control layer, including index lifecycle and operational workflows.
Files Pipeline
Supports document uploads with optional Markdown conversion, preparing content for RAG-ready ingestion pipelines.
Guardrails & Safety
Evaluate content against configurable safety policies with input and output filtering for production AI deployments.
Tracing & Observability
Tracks agent executions, tool calls, and LLM requests with integrated platform-level visibility.
Projects & Workspaces
Organise resources, models, and API keys into isolated projects with independent configuration and usage tracking.
How it runs
Console is the multi-tenant control plane: an OpenAI-compatible gateway sitting in front of providers, vectors, files, and tracing — with per-project quotas and guardrails baked in.
Inputs
- Chat completions
- Embeddings
- File uploads
- Vector queries
Controls
- Per-project quotas
- Provider fallback
- Guardrail policies
- API key scopes
Outputs
- Trace events
- Token accounting
- Audit log
- Usage exports
Quickstart
Install, configure, run. The example below is the smallest piece of code that does something useful in production.
How it compares
Against the flagship · control plane options teams most often weigh — focused on operational concerns, not feature inventories.
| Capability | Console | LiteLLM | Helicone | Portkey |
|---|---|---|---|---|
| Self-Hosted | ● native | ● native | ● native | ◐ partial |
| Open Source | ● native | ● native | ● native | ◐ partial |
| OpenAI Compatible | ● native | ● native | ● native | ● native |
| Routing / Fallback | ● native | ● native | ◐ partial | ● native |
| Rate Limiting | ● native | ● native | ◐ partial | ● native |
| Caching | ● native | ◐ partial | ◐ partial | ● native |
Patterns that use Console
Production-tested recipes powered by this package.
Quota-Aware LLM Gateway
Run application traffic through Console with project-scoped quotas, model routing, and request-level visibility for cost control.
Vector RAG Operations Control Plane
Operate RAG pipelines through Console by combining file ingestion, vector index management, embeddings, and chat retrieval in one control surface.
PromptOps And MCP Tool Gateway
Use Console as the control plane for prompt versioning, secure config storage, and converting OpenAPI specs into MCP and tool endpoints.