Console

@cognipeer/console  ·  Flagship · Control plane
stable

Self-hosted AI infrastructure with OpenAI-compatible API, provider routing & resiliency, vector orchestration, files pipeline, guardrails, tracing & observability, and project-level resource management.

License  ·  MITTypeScript  ·  100%Audience  ·  Platform teams thatFits  ·  Self-hosted AI platform
Self-hosted AI platformOpenAI-compatible APIsMulti-tenant operationsBrowse recipes
Production install
$npm install @cognipeer/console

What's in the box

Platform teams that need a multi-tenant AI gateway, control plane, and operational surface for inference, vectors, files, and tracing. Each capability is opt-in — use the parts that fit, leave the rest.

OpenAI Compatible API

Enables existing clients to operate as a seamless drop-in replacement, supporting chat, embeddings, streaming, and tool calling.

Provider Routing & Resiliency

Routes requests across model providers with built-in fallback handling and health-based routing strategies.

Vector Orchestration

Manages multiple vector databases through a unified control layer, including index lifecycle and operational workflows.

Files Pipeline

Supports document uploads with optional Markdown conversion, preparing content for RAG-ready ingestion pipelines.

Guardrails & Safety

Evaluate content against configurable safety policies with input and output filtering for production AI deployments.

Tracing & Observability

Tracks agent executions, tool calls, and LLM requests with integrated platform-level visibility.

Projects & Workspaces

Organise resources, models, and API keys into isolated projects with independent configuration and usage tracking.

How it runs

Console is the multi-tenant control plane: an OpenAI-compatible gateway sitting in front of providers, vectors, files, and tracing — with per-project quotas and guardrails baked in.

Client
Your app
OpenAI-compat
Layer 1
Gateway
Routing · quotas
Layer 2
Guardrails
In/out filters
Layer 3
Provider
OpenAI · Anthropic …
Layer 4
Vectors
Files · embeddings
Sink
Tracing
Audit · evals

Inputs

  • Chat completions
  • Embeddings
  • File uploads
  • Vector queries

Controls

  • Per-project quotas
  • Provider fallback
  • Guardrail policies
  • API key scopes

Outputs

  • Trace events
  • Token accounting
  • Audit log
  • Usage exports

Quickstart

Install, configure, run. The example below is the smallest piece of code that does something useful in production.

1# Self-host the gateway with one command
2$ docker run -p 8080:8080 ghcr.io/cognipeer/console:latest
3 
4# Call it from any OpenAI-compatible client
5const res = await fetch("http://localhost:8080/v1/chat/completions", {
6  method: "POST",
7  headers: { "Authorization": "Bearer $COGNIPEER_KEY" },
8  body: JSON.stringify({
9    model: "gpt-4.1",
10    messages: [{ role: "user", content: "Hi" }],
11  }),
12});

How it compares

Against the flagship · control plane options teams most often weigh — focused on operational concerns, not feature inventories.

CapabilityConsoleLiteLLMHeliconePortkey
Self-Hosted●  native●  native●  native◐  partial
Open Source●  native●  native●  native◐  partial
OpenAI Compatible●  native●  native●  native●  native
Routing / Fallback●  native●  native◐  partial●  native
Rate Limiting●  native●  native◐  partial●  native
Caching●  native◐  partial◐  partial●  native

Patterns that use Console

Production-tested recipes powered by this package.

Next steps