Built by Cognipeer, in production, in the open

An open AI platform,
end to end.

Two flagships do the heavy lifting — Console as the multi-tenant control plane, Agent SDK as the deterministic agent runtime. A growing toolkit of libraries plugs into both. We use it ourselves. You can ship on the same stack.

Start with Console Build with Agent SDK

Self-host or hostedAny LLM providerTypeScript-firstAGPL & permissive licenses

cognipeer.stack · one platform, two flagships

// The shape of your AI stack on Cognipeer

interface CognipeerStack {

// flagship: the control plane

console: {

gateway: "openai-compatible",

vectors: "any", rag: true, tracing: true,

guardrails: true, prompts: true, tenancy: "multi",

// flagship: the runtime

agentSdk: {

loop: "message-first", planning: true,

reasoning: "fast | balanced | deep | research",

multiAgent: true, tracing: "otel",

// supporting libraries plug into both

extras: "console-sdk · client-sdk · agent-server · chat-ui · ...",

}

BUILT BY US · USED BY US · OPEN FOR YOU

Not a side project. The same stack runs Cognipeer.

Every library here exists because we needed it ourselves. We open-source the parts other teams can reuse — and keep shipping new ones as our own workloads grow.

In production at

Cognipeer

our hosted & enterprise deployments

Customer agents shipped on it

hundreds

across banking, telco, retail, energy

Inference routed monthly

millions

through Console's LLM gateway

Open-source toolkit

growing · steadily

new libraries as our own workloads grow

Powering production AI for teams in

banking·telco·insurance·retail·airlines·healthcare·energy·public sector

Flagships

Two products carry the platform.

Console is everything you need to run AI in production. Agent SDK is everything you need to build agents that survive contact with production. Use one, both, or layer the rest of the toolkit on top.

Flagship · Control plane

Cognipeer Console

@cognipeer/console · AGPL-3.0 · self-host or hosted

An open, multi-tenant AI gateway. Operate inference, vector stores, RAG, tracing, guardrails, prompts, and quotas behind one console — with tenant isolation built in.

LLM GatewayOpenAI-compatible chat & embeddings across OpenAI, Anthropic, Bedrock, Vertex, Azure, vLLM, Ollama, and more.

Vector store & RAGMulti-provider vector operations, document ingestion, chunking, embedding, and retrieval.

Tracing & tracing sinksBatch and streaming ingest with thread correlation and OTel-compatible IDs.

Guardrails & promptsPII detection, content moderation, prompt shields, versioned templates with environments.

Quotas & isolationMulti-dimensional quotas, rate limits, and per-tenant databases out of the box.

SQLite-by-defaultDocker / HelmMulti-tenant

Flagship · Runtime

Agent SDK

@cognipeer/agent-sdk · npm · Node 18.17+

Lightweight, message-first agent runtime. Tool calls stay transparent, long histories summarize without losing evidence, and one config controls planning, reasoning, and reflection.

Native provider layerCall OpenAI, Anthropic, Azure, Bedrock, Vertex, or any OpenAI-compatible API directly. No LangChain required.

Reasoning + reflectionOne reasoning config switches provider-native reasoning and post-tool reflections.

Runtime profilesFast, balanced, deep, or research presets — real tradeoff bundles for context, depth, memory.

Plans, summaries, recoveryDurable plan on state.plan; archived tool outputs still retrievable via get_tool_response.

Multi-agent & structured outputWrap agents as tools, hand off mid-run, return Zod-validated JSON deterministically.

Message-firstZero-graphOTel tracing

How they fit, end to end

Your app

Web

Mobile

Backend

Internal tools

Build with

Agent SDK

Chat UI

Client SDK

Console SDK

Agent Server

Run on

Cognipeer Console

LLM gateway · vector stores · RAG · prompt mgmt · guardrails · tracing · quotas · multi-tenancy

Talks to

OpenAI

Anthropic

Azure

Bedrock

Vertex

vLLM

Ollama

any OpenAI-compatible

Supporting toolkit

The libraries that plug into both.

Targeted libraries that extend Console and Agent SDK at specific points in the stack — and a growing set of utilities we open-source as we need them.

Actively expanding. New packages ship when our own work needs them — adapters, components, evals, recipes. Watch the GitHub org for the next release.

Browse on GitHub

Quickstart

Pick the flagship that matches your day.

Need to run AI for an org? Start with Console. Building an agent against any LLM? Start with the SDK.

PATH A · Platform team

Deploy Cognipeer Console

Self-host the open AI gateway. SQLite-by-default — no external database required.

runs anywhere with Node 20+

$ git clone https://github.com/Cognipeer/console

$ cd console && npm install

$ cp .env.example .env.local

$ npm run dev

or with Docker

$docker compose up -d

Read the Console guide

PATH B · App developer

Build with Agent SDK

One install, one config, any LLM. Planning, reflection, and tracing are built in.

$npm install @cognipeer/agent-sdk zod

import {
  createSmartAgent, createTool,
  createProvider, fromNativeProvider,
} from "@cognipeer/agent-sdk";
 
const model = fromNativeProvider(
  createProvider({ provider: "openai", apiKey: process.env.OPENAI_API_KEY! }),
  { model: "gpt-4o" },
);
 
const agent = createSmartAgent({
  model,
  tools: [searchKb, openTicket],
  runtimeProfile: "balanced",
  tracing: { enabled: true },
});
 
const result = await agent.invoke({
  messages: [{ role: "user", content: "Where is order #4187?" }],
});

Read the SDK guide

Recipes

Patterns we've shipped — now in code.

Production-tested recipes for agents, RAG, governance, and embedded chat. Each one shows the package mix and a starting point you can copy.

All recipes

Recipe · 6 min read

HR Self-Service Widget

Embed a Studio-powered HR assistant into an employee portal for policy Q&A, leave requests, onboarding help, and document routing.

Client SDKChat · Widget

Recipe · 6 min read

Expense Upload & Approval Widget

Let employees upload receipts, extract fields, and submit expense requests through a Studio-powered finance widget embedded in your product.

Client SDKChat · Widget

Recipe · 8 min read

Mobile Peer Embedded In The App

Run a Studio peer directly inside a mobile app so it can answer users and trigger native app actions such as screen navigation, camera capture, and task execution.

Client SDKChat · Mobile

Recipe · 7 min read

Website Product Advisor Widget

Embed a Studio peer into a public website to qualify visitors, answer pricing questions, and trigger product actions without building a separate bot backend.

Client SDKChat · Widget

Recipe · 8 min read

Quota-Aware LLM Gateway

Run application traffic through Console with project-scoped quotas, model routing, and request-level visibility for cost control.

Console + Console SDKPlatform · Gateway

Recipe · 9 min read

Vector RAG Operations Control Plane

Operate RAG pipelines through Console by combining file ingestion, vector index management, embeddings, and chat retrieval in one control surface.

Console + Console SDKRAG · Platform

Roadmap

Public, opinionated, regularly updated.

What we just shipped, what's next, and what we're researching. Vote, comment, or open an RFC.

Shipped

Past 30 days

agent-sdk: runtime profiles
fast / balanced / deep / research presets — real tradeoff bundles.
console: per-project quotas
Multi-dimensional limits with automatic provider failover.
chat-ui: tool-call inspector
Inline panel for inputs/outputs, latency, partial sink failures.

In progress

This quarter

Evals harness
Offline grading and scenario replay over Console traces.
MCP tool gateway
Quota-aware adapter for MCP servers, exposed through Console.
Edge runtime for Agent SDK
Workers-compatible build for low-latency deployments.

Exploring

Next up

Visual agent planner
DAG-style preview of agent reasoning, served from Console.
Policy DSL
Declarative guardrails composable across tenants and agents.
Mobile Chat UI parity
React Native build of the conversational kit.

Open the public board

Changelog

Released this week

Granular notes for every library, plus migration tips when an API shifts.

All releases

May 18 · 2 days ago

agent-sdk

Reasoning presets and durable plans

New reasoning config switches between fast, balanced, deep, and research presets — real tradeoff bundles for context budget, delegation depth, and memory policy. Plans now live on state.plan instead of transient UI events.

May 14 · 6 days ago

console

Per-project quotas + provider routing v2

Define burst & sustained limits per project, with automatic routing across providers when a primary degrades. The gateway exposes per-tenant headers and surfaces overage events to webhooks.

May 09 · 11 days ago

chat-ui

Tool-call inspector + attachment streaming

A first-class panel that shows tool input/output, latency, and partial sink failures inline. Attachments stream incrementally so users see images and files appear as they're generated.

Open infrastructure for serious AI

Build on the same stack we use to ship Cognipeer.

Self-host Console, install Agent SDK, layer the rest of the toolkit on top. Or skip the operations and use Cognipeer hosted — same APIs, same SDKs.

Start with Console Build with Agent SDK Talk to sales

An open AI platform,end to end.

Not a side project. The same stack runs Cognipeer.

Two products carry the platform.

Cognipeer Console

Agent SDK

The libraries that plug into both.

Console SDK

Client SDK

Agent Server

Chat UI

to-markdown

graph-builder

Pick the flagship that matches your day.

Deploy Cognipeer Console

Build with Agent SDK

Patterns we've shipped — now in code.

HR Self-Service Widget

Expense Upload & Approval Widget

Mobile Peer Embedded In The App

Website Product Advisor Widget

Quota-Aware LLM Gateway

Vector RAG Operations Control Plane

Public, opinionated, regularly updated.

Past 30 days

This quarter

Next up

Released this week

Reasoning presets and durable plans

Per-project quotas + provider routing v2

Tool-call inspector + attachment streaming

Build on the same stack we use to ship Cognipeer.

An open AI platform,
end to end.