Now in private preview v2026.5 → multi-agent orchestration

The agent OS
for the enterprise.

Mandarum is the runtime and control plane where companies build, deploy, and govern fleets of AI agents — with identity, policy, observability, and audit built in. The Kubernetes layer for the agent era.

Cloud · On-prem · BYOCSOC 2 Type IIISO 27001EU AI Act ready200+ integrations
The problem

Agent sprawl is the new shadow IT.

Enterprises are running dozens of agents across LangGraph, CrewAI, in-house stacks, and SaaS copilots. No shared memory. No policy enforcement. No audit trail. IT is panicking — and the CFO wants ROI.

01 · Identity

Every agent action is authenticated

Per-agent service identities, scoped tokens, and short-lived credentials issued by Mandarum — not your model provider.

02 · Policy

Spend caps, escalations, two-person rules

A declarative policy engine that runs before the tool call. Block exfiltration. Require approval. Route to humans.

03 · Observability

Every step traced, every dollar attributed

OpenTelemetry-native traces, prompt diffs, token cost per workflow, and one-click replay of any historical run.

The runtime

One platform. Every agent stack.

Mandarum runs your agents — whether you wrote them in LangGraph, the OpenAI Agents SDK, CrewAI, or our native DSL. Bring your models. Bring your tools. Keep your data.

Workflow engine

Planner → Executor → Verifier → Writer

Durable, retryable, multi-agent state machines on top of Temporal. Self-consistency voting on critical steps. Bounded retries with escalation.

01
Planner
02
Executor
03
Verifier
04
Writer
Memory

Shared across the fleet

Episodic + entity memory per user and per org. Permission-aware retrieval enforced at query time.

mem.read(user.id, scope="case-2031")
Policy

Approve before act

DSL for budgets, allowlists, escalation paths, and two-person rules — enforced at the tool call.

Evals

Golden datasets + CI

Regression nets on every prompt change. Shadow-eval in production. Publishable accuracy.

Marketplace

100+ pre-built workflows

Ship in days. Customize in hours. Revenue-share for partner builders.

Live preview

A real workflow, end to end.

Click to run a sample agent across CRM, calendar, and Slack — with policy review and human approval inline.

mandarum · trace workflow: deal-desk-review
↳ ~/mandarum/workflows/deal-desk-review
By the numbers

Production-grade from day one.

0
Runtime uptime
0
p99 orchestration
0
Agent steps / month
0
Native integrations
Observability

One pane. Every agent. Every dollar.

Drill from a KPI to a specific tool call. Replay any run. Diff any prompt change. Export to your data warehouse.

acme-corp/workflows/deal-desk-review
running · 4 agents
96.4%
Success rate
+1.2 vs last week
$18,420
Labor saved
+12.0%
2,108
Runs (24h)
+318
deal-desk-review · NorthBank renewalpolicy: pending approval6.8s
support-triage · ticket #84293verifier: pass2.1s
finance-close · Q4 reconciliationwriter: complete42.0s
vendor-onboarding · TechCorpescalated to human
sdr-research · 14 accounts enrichedtool: salesforce.update11.4s
Built on standards

Open by design. Yours by deployment.

Bring any model

Anthropic, OpenAI, Google, open-weights via vLLM, or your fine-tunes. Route per-step.

Bring any framework

LangGraph, OpenAI Agents SDK, CrewAI, MCP servers. Adopt incrementally.

Bring your cloud

SaaS, VPC, BYOC, fully air-gapped on-prem. Same control plane, same SLAs.

Stop deploying agents. Start operating them.

Join the design partner program. White-glove onboarding, weekly shipping cadence, and direct access to the founding team.