No seatsNo subscription requiredUsage-based

Pay for AI usage, not software seats.

Add credits, route through Tokaroo, and track every request. Tokaroo handles model routing, fallback, caching, Knowledge Base context, guardrails, telemetry, and billing meters.

Add credits and start Read docs

Fast

Speed-optimized routing for low-latency workloads. Best for interactive UI, quick summaries, extraction, and simple agent steps.

Auto

Best-value routing. Auto can use cheaper models for easy work and escalate to stronger models when complexity, risk, or confidence calls for it.

Max

Highest-capability routing for complex reasoning, architecture, important reviews, critical decisions, and hard agent work.

What usage includes

Included meter

Model calls and tokens

Included meter

Images, audio, video, and embeddings when used

Included meter

Knowledge Base context packs and retrieval

Included meter

Sources ingestion and document chunking

Included meter

Docs Studio generation and artifact creation

Included meter

URL intelligence scans

Included meter

Mission, trace, approval, and action telemetry

Spend controls

Prepaid balance and usage-based drawdown
Workspace budgets for teams, customers, or environments
Per-key and per-workspace usage views
Spend history, tokens, requests, latency, and savings
Admin reconciliation for provider cost, customer charge, margin, internal spend, and shadow-test spend

Savings measurement

Tokaroo tracks customer charge, direct-model baseline, actual provider cost, and sampled shadow comparisons where available. Customers see simple spend and savings. Admins see reconciliation, provider spend, margin, internal usage, and shadow-test spend.

Simple examples

Chat app

Route requests through auto, use cache/fallback, and see tokens, spend, requests, and savings in the dashboard.

Agent with memory

Use Knowledge Base context packs, Sources, events, and feedback so future replies/actions get better context.

Gwen-style workforce

Track missions, steps, artifacts, tool actions, approvals, outcomes, and usage without showing per-task cost noise to the end user.

Start with credits. Scale with usage.

No seats. No fixed subscription required. Usage and savings stay visible in the dashboard.

Get started