No seatsNo subscription requiredUsage-based

Pay for AI usage, not software seats.

Add credits, route through Tokaroo, and track every request. Tokaroo handles model routing, fallback, caching, Knowledge Base context, guardrails, telemetry, and billing meters.

Add credits and startRead docs

Fast

Speed-optimized routing for low-latency workloads. Best for interactive UI, quick summaries, extraction, and simple agent steps.

Auto

Best-value routing. Auto can use cheaper models for easy work and escalate to stronger models when complexity, risk, or confidence calls for it.

Max

Highest-capability routing for complex reasoning, architecture, important reviews, critical decisions, and hard agent work.

What usage includes

Included meter
Model calls and tokens
Included meter
Images, audio, video, and embeddings when used
Included meter
Knowledge Base context packs and retrieval
Included meter
Sources ingestion and document chunking
Included meter
Docs Studio generation and artifact creation
Included meter
URL intelligence scans
Included meter
Mission, trace, approval, and action telemetry

Spend controls

  • Prepaid balance and usage-based drawdown
  • Workspace budgets for teams, customers, or environments
  • Per-key and per-workspace usage views
  • Spend history, tokens, requests, latency, and savings
  • Admin reconciliation for provider cost, customer charge, margin, internal spend, and shadow-test spend

Savings measurement

Tokaroo tracks customer charge, direct-model baseline, actual provider cost, and sampled shadow comparisons where available. Customers see simple spend and savings. Admins see reconciliation, provider spend, margin, internal usage, and shadow-test spend.

Simple examples

Chat app

Route requests through auto, use cache/fallback, and see tokens, spend, requests, and savings in the dashboard.

Agent with memory

Use Knowledge Base context packs, Sources, events, and feedback so future replies/actions get better context.

Gwen-style workforce

Track missions, steps, artifacts, tool actions, approvals, outcomes, and usage without showing per-task cost noise to the end user.

Start with credits. Scale with usage.

No seats. No fixed subscription required. Usage and savings stay visible in the dashboard.

Get started