AI Summary
About
AI21 Labs is a Tel Aviv-based foundation-model company building large language models for the enterprise — most prominently the Jamba family of hybrid SSM-Transformer models and Maestro, an AI planning and orchestration system for multi-step agentic workflows. Unlike consumer-first labs, AI21 sells almost entirely to developers and businesses: there are no per-seat chat subscriptions on its pricing page, just a free trial, pure per-million-token billing for the Jamba API, and a sales-led Custom Plan for organizations that need volume discounts, premium rate limits, or private-cloud hosting.
Founded in November 2017 by Stanford AI professor Yoav Shoham, Ori Goshen, and Mobileye founder Amnon Shashua, AI21 is one of the oldest independent foundation-model labs — it shipped Jurassic-1 in 2021, well before the post-ChatGPT wave. It raised an oversubscribed $208M Series C in 2023 at a 1.4B valuation, with Google, Nvidia, and Intel Capital among its backers, bringing total funding to roughly 336M raised.
Strategically, AI21 has moved from a two-product company — the consumer Wordtune writing assistant plus the Jurassic API — toward an enterprise-only foundation-model and orchestration posture. Wordtune still exists, but the center of gravity shifted in 2024 to Jamba (open-weight, 256k context) and in 2025 to Maestro. That pivot reshaped the pricing: the consumer-facing task APIs of the Jurassic era collapsed into a clean per-token Jamba API, while the highest-value product, Maestro, abandoned per-token billing entirely for a quoted, budget-controlled enterprise model. AI21 competes with open-weight peers like Mistral AI and closed-weight leaders like OpenAI, differentiating on long-context efficiency and a deliberately developer-simple price sheet.
Pricing summary : How AI21 Labs’s pricing model works
AI21 Labs runs a pure usage-based API with no consumer seat tiers. The public ladder has just three rungs, and the real meter is tokens:
- Free Trial — $10 in usage credits, valid 7 days, no credit card required. A pure evaluation funnel into the paid meter.
- Pay As You Go — usage-based billing per million tokens for the Jamba foundation-model APIs and SDK, with unlimited seats (seats are free; tokens are the only meter).
- Custom Plan — sales-led, quoted: everything in Pay As You Go plus volume discounts, premium API rate limits, private cloud hosting, priority support, a dedicated account manager, and AI consultancy. No public floor price.
Per-model token rates (USD per 1M tokens): Jamba Mini at $0.2 input / $0.4 output and Jamba Large at $2 input / $8 output. Maestro sits outside this table — it is priced by a “budget” parameter and quoted to enterprises, not billed per token.
What makes this different: AI21 charges for inference and nothing else on the self-serve path — seats are unlimited and free, there is no monthly platform fee, and the only consumer-style packaging is a 7-day trial. Its tokenizer is itself a pricing lever: AI21 claims ~30% more English text per token than rivals, so the effective per-word cost undercuts the headline per-token rate.
Pricing by product
Plans (self-serve + sales-led)
| Tier | Price | Included | Key mechanics |
|---|---|---|---|
| Free Trial | $10 credits | $10 in usage credits, 7 days, no card, full API & SDK | Pure evaluation funnel |
| Pay As You Go | Usage-based | Per-token Jamba APIs & SDK, unlimited seats | Tokens are the only meter |
| Custom Plan | Contact sales | Everything in PAYG + volume discounts, premium rate limits, private cloud, priority support, dedicated AM, AI consultancy | Sales-led, quoted, no public floor |
Jamba foundation models (per 1M tokens, USD)
| Model | Input /M | Output /M | Key mechanics |
|---|---|---|---|
| Jamba Mini | $0.2 | $0.4 | Efficient, lightweight; 256k context; open-weight |
| Jamba Large | $2 | $8 | Most powerful long-context; 256k context; open-weight |
Output is billed at 2x–4x input on both models (4x on Large). The 256k-token context window is shared across both.
Maestro (AI planning & orchestration)
| Service | Price | Key mechanics |
|---|---|---|
| Maestro | Quoted (budget parameter) | Plans, validates, and self-corrects multi-step tasks at inference; budget dial trades speed vs cost vs reliability; GA in Amazon VPC since Dec 2025 |
Sales motions across products: PLG / self-serve for the Free Trial and Pay As You Go Jamba API; sales-led for the Custom Plan and Maestro (private deployment, volume, support).
Hidden costs : What AI21 Labs users actually pay
AI21’s self-serve sheet is unusually simple — no seat fees, no platform minimum, no per-request task-API surcharges — so the “hidden” cost is really just the shape of token consumption: the output premium on Jamba Large, and the jump from self-serve to a quoted Custom Plan once you need rate-limit headroom or private hosting.
Archetype — a long-context RAG application on the Jamba API. A team summarizing and answering over long documents on Jamba Large, running ~50M input tokens and ~12M output tokens a month, plus a lighter Jamba Mini classifier path at ~30M input / 10M output.
| Line item | Monthly cost |
|---|---|
| Jamba Large input — 50M tok @ $2/M | $100 |
| Jamba Large output — 12M tok @ $8/M | $96 |
| Jamba Mini input — 30M tok @ $0.2/M | $6 |
| Jamba Mini output — 10M tok @ $0.4/M | $4 |
| Estimated total | ~$206/mo |
The lesson: on Jamba Large the $8/M output rate is 4x the input rate, so output-heavy work (long generations, verbose answers) dominates the bill even when input token counts are large. Routing simpler steps to Jamba Mini ($0.2/$0.4) is the main cost lever, and AI21’s ~30% denser tokenizer means the real token count for a given English workload runs lower than a naive word-count estimate suggests. The genuine step-change is Maestro: once you need its planning/orchestration layer, you leave the published per-token sheet entirely for a budget-controlled, sales-quoted price.
Want to estimate your own AI21 Labs bill? Use the AI21 Labs pricing calculator to model your costs based on token volume per model.
Pricing evolution : AI21 Labs pricing history and changes
AI21’s pricing has tracked a clear strategic arc: from a Jurassic-era two-layer model (per-token base models plus per-request task APIs and a consumer Wordtune subscription) toward a clean, single-meter per-token Jamba API — and then a deliberate move up the stack with Maestro, which abandons token pricing for quoted, budget-controlled orchestration. The dated milestones below are reconstructed from primary announcements, contemporaneous press, and the live pricing capture.
Cadence
| Quarter | Price changes | Product / SKU additions | Notes |
|---|---|---|---|
| 2021 Q3 | 1 | 1 | Jurassic-1 + AI21 Studio API launch with per-token pricing |
| 2023 Q1 | 0 | 1 | Jurassic-2 + task-specific APIs (answers, summarize, paraphrase) |
| 2024 Q1 | 0 | 1 | 2024-03 Jamba launches (open-weight, 256k context) |
| 2024 Q3 | 0 | 1 | 2024-08 Jamba 1.5 Mini & Large; GA on Amazon Bedrock |
| 2025 Q1 | 0 | 1 | 2025-03 Maestro introduced (budget-parameter pricing) |
| 2025 Q4 | 0 | 1 | 2025-12 Maestro GA in Amazon VPC |
Tracked range: 2021 Q3–2026 Q2. Quarters not listed had no publicly announced price or SKU change. The headline Jamba token rates (Mini $0.2/$0.4, Large $2/$8) have held steady through the live 2026-06 capture.
Notable changes
- 2021-08 — Jurassic-1 ships with AI21 Studio, establishing per-token API billing (250k-token vocabulary).
- 2023-03 — Jurassic-2 plus task-specific APIs (contextual answers, summarize, paraphrase, grammar), a brief two-layer model later folded into pure token pricing.
- 2023-11 — $208M Series C at a 1.4B valuation (Google, Nvidia, Intel Capital), funding the enterprise pivot.
- 2024-03 — Jamba launches: hybrid Mamba SSM + Transformer MoE, 256k context, open weights — the strategic shift to open-weight enterprise inference.
- 2024-08 — Jamba 1.5 Mini & Large under the Jamba Open Model License; GA on Amazon Bedrock (Sep 2024), plus Vertex AI, Azure, NVIDIA NIM.
- 2025-03 — Maestro introduced with budget-parameter pricing rather than per-token billing — a deliberate move from token metering toward outcome-shaped enterprise pricing.
- 2025-12 — Maestro reaches GA in Amazon VPC; Jamba API stays pure per-token (Mini $0.2/$0.4, Large $2/$8).
The consumer-to-enterprise pivot in detail
AI21 began as a two-front company: the consumer Wordtune writing assistant (launched 2020, several million users) and the Jurassic developer API. After the 2023 Series C, it concentrated on reliable enterprise AI, and the pricing followed. The Jurassic-era patchwork — per-token base models, per-request task APIs, and a consumer subscription — gave way to a single per-token Jamba meter with free unlimited seats. The most telling move is Maestro: instead of extending the token meter to its agentic orchestration layer, AI21 priced it by a budget parameter and sold it through enterprise quotes. That bifurcation — public, simple, per-token for the models; quoted, budget-controlled for orchestration — is the signature of a lab pricing the outcome (a validated multi-step result) separately from the raw inference underneath it.
What’s unique : AI21 Labs’s distinctive pricing mechanics
1. Unlimited free seats, tokens as the only meter. AI21’s Pay As You Go plan explicitly bundles unlimited seats — there is no per-user fee anywhere on the self-serve path. The entire self-serve bill is tokens consumed, which makes the cost model unusually legible: no seat math, no platform minimum, just input/output rates per model. This is the inverse of seat-anchored SaaS and a purer expression of usage-based pricing than most peers.
2. The tokenizer as a price lever. AI21 markets a ~30% denser tokenizer — an average token covers about one word / six English characters — and frames it explicitly as a cost advantage: the same English workload consumes ~30% fewer tokens, so the effective per-word price beats the headline per-token rate. Few labs sell the tokenizer itself as a pricing feature.
3. Orchestration priced by budget, not tokens. Maestro breaks from per-token billing entirely. Customers set a budget dial that trades speed against cost and reliability, and AI21 quotes the deal. As the agentic layer becomes the high-value product, AI21 prices the validated outcome — not the raw inference — a structural step toward outcome-shaped pricing.
Strengths & weaknesses
| Strengths | Weaknesses |
|---|---|
| Fully public per-token Jamba rates (Mini $0.2/$0.4, Large $2/$8) — no “contact sales” wall for inference | Only two models on the public sheet — far narrower catalog than Mistral or OpenAI for buyers comparing options |
| Unlimited free seats; tokens are the only self-serve meter — exceptionally legible cost model | Output premium on Jamba Large ($8/M, 4x input) can surprise generation-heavy workloads |
| Open-weight Jamba (256k context) lets buyers self-host the same models — a credible lock-in hedge | Maestro, the flagship enterprise product, has no public price — fully sales-gated and budget-quoted |
| ~30% denser tokenizer lowers the effective per-word cost below the headline token rate | Custom Plan (volume, private cloud, support) is fully sales-gated with no published floor |
| Clean $10 / 7-day trial with no credit card lowers evaluation friction | No batch discount, cached-input rate, or context-tier pricing published — fewer optimization levers than peers |
| Long, deep-tech founder pedigree and multi-cloud availability (Bedrock, Vertex, Azure, VPC) | Frequent product re-architecture (Jurassic to Jamba to Maestro) makes historical price tracking harder |
Billing UX : AI21 Labs billing controls and transparency
- Free trial credits — new accounts get $10 in usage credits valid for 7 days with no credit card, so spend starts at zero and only begins after explicit conversion to Pay As You Go.
- Per-token usage tracking — AI21 Studio meters API consumption per model (input vs output tokens), the single dimension that drives the self-serve bill.
- Unlimited seats — no per-user billing to manage; team size is decoupled from cost entirely on Pay As You Go.
- Budget control (Maestro) — Maestro exposes a budget parameter that caps the speed/cost/reliability tradeoff per task, giving enterprises a direct lever over agentic spend.
- Multi-cloud billing options — Jamba is available through Amazon Bedrock, Google Vertex AI, Azure, and NVIDIA NIM, so usage can be consolidated onto an existing cloud bill instead of a separate AI21 invoice.
- Custom Plan controls — premium rate limits, volume discounts, and private-cloud hosting are negotiated as part of the sales-led Custom Plan, with a dedicated account manager for ongoing cost governance.
Strategic wins : Why AI21 Labs’s pricing decisions worked
1. Collapsing a messy two-layer model into one clean meter
The Jurassic era mixed per-token base models, per-request task APIs, and a consumer Wordtune subscription. By concentrating on Jamba and billing purely per token with free unlimited seats, AI21 made its self-serve pricing dramatically easier to reason about — a buyer can model their bill from two rates and a token estimate. See usage-based pricing strategy for why a single durable meter scales better than a patchwork.
2. Pricing orchestration above the token line
Rather than stretching the token meter onto Maestro, AI21 priced the orchestration layer by a budget parameter and sold it through enterprise quotes. That lets AI21 capture the value of a validated multi-step outcome separately from raw inference — and mirrors the broader shift away from per-token toward outcome-based pricing as agents become the product.
3. Turning the tokenizer into a price argument
Most labs treat tokenization as plumbing. AI21 markets its ~30% denser tokenizer as a cost advantage, giving its sales motion a concrete “you pay for fewer tokens” story that reframes a raw rate comparison. It’s a reminder that the unit you meter is itself a competitive lever, not just an accounting detail.
Areas to improve : Gaps in AI21 Labs’s pricing approach
1. Publish a Maestro price anchor
Maestro is the flagship enterprise product, yet it has no public price — only a budget parameter and a sales motion. A published starting point or worked example would shorten evaluation for mid-market buyers who can’t justify a sales call but have outgrown the raw Jamba API. The opacity invites exactly the cost-unpredictability anxiety that usage pricing is meant to remove.
2. Add optimization levers to the token sheet
Peers publish batch discounts, cached-input rates, and context-tier pricing; AI21’s sheet is just two flat input/output pairs. Adding even a batch or cached-input discount would give cost-sensitive workloads a reason to optimize on AI21 rather than route to a cheaper rival — and would make the ~30% tokenizer claim land harder against a fuller comparison.
3. Surface a broader public model line
With only Jamba Mini and Large on the public sheet, buyers comparing breadth see a thin catalog next to Mistral AI or OpenAI. Publishing fine-tuning, embedding, or specialized-model rates — even if narrow — would signal a fuller platform and reduce the perception that AI21 is a two-model shop.
Key takeaways
- One meter beats a patchwork. AI21 collapsed Jurassic-era per-token base models, per-request task APIs, and a consumer subscription into a single per-token Jamba meter with free seats — making its self-serve cost legible from two rates and a token estimate.
- Price the outcome above the inference. Maestro abandons per-token billing for a budget-controlled, quoted model, capturing the value of a validated multi-step result separately from raw tokens — an early move toward outcome-shaped pricing.
- The tokenizer is a pricing lever. A ~30% denser tokenizer means the effective per-word cost beats the headline per-token rate — AI21 sells that efficiency as a cost argument, not just a technical footnote.
- Output dominates the bill on premier models. Jamba Large’s $8/M output is 4x its input rate, so generation-heavy workloads pay far more than input-token counts suggest — routing simpler steps to Jamba Mini is the main lever.
- Open weights hedge lock-in. Jamba’s open-weight license (256k context) lets buyers self-host the same models they call per token — the same playbook that anchors Mistral’s strategy.
UBP implications
- A single token meter with free seats is the cleanest UBP story available. AI21 shows that decoupling seats from cost entirely — billing only tokens — produces a model buyers can estimate without seat math or platform minimums. UBP designers should look for the one unit every customer can reason about directly.
- Agent orchestration pushes pricing off the token meter. Maestro’s budget-parameter model signals that as labs move up the stack from inference to validated multi-step outcomes, the meter shifts from tokens toward a quoted, outcome-shaped dimension — UBP strategists should plan for that transition.
- The metering unit itself can be a differentiator. AI21’s ~30% denser tokenizer reframes a rate comparison: a “cheaper” per-token rival can still be more expensive per delivered word. UBP practitioners should define the value metric so that efficiency gains accrue to the customer’s visible bill.
Sources
- AI21 Labs pricing page (accessed 2026-06-11)
- AI21 Maestro — product page (accessed 2026-06-11)
- AI21 blog — Meet Maestro: AI planning & orchestration (accessed 2026-06-11)
- Jamba 1.5 family now available in Amazon Bedrock — AWS (accessed 2026-06-11)
- AI21 completes $208M oversubscribed Series C — PRNewswire (accessed 2026-06-11)
- AI21 Labs — Wikipedia (accessed 2026-06-11)
- Browse the pricing blueprint corpus
Bottom line
AI21 Labs prices like an enterprise foundation-model lab, not a consumer app: a $10 / 7-day trial funnels into a pure per-million-token Jamba API (Mini $0.2/$0.4, Large $2/$8) with free unlimited seats, while a sales-led Custom Plan handles volume, private cloud, and support. Its sharpest moves are collapsing the Jurassic-era patchwork into a single legible token meter, selling its denser tokenizer as a cost advantage, and pricing Maestro’s orchestration by a budget parameter rather than tokens — an early bet on outcome-shaped enterprise pricing. The main gaps are an opaque Maestro price, a thin public model line, and no batch or cached-input levers.
Want to compare AI21 Labs against other foundation-model providers? See Mistral AI and OpenAI, or browse the full pricing blueprint.
Pricing timeline : Major events on a vertical axis
Each milestone below corresponds to a public pricing change, product launch, or material adjustment. Major events use a filled marker; minor adjustments use a faded one.
Live snapshot: pure per-token Jamba API + $10 trial + Custom
Captured live USD pricing at ai21.com/pricing: Free Trial ($10 credits / 7 days, no card), Pay As You Go (usage-based, unlimited seats), and a sales-led Custom Plan. Jamba Mini $0.2 in / $0.4 out and Jamba Large $2 in / $8 out per 1M tokens.
Maestro reaches GA in Amazon VPC
Maestro becomes generally available for deployment in Amazon Virtual Private Cloud on December 1, 2025, cementing the enterprise/agentic focus. Foundation-model API pricing remains pure per-token (Jamba Mini $0.2/$0.4, Jamba Large $2/$8 per 1M); Maestro stays quote-based.
Maestro AI planning & orchestration introduced
AI21 unveils Maestro, an enterprise AI planning and orchestration system that plans, validates, and corrects multi-step tasks at inference time. It is priced not per token but via a 'budget' parameter trading speed against cost and reliability — a structural move toward outcome-shaped, quoted enterprise pricing.
Jamba 1.5 Mini & Large under an open license; on Bedrock
Jamba 1.5 Mini and Large ship August 22-23, 2024 under the Jamba Open Model License (256k context; 12B/52B and 94B/398B active/total params) and reach GA on Amazon Bedrock in September 2024, plus Vertex AI, Azure, and NVIDIA NIM — establishing multi-cloud per-token availability.
Jamba launches — hybrid SSM-Transformer, open weights
AI21 releases Jamba on March 29, 2024 — the first production-grade hybrid Mamba SSM + Transformer mixture-of-experts model, with context up to 256,000 tokens and open weights. It signals the strategic pivot to open-weight enterprise models monetized through hosted per-token inference.
$208M Series C at $1.4B valuation funds enterprise pivot
AI21 closes an oversubscribed $208M Series C (an initial $155M in August 2023 topped up to $208M in November) at a 1.4B valuation, with Google, Nvidia, and Intel Capital among backers. The capital underwrites the shift from consumer Wordtune toward reliable enterprise foundation models.
Jurassic-2 + task-specific APIs
Jurassic-2 (J2) ships March 9, 2023 with faster responses and multilingual support, alongside task-specific APIs (contextual answers, summarize, paraphrase, grammar). Pricing stays per-token for the base models, with task APIs billed per request — a brief two-layer model later collapsed into pure token pricing.
Jurassic-1 launches with AI21 Studio API
AI21 opens AI21 Studio with Jurassic-1, a 178B-parameter model carrying a 250,000-token vocabulary, billed via per-token API credits. This establishes per-token metering as the core primitive years before Jamba — alongside the consumer Wordtune writing assistant launched in 2020.
- · AI21 Labs was co-founded in 2017 by Stanford professor Yoav Shoham, Ori Goshen, and Mobileye founder Amnon Shashua — making it one of the few foundation-model labs with a serial deep-tech founder team.
- · AI21 advertises a pricing edge in the tokenizer itself: it claims an average token covers ~1 word / 6 English characters, ~30% more text per token than other providers — so the same workload costs ~30% fewer tokens.
- · Jamba (March 2024) was the first production-grade hybrid Mamba SSM + Transformer mixture-of-experts model, pairing a 256,000-token context window with open weights under the Jamba Open Model License.
Questions & answers
- What is AI21 Labs's pricing model?
- AI21 Labs runs a pure usage-based API: you pay per million tokens for its Jamba models (Jamba Mini at $0.2 in / $0.4 out, Jamba Large at $2 in / $8 out). The public ladder is a $10 free trial, Pay As You Go, and a sales-led Custom Plan for volume discounts, private cloud, and support.
- Does AI21 Labs offer a free tier?
- Yes — a Free Trial of $10 in credits valid for 7 days, with no credit card required. After that you move to Pay As You Go and are billed per token for the Jamba foundation-model APIs and SDK, with unlimited seats.
- How much do AI21's Jamba models cost per token?
- Jamba Mini is $0.2 per 1M input tokens and $0.4 per 1M output tokens. Jamba Large is $2 per 1M input tokens and $8 per 1M output tokens. AI21 also notes its tokenizer fits ~30% more English text per token than other providers.
- How is AI21 Maestro priced?
- Maestro, AI21's enterprise AI planning and orchestration system, has no published per-token price. It is governed by a budget parameter that trades off speed, cost, and reliability, and is sold via enterprise quote — generally available in Amazon VPC since December 2025.
- Is AI21 Labs pricing usage-based or subscription?
- It is usage-based, not subscription. AI21 bills per million tokens on Pay As You Go for the Jamba API; there are no per-seat consumer plans. Enterprise customers move to a Custom Plan for volume discounts, premium rate limits, and private cloud hosting.