All companies
technology

AI21 Labs pricing

ai21.com facts checked analysis reviewed
Quick summary
Billing units
Sales motion
Product segment
Region
Product
Jamba foundation models, Maestro orchestration & enterprise AI
Industry
technology
Commits
None
In this page
AI Summary
  • AI21 Labs prices its Jamba foundation models on a pure per-million-token API: Jamba Mini at $0.2 in / $0.4 out and Jamba Large at $2 in / $8 out (USD).
  • There are no consumer seat tiers — the public ladder is Free Trial ($10 in credits for 7 days, no card), Pay As You Go (usage-based, unlimited seats), and a sales-led Custom Plan.
  • Maestro, AI21's enterprise AI planning and orchestration system, has no public per-token price — it is metered by a 'budget' parameter and quoted, GA on Amazon VPC since December 2025.
  • AI21 claims its tokenizer packs ~30% more English text per token than rivals, so the effective per-word cost runs below the headline per-token rate.
  • AI21 pivoted from the consumer Wordtune assistant and Jurassic models toward enterprise Jamba (open-weight, 256k context) and agentic orchestration, after a $208M Series C at a $1.4B valuation in 2023.
Pricing summary
AI21 Labs 2026 — pure per-token Jamba API with a $10 trial and sales-led Custom plan
No consumer seat tiers: a free trial funnels into pay-as-you-go per-million-token billing for the Jamba foundation models, with a quoted Custom Plan for enterprise scale.
Free Trial
$10 credits
Developers evaluating the Jamba API
Custom Plan
Contact sales
Companies scaling or needing private deployment
Jamba Mini
$0.2 / $0.4 /M tok
Efficient, lightweight tasks
Jamba Large
$2 / $8 /M tok
Most powerful long-context model
USD per million tokens. Maestro (AI planning & orchestration) is priced by a budget parameter and quoted separately — GA in Amazon VPC since December 2025.

About

AI21 Labs is a Tel Aviv-based foundation-model company building large language models for the enterprise — most prominently the Jamba family of hybrid SSM-Transformer models and Maestro, an AI planning and orchestration system for multi-step agentic workflows. Unlike consumer-first labs, AI21 sells almost entirely to developers and businesses: there are no per-seat chat subscriptions on its pricing page, just a free trial, pure per-million-token billing for the Jamba API, and a sales-led Custom Plan for organizations that need volume discounts, premium rate limits, or private-cloud hosting.

Founded in November 2017 by Stanford AI professor Yoav Shoham, Ori Goshen, and Mobileye founder Amnon Shashua, AI21 is one of the oldest independent foundation-model labs — it shipped Jurassic-1 in 2021, well before the post-ChatGPT wave. It raised an oversubscribed $208M Series C in 2023 at a 1.4B valuation, with Google, Nvidia, and Intel Capital among its backers, bringing total funding to roughly 336M raised.

Strategically, AI21 has moved from a two-product company — the consumer Wordtune writing assistant plus the Jurassic API — toward an enterprise-only foundation-model and orchestration posture. Wordtune still exists, but the center of gravity shifted in 2024 to Jamba (open-weight, 256k context) and in 2025 to Maestro. That pivot reshaped the pricing: the consumer-facing task APIs of the Jurassic era collapsed into a clean per-token Jamba API, while the highest-value product, Maestro, abandoned per-token billing entirely for a quoted, budget-controlled enterprise model. AI21 competes with open-weight peers like Mistral AI and closed-weight leaders like OpenAI, differentiating on long-context efficiency and a deliberately developer-simple price sheet.


Pricing summary : How AI21 Labs’s pricing model works

AI21 Labs runs a pure usage-based API with no consumer seat tiers. The public ladder has just three rungs, and the real meter is tokens:

  • Free Trial — $10 in usage credits, valid 7 days, no credit card required. A pure evaluation funnel into the paid meter.
  • Pay As You Gousage-based billing per million tokens for the Jamba foundation-model APIs and SDK, with unlimited seats (seats are free; tokens are the only meter).
  • Custom Plan — sales-led, quoted: everything in Pay As You Go plus volume discounts, premium API rate limits, private cloud hosting, priority support, a dedicated account manager, and AI consultancy. No public floor price.

Per-model token rates (USD per 1M tokens): Jamba Mini at $0.2 input / $0.4 output and Jamba Large at $2 input / $8 output. Maestro sits outside this table — it is priced by a “budget” parameter and quoted to enterprises, not billed per token.

What makes this different: AI21 charges for inference and nothing else on the self-serve path — seats are unlimited and free, there is no monthly platform fee, and the only consumer-style packaging is a 7-day trial. Its tokenizer is itself a pricing lever: AI21 claims ~30% more English text per token than rivals, so the effective per-word cost undercuts the headline per-token rate.


Pricing by product

Plans (self-serve + sales-led)

TierPriceIncludedKey mechanics
Free Trial$10 credits$10 in usage credits, 7 days, no card, full API & SDKPure evaluation funnel
Pay As You GoUsage-basedPer-token Jamba APIs & SDK, unlimited seatsTokens are the only meter
Custom PlanContact salesEverything in PAYG + volume discounts, premium rate limits, private cloud, priority support, dedicated AM, AI consultancySales-led, quoted, no public floor

Jamba foundation models (per 1M tokens, USD)

ModelInput /MOutput /MKey mechanics
Jamba Mini$0.2$0.4Efficient, lightweight; 256k context; open-weight
Jamba Large$2$8Most powerful long-context; 256k context; open-weight

Output is billed at 2x–4x input on both models (4x on Large). The 256k-token context window is shared across both.

Maestro (AI planning & orchestration)

ServicePriceKey mechanics
MaestroQuoted (budget parameter)Plans, validates, and self-corrects multi-step tasks at inference; budget dial trades speed vs cost vs reliability; GA in Amazon VPC since Dec 2025

Sales motions across products: PLG / self-serve for the Free Trial and Pay As You Go Jamba API; sales-led for the Custom Plan and Maestro (private deployment, volume, support).


Hidden costs : What AI21 Labs users actually pay

AI21’s self-serve sheet is unusually simple — no seat fees, no platform minimum, no per-request task-API surcharges — so the “hidden” cost is really just the shape of token consumption: the output premium on Jamba Large, and the jump from self-serve to a quoted Custom Plan once you need rate-limit headroom or private hosting.

Archetype — a long-context RAG application on the Jamba API. A team summarizing and answering over long documents on Jamba Large, running ~50M input tokens and ~12M output tokens a month, plus a lighter Jamba Mini classifier path at ~30M input / 10M output.

Line itemMonthly cost
Jamba Large input — 50M tok @ $2/M$100
Jamba Large output — 12M tok @ $8/M$96
Jamba Mini input — 30M tok @ $0.2/M$6
Jamba Mini output — 10M tok @ $0.4/M$4
Estimated total~$206/mo

The lesson: on Jamba Large the $8/M output rate is 4x the input rate, so output-heavy work (long generations, verbose answers) dominates the bill even when input token counts are large. Routing simpler steps to Jamba Mini ($0.2/$0.4) is the main cost lever, and AI21’s ~30% denser tokenizer means the real token count for a given English workload runs lower than a naive word-count estimate suggests. The genuine step-change is Maestro: once you need its planning/orchestration layer, you leave the published per-token sheet entirely for a budget-controlled, sales-quoted price.

Want to estimate your own AI21 Labs bill? Use the AI21 Labs pricing calculator to model your costs based on token volume per model.


Pricing evolution : AI21 Labs pricing history and changes

AI21’s pricing has tracked a clear strategic arc: from a Jurassic-era two-layer model (per-token base models plus per-request task APIs and a consumer Wordtune subscription) toward a clean, single-meter per-token Jamba API — and then a deliberate move up the stack with Maestro, which abandons token pricing for quoted, budget-controlled orchestration. The dated milestones below are reconstructed from primary announcements, contemporaneous press, and the live pricing capture.

Cadence

QuarterPrice changesProduct / SKU additionsNotes
2021 Q311Jurassic-1 + AI21 Studio API launch with per-token pricing
2023 Q101Jurassic-2 + task-specific APIs (answers, summarize, paraphrase)
2024 Q1012024-03 Jamba launches (open-weight, 256k context)
2024 Q3012024-08 Jamba 1.5 Mini & Large; GA on Amazon Bedrock
2025 Q1012025-03 Maestro introduced (budget-parameter pricing)
2025 Q4012025-12 Maestro GA in Amazon VPC

Tracked range: 2021 Q3–2026 Q2. Quarters not listed had no publicly announced price or SKU change. The headline Jamba token rates (Mini $0.2/$0.4, Large $2/$8) have held steady through the live 2026-06 capture.

Notable changes

  • 2021-08 — Jurassic-1 ships with AI21 Studio, establishing per-token API billing (250k-token vocabulary).
  • 2023-03 — Jurassic-2 plus task-specific APIs (contextual answers, summarize, paraphrase, grammar), a brief two-layer model later folded into pure token pricing.
  • 2023-11 — $208M Series C at a 1.4B valuation (Google, Nvidia, Intel Capital), funding the enterprise pivot.
  • 2024-03 — Jamba launches: hybrid Mamba SSM + Transformer MoE, 256k context, open weights — the strategic shift to open-weight enterprise inference.
  • 2024-08 — Jamba 1.5 Mini & Large under the Jamba Open Model License; GA on Amazon Bedrock (Sep 2024), plus Vertex AI, Azure, NVIDIA NIM.
  • 2025-03 — Maestro introduced with budget-parameter pricing rather than per-token billing — a deliberate move from token metering toward outcome-shaped enterprise pricing.
  • 2025-12 — Maestro reaches GA in Amazon VPC; Jamba API stays pure per-token (Mini $0.2/$0.4, Large $2/$8).

The consumer-to-enterprise pivot in detail

AI21 began as a two-front company: the consumer Wordtune writing assistant (launched 2020, several million users) and the Jurassic developer API. After the 2023 Series C, it concentrated on reliable enterprise AI, and the pricing followed. The Jurassic-era patchwork — per-token base models, per-request task APIs, and a consumer subscription — gave way to a single per-token Jamba meter with free unlimited seats. The most telling move is Maestro: instead of extending the token meter to its agentic orchestration layer, AI21 priced it by a budget parameter and sold it through enterprise quotes. That bifurcation — public, simple, per-token for the models; quoted, budget-controlled for orchestration — is the signature of a lab pricing the outcome (a validated multi-step result) separately from the raw inference underneath it.


What’s unique : AI21 Labs’s distinctive pricing mechanics

1. Unlimited free seats, tokens as the only meter. AI21’s Pay As You Go plan explicitly bundles unlimited seats — there is no per-user fee anywhere on the self-serve path. The entire self-serve bill is tokens consumed, which makes the cost model unusually legible: no seat math, no platform minimum, just input/output rates per model. This is the inverse of seat-anchored SaaS and a purer expression of usage-based pricing than most peers.

2. The tokenizer as a price lever. AI21 markets a ~30% denser tokenizer — an average token covers about one word / six English characters — and frames it explicitly as a cost advantage: the same English workload consumes ~30% fewer tokens, so the effective per-word price beats the headline per-token rate. Few labs sell the tokenizer itself as a pricing feature.

3. Orchestration priced by budget, not tokens. Maestro breaks from per-token billing entirely. Customers set a budget dial that trades speed against cost and reliability, and AI21 quotes the deal. As the agentic layer becomes the high-value product, AI21 prices the validated outcome — not the raw inference — a structural step toward outcome-shaped pricing.


Strengths & weaknesses

StrengthsWeaknesses
Fully public per-token Jamba rates (Mini $0.2/$0.4, Large $2/$8) — no “contact sales” wall for inferenceOnly two models on the public sheet — far narrower catalog than Mistral or OpenAI for buyers comparing options
Unlimited free seats; tokens are the only self-serve meter — exceptionally legible cost modelOutput premium on Jamba Large ($8/M, 4x input) can surprise generation-heavy workloads
Open-weight Jamba (256k context) lets buyers self-host the same models — a credible lock-in hedgeMaestro, the flagship enterprise product, has no public price — fully sales-gated and budget-quoted
~30% denser tokenizer lowers the effective per-word cost below the headline token rateCustom Plan (volume, private cloud, support) is fully sales-gated with no published floor
Clean $10 / 7-day trial with no credit card lowers evaluation frictionNo batch discount, cached-input rate, or context-tier pricing published — fewer optimization levers than peers
Long, deep-tech founder pedigree and multi-cloud availability (Bedrock, Vertex, Azure, VPC)Frequent product re-architecture (Jurassic to Jamba to Maestro) makes historical price tracking harder

Billing UX : AI21 Labs billing controls and transparency

  • Free trial credits — new accounts get $10 in usage credits valid for 7 days with no credit card, so spend starts at zero and only begins after explicit conversion to Pay As You Go.
  • Per-token usage tracking — AI21 Studio meters API consumption per model (input vs output tokens), the single dimension that drives the self-serve bill.
  • Unlimited seats — no per-user billing to manage; team size is decoupled from cost entirely on Pay As You Go.
  • Budget control (Maestro) — Maestro exposes a budget parameter that caps the speed/cost/reliability tradeoff per task, giving enterprises a direct lever over agentic spend.
  • Multi-cloud billing options — Jamba is available through Amazon Bedrock, Google Vertex AI, Azure, and NVIDIA NIM, so usage can be consolidated onto an existing cloud bill instead of a separate AI21 invoice.
  • Custom Plan controls — premium rate limits, volume discounts, and private-cloud hosting are negotiated as part of the sales-led Custom Plan, with a dedicated account manager for ongoing cost governance.

Strategic wins : Why AI21 Labs’s pricing decisions worked

1. Collapsing a messy two-layer model into one clean meter

The Jurassic era mixed per-token base models, per-request task APIs, and a consumer Wordtune subscription. By concentrating on Jamba and billing purely per token with free unlimited seats, AI21 made its self-serve pricing dramatically easier to reason about — a buyer can model their bill from two rates and a token estimate. See usage-based pricing strategy for why a single durable meter scales better than a patchwork.

2. Pricing orchestration above the token line

Rather than stretching the token meter onto Maestro, AI21 priced the orchestration layer by a budget parameter and sold it through enterprise quotes. That lets AI21 capture the value of a validated multi-step outcome separately from raw inference — and mirrors the broader shift away from per-token toward outcome-based pricing as agents become the product.

3. Turning the tokenizer into a price argument

Most labs treat tokenization as plumbing. AI21 markets its ~30% denser tokenizer as a cost advantage, giving its sales motion a concrete “you pay for fewer tokens” story that reframes a raw rate comparison. It’s a reminder that the unit you meter is itself a competitive lever, not just an accounting detail.


Areas to improve : Gaps in AI21 Labs’s pricing approach

1. Publish a Maestro price anchor

Maestro is the flagship enterprise product, yet it has no public price — only a budget parameter and a sales motion. A published starting point or worked example would shorten evaluation for mid-market buyers who can’t justify a sales call but have outgrown the raw Jamba API. The opacity invites exactly the cost-unpredictability anxiety that usage pricing is meant to remove.

2. Add optimization levers to the token sheet

Peers publish batch discounts, cached-input rates, and context-tier pricing; AI21’s sheet is just two flat input/output pairs. Adding even a batch or cached-input discount would give cost-sensitive workloads a reason to optimize on AI21 rather than route to a cheaper rival — and would make the ~30% tokenizer claim land harder against a fuller comparison.

3. Surface a broader public model line

With only Jamba Mini and Large on the public sheet, buyers comparing breadth see a thin catalog next to Mistral AI or OpenAI. Publishing fine-tuning, embedding, or specialized-model rates — even if narrow — would signal a fuller platform and reduce the perception that AI21 is a two-model shop.


Key takeaways

  1. One meter beats a patchwork. AI21 collapsed Jurassic-era per-token base models, per-request task APIs, and a consumer subscription into a single per-token Jamba meter with free seats — making its self-serve cost legible from two rates and a token estimate.
  2. Price the outcome above the inference. Maestro abandons per-token billing for a budget-controlled, quoted model, capturing the value of a validated multi-step result separately from raw tokens — an early move toward outcome-shaped pricing.
  3. The tokenizer is a pricing lever. A ~30% denser tokenizer means the effective per-word cost beats the headline per-token rate — AI21 sells that efficiency as a cost argument, not just a technical footnote.
  4. Output dominates the bill on premier models. Jamba Large’s $8/M output is 4x its input rate, so generation-heavy workloads pay far more than input-token counts suggest — routing simpler steps to Jamba Mini is the main lever.
  5. Open weights hedge lock-in. Jamba’s open-weight license (256k context) lets buyers self-host the same models they call per token — the same playbook that anchors Mistral’s strategy.

UBP implications

  1. A single token meter with free seats is the cleanest UBP story available. AI21 shows that decoupling seats from cost entirely — billing only tokens — produces a model buyers can estimate without seat math or platform minimums. UBP designers should look for the one unit every customer can reason about directly.
  2. Agent orchestration pushes pricing off the token meter. Maestro’s budget-parameter model signals that as labs move up the stack from inference to validated multi-step outcomes, the meter shifts from tokens toward a quoted, outcome-shaped dimension — UBP strategists should plan for that transition.
  3. The metering unit itself can be a differentiator. AI21’s ~30% denser tokenizer reframes a rate comparison: a “cheaper” per-token rival can still be more expensive per delivered word. UBP practitioners should define the value metric so that efficiency gains accrue to the customer’s visible bill.

Sources


Bottom line

AI21 Labs prices like an enterprise foundation-model lab, not a consumer app: a $10 / 7-day trial funnels into a pure per-million-token Jamba API (Mini $0.2/$0.4, Large $2/$8) with free unlimited seats, while a sales-led Custom Plan handles volume, private cloud, and support. Its sharpest moves are collapsing the Jurassic-era patchwork into a single legible token meter, selling its denser tokenizer as a cost advantage, and pricing Maestro’s orchestration by a budget parameter rather than tokens — an early bet on outcome-shaped enterprise pricing. The main gaps are an opaque Maestro price, a thin public model line, and no batch or cached-input levers.

Want to compare AI21 Labs against other foundation-model providers? See Mistral AI and OpenAI, or browse the full pricing blueprint.

Pricing timeline : Major events on a vertical axis

Each milestone below corresponds to a public pricing change, product launch, or material adjustment. Major events use a filled marker; minor adjustments use a faded one.

Live snapshot: pure per-token Jamba API + $10 trial + Custom

Captured live USD pricing at ai21.com/pricing: Free Trial ($10 credits / 7 days, no card), Pay As You Go (usage-based, unlimited seats), and a sales-led Custom Plan. Jamba Mini $0.2 in / $0.4 out and Jamba Large $2 in / $8 out per 1M tokens.

Live snapshot: pure per-token Jamba API + $10 trial + Custom - Captured live USD pricing at ai21.com/pricing: Free Trial ($10 credits / 7 days,
captured

Maestro reaches GA in Amazon VPC

Maestro becomes generally available for deployment in Amazon Virtual Private Cloud on December 1, 2025, cementing the enterprise/agentic focus. Foundation-model API pricing remains pure per-token (Jamba Mini $0.2/$0.4, Jamba Large $2/$8 per 1M); Maestro stays quote-based.

Maestro AI planning & orchestration introduced

AI21 unveils Maestro, an enterprise AI planning and orchestration system that plans, validates, and corrects multi-step tasks at inference time. It is priced not per token but via a 'budget' parameter trading speed against cost and reliability — a structural move toward outcome-shaped, quoted enterprise pricing.

Jamba 1.5 Mini & Large under an open license; on Bedrock

Jamba 1.5 Mini and Large ship August 22-23, 2024 under the Jamba Open Model License (256k context; 12B/52B and 94B/398B active/total params) and reach GA on Amazon Bedrock in September 2024, plus Vertex AI, Azure, and NVIDIA NIM — establishing multi-cloud per-token availability.

Jamba launches — hybrid SSM-Transformer, open weights

AI21 releases Jamba on March 29, 2024 — the first production-grade hybrid Mamba SSM + Transformer mixture-of-experts model, with context up to 256,000 tokens and open weights. It signals the strategic pivot to open-weight enterprise models monetized through hosted per-token inference.

$208M Series C at $1.4B valuation funds enterprise pivot

AI21 closes an oversubscribed $208M Series C (an initial $155M in August 2023 topped up to $208M in November) at a 1.4B valuation, with Google, Nvidia, and Intel Capital among backers. The capital underwrites the shift from consumer Wordtune toward reliable enterprise foundation models.

Jurassic-2 + task-specific APIs

Jurassic-2 (J2) ships March 9, 2023 with faster responses and multilingual support, alongside task-specific APIs (contextual answers, summarize, paraphrase, grammar). Pricing stays per-token for the base models, with task APIs billed per request — a brief two-layer model later collapsed into pure token pricing.

Jurassic-1 launches with AI21 Studio API

AI21 opens AI21 Studio with Jurassic-1, a 178B-parameter model carrying a 250,000-token vocabulary, billed via per-token API credits. This establishes per-token metering as the core primitive years before Jamba — alongside the consumer Wordtune writing assistant launched in 2020.

Trivia
  • · AI21 Labs was co-founded in 2017 by Stanford professor Yoav Shoham, Ori Goshen, and Mobileye founder Amnon Shashua — making it one of the few foundation-model labs with a serial deep-tech founder team.
  • · AI21 advertises a pricing edge in the tokenizer itself: it claims an average token covers ~1 word / 6 English characters, ~30% more text per token than other providers — so the same workload costs ~30% fewer tokens.
  • · Jamba (March 2024) was the first production-grade hybrid Mamba SSM + Transformer mixture-of-experts model, pairing a 256,000-token context window with open weights under the Jamba Open Model License.

Questions & answers

What is AI21 Labs's pricing model?
AI21 Labs runs a pure usage-based API: you pay per million tokens for its Jamba models (Jamba Mini at $0.2 in / $0.4 out, Jamba Large at $2 in / $8 out). The public ladder is a $10 free trial, Pay As You Go, and a sales-led Custom Plan for volume discounts, private cloud, and support.
Does AI21 Labs offer a free tier?
Yes — a Free Trial of $10 in credits valid for 7 days, with no credit card required. After that you move to Pay As You Go and are billed per token for the Jamba foundation-model APIs and SDK, with unlimited seats.
How much do AI21's Jamba models cost per token?
Jamba Mini is $0.2 per 1M input tokens and $0.4 per 1M output tokens. Jamba Large is $2 per 1M input tokens and $8 per 1M output tokens. AI21 also notes its tokenizer fits ~30% more English text per token than other providers.
How is AI21 Maestro priced?
Maestro, AI21's enterprise AI planning and orchestration system, has no published per-token price. It is governed by a budget parameter that trades off speed, cost, and reliability, and is sold via enterprise quote — generally available in Amazon VPC since December 2025.
Is AI21 Labs pricing usage-based or subscription?
It is usage-based, not subscription. AI21 bills per million tokens on Pay As You Go for the Jamba API; there are no per-seat consumer plans. Enterprise customers move to a Custom Plan for volume discounts, premium rate limits, and private cloud hosting.