AI Summary
About
Scale AI is the data-engine company behind much of the modern AI stack — it collects, curates, and annotates the training and evaluation data that frontier labs and enterprises use to build models. Founded in 2016 by Alexandr Wang and Lucy Guo, Scale grew from a labeling API for self-driving-car datasets into a vertically integrated data operation that bundles pre-labeling software with a global contributor workforce. Today its product surface spans Scale Data Engine (collection, annotation, RLHF, red-teaming, model evaluation across text, image, video and 3D/LiDAR), the Scale GenAI Platform (enterprise generative-AI applications), the Outlier and Remotasks contributor marketplaces, and Donovan, its public-sector / defense offering.
Commercially, Scale is a labor-arbitrage business: it marks up the work of a reported 240,000+ contractors for a 50%+ gross margin, which is exactly why its pricing is private. Revenue scaled with the generative-AI boom — roughly 870M in 2024 (about a 1.5B annualized run rate by year-end) and an estimated 2B in 2025, with applications (GenAI Platform, Donovan) reportedly contributing 200-300M.
The defining 2025 event was strategic, not a price change. In June 2025, Meta invested about 14.3 billion dollars for a roughly 49% non-voting stake (valuing Scale at 29B) and hired founder-CEO Alexandr Wang to lead its superintelligence effort; Jason Droege became Scale’s CEO. The deal reshaped Scale’s customer base — rivals including Google (which had reportedly spent about 150M in 2024) and OpenAI pulled back over data-confidentiality concerns — pushing Scale further toward enterprise and public-sector contracts.
For current pricing, you book a demo at Scale’s pricing page — there is no public rate card to quote from.
Pricing summary : How Scale AI’s pricing model works
Scale AI is sales-only: the core data business is priced by quote, structured around per labeled task / per annotation / per data unit, and usually packaged as a committed annual data-engine contract with volume discounts. There is no published rate card and no ongoing free tier — only a free trial. The pricing page presents two paths, and both route to “book a demo”:
- Self-Serve Data Engine — pay-as-you-go via credit card for experimental or research projects, beyond a free trial of the first 1,000 labeling units and first 10,000 images. Per-unit rates are not published on the page.
- Enterprise — committed contracts spanning Data Engine plus the GenAI Platform, with enterprise SLAs and dedicated customer-operations support. This is where the revenue is, and it is fully sales-quoted.
Underneath, the unit economics are a markup on human labor: Scale monetizes per task and, per third-party reporting, runs a 50%+ gross margin. Third parties cite indicative self-serve figures — roughly 2 cents per image and 6 cents per annotation — and an average contract near 93K per year with complex projects past 400K, but Scale itself publishes none of this.
What makes this different: Scale is one of the largest AI vendors with essentially zero public price transparency for its main product. Where a GPU cloud publishes a per-hour rate, Scale deliberately keeps per-task pricing behind sales — because the price is really a negotiated markup on a labor supply chain it would rather not expose.
Pricing by product
Scale does not publish list prices. The table below describes the buying motion per surface; any cent- or dollar-level figure is a third-party indicative estimate, not an official Scale rate.
| Product | How it’s priced | Buying motion | Notes |
|---|---|---|---|
| Data Engine (self-serve) | Per data unit, pay-as-you-go | Self-serve → sales for volume | Free trial: 1,000 labeling units + 10,000 images; per-unit rates not published |
| Data Engine (enterprise) | Committed annual contract, per-task / per-annotation | Sales-led | Volume discounts; avg contract cited near 93K/yr (third-party) |
| GenAI Platform | Enterprise contract | Sales-led | Generative-AI applications; reported 200-300M apps revenue line |
| Donovan (public sector) | Government contract | Sales-led / partner-led | Defense & public-sector deployments |
| Outlier / Remotasks | Contributor payouts (supply side) | n/a | The labor pool Scale marks up; not a customer SKU |
Sales motions across products: every customer-facing surface is sales-led and quote-based. The free trial is the only self-serve entry point; there is no published per-task rate card.
Hidden costs : What Scale AI users actually pay
Because Scale prices by negotiated contract, the “hidden costs” are less about line-item add-ons and more about the structure of a quote-based data deal:
| Line item | What it means |
|---|---|
| Per-task / per-annotation rate | The base unit; varies by data type and quality bar, set in the quote, not published |
| Quality / review passes | Multi-pass review and QA raise the effective per-task cost vs. a raw label |
| Annual commitment | Enterprise pricing assumes committed volume; under-utilization still bills |
| Project complexity premium | Segmentation, RLHF, red-teaming and 3D/LiDAR cost far more than simple classification |
| Onboarding & ops | Dedicated customer-operations support is bundled into enterprise deals, not free |
The single biggest “cost” is price opacity itself: with no rate card, buyers cannot benchmark a quote without an RFP across vendors, and effective per-unit cost depends heavily on complexity and quality bar (third-party benchmarks for the category span roughly 0.01 to 1.00+ dollars per image, and 0.05 to 3.00 dollars per segmentation mask). The second is vendor concentration risk that the Meta deal made concrete — committing annual volume to a vendor partly owned by a competitor is now a real procurement consideration.
Want to estimate your own data-labeling bill? Use the Scale AI pricing calculator to model costs against indicative per-task assumptions.
Pricing evolution : Scale AI pricing history and changes
Cadence
| Period | Pricing changes | Product / motion shifts | Notes |
|---|---|---|---|
| 2023 | — (per-task model unchanged) | LLM/RLHF demand surges | Contract-based, no public rates |
| 2024 | — | Enterprise contracts dominate | ~870M revenue; avg deal cited near 93K/yr |
| 2025 H1 | — | GenAI Platform + Donovan grow | Apps line reportedly 200-300M |
| 2025 H2 → 2026 | — (still sales-quoted) | Meta stake; customer pullback | Mix shifts to enterprise + public sector |
Tracked range: 2023–present. Pricing itself stayed quote-based throughout; the material changes were strategic (ownership, customer base), not rate-card moves.
Notable changes
- 2023 — Per-labeled-task data-engine model scales with the generative-AI training-data boom; frontier labs buy RLHF and annotation at volume. No public rate card.
- 2024 — Enterprise committed contracts dominate revenue (~870M). Self-serve Data Engine offers a free trial but routes paid usage to sales.
- June 2025 — Meta takes a roughly 49% non-voting stake for about 14.3B (29B valuation); Alexandr Wang departs for Meta, Jason Droege becomes CEO. Customers including Google and OpenAI reportedly pull back, accelerating the shift toward enterprise apps and public-sector Donovan.
The throughline: Scale has barely changed its pricing mechanic in years — it is still per-task, contract-based, sales-quoted. What changed in 2025 was who buys and who owns the vendor, which matters more for a quote-based business than any list-price tweak.
What’s unique : Scale AI’s distinctive pricing mechanics
1. A market leader with no public rate card. Unlike most of the AI-infra corpus, Scale publishes essentially nothing — both the self-serve and enterprise paths route to “book a demo.” Pricing is a negotiated markup on labor, kept private by design.
2. The unit is human labor, not compute. Scale meters per labeled task / per annotation and earns a reported 50%+ gross margin by marking up a 240,000+ contributor workforce. The “value metric” is annotated data units, not GPU-hours or tokens.
3. Ownership became a pricing variable. After Meta’s ~49% stake, a procurement decision about Scale is also a decision about feeding a competitor — a rare case where the cap table, not the rate card, drives buyer behavior.
Strengths & weaknesses
| Strengths | Weaknesses |
|---|---|
| Deep, vertically integrated data stack (collection → annotation → eval) | Zero public price transparency; every quote needs an RFP to benchmark |
| Free trial lowers the barrier to first experiment | No ongoing free tier; production work is contract-only |
| Enterprise SLAs, RLHF/red-teaming, 3D/LiDAR breadth | Effective cost varies wildly with complexity and quality bar |
| Strong gross margins from labor arbitrage | Meta ownership created competitive-conflict customer churn (Google, OpenAI) |
| Growing higher-margin apps line (GenAI Platform, Donovan) | Contributor-labor model carries reputational and supply-chain risk |
Billing UX : Scale AI billing controls and transparency
- Billing controls — Self-serve Data Engine bills pay-as-you-go via credit card after the free trial; enterprise runs on committed annual contracts with invoicing and volume discounts negotiated by sales.
- Usage visibility — The Data Engine console tracks labeling units and project spend, but there is no published rate to reconcile against — effective per-task cost is set in the contract.
- Payment options — Credit card for self-serve; sales-led contracts, POs and invoicing for enterprise and public-sector (Donovan) deals.
Strategic wins : Why Scale AI’s pricing decisions worked
1. Pricing the data, not the model
By charging per labeled task instead of per seat or per model, Scale captured value from the single scarcest input in AI — high-quality training data — and turned a BPO motion into a high-margin software-adjacent business. See how AI companies structure pricing.
2. Opacity as leverage
Keeping per-task rates private let Scale price each frontier-lab deal to willingness-to-pay rather than a public anchor, protecting its labor markup. Related: outcome-based pricing trends.
3. Layering committed contracts over self-serve
A free trial funnels experiments into committed annual data-engine deals, converting spiky project demand into forecastable enterprise revenue. See choosing the right usage metric.
Areas to improve : Gaps in Scale AI’s pricing approach
1. No way to estimate before talking to sales
With no published per-task rate, a buyer cannot ballpark a budget without an RFP. Even an indicative range or a public self-serve rate would reduce friction at the top of the funnel. See bill shock and cost unpredictability.
2. Complexity-driven cost is unpredictable
Effective per-unit cost swings widely with annotation type and quality bar, and the contract structure makes that hard to forecast — clearer complexity tiers would help buyers plan.
3. Ownership conflict needs a pricing answer
Post-Meta, Scale needs commercial guardrails (data isolation guarantees, neutral-vendor assurances) priced into deals to win back labs that pulled back — otherwise quote-based flexibility cannot offset the trust gap.
Key takeaways
- Scale AI is sales-only, per-task pricing — no public rate card, contract-based per labeled task / annotation / data unit, with committed annual data-engine deals. For the underlying model, see the introduction to usage-based pricing.
- The meter is human labor, not compute — Scale marks up a 240,000+ contributor workforce for a reported 50%+ gross margin, which is why pricing stays private.
- A free trial is the only self-serve door (first 1,000 labeling units, first 10,000 images); everything past it is quoted.
- The big 2025 change was strategic, not a price move — Meta’s ~49% / ~14.3B stake and the CEO swap reshaped Scale’s customer base more than any rate-card tweak.
- Opacity is the model’s signature and its weakness — it protects margin but blocks buyer benchmarking and amplified the post-Meta trust problem.
UBP implications
- Usage pricing can be fully private and still scale. Scale proves a per-unit, usage-shaped model doesn’t require a public rate card — but only if you own a defensible, hard-to-benchmark input.
- The value metric should track the scarce input. Pricing per annotated data unit aligns cost with the thing customers actually need (quality data), not a proxy like seats.
- Ownership and trust are pricing inputs for data vendors. When a buyer’s data feeds a vendor partly owned by a rival, no discount fully closes the gap — neutrality has to be built into the offer, not just the price.
Sources
- Scale AI pricing page — sales-quoted, free trial only (accessed 2026-06-15)
- Scale Data Engine — no prices disclosed (accessed 2026-06-15)
- Sacra — Scale AI revenue, valuation & model — indicative per-unit figures, revenue, Meta deal (accessed 2026-06-15)
- CheckThat.ai — Scale AI pricing (third-party) — indicative per-unit and contract figures (accessed 2026-06-15)
- CNN Business — Scale AI making money under Meta — CEO Droege on revenue mix (accessed 2026-06-15)
- AInvest — Meta acquires 49% stake in Scale for $14.3B (accessed 2026-06-15)
Bottom line
Scale AI is the rare market leader that publishes almost no pricing: its data engine and GenAI platform are sold by enterprise quote, structured around per-labeled-task / per-annotation rates and committed annual contracts, with only a free trial (first 1,000 labeling units, first 10,000 images) as a self-serve door. The unit is human labor — Scale marks up a 240,000+ contributor workforce for a reported 50%+ gross margin — which is exactly why the rate card stays private. The defining 2025 event was strategic, not a price change: Meta’s roughly 14.3B investment for a ~49% non-voting stake pulled founder-CEO Alexandr Wang to Meta and reshaped Scale’s customer base as rivals like Google and OpenAI pulled back. Browse the pricing blueprint for more fully-researched company profiles.
Want to compare Scale AI against other Infrastructure, Compute & MLOps companies? Browse the pricing blueprint.
Pricing timeline : Major events on a vertical axis
Each milestone below corresponds to a public pricing change, product launch, or material adjustment. Major events use a filled marker; minor adjustments use a faded one.
Meta invests ~14.3B for ~49%; founder-CEO leaves; customers pull back
Meta takes a roughly 49% non-voting stake for about 14.3B (29B valuation). Alexandr Wang joins Meta's superintelligence effort; Jason Droege becomes Scale CEO. Google, OpenAI and others reportedly reduce or pause work over confidentiality concerns. Pricing stays sales-quoted; mix shifts toward enterprise apps (GenAI Platform) and public-sector Donovan.
Enterprise contracts dominate; ~870M revenue
Scale reaches roughly 870M revenue (about a 1.5B annualized run rate by year-end) on committed annual data-engine deals. Average contract cited near 93K per year, complex projects past 400K. Self-serve Data Engine offers a free trial (1,000 labeling units, 10,000 images) but routes paid usage to sales.
Per-task data-engine model scales with the LLM boom
Scale's contract-based, per-labeled-task / per-annotation model (markup on contractor labor, 50%+ gross margin) rides the generative-AI training-data boom; revenue grows explosively as frontier labs buy RLHF and annotation at volume. No public rate card — large committed data-engine contracts.
Monetization stack & signals : how Scale AI builds & buys its revenue engine
What billing, metering, CPQ, customer-success and revenue tooling Scale AI runs — built in-house vs bought — plus where the revenue/lifecycle org is hiring. Every item below links to the job post, engineering blog, or filing it was drawn from; unconfirmed tools are marked as such rather than guessed.
Director, Forward Deployed Engineering · Engagement Manager, Public Sector · Technical Program Manager, Public Sector
Enterprise Account Executive · Frontier Agents Engineer · Manager of Commercial Partnerships, Robotics
GTM Architect · Director of Technology & Systems · Enterprise AI Development Strategist
Director of Technology & Systems · GTM Systems Analyst
Head of Finance Systems & Automation
Field Marketing & Events Manager, Public Sector
Scale runs a bought, sales-led revenue stack and is investing in re-architecting and AI-automating it rather than building monetization tooling in-house. Public-sector GTM Systems Analyst and Director-of-Technology reqs name a live Salesforce CRM environment, NetSuite as the ERP/rev-rec system of record, and Snowflake as the warehouse the GTM/finance fabric integrates into; a generic CPQ + billing layer sits inside that GTM stack but no specific CPQ vendor is confirmed in-use (Salesforce CPQ appears only as an "e.g." in skills lists). The hiring pattern is dominated by enterprise customer-success/retention and RevOps roles (a labor-arbitrage, contract-quoted business with a private rate card), plus a finance-systems build-out (Head of Finance Systems & Automation owning the NetSuite order-to-cash / billing lifecycle) — all framed around deploying internal AI agents on top of these vendor systems, not replacing them with a home-grown billing platform.
Signals reviewed · derived from public job posts, engineering blogs & filings
- · In June 2025 Meta paid about 14.3 billion dollars for a roughly 49% non-voting stake in Scale (valuing it at 29B) and hired 28-year-old founder-CEO Alexandr Wang to lead its superintelligence team — without taking a board vote.
- · The deal backfired commercially: rival labs including Google (which had reportedly spent about 150M with Scale in 2024) and OpenAI pulled back, wary of feeding training-data signals to a now Meta-aligned vendor.
- · Scale's economics are a labor-arbitrage business — it marks up the work of 240,000+ contractors on its Outlier and Remotasks platforms for a reported 50%+ gross margin, which is why the rate card is private.
Questions & answers
- How does Scale AI's pricing work?
- Scale AI prices its core data business by enterprise sales quote, not a published rate card. Engagements are contract-based, typically priced per labeled task / per annotation / per data unit, often as a committed annual data-engine deal with volume discounts. The only self-serve entry point is a free trial (first 1,000 labeling units and first 10,000 images at no cost); beyond that you book a demo for a quote.
- Does Scale AI publish per-task prices?
- No. Scale does not publish an official per-task rate card for Data Engine or the GenAI Platform. Third parties cite indicative figures — roughly 2 cents per image and 6 cents per annotation for self-serve, and around 0.05 dollars per labeling unit after a free monthly allowance — but Scale itself routes pricing to 'book a demo.' Treat any per-unit number you see as an estimate, not a quote.
- Does Scale AI have a free tier?
- Scale offers a free trial rather than an ongoing free tier: the pricing page advertises the first 1,000 labeling units and the first 10,000 images at no cost. There is no perpetual free plan for production data work — sustained usage requires a paid, sales-quoted contract.
- What changed after Meta invested in Scale AI?
- In June 2025 Meta invested about 14.3 billion dollars for a roughly 49% non-voting stake, valuing Scale at 29 billion, and hired CEO Alexandr Wang to lead Meta's superintelligence team; Jason Droege became Scale's CEO. Several major customers — reportedly Google, OpenAI, and others — pulled back over data-confidentiality and competitive concerns, pushing Scale to lean harder into enterprise and public-sector contracts. Pricing remained quote-based throughout.