All companies
technology

turbopuffer pricing

turbopuffer.com facts checked analysis reviewed
Quick summary
Region
Product
Serverless vector and full-text search database on object storage
Industry
technology
Commits
Available (annual)
In this page
AI Summary
  • turbopuffer is a serverless vector and full-text search database built directly on object storage, priced purely on metered usage rather than nodes or provisioned memory.
  • Customers pay for three usage dimensions — writes per GB, queries per unit of data scanned, and stored data per GB-month — and pay the greater of their usage or a tier-scaled monthly minimum.
  • The three plan tiers (Launch at $64/mo, Scale at $256/mo, Enterprise at $4,096/mo or more) set the monthly minimum and the compliance and support envelope, not the base usage rates.
  • turbopuffer has no free tier; the lowest entry point is the Launch tier's $64 monthly usage minimum.
  • In February 2026 turbopuffer cut its base queried-data rate from $5 per petabyte to $1 per petabyte, an up-to-94% reduction for the largest namespaces, documented in a public pricing changelog.
  • turbopuffer powers retrieval for Cursor and Notion and reached an estimated $100M annualized run-rate in March 2026 on minimal outside capital.
Pricing summary
turbopuffer 2026 — usage pricing with a tier-scaled monthly minimum
Pure usage: every plan meters writes, queries, and stored data; the tier you pick sets the monthly minimum spend and the compliance/support envelope.
Monthly minimum
Launch
$64 /mo
Teams getting to production on multi-tenant search
+35% usage premium
Enterprise
≥$4,096 /mo
Billion-scale, single-tenant or BYOC deployments
Minimums are floors on metered usage, not platform fees — you pay the greater of your usage or the tier minimum. Captured from turbopuffer.com/pricing, 2026-06-04.

About

turbopuffer is a serverless vector and full-text search database built directly on object storage (e.g. S3) with SSD caching, designed to make large-scale retrieval an order of magnitude cheaper than memory-resident vector databases. It serves AI and search teams running retrieval-augmented generation, semantic search, and hybrid (vector + full-text) workloads, with a multi-tenant architecture purpose-built for namespaces-per-customer patterns at billion-document scale.

The company positions itself against in-memory vector databases (Pinecone, Qdrant, Weaviate) by trading a small amount of cold-start latency for a storage cost structure anchored to cheap object storage rather than RAM. Its benchmarks advertise p50 warm-query latency around 14ms at 10M documents, and it markets a path “to billion-scale” handled directly with the customer’s engineering team.

turbopuffer monetizes purely on usage — writes, queries, and stored data — rather than on nodes, pods, or provisioned memory. The plan tier a customer selects (Launch, Scale, Enterprise) changes the monthly minimum spend and the security/support envelope, but the underlying per-unit usage rates are the same on Launch and Scale; only Enterprise adds a 35% usage premium in exchange for single-tenancy, BYOC, and SLAs.

Pricing summary : serverless vector search billed per write, query, and GB-month

turbopuffer uses pure usage-based pricing: there are no seats and no platform fee — you pay for metered writes, queries, and stored data, subject to a tier-scaled monthly minimum. The three published tiers differ in their minimum spend and their compliance/support entitlements, not in their base usage rates (Launch and Scale share rates; Enterprise adds a 35% premium).

  • Storage — billed per GB-month of logical stored data. (Base per-GB rate is shown only inside the interactive cost calculator and was not exposed in the static capture — recorded as unknown rather than guessed.)
  • Writes — billed per GB written. copy_from_namespace copies bill at a 50% discount on write costs.
  • Queries — billed per unit of data queried. The base queried-data rate is $1/PB (reduced from $5/PB in February 2026), with marginal volume discounts: 80% off bytes between 32–128 GB queried and 96% off above 128 GB per query. Minimum billable data per query is 1.28 GB.
  • Namespaces — multi-tenant by design; the calculator includes an allotment (e.g. 10 namespaces) before per-namespace charges apply.
  • Pinning — pinned namespaces (introduced April 2026) bill in GB-hours instead of per-query, scaling with namespace size, replica count, and time pinned (minimums of 64 GB and 10 minutes).

What makes this different: the monthly tier price is a floor on usage, not an add-on fee — customers pay the greater of their metered usage or the tier minimum, so the model behaves like consumption usage-based billing with a commitment floor rather than a classic seat-plus-overage SaaS plan.

Pricing by product

turbopuffer ships a single product (the search database) sold across three tiers. Tiers gate compliance and support; the metered usage dimensions are common to all.

turbopuffer database (plan tiers)

TierPriceIncludedKey mechanics
Launch$64 / mo minimumAll database features; multi-tenancy; SOC 2 + GDPR-ready DPA; community Slack & emailFloor on usage; self-serve sign-up
Scale$256 / mo minimumEverything in Launch plus HIPAA-ready BAA, SSO, audit logs, private Slack channel, 8–5 support hoursSame usage rates as Launch; higher floor buys compliance
Enterprise≥ $4,096 / moSingle-tenancy + BYOC; CMEK per namespace; private networking; 24/7 support; support + 99.95% uptime SLA35% usage premium; sales-led, contact required

Usage rates (metered across all tiers)

DimensionRateNotes
Storageper GB-month (base rate gated)Base $/GB shown only in the interactive calculator; recorded as unknown
Writesper GB written (base rate gated)copy_from_namespace copies at a 50% write discount
Queries$1 / PB queried (base)80% marginal discount 32–128 GB; 96% above 128 GB; min 1.28 GB billable per query
Namespacesper namespace beyond included allotmentAllotment (e.g. 10) included in the calculator before charges
PinningGB-hours (size × replicas × time)Minimums 64 GB and 10 minutes; replaces per-query billing while pinned

Sales motions across products: PLG / self-serve sign-up for Launch and Scale; sales-led, quoted for Enterprise (single-tenancy, BYOC, SLAs).

Hidden costs : the monthly minimum floor and the per-query 1.28 GB scan minimum

turbopuffer’s pure-usage model is honest, but two mechanics quietly shape the bill: the tier minimum (you pay it even when usage is lower) and the 1.28 GB minimum billable data per query (small namespaces still scan a minimum chunk). Because the base storage and write rates live only inside the interactive calculator, the two archetypes below isolate the parts that are publicly stated — the query rate and the minimum floor.

Early-production RAG app (low traffic, under the floor)

A small team runs a single-tenant RAG index of ~3M documents with light query traffic — well under what $64 of usage would buy. They pay the Launch minimum regardless.

Line itemMonthly cost
Metered usage (writes + queries + storage)~$20 (illustrative)
Launch tier minimum floorapplies the difference
Effective bill$64 (the minimum)

The lesson: at low volume turbopuffer behaves like a $64/mo subscription — the usage-based billing only becomes the dominant cost once metered usage clears the floor.

High-QPS multi-tenant search (many small queries)

A consumer app issues a very high rate of queries against many small per-tenant namespaces. Each query bills a minimum of 1.28 GB scanned, so query cost is driven by query count, not data size.

Line itemMonthly cost
Storage (per GB-month, base rate gated)unknown — calculator-only
Writes (per GB, base rate gated)unknown — calculator-only
Queries — billed at min 1.28 GB eachdominant line at high QPS
Effective billdriven by the per-query floor

The lesson: high-QPS workloads on tiny namespaces are billed on the 1.28 GB per-query minimum, not on actual bytes scanned — so the headline $1/PB rate understates cost for query-heavy, data-light patterns. Latency-sensitive teams can switch such namespaces to pinning (GB-hours), trading the per-query floor for a time-based rate. For the storage-versus-RAM economics that make turbopuffer cheaper at rest, see our note on storage-based billing.

Want to estimate your own turbopuffer bill? Use the turbopuffer pricing calculator to model your monthly cost based on document count, write rate, query rate, and namespace count.

Pricing evolution : a public changelog of query-rate cuts and new billing modes

turbopuffer maintains a dated public pricing changelog (last updated April 3, 2026), so its rate history is fully recoverable. The through-line is a series of price cuts on queries — the opposite of the typical SaaS price-creep — plus the introduction of new billing modes (copy_from_namespace, pinning) rather than new tiers.

Cadence

QuarterPrice changesProduct / SKU additionsNotes
2024 Q301Sept 2024copy_from_namespace introduced; copies bill at a 50% discount on write costs.
2025 Q310July 2025 — query pricing for large namespaces cut up to 80% (80% marginal discount on bytes over 32 GB/query).
2026 Q120Feb 2026 — base query rate cut $5/PB → $1/PB (up to 94% off largest namespaces); Mar 2026 — multi-vector-column billing clarified.
2026 Q201April 2026 — namespace pinning introduced, billed in GB-hours with 64 GB / 10-minute minimums.

Tracked range: 2024 Q3–2026 Q2. Quarters not listed were verified stable (0 price changes, 0 SKU additions) against the public pricing changelog.

Notable changes

  • 2024-09 — Introduced copy_from_namespace, copies billed at a 50% discount on write costs (turbopuffer.com/docs/pricing-log).
  • 2025-07 — Query pricing for large namespaces cut up to 80%: 80% marginal discount on bytes queried over 32 GB per query (turbopuffer.com/docs/pricing-log).
  • 2026-02 — Base queried-data rate cut from $5/PB to $1/PB; 80% marginal discount 32–128 GB, 96% above 128 GB; minimum billable data per query raised from 256 MB to 1.28 GB (turbopuffer.com/docs/pricing-log).
  • 2026-03 — Multi-vector-column billing clarified: filterable attributes billed once per vector column for writes and storage; non-filterable attributes billed once total (turbopuffer.com/docs/pricing-log).
  • 2026-04 — Namespace pinning introduced, billed in GB-hours (size × replicas × time) with minimums of 64 GB and 10 minutes (turbopuffer.com/docs/pricing-log).

The February 2026 query-rate cut in detail

The Feb 2026 change is the most consequential: the base scan rate fell 5× ($5/PB → $1/PB), and the marginal-discount curve (80% for 32–128 GB, 96% above 128 GB per query) means the biggest namespaces saw effective reductions up to 94%. This is corroborated by the same period in which turbopuffer was scaling to a reported $100M run-rate (Sacra, March 2026) on workloads like Cursor and Notion — i.e. the cut coincided with, rather than constrained, rapid revenue growth, consistent with object-storage unit economics improving as volume grows. The simultaneous increase of the per-query minimum from 256 MB to 1.28 GB partly offsets the cut for tiny, high-QPS namespaces (see Hidden costs).

What’s unique : object-storage economics, a usage-floor tier model, and price cuts over time

Object storage instead of RAM. turbopuffer’s core differentiator is architectural: vectors and indexes live on object storage (e.g. S3) with SSD caching, not in provisioned memory. This trades a small amount of cold-start latency for a cost structure anchored to cheap storage, and it’s why the company can keep cutting query prices rather than raising them. Pricing follows architecture — there are no pods, nodes, or provisioned-memory SKUs.

The tier price is a floor, not a fee. Launch ($64), Scale ($256), and Enterprise (≥$4,096) are monthly minimums on metered usage, not platform fees stacked on top of usage. Customers pay the greater of their usage or the minimum — a commitment floor wrapped around pure consumption, rather than the seat-plus-overage shape most SaaS uses.

Compliance gates the tier, rates stay flat. Launch and Scale charge identical usage rates; the higher Scale floor buys HIPAA-ready BAA, SSO, and audit logs — not cheaper or pricier units. Only Enterprise changes the unit economics, adding a 35% usage premium in exchange for single-tenancy, BYOC, CMEK, and a 99.95% uptime SLA. This cleanly separates “what compliance do you need” from “how much do you use.”

Prices go down, publicly and on the record. Across 2024–2026 every metered change in the changelog was a cut or a discount (50% copy discount, 80% then 94% query reductions). turbopuffer publishes these in a dated pricing changelog — unusually transparent for an infrastructure vendor and a deliberate trust signal.

New billing modes, not new tiers. Rather than adding plans, turbopuffer adds meters: copy_from_namespace (50% write discount), and namespace pinning (GB-hours). The pricing surface area grows by exposing real cost levers to customers, not by fragmenting the plan grid.

Strengths & weaknesses

StrengthsWeaknesses
Pure usage with a transparent, dated pricing changelog — rare honesty for infra pricingBase storage and write rates are exposed only inside the calculator, not as static published numbers
Object-storage economics let prices fall over time (query rate cut 5× in Feb 2026)No free tier — the $64/mo Launch minimum is a hard entry cost vs. free tiers at Pinecone/Qdrant/Weaviate
Tier = compliance + minimum, with identical usage rates on Launch and Scale (no rate penalty)The 1.28 GB per-query minimum can make high-QPS, data-light workloads cost more than the $1/PB rate implies
Built-in cost levers: copy_from_namespace 50% discount and GB-hour pinningEnterprise 35% usage premium is opaque until you model real volumes through the calculator
Public cost calculator lets buyers self-estimate before talking to salesPinning, branching, and multi-vector-column billing add modeling complexity for finance teams

Billing UX : an interactive cost calculator and a public pricing changelog

  • Cost calculator — an interactive widget on the pricing page lets you set vector dimensions, attribute size, plan tier, and per-dimension volumes (storage GB, writes/WPM, queries/QPM, namespaces) to produce an estimated monthly cost (with the Launch $64/mo floor applied).
  • Plan selector — Launch / Scale / Enterprise toggle inside the calculator recomputes the estimate, surfacing the Enterprise 35% usage premium.
  • Pricing Changelog — a public, dated docs page (/docs/pricing-log, last updated April 3, 2026) records every rate change over time, including the Feb 2026 query-rate cut from $5/PB to $1/PB.
  • copy_from_namespace — a first-class API operation billed at a 50% write discount, exposing migration/copy economics directly in the product.
  • Namespace pinning — an explicit billing mode (GB-hours) customers can opt into for latency-sensitive namespaces, with documented 64 GB / 10-minute minimums.
  • Contact-sales workflow — the Enterprise path routes to a sales form (“let’s get puffin’”) advertising response within a few hours.

Strategic wins : why object-storage economics and a floor-based model landed

1. Pricing follows architecture — and the architecture gets cheaper

By building on object storage instead of RAM, turbopuffer’s unit cost falls as object-storage and caching improve, which is why it can keep cutting query prices (5× in Feb 2026) while growing revenue. Most vector databases priced on provisioned memory cannot do this without margin pain. This is a textbook case of choosing a value metric aligned to a deflating cost base — a principle we cover in choosing the right usage metric.

2. The minimum-as-floor model captures small accounts without a free tier

By making the tier price a floor on usage rather than a platform fee, turbopuffer guarantees $64+ from every account while still letting heavy users pay pure usage. It is a commitment floor that funds support and compliance without the seat math, and it sidesteps the free-tier abuse that plagues consumption infra. See our take on usage minimums and commitments.

3. The public pricing changelog is a trust weapon

Publishing every rate change on a dated /docs/pricing-log page — all of them cuts — signals to buyers that prices will not creep up after lock-in. For infrastructure that sits in a critical path, that predictability is worth more than a marginal discount, and it pre-empts the pricing-change backlash that hits vendors who change quietly.

4. Compliance is decoupled from unit price

Launch and Scale share usage rates; the higher Scale floor buys HIPAA/SSO/audit, not pricier units. Customers never feel taxed on usage for needing compliance — they pay a higher minimum, not a higher rate — which keeps the metered story clean and defensible.

Areas to improve : surface the gated rates and the per-query floor up front

1. Publish the base storage and write rates as static numbers

Today the per-GB storage and write rates live only inside the interactive calculator, so they are invisible to AI search engines and hard to cite in a budget memo. Publishing them as plain text (even as “from $X/GB”) would improve price transparency and let finance teams model without the widget.

2. Make the 1.28 GB per-query minimum prominent

High-QPS, data-light workloads are billed on the per-query floor, not the $1/PB headline — a surprise for teams that size their bill off the marquee rate. A worked “query-heavy vs data-heavy” example on the pricing page would set expectations and reduce sticker shock, in the spirit of honest bill-shock prevention.

3. Quote the Enterprise 35% premium against a worked baseline

”≥$4,096/mo + 35% usage premium” is hard to internalize without numbers. Showing a single illustrative single-tenant workload at both Scale and Enterprise rates would make the premium concrete and shorten sales cycles.

4. Offer an annual commit discount for predictable spend

turbopuffer already has a commitment floor; layering a documented annual-commit discount on top would reward predictable large accounts and formalize the commitment economics buyers expect at the Enterprise tier.

Key takeaways

  1. Align the value metric to a deflating cost base. turbopuffer prices on object-storage-backed usage, so it can cut prices as its costs fall — a structural advantage RAM-priced competitors lack.
  2. A minimum-as-floor beats a platform fee for consumption infra. Charging the greater of usage or a tier minimum captures small accounts and funds support without a free tier or seat math.
  3. Decouple compliance from unit price. Selling HIPAA/SSO/SLA as a higher minimum rather than a higher rate keeps the metered story clean and avoids penalizing regulated buyers on usage.
  4. A public, dated pricing changelog is a moat, not a liability. When every logged change is a cut, transparency becomes a reason to trust the vendor through lock-in.
  5. Watch the per-unit minimums. A 1.28 GB per-query floor means the headline $1/PB rate understates cost for query-heavy, data-light workloads — always model the minimums, not just the marquee rate.

UBP implications

  1. Object storage reframes vector-DB economics. Pricing anchored to storage instead of memory lets a usage-based vendor lower prices over time, shifting the competitive axis from features to unit cost — a pattern other usage-based infra categories will follow.
  2. The “floor, not fee” pattern is a clean substitute for free tiers. For high-cost consumption products, a tier-scaled monthly minimum monetizes small accounts and discourages abuse while preserving pure-usage economics for heavy users.
  3. Per-request minimums are the hidden lever in metered pricing. turbopuffer’s 1.28 GB per-query floor shows how a single minimum can dominate the bill for a whole workload class — every usage-based vendor should model and disclose its smallest billable unit.

Sources

Bottom line

turbopuffer is one of the cleanest examples of pricing that follows architecture: by building search on object storage rather than RAM, it can charge pure usage, keep cutting query prices, and still hit a $100M run-rate. The tier price is a floor — not a fee — and the public pricing changelog turns transparency into a trust advantage. The catch is in the minimums: no free tier, a $64/mo entry floor, and a 1.28 GB per-query scan minimum that can dominate query-heavy bills. Model the floors, not just the $1/PB headline — and compare it against the rest of the blueprint corpus.

Pricing timeline : Major events on a vertical axis

Each milestone below corresponds to a public pricing change, product launch, or material adjustment. Major events use a filled marker; minor adjustments use a faded one.

Current snapshot — usage pricing with tier-scaled minimums

Three tiers (Launch $64/mo, Scale $256/mo, Enterprise ≥$4,096/mo at a 35% usage premium) act as monthly minimums on metered usage. Queried-data base rate is $1/PB, min 1.28 GB billable per query; writes and storage metered per GB with copy_from_namespace at a 50% write discount. Storage and write base rates are exposed only inside the interactive calculator — recorded as unknown rather than guessed.

Current snapshot — usage pricing with tier-scaled minimums - Three tiers (Launch $64/mo, Scale $256/mo, Enterprise ≥$4,096/mo at a 35% usage
captured

Namespace pinning billed in GB-hours

Introduced namespace pinning, which bills pinned namespaces in GB-hours instead of per-query rates; cost scales with namespace size, replica count, and time pinned, with minimums of 64 GB and 10 minutes. Source: turbopuffer.com/docs/pricing-log (last updated April 3, 2026).

Multi-vector-column billing clarified

Filterable attributes are billed once per vector column for writes and storage (reflecting per-column ANN index cost); non-filterable attributes are billed once regardless of vector-column count. Source: turbopuffer.com/docs/pricing-log.

Base query rate cut from $5/PB to $1/PB (up to 94% off largest namespaces)

Base queried-data rate dropped from $5/PB to $1/PB; added an 80% marginal discount between 32–128 GB and 96% above 128 GB per query; minimum billable data per query rose from 256 MB to 1.28 GB. Source: turbopuffer.com/docs/pricing-log.

Query pricing for large namespaces cut by up to 80%

Introduced an 80% marginal discount on bytes queried over 32 GB per query, reducing the cost of scanning large namespaces. Source: turbopuffer.com/docs/pricing-log.

copy_from_namespace introduced at a 50% write discount

turbopuffer added the copy_from_namespace operation, allowing data to be copied between namespaces at a 50% discount on write costs — making bulk migration and branching cheaper than re-ingesting. Source: turbopuffer.com/docs/pricing-log.

Trivia
  • · turbopuffer's $1/PB base query rate is the price for scanning a full petabyte of data per query — the result of a February 2026 cut from $5/PB that lowered query costs for the largest namespaces by up to 94%.
  • · turbopuffer has no free tier at all; unlike most vector databases its cheapest entry point is a $64/month usage minimum on the Launch plan.
  • · turbopuffer reached an estimated $100M annualized run-rate in March 2026 having raised less than $1M in total primary capital, per its CEO — an unusually capital-efficient infrastructure business.

Questions & answers

Does turbopuffer have a free tier?
No. turbopuffer has no free tier — its cheapest entry point is the Launch plan's $64/month usage minimum. You pay the greater of your metered usage or that minimum.
How is turbopuffer priced?
Purely on usage across three dimensions: writes per GB, queries per unit of data scanned (base $1/PB), and stored data per GB-month. A tier-scaled monthly minimum ($64 / $256 / ≥$4,096) acts as a floor on that usage.
What's the difference between the Launch, Scale, and Enterprise tiers?
The tiers set your monthly minimum and your compliance/support envelope, not your base usage rates. Launch and Scale charge identical usage rates; Scale adds HIPAA-ready BAA, SSO, and audit logs. Enterprise adds single-tenancy, BYOC, and SLAs but applies a 35% premium on usage.
How much did turbopuffer's query pricing change in 2026?
In February 2026 the base queried-data rate dropped from $5/PB to $1/PB, with up to 94% reductions for the largest namespaces via marginal discounts (80% for 32–128 GB, 96% above 128 GB per query). The minimum billable data per query rose from 256 MB to 1.28 GB.
Does turbopuffer charge for storage and writes too?
Yes. Beyond queries, you pay per GB written and per GB-month of stored data. The exact base storage and write rates are exposed only inside turbopuffer's interactive cost calculator, not as static published numbers.
What is namespace pinning and how is it billed?
Pinning keeps a namespace warm for latency-sensitive workloads. Introduced in April 2026, pinned namespaces bill in GB-hours (scaling with size, replica count, and time pinned) instead of per-query rates, with minimums of 64 GB and 10 minutes.