Per-API-Call Pricing: Examples & Companies

What is it

Per-API-Call Pricing is a billing unit where customers are charged per API request, regardless of payload size or processing time. The unit is the request itself: one call to the endpoint produces one charge, whether the response is a single search result or a full page of structured data.

It is the simplest of all usage meters. There is nothing to estimate about tokens, megabytes, or seconds of compute — a developer can forecast a bill by multiplying expected request volume by a published rate. That legibility is the model’s defining strength, and it is why per-call pricing dominates categories where the unit of work maps cleanly to a single request: web search, scraping and extraction, and per-image or per-clip media processing.

The model shows up two ways across the corpus. For pure-request APIs like SerpApi, Exa, and Tavily, the call is the product, and the entire rate card is denominated in requests (often expressed as credits that map back to calls). For broader platforms — image generators like Recraft and Ideogram, or scraping stacks like ZenRows that also meter data by the gigabyte — per-call pricing is one line on a multi-dimensional bill, used for the discrete operations (a background removal, an upscale) that don’t fit a continuous meter.

The model’s weakness is the flip side of its simplicity: not all calls cost the same to serve. A deep-research request and a cached lookup both count as one call unless the vendor splits them into separate SKUs or tiers — which, as the worked examples below show, is exactly what the better-designed APIs do.

One "call" fans out into endpoint sub-types priced ~285× apart

How it works

The mechanic is a multiplication: bill = calls × rate. Vendors layer three things on top of that base formula:

Lever	What it does	Example from the corpus
Free allowance	A monthly call quota at $0 to seed adoption	SerpApi Free: 250 searches/mo; Tavily Free: 1,000 credits/mo; Linkup: a $20 balance that auto-refills monthly (~4,000 searches)
Per-1k quoting	Normalizes sub-cent prices into readable figures	Exa Search “$7 / 1k” (~$0.007/call); ScraperAPI’s $3-per-1,000 floor without rendering; You.com Search “$5.00 / 1k calls”
Endpoint / mode tiers	Different call types priced differently	You.com Research at $12 (lite) → $450 (exhaustive) per 1,000 calls; Exa Agent at $0.025 / $0.10 / $0.50 / $2.00 per request

Worked example — search API at scale. A team running an agent on Linkup issues 50,000 standard searches a month. At the raw searchResults rate of $0.005 per call, that’s $250. Switch every call to sourcedAnswer (an LLM-composed answer) at $0.006 and the same volume costs $300 — a 20% per-call premium that compounds to $50/month purely from the output type chosen. Add 10,000 deep sourcedAnswer searches at $0.055 and another $550 lands on the bill. The base formula never changes; the rate does, per endpoint and per mode.

Worked example — success-based metering. SerpApi charges only for successful searches. Its Enterprise plan bundles 100,000 complimentary searches into a $3,750/month base, then meters extra calls per 1,000 by speed mode: on-demand at $7.50 (Best Effort), $15.00 (Ludicrous Speed), or $30.00 (Ludicrous Speed Max), or reserved at $2.75 / $5.50 / $11.00 per 1,000 on a commit. Blocked, errored, and CAPTCHA’d requests are not counted, so the meter tracks delivered value rather than attempted calls — a meaningful difference for scraping-adjacent workloads where failure rates are real.

Worked example — per-operation media. On Recraft, API units are prepaid at $1 = 1,000 units and each operation deducts a fixed amount: a background removal or vectorization costs $0.01 per request, a V4 raster generation $0.04, and a V4 Pro vector image $0.30. Five thousand background removals is a flat $50 — no tokens, no compute time, just calls. See the introduction to usage-based pricing guide for how this maps to value-metric selection, and the usage invoicing and billing cycles guide for how per-call counts roll into a monthly invoice.

Companies using this

The 53 companies below all list api-calls as a billing unit in their blueprint entry — verified against each vendor’s official pricing page. The category clusters tightly into three groups: search APIs, scraping/extraction APIs, and per-image or per-clip media generation, with a long tail of inference platforms (OpenAI, Anthropic, Google, DeepSeek) that expose a per-request meter alongside tokens.

Patterns observed

Per-call pricing is not one model but a family of them. The 53 companies split into a handful of recurring designs, each identifiable by a named vendor in the corpus.

Search APIs are the purest expression of the model. SerpApi, Tavily, Exa, Linkup, You.com, and Jina AI all price the search request as the atomic unit. Whether the vendor denominates it in credits (Tavily at $0.008/credit pay-as-you-go), quotes per 1,000 calls (You.com at $5.00), or sells monthly search allotments (SerpApi), the underlying meter is the call.
Per-1k quoting is near-universal for sub-cent prices. Because a single search or scrape often costs a fraction of a cent, vendors quote per 1,000 to stay legible. Exa’s “$7 / 1k” (~$0.007 each), You.com’s Contents API at “$1.00 / 1k pages,” and ScraperAPI’s floor of “$3 per 1,000 requests without rendering, $7 with” are the canonical examples. The model stays purely usage-based; only the display unit is scaled.
The better APIs split one “call” into priced sub-types. Rather than charging one flat rate, Exa prices Search ($7/1k), Deep Search ($12/1k), Contents ($1/1k pages), and Monitors ($15/1k) separately; You.com tiers its Research API by effort from $12 to $450 per 1,000 calls; and Tavily publishes a per-endpoint credit table where different calls consume different credit counts. This preserves the simplicity of per-call billing while letting price track the real cost of serving each request type.
Success-based metering is a credible variant. SerpApi and ZenRows bill only on successful results — ZenRows charges a CPM on successful Scraper API responses (from $0.10 per 1,000 basic requests up to $2.50 with JS rendering plus premium proxies), and SerpApi excludes blocked, errored, and CAPTCHA’d searches. For any API with a meaningful failure rate, charging per delivered result rather than per attempted call aligns the meter with value.
Media generators converge on a penny for utility calls. Recraft prices background removal and vectorization at $0.01 per request while Ideogram prices Remove Background and Describe at $0.01 per input — even as their generative image calls span $0.022 to $0.30. The utility operation is cheap and standardized; the generative one carries the model premium. PhotoRoom sits just above the pack at $0.02 per background-removal image, with generative Image Editing at $0.10.
The multiplier is where headline capacity lies. ScraperAPI advertises “100,000 API credits for $49,” but a plain request costs 1 credit while a JS-rendered page costs 10 and an ultra-premium render 75 — so the real per-scrape cost lands near $0.0049 the moment rendering turns on. ZenRows applies the same idea with ×5 (JS), ×10 (premium proxies), and ×25 (both) multipliers on a shared balance. The credit count on the pricing page is a headline, not a scrape count.

Counterexamples & variants

The cleanest counterexample sits inside the model itself. Linkup’s Research endpoint shows where flat per-call pricing breaks down: a single Research call ranges from $0.25 (S) to $2.50 (XL) depending on reasoningDepth, a 10x spread on what is nominally “one request” — a single XL Research call costs as much as 500 standard searches. You.com has the same shape: its Research API runs from $12 per 1,000 calls (lite) to $450 (exhaustive), with a Contact-Sales Frontier tier listed above $2,000 per 1,000. When the work behind a request varies by two orders of magnitude, “per call” stops being a useful budgeting unit and the vendor has to expose an explicit effort ladder.

Worse is when the range hides in the docs. Linkup’s own marketing page advertises a Deep-search range of “$0.05–$0.55” while the billing docs quote $0.05–$0.055 — a 10x discrepancy on the same tier that undermines the simplicity the rest of the model works hard to achieve. A per-call price is only as trustworthy as the consistency between the pricing page and the invoice.

The variant worth calling out is credits as a per-call proxy. Diffbot, Firecrawl, Tavily, and Recraft all expose a credit currency rather than a raw per-call price, where each endpoint or model deducts a different number of units. This lets a vendor keep a single legible unit on the pricing page while charging more for expensive calls — a deliberate softening of pure per-call pricing that trades transparency for flexibility. ZenRows goes further still, layering a CPM on successful scraper calls plus a separate per-GB charge for proxy and browser data plus a $0.09 per-session-hour fee, so the request is only one of several meters.

Finally, the large inference platforms — OpenAI, Anthropic, Google, DeepSeek, Cohere, and Mistral AI — list api-calls as a billing unit but are not really per-call businesses. Their economics run on tokens; the request count is incidental. They appear in this list because the corpus records the meter, not because per-call pricing drives their bill. For those vendors, the token meter is the one that matters.

What this means for buyers vs vendors

For buyers

Per-call pricing is the easiest model to forecast — until it isn’t. For fixed-cost endpoints (a SerpApi search, a Recraft background removal at $0.01, a PhotoRoom background removal at $0.02), multiply your expected request volume by the published rate and you have a budget. The trap is the variable-cost endpoint hiding inside an otherwise flat rate card: Linkup’s Research at up to $2.50/request, You.com’s Research at up to $450 per 1,000 calls. Before committing, map which endpoints your workload actually hits and price the expensive ones explicitly rather than off the headline rate.

Two second-order costs deserve scrutiny. First, watch for failure billing — on SerpApi and ZenRows, high error rates won’t inflate your bill because they meter on success, but on APIs that charge per attempted call a flaky target site can double your cost. Second, watch for credit multipliers — ScraperAPI’s “100,000 credits” is only 10,000 scrapes once JS rendering (×10) is on, and a 3,000,000-credit plan yields only ~120,000 Google SERP pulls at ×25. Read the multiplier table before you trust the headline.

Model your scenario with the pricing calculator and read the choosing the right usage metric guide to sanity-check that the vendor’s unit matches your value. If your workload concentrates on one endpoint at steady volume, per-call pricing is likely to be the most predictable model you can buy.

For vendors

Per-call pricing buys you legibility, which converts: a developer who can estimate a bill in ten seconds signs up faster. But the unit only works if your cost-to-serve is roughly uniform per call. The moment one endpoint costs 100x another, flat per-call pricing either loses you money on the expensive calls or overcharges on the cheap ones.

The corpus’s best designs solve this without abandoning the model: split calls into priced sub-types (Exa’s per-endpoint rate card, You.com’s effort-tiered Research ladder), quote per-1k to keep sub-cent prices readable, and offer success-based metering where failure rates are real so buyers trust the meter. If your endpoints vary wildly, a credit currency (Diffbot, Firecrawl, Tavily) lets you keep one unit on the page while charging differentially underneath — at the cost of the transparency that made per-call attractive in the first place.

One discipline matters more than any pricing lever: keep the pricing page and billing docs in lockstep. The Linkup marketing-vs-docs gap flagged in the counterexamples above is a 10x discrepancy that erodes trust in an otherwise clean model — the meter is only as credible as the consistency between the quote and the invoice. See usage invoicing and billing cycles for how to roll per-call counts into a clean monthly invoice, and choosing the right usage metric for whether the request is really your value metric or just the easiest thing to count.

Company	Product	Pricing model	Billing units	Free tier	Verified
01.AI	Yi open-weight models + Yi API + enterprise vertical solutions	pure-usage freemium	tokens api-calls	Yes	2026-06-11
AI21 Labs	Jamba foundation models, Maestro orchestration & enterprise AI	pure-usage freemium	tokens api-calls	Yes	2026-06-11
Anthropic	Claude API (token-based) + Claude.ai consumer subscriptions (Free/Pro/Team/Enterprise)	freemium subscription seat-based	tokens seats api-calls	Yes	2026-07-06
AssemblyAI	Speech-to-Text & Audio AI APIs	pure-usage	api-calls tokens	Yes	2026-07-06
Baichuan AI	Baichuan & medical M-series LLM APIs	pure-usage freemium	tokens api-calls	Yes	2026-06-11
Bland AI	AI phone call automation platform — inbound and outbound voice agents at scale	hybrid pure-usage subscription	api-calls credits media-minutes	Yes	2026-05-29
Browserbase	Browser-agent infrastructure: headless browser sessions, web Search/Fetch APIs, agent identity, runtime, and a model gateway behind one API key	freemium hybrid pure-usage	browser-hours api-calls requests	Yes	2026-06-02
Cartesia	Real-time voice AI platform (Sonic TTS, voice cloning, voice agents)	freemium subscription hybrid	credits requests api-calls	Yes	2026-05-29
Cerebras	Wafer-scale AI inference cloud and WSE hardware systems	pure-usage subscription commitment	tokens api-calls gpu-hours	Yes	2026-05-30
Chroma	Open-source vector database + Chroma Cloud	pure-usage freemium	storage-gb bandwidth-gb api-calls	Yes	2026-06-09
Clipdrop	AI image-editing and generation tools (background removal, upscaling, text-to-image), now part of Jasper	freemium subscription	requests credits api-calls	Yes	2026-06-05
Cohere	Command, Embed, Rerank APIs	pure-usage	tokens api-calls requests	Yes	2026-05-29
Composio	Tool-calling and integration infrastructure that connects AI agents to 1,000+ apps with managed auth and tool execution	hybrid freemium	api-calls	Yes	2026-06-10
Daily	Real-time voice and video WebRTC APIs (Video SDK + Pipecat Cloud)	pure-usage	media-minutes api-calls	Yes	2026-07-14
Deepgram	Usage-based speech-to-text, text-to-speech, and voice agent APIs	pure-usage freemium	media-minutes tokens credits	Yes	2026-05-31
DeepSeek	DeepSeek API (V4-Flash + V4-Pro models, 1M context) with token-based pricing and aggressive cache discounts	freemium pure-usage	tokens api-calls	Yes	2026-06-05
Diffbot	Web-extraction APIs (Extract, Crawl, Natural Language) plus a Knowledge Graph, metered on monthly credits	hybrid freemium	credits api-calls	Yes	2026-06-04
Exa	AI web search API for agents — search, contents, deep research, and monitoring endpoints billed per request	pure-usage freemium	requests credits api-calls	Yes	2026-07-14
Firecrawl	Web-scraping and data-extraction API for AI agents — scrape, crawl, map, search, and extract pages into clean markdown/JSON	subscription hybrid freemium	credits pages-rendered api-calls	Yes	2026-06-30
Freepik	AI creative suite — image, video, audio generation plus a 200M+ stock library	subscription hybrid pure-usage	seats credits api-calls	Yes	2026-06-05
Google	Gemini API & AI Studio	pure-usage freemium	tokens requests api-calls	Yes	2026-07-14
Groq	GroqCloud — LPU-based ultra-low-latency inference API for Llama, GPT-OSS, Qwen, Whisper transcription, and Orpheus text-to-speech	pure-usage hybrid commitment	tokens requests api-calls	Yes	2026-07-14
Hedra	AI video, avatar, image, and audio generation platform (Hedra Studio + API)	subscription freemium	credits media-minutes characters	Yes	2026-06-04
HeyGen	AI avatar and video generation platform	subscription freemium	credits seats api-calls	Yes	2026-05-30
Hume AI	Empathic Voice Interface (EVI) + Octave TTS + expression-measurement APIs	hybrid freemium	media-minutes characters api-calls	Yes	2026-06-30
Ideogram	Text-aware AI image generation platform	freemium subscription hybrid	credits api-calls	Yes	2026-06-15
Jina AI	Search Foundation API (Embeddings, Reranker, Reader, DeepSearch, Classifier)	pure-usage freemium	tokens requests api-calls	Yes	2026-06-03
Linkup	Web search API for AI agents — Search, Fetch, and async Research endpoints with grounded, structured results	pure-usage freemium	requests credits api-calls	Yes	2026-07-14
Mistral AI	Open and commercial LLM APIs	pure-usage freemium	tokens seats api-calls	Yes	2026-07-06
Moonshot AI	Kimi assistant + Kimi/Moonshot open-weight LLM API	pure-usage freemium	tokens seats api-calls	Yes	2026-06-11
Novita AI	Pay-as-you-go AI cloud: 200+ model inference APIs, on-demand GPUs, and per-second agent sandboxes under one API	pure-usage freemium	tokens gpu-hours cpu-hours	Yes	2026-07-06
OpenAI	ChatGPT consumer subscriptions + GPT-5.x API with token-based usage billing	freemium subscription seat-based	tokens seats api-calls	Yes	2026-06-30
OpenMeter	Open-source usage metering and billing platform for AI, agentic, and developer tools	freemium	events api-calls	Yes	2026-06-03
Outreach	AI Agent Platform for revenue teams — sales execution, deal management, conversation intelligence and forecasting for AEs, sales leaders and RevOps	hybrid seat-based	seats credits api-calls	No	2026-07-06
Patronus AI	LLM and AI agent evaluation, monitoring, and guardrail platform	freemium pure-usage	api-calls credits	Yes	2026-06-04
Perplexity AI	AI-native answer engine with citations and multi-model search	freemium subscription seat-based	seats tokens requests	Yes	2026-05-29
PhotoRoom	AI image-editing app and per-image Image Editing / Remove Background API for e-commerce product visuals	subscription pure-usage freemium	api-calls credits seats	Yes	2026-06-05
Playground	AI image generation and graphic-design studio with a monthly credit pool	freemium subscription hybrid	credits api-calls	Yes	2026-06-04
PlayHT	Text-to-speech & voice cloning API (PlayAI)	subscription freemium pure-usage	characters words api-calls	Yes	2026-06-09
Recraft	AI image and vector generation studio plus a per-image generation API	freemium subscription hybrid	credits api-calls seats	Yes	2026-07-14
Reka AI	Natively multimodal models (Spark, Edge, Flash, Core) + Research & Vision APIs	pure-usage freemium	tokens api-calls requests	Yes	2026-06-11
Resemble AI	AI deepfake detection & watermarking + voice generation APIs	pure-usage	credits media-minutes seats	No	2026-07-14
Rev AI	Pay-as-you-go speech-to-text, transcription, and audio-intelligence APIs	pure-usage freemium	media-minutes credits api-calls	Yes	2026-06-04
Rows	Rows AI spreadsheet	subscription hybrid	seats tasks api-calls	Yes	2026-06-08
Sarvam AI	Sovereign Indic LLM, speech & translation APIs	pure-usage freemium	tokens characters media-minutes	Yes	2026-06-11
ScraperAPI	Web scraping API that handles proxies, browsers, and CAPTCHAs behind a single endpoint	subscription pure-usage	credits requests api-calls	No	2026-06-04
SerpApi	Real-time search-results API (Google, Bing, and other engines)	subscription pure-usage	api-calls requests	Yes	2026-06-04
Tavily	Tavily Search API	pure-usage freemium	credits api-calls requests	Yes	2026-06-03
Upstash	Upstash (Redis, Vector, QStash, Search, Workflow)	pure-usage freemium hybrid	requests api-calls vectors-indexed	Yes	2026-07-14
Weaviate	AI-native vector database (open-source core + Weaviate Cloud managed serverless, dedicated/Enterprise Cloud, BYOC)	pure-usage hybrid commitment	vectors-indexed tokens api-calls	Yes	2026-07-06
xAI	Grok API and agentic AI stack	pure-usage freemium	tokens api-calls seats	Yes	2026-07-14
You.com	Web search, contents, research, and finance-research APIs for AI systems	pure-usage freemium	api-calls requests pages-rendered	Yes	2026-06-01
ZenRows	Universal Scraper API, Scraping Browser, and Residential Proxies	hybrid subscription pure-usage	requests api-calls bandwidth-gb	Yes	2026-06-04
Zhipu AI	GLM foundation models, per-token API, and GLM Coding Plan	pure-usage freemium subscription	tokens api-calls seats	Yes	2026-06-11

Explore this theme in the knowledge graph

FAQ

What is per-API-call pricing?

Per-API-call pricing is a billing unit where the customer is charged per API request, regardless of payload size or processing time. One request triggers one charge, usually quoted per call or per 1,000 calls.

How is per-API-call pricing different from token-based pricing?

Token pricing meters the volume of text processed inside a request, so two calls can cost very differently. Per-call pricing charges a set amount per request no matter how large the input or output is, which makes it simpler to forecast but a looser fit to the vendor's actual serving cost.

Do vendors charge for failed API calls?

It varies. SerpApi and ZenRows bill only on successful results and do not count blocked, errored, or CAPTCHA'd requests. Many other request-metered APIs charge for every call that hits the endpoint, so failure handling is a real cost line for high-error workloads like scraping.

Which companies use per-API-call pricing?

It dominates the search-API category (SerpApi, Tavily, Exa, Linkup, You.com, Jina AI), web scraping and extraction (Firecrawl, ScraperAPI, ZenRows, Diffbot), and per-image media generation (PhotoRoom, Recraft, Ideogram, HeyGen). Across the corpus, 53 companies list api-calls as a billing unit.

Why do vendors quote prices per 1,000 calls instead of per call?

Because the per-call price is often a fraction of a cent. Quoting '$7 per 1,000 requests' instead of '$0.007 each' keeps the model purely usage-based while staying readable, and it makes volume tiers easy to compare on a pricing page.

Is per-API-call pricing the same as credit-based pricing?

Often they are the same meter wearing different clothes. Vendors like Tavily, Firecrawl, and Diffbot expose a credit currency where each endpoint deducts a set number of credits, which lets an expensive call cost more while keeping one legible unit on the pricing page.

Related billing units

Back to companies