AI Pricing Calculator

Input tokens per requestOutput tokens per requestRequests per day

Estimate costs for GPT-5.6 Sol, GPT-5.6 Terra, GPT-5.6 Luna, GPT-5.5, GPT-5.4, GPT-5.2, GPT-4.1, o3, o4-mini, and other OpenAI models based on your token usage.

Anthropic

Input tokens per requestOutput tokens per requestRequests per day

Calculate costs for Claude Fable 5, Claude Opus 4.8, Claude Sonnet 5, Claude Sonnet 4.6, Claude Haiku 4.5, and other Anthropic models.

Google

Input tokens per requestOutput tokens per requestRequests per day

Estimate costs for Gemini 3.5 Flash, Gemini 3.1 Pro, Gemini 3.1 Flash-Lite, Gemini 3 Flash, Gemini 2.5 Pro, and other Google AI models.

DeepSeek

Input tokens per requestOutput tokens per requestRequests per day

Calculate costs for DeepSeek V4-Pro, V4-Flash, V3.2, and R1 — high-quality models at a fraction of proprietary pricing.

xAI

Input tokens per requestOutput tokens per requestRequests per day

Estimate costs for Grok 4.5, Grok 4.3, Grok 4.20 Beta, Grok 4, Grok 4.1 Fast, Grok 3 Mini, and other xAI models based on your token usage.

Mistral

Input tokens per requestOutput tokens per requestRequests per day

Calculate costs for Mistral Large 3, Mistral Medium 3.5, Mistral Small 4, and other Mistral AI models.

AI-Powered Applications

Use-case pricing

Estimate costs based on your actual business metrics — support tickets, audio minutes, developer seats, and more. No need to think in tokens.

ElevenLabs

Characters to synthesizeConversational AI minutes

Estimate your monthly ElevenLabs cost based on text-to-speech character volume and conversational AI usage.

Gorgias AI

Monthly support ticketsAI automation rate

Calculate your Gorgias cost based on ticket volume and AI automation rate.

Cursor

Number of developersPremium model requests per dev/day

Estimate your Cursor AI code editor costs based on team size and premium model usage.

Vercel

Function invocations per monthAverage execution timeAverage memory allocation

Estimate your Vercel serverless function costs based on invocations, execution time, and memory usage.

Abacus.AI

Monthly credits usedTeam seats

Per-seat subscription (ChatLLM) + sales-led enterprise platform

AssemblyAI

Async audio transcribedAsync transcription modelReal-time streaming audioStreaming model

Pure usage-based — pay per hour of audio processed; add-on fees for Speech Understanding features and per-token LLM Gateway calls

Augment Code

AI tasks per monthModel routed per taskDeveloper seats

Hybrid (per-seat plan + pooled usage credits)

Bland AI

Connected call minutesOutbound call attemptsHuman-transfer minutes (Bland numbers)SMS messages

Hybrid: tiered monthly subscription (Start free / Build $299 / Scale $499) + per-minute usage billing; Enterprise custom

Browserbase

Browser hoursSearch API requestsFetch API callsProxy bandwidthConcurrent browsers

Freemium + hybrid (flat plan fee + usage on browser-hours, Search/Fetch calls, proxies, and model tokens)

Cartesia

TTS minutes / monthVoice-agent call minutes / monthTelephony minutes (Cartesia number) / month

Freemium credit-based subscription + usage overages + enterprise commitments

Cerebras

Input tokens / moOutput tokens / moModel — input rateModel — output rate

Usage-based per-token inference API ($5-credit Free Trial, $10 self-serve Developer, and Enterprise tiers) plus fixed-price Cerebras Code coding subscriptions; hardware systems on custom enterprise contracts

Clay

Data Credits / moActions / mo

Hybrid (Actions capacity tier + Data Credits usage pool)

Cohere

Input tokens (Command)Output tokens (Command)Embed tokensRerank queries

Pure usage-based (per-token for generation/embeddings, per-query for reranking); enterprise private-deployment on custom contract

Comet

Users / seatsSpans / month (Opik)Training hours / month (MLOps)Data storage (MLOps)

Freemium + seat (Opik flat per-account; MLOps per-user with training-hour/storage usage)

Deepgram

Speech-to-Text minutesSpeech-to-Text modelText-to-Speech charactersText-to-Speech modelVoice Agent minutesVoice Agent tier

Pure usage (per-minute / per-character / per-token) with prepaid Growth credits

DeepInfra

Input tokens / moLLM input rateOutput tokens / moLLM output rateDedicated GPU-hours / moGPU type

Pure-usage per-token inference + per-hour GPU instances + reserved multi-year GPU clusters + startup credits

Descript

Media minutes / monthAI credits / monthEditor seats

Hybrid (per-seat + metered media hours & AI credits)

E2B

Sandbox run-hours / movCPUs per sandboxRAM per sandbox

Freemium Hobby tier + Pro platform fee ($150/mo) layered on per-second compute usage, with concurrency add-ons and sales-led Enterprise

Exa

Search requestsDeep Search requestsDeep Search variantContents pages crawledMonitors requestsAgent runsAgent effort mode

Pure usage (pay-as-you-go credits) with free tier and Enterprise

Fal

Video modelVideo seconds / monthImage modelImages / monthDedicated GPUGPU-hours / month

Pure usage (per-output model APIs + per-second/per-hour GPU compute)

Fathom

Seat-based freemium (free forever + per-user paid plans)

Seats (paid users)

Firecrawl

Credit-based subscription (monthly credit pools sized per tier) with auto-recharge usage overflow and a free tier

Pages scraped (credits)

Fireworks AI

Serverless input tokensServerless model size (input rate)Serverless output tokensServerless model (output rate)Dedicated GPU hoursGPU type (per-hour rate)Fine-tuning training tokensFine-tuning size × method (per 1M tokens)

Pure-usage per-token serverless + per-hour GPU + per-1M-token fine-tuning + sales-led enterprise

GitHub Copilot

Monthly AI Credits usageOrganization seats

Hybrid (per-seat + GitHub AI Credits usage pool)

Google

Input tokens / monthOutput tokens / monthGoogle Search grounding queries / month

Pure usage-based (pay-per-token) via Gemini API/AI Studio and Vertex AI; consumer Gemini app free with Google AI Pro ($19.99/mo) and AI Ultra (from $100/mo) subscription upsell

Groq

Input tokensModel (input rate)Output tokensModel (output rate)Audio transcriptionTranscription model (per hour)Web search requestsSearch depth (per request)Code execution

Pure-usage per-token serverless + per-hour transcription + per-1M-character text-to-speech + per-use tools + sales-led enterprise

Gumloop

Freemium + usage-tiered subscription (credit pool with overage)

Monthly credits

HeyGen

Minutes of video / monthAvatar engineAdditional team seats

Freemium credit-based subscription (Free/Creator/Pro/Business/Enterprise) plus per-seat add-ons and a Pay-As-You-Go API wallet

Ideogram

Priority credits / monthTeam seatsAPI images / month (pay-as-you-go)API model / quality

Freemium credit-based subscription (Free/Plus/Pro/Team/Enterprise) plus a per-image, pay-as-you-go generation API

Intercom

Support team seatsMonthly support conversationsFin resolution rate

Hybrid (seats + per-resolution outcome pricing)

Jasper

Seats (users)Brand Voices needed

Hybrid (per-seat subscription + credit meter on Business)

Lightning AI

GPU hours / monthGPU machine type ($/GPU/hr)Team seats (Teams plan)

Hybrid (freemium seat tiers + per-GPU-hour usage credits)

Make

Credit-metered (volume-tiered) + free tier

Credits / mo

Manus

Credits / monthTeam seats

Credit-based subscription (tiered credit allowances + Team seats)

Midjourney

Subscription (tiered GPU-hour bundles, fast/relax dual-mode)

Fast GPU hours / mo

MiniMax

Input tokensOutput tokensModel — input rateModel — output rateHailuo video clipsSpeech charactersToken Plan subscriptions

Three-surface: consumer apps (Talkie, Hailuo AI) + Token Plan subs ($20–$120/mo) + per-token API from $0.30/M

Mistral AI

Input tokensOutput tokensModel — input rateModel — output rateOCR pagesAgent tool calls (web search / code exec)Vibe seats

Two-track: Vibe assistant subscriptions ($0–$24.99/user/mo) + pure per-token API

Modal

GPU hours / monthGPU typeCPU core-hours / monthMemory GiB-hours / monthWorkspace seats

Pure-usage per-second GPU/CPU/memory + flat plan fees (Starter $0, Team $250) + sales-led enterprise

Moonshot AI

Input tokensOutput tokensModel — input rateModel — output rateKimi assistant seats

Two-track: tempo-named Kimi assistant subscriptions ($0–$199/mo) + a per-model, cache-discounted per-token API

Murf AI

Studio voice generationMurf API charactersAPI TTS modelMurf Dub — file durationDubbed languages per file

Hybrid: flat Studio subscriptions + pure-usage API per-character/per-minute

n8n

Monthly workflow executions

Execution-tiered subscription (workflow executions, not seats or steps)

Novita AI

LLM input tokens / moLLM input rateLLM output tokens / moLLM output rateGPU-hours / moGPU type & packagingImages / moImage modelVideos / moVideo modelText-to-speech characters / moTTS modelAgent-sandbox vCPU-hours / mo

Pure usage (per-token inference + per-hour GPU + per-second sandbox), free to start

Perplexity AI

Team seatsSonar API requests / monthSonar API input tokens / monthSonar API output tokens / month

Freemium subscription (individual + enterprise tiers) + usage-based API (Sonar)

Recraft

Studio credits / moTeam seatsAPI images / mo (V4 raster)

Hybrid (subscription credits for Studio + per-image usage for the API)

Reka AI

Input tokensOutput tokensModel — input rateModel — output rateReka Vision — video indexedReka Research requests

Pure usage-based: per-1M-token + per-multimodal-unit API, Research per-1k-requests, Vision per-video-minute + per-image on credits; Enterprise sold by quote

Relevance AI

Actions / monthVendor Credits / month

Hybrid (seat-tiered plans + Actions usage + pass-through Vendor Credits)

Replicate

Hardware compute timeHardware tier (per-second rate)Per-output imagesImage model (per-output rate)LLM input tokensLLM model (input rate)LLM output tokensLLM model (output rate)

Pure-usage per-second public-model + per-output image + per-token LLM + per-second dedicated GPU + sales-led enterprise

Roboflow

Credits / monthAdditional seats (beyond included)

Hybrid (seat + credit-based usage) with a freemium tier

RunPod

Pod GPU-hours / monthPod GPU type (Secure Cloud rate)Serverless worker-hours / monthServerless worker class (flex per-hr)Persistent storage (GB)Storage SKU ($/GB-month)

Pure-usage per-hour Pods + per-second Serverless + tiered storage + sales-led enterprise commits

Runway

Credits used / monthSeats (users)

Per-seat subscription with bundled monthly credits; usage-based credit API

Suno

Songs generated per month

Subscription (credit-metered tiers) + freemium

Synthesia

Tiered subscription with credit-metered video minutes; sales-led enterprise

Video minutes per month

Tavus

Conversational video minutesVideo generation minutesCustom replica trainingsConversation recording minutes

Hybrid (monthly access fee + pay-as-you-go video minutes); separate flat-tier consumer plans

tl;dv

Paid seats (people recording meetings)Meetings with AI notes / month

Per-seat subscription with a freemium free tier (optional usage-based AI on Enterprise)

Together AI

LLM input tokensChat model (input rate)LLM output tokensChat model (output rate)Generated imagesImage model (per image or per MP)Dedicated / cluster GPU hoursGPU type (per-hour rate)Code Sandbox vCPU-hoursCode Sandbox GiB-hours

Pure-usage per-token serverless + per-hour dedicated GPU + reserved capacity + sales-led enterprise

Twelve Labs

Video indexedEmbeddings stored (monthly)Search queriesPegasus video analyzedPegasus output text

Pure usage (pay-as-you-go video minutes) with a free tier and committed-use enterprise contracts

Udio

Songs generated per month

Subscription (credit-metered tiers) + freemium, with purchasable add-on credit packs

Vast.ai

GPU-hours / monthGPU type (on-demand market rate)GPUs per machine

Marketplace usage (dynamic $/hr GPU + $/GB/hr storage + $/TB bandwidth) with reserved pre-pay discounts

VEED AI

AI credits used / yearSeats (users)

Per-seat subscription (Free/Creator/Pro/Studio) with each paid plan bundling an annual AI-credit allowance that meters Gen-AI Studio video, dubbing and avatars; Enterprise is quoted.

You.com

Search API callsLivecrawl pagesContents API pagesResearch API callsResearch effortFinance Research API callsFinance Research effort

Pure usage (per-1k-call metering with effort tiers)

Zapier

Task-metered Platform tiers + separately-metered AI add-ons + free tier

Tasks per month

Zhipu AI

Input tokensOutput tokensModel — input rateModel — output rateGLM Coding Plan seats

Per-token GLM API (free Flash tiers up) plus a flat GLM Coding Plan subscription listed at $18-$160/mo