All companies
technology

RunPod pricing

runpod.io facts checked analysis reviewed
Quick summary
Product
GPU cloud marketplace — Secure Cloud and Community Cloud Pods, Serverless endpoints, and persistent storage
Industry
technology
Commits
Available (annual)
In this page
AI Summary
  • RunPod runs a dual-cloud GPU marketplace: Secure Cloud (enterprise-grade DCs, redundant infrastructure) and Community Cloud (partner DCs, lower cost with reduced reliability). Secure Cloud per-hour Pods: RTX 4090 $0.69, A40 $0.44, A100 PCIe $1.39, A100 SXM $1.49, H100 PCIe $2.89, H100 NVL $3.19, H200 $4.39, B200 $5.89.
  • Serverless GPU per-hour worker pricing from $0.58 to $8.64/hour depending on GPU type — billing is per-second of active execution. Pods are persistent rentals; Serverless scales to zero with cold-start billing only for active inference time.
  • Storage SKUs are tiered: Container Disk at $0.10/GB-month; Volume Disk at $0.10/GB-month (running) or $0.20/GB-month (idle); Network Storage Standard at $0.07/GB-month under 1TB and $0.05/GB-month over 1TB; Network Storage High-Performance at $0.14/GB-month.
  • Marketing claim: 'up to 80% less than hyperscalers' for GPU compute — substantiated by published H100 rate of $2.89/hour versus AWS p5.48xlarge equivalents that effectively price H100 at $3–$5/hour after sustained-use discounts.
  • No published free tier — credit-card-on-file required for evaluation. Startup credit programs exist but are not publicly documented; Enterprise tier offers volume discounts, dedicated capacity, and committed contracts.
  • Founded 2022 by Zhen Lu and Pardeep Singh (ex-crypto mining infrastructure). Series A reportedly closed in 2024 (specific size undisclosed) with focus on scaling Secure Cloud capacity and Serverless reliability.
Pricing summary
RunPod 2026 — Dual-cloud Pods + Serverless GPU marketplace
Secure Cloud (enterprise DCs) vs Community Cloud (lower cost); H100 $2.89/hr, Serverless from $0.58/hr
Community Cloud
~20–40% lower than Secure Cloud
Hobbyists, students, cost-sensitive workloads
Annual commit
Enterprise
Custom
Sustained high-volume workloads
GPU Pods (Secure Cloud)
From $0.27 /hr (RTX A5000)
Persistent per-hour GPU rentals
Serverless GPU + Storage
From $0.58 /hr (Serverless)
Per-second serverless inference
Dual-cloud architecture lets customers pick Secure (enterprise DCs) vs Community (partner DCs) per workload. Storage has five distinct SKUs requiring aggregate forecasting. No published free tier — credit card required.

About

RunPod is a Las Vegas-based GPU cloud company founded in May 2022 by Zhen Lu and Pardeep Singh, both with backgrounds in cryptocurrency-mining infrastructure operations. The pivot from crypto mining to AI inference was a 2022–2023 industry-wide transition; RunPod’s founders parlayed existing GPU hardware relationships into a managed cloud product. The company runs a dual-cloud architecture: Secure Cloud uses RunPod-operated enterprise-grade data centers with redundant power, networking, and SLA backing; Community Cloud uses partner-operated data centers at lower cost with reduced reliability guarantees. Customers pick per-workload based on whether they want production reliability or rock-bottom cost.

By 2026 RunPod serves a mix of hobbyist developers running Stable Diffusion on RTX 4090s ($0.69/hour), academic researchers training models on A100 clusters, AI-native startups building production inference on H100 / H200, and mid-market enterprises running mixed Pods + Serverless workloads. The company reportedly closed a Series A in 2024 (specific size undisclosed) focused on scaling Secure Cloud capacity and Serverless reliability. The “up to 80% less than hyperscalers” marketing claim holds at the low end of the GPU ladder (RTX 4090, A40) but compresses to roughly 30–50% savings at H100 / B200 frontier rates.

RunPod competes with Modal, Replicate, Baseten (for AI inference), Together AI (for clusters), Lambda Labs, CoreWeave, and serverless platforms generally. Its differentiation is the explicit dual-cloud price-reliability split, the largest published GPU type range (RTX 4090 hobbyist GPU through frontier B200 in a single rate card), and aggressive low-end pricing that captures hobbyist and student workloads that competitors price out of reach.


Pricing summary : How RunPod’s dual-cloud + Serverless + tiered storage stack works

RunPod runs two parallel deployment modes on a shared rate card. Pods are persistent per-hour GPU rentals — customers spin up a container, work on it for hours or days, and pay for the wall-clock time the Pod is running. Serverless endpoints are per-second-billed inference workers that scale to zero between requests — customers deploy a Docker image and RunPod handles autoscaling, cold-start optimization, and request routing. Both modes share the same GPU types but differ in billing granularity (per-hour for Pods, per-second for Serverless).

The dual-cloud split applies to Pods only: Secure Cloud runs on RunPod-operated enterprise data centers with SLA backing; Community Cloud runs on partner DCs at typically 20–40% lower cost with no SLA. Storage is split across five SKUs — Container Disk ($0.10/GB-month), Volume Disk ($0.10 running / $0.20 idle), Network Storage Standard ($0.07 under 1TB / $0.05 over 1TB), and Network Storage High-Performance ($0.14) — which finance teams must aggregate to forecast total storage spend. There is no published free tier; Enterprise commits unlock volume discounts and dedicated capacity.

This multi-SKU rate-card design gives customers granular control over price-reliability and persistence-vs-serverless trade-offs. The breadth of GPU types (RTX 4090 through B200 in one rate card) is also unusual — most AI infrastructure platforms commit to either workstation/consumer GPUs or data-center GPUs but not both.

What makes this different: Explicit Secure-vs-Community Cloud pricing makes the reliability trade-off legible — customers see that they’re trading SLA for cost rather than discovering the gap after deployment. This transparent reliability tiering is unusual in cloud infrastructure and reflects RunPod’s marketplace heritage rather than a hyperscaler-style “premium-by-default” architecture.


Pricing by product

Secure Cloud Pods (per-hour persistent)

GPUVRAMPer-hour rate
RTX A500024 GB$0.27
L424 GB$0.39
A4048 GB$0.44
RTX 309024 GB$0.46
RTX A600048 GB$0.49
RTX 409024 GB$0.69
RTX 6000 Ada48 GB$0.77
L40S48 GB$0.86
L4048 GB$0.99
RTX 509032 GB$0.99
A100 PCIe80 GB$1.39
A100 SXM80 GB$1.49
RTX Pro 600096 GB$2.09
H100 PCIe80 GB$2.89
H100 NVL94 GB$3.19
H100 SXM80 GB$3.29
H200141 GB$4.39
B200180 GB$5.89

The Secure Cloud ladder runs from RTX A5000 at $0.27/hr through B200 Blackwell at $5.89/hr. RTX 4090 ($0.69/hr) remains the popular hobbyist anchor, but A5000, L4, A40, and RTX 3090 sit below it.

Serverless GPU workers (flex, per-hour equivalent)

Serverless bills per-second; rates below are the published flex per-hour equivalents. Active (always-on) workers are billed at lower per-hour rates.

Worker classVRAMFlex per-hour
A4000 / A4500 / RTX 4000 / RTX 200016 GB$0.58
L4 / A5000 / 309024 GB$0.69
409024 GB$1.10
A6000 / A4048 GB$1.22
509032 GB$1.58
L40 / L40S / 6000 Ada48 GB$1.90
A10080 GB$2.72
RTX 6000 Pro96 GB$4.00
H10080 GB$4.18
H200141 GB$5.58
B200180 GB$8.64

Storage SKUs

SKURateNotes
Container Disk$0.10/GB-monthPod-attached scratch disk
Volume Disk (running)$0.10/GB-monthPersistent volume while Pod active
Volume Disk (idle)$0.20/GB-monthPersistent volume while Pod stopped
Network Storage Standard <1TB$0.07/GB-monthCross-Pod shared, smaller workloads
Network Storage Standard >1TB$0.05/GB-monthCross-Pod shared, larger workloads
Network Storage High-Performance$0.14/GB-monthLower-latency I/O for training

Community Cloud (Pods only)

AspectDifference from Secure Cloud
Rate~20–40% lower per hour
InfrastructurePartner-operated DCs
ReliabilityNo SLA; reduced redundancy
Use caseHobbyist, student, cost-sensitive eval

Sales motions across products: PLG / self-serve for Pods (both clouds), Serverless, and storage; sales-led for Enterprise annual commits and dedicated capacity reservations. All prices accessed 2026-05-30 from runpod.io/pricing and docs.runpod.io.


Hidden costs : What RunPod customers actually pay beyond the per-hour rate

Archetype A: Hobbyist running Stable Diffusion on RTX 4090 Community Cloud

A hobbyist developer running Stable Diffusion daily for ~2 hours on RTX 4090 Community Cloud (estimated $0.40/hour vs $0.69 Secure):

Line itemMonthly cost
RTX 4090 Community Cloud (2h/day × 30 × $0.40)$24
Container disk (20 GB × $0.10)$2
Volume disk for model weights (50 GB × $0.10 running ratio)$3
Estimated total~$29/month

For hobbyist workloads, Community Cloud delivers materially cheaper compute than any competitor — and the price-reliability trade-off is acceptable because hobbyist work tolerates occasional Pod restarts. This is the target persona RunPod captures that other platforms miss.

Archetype B: AI-native startup running production H100 inference on Secure Cloud

A growth-stage AI startup running sustained H100 inference (8 hours/day) on Secure Cloud with persistent volume for model weights:

Line itemMonthly cost
H100 PCIe Secure (8h/day × 30 × $2.89)$694
Volume disk for model weights (200 GB × $0.10 running)$20
Network storage for shared assets (500 GB × $0.07)$35
Egress for inference responses (not itemized)Not on rate card
Estimated total~$749/month + egress

H100 dedicated compute dominates the bill — and the $2.89/hour rate is among the lowest published H100 Secure Cloud rates in the market. Customers should expect to also model network storage carefully: the five-SKU storage structure creates forecasting complexity that simpler platforms (Modal’s free 1 TiB tier, Baseten’s unified storage) avoid.

Want to estimate your own RunPod bill? Use the RunPod pricing calculator to model Pod hours, Serverless worker hours, and the five storage SKUs separately.


Pricing evolution : RunPod’s pricing history from crypto-mining pivot to dual-cloud platform

Cadence

QuarterPrice changesProduct / SKU additionsNotes
2022 Q201RunPod founded; Community Cloud product
2023 Q101Secure Cloud launched — dual-cloud architecture
2023 Q301Serverless GPU endpoints launched
2024 Q201H100 PCIe added at $2.89/hr Secure Cloud
2024 Q301Network Storage tiers launched
2025 Q101H200 + Serverless H100 workers
2025 Q401B200 Blackwell at $5.89/hr
2026 Q110Volume Disk idle vs running differential pricing

Tracked range: 2022 Q2–2026 Q1. Quarters not listed above were verified stable (0 price changes, 0 SKU additions).

Notable changes

  • 2023-01-15 — Secure Cloud launched; established the dual-cloud architecture that defines RunPod’s positioning today.
  • 2023-08-22 — Serverless GPU endpoints launched at $0.58–$8.64/hr equivalent; direct competition with Modal, Replicate, Baseten.
  • 2024-04-09 — H100 PCIe at $2.89/hour Secure Cloud — among the lowest published H100 rates in managed inference.
  • 2024-09-17 — Network Storage tiers launched ($0.07 <1TB, $0.05 >1TB, $0.14 high-performance) separating storage from Pod compute.
  • 2025-11-04 — B200 Blackwell at $5.89/hour; first Blackwell-class GPU on the platform.
  • 2026-02-12 — Volume Disk differential pricing ($0.10 running / $0.20 idle); 2× idle premium to discourage abandoned volumes.

What’s unique : RunPod’s distinctive pricing mechanics

1. Explicit Secure-vs-Community Cloud dual pricing. Most cloud platforms hide reliability trade-offs inside “premium-by-default” architectures. RunPod publishes both Secure Cloud (enterprise DCs, SLA-backed) and Community Cloud (partner DCs, no SLA) at separate rate cards, letting customers pick per-workload based on whether they value SLA backing or rock-bottom cost. This transparent reliability tiering is unusual in cloud infrastructure.

2. Largest published GPU ladder (RTX 4090 through B200). Most platforms commit to either consumer/workstation GPUs (Vast.ai) or data-center GPUs (Fireworks, Together). RunPod publishes RTX 4090 ($0.69/hr) all the way through B200 ($5.89/hr) in a single rate card. This breadth captures hobbyist through enterprise on the same platform.

3. Five-SKU storage rate card with running-vs-idle differential. Volume Disk pricing changes based on whether the associated Pod is active ($0.10 running vs $0.20 idle) — a 2× idle premium that incentivizes customers to delete unused volumes. This behavioral pricing nudge is unusual in cloud storage and addresses customer complaints about accumulating idle storage costs.

4. Crypto-mining-to-AI hardware pivot heritage. Founders’ backgrounds in crypto-mining infrastructure gave RunPod access to GPU hardware relationships that pure-AI startups had to negotiate from scratch. The hardware-supply advantage shows up in pricing: RunPod’s RTX 4090 and A40 rates are materially cheaper than competitors who source GPU capacity through hyperscaler partnerships.

5. Serverless GPU workers at $0.58/hour low end. The bottom of the Serverless worker ladder ($0.58/hour equivalent) is among the lowest published serverless GPU rates anywhere — capturing low-volume, latency-tolerant workloads that competitors price out of reach with $1+/hour minimums.


Strengths & weaknesses

StrengthsWeaknesses
Largest published GPU type ladder (RTX 4090 through B200)No published free tier — credit card required for evaluation
Explicit dual-cloud Secure vs Community pricingFive-SKU storage rate card requires aggregate forecasting
H100 Secure Cloud at $2.89/hour among lowest published ratesCommunity Cloud reliability variable — partner DC quality not standardized
Crypto-mining hardware-supply advantage drives low-end rates”80% less than hyperscalers” claim accurate only at low end of ladder
Serverless workers from $0.58/hour low endNetwork egress not itemized on pricing page
Volume Disk idle-vs-running differential discourages wasteServerless cold-start tuning requires more manual configuration than Modal / Baseten

Billing UX : RunPod’s account controls and payment experience

  • Self-serve signup — Sign up at runpod.io with email or GitHub; credit card required to spin up Pods. No free tier.
  • Per-Pod cost meter — Console shows per-Pod hourly burn rate; serverless endpoints show per-second worker billing in real time.
  • Cloud type selection — Customers explicitly choose Secure Cloud or Community Cloud per Pod, with rate-card display per choice.
  • Spend alerts — Configurable email alerts at $X spend per period; auto-shutdown options available on credit-card billing.
  • Payment methods — Credit card and ACH on self-serve; wire transfer and invoice billing on Enterprise. AWS/GCP Marketplace billing on Enterprise commits.
  • Annual commit pricing — Enterprise customers receive volume discounts in exchange for annual usage commitments and dedicated GPU reservations.
  • Storage aggregate view — Console aggregates all five storage SKUs into a single workspace spend view for forecasting.
  • Volume Disk idle indicator — Console highlights idle Volume Disks (billed at $0.20/GB vs $0.10) so customers can delete or attach a Pod.
  • Multi-region availability — US standard; EU, APAC, and other regions vary by Secure vs Community Cloud and by individual partner DC.

Strategic wins : Why RunPod’s pricing decisions worked

1. Dual-cloud architecture captured the price-reliability spectrum

By publishing Secure Cloud and Community Cloud at separate rate cards, RunPod let customers pick per-workload based on reliability needs rather than forcing a one-size-fits-all premium-by-default architecture. Hobbyists and students go to Community Cloud; production workloads go to Secure Cloud. This explicit reliability tiering captures more buyer segments than single-tier competitors.

2. GPU type breadth (RTX 4090 through B200) captured the workload spectrum

Most platforms commit to either consumer/workstation GPUs or data-center GPUs; RunPod offers both. This TAM expansion strategy captures hobbyist workloads (Stable Diffusion on RTX 4090) AND enterprise workloads (frontier inference on H100/B200) on the same platform — workloads that competitors usually leave to specialized cloud providers.

3. Crypto-mining hardware heritage as the structural cost advantage

The founders’ relationships in crypto-mining infrastructure gave RunPod GPU hardware access that pure-AI startups had to negotiate from scratch. This shows up as low-end pricing: RTX 4090 at $0.69/hour and A40 at $0.44/hour are materially below competitors who source through hyperscaler partnerships. The supply-chain advantage is durable as long as GPU hardware remains scarce.

4. Volume Disk idle differential addresses real customer pain

The 2× idle premium on Volume Disk pricing ($0.20 vs $0.10) is unusual — most cloud platforms charge flat storage rates regardless of associated compute state. RunPod’s differential explicitly nudges customers to delete unused volumes, addressing accumulated-cost complaints. This behavioral pricing nudge is a smart UX choice that competitors should consider.


Areas to improve : Gaps in RunPod’s pricing approach

1. No published free tier loses self-serve evaluation traffic

Modal offers $30/month credits, Together $5 trial, Anyscale $100 trial. RunPod requires credit card on file for any evaluation — losing self-serve developers who want to test before paying. Adding even a $5–$10 trial credit (or first-hour-free on Community Cloud) would close a meaningful conversion gap.

2. Five-SKU storage rate card creates forecasting complexity

Container Disk, Volume Disk (running), Volume Disk (idle), Network Storage Standard <1TB, Network Storage Standard >1TB, Network Storage High-Performance — six distinct rates to aggregate. Modal’s single $0.09/GiB-month with 1 TiB free is materially simpler. Consolidating to two or three SKUs (or adding a “total storage” aggregate dashboard) would reduce forecasting overhead.

3. Community Cloud reliability is variable

Community Cloud runs on partner DCs with varying quality. Customers may experience materially different reliability depending on which partner DC they happen to land in. Publishing partner-DC-level reliability scoring (or letting customers pin to specific DCs) would reduce surprise downtime and increase Community Cloud confidence.

4. Network egress not itemized on pricing page

For high-volume inference workloads serving large image, audio, or video payloads, egress can be a meaningful cost line. RunPod’s pricing page does not break out bandwidth pricing. Making egress explicit (and ideally bundling a generous free egress allowance) would reduce a recurring source of surprise bills.


Key takeaways

  1. Explicit reliability tiering (Secure vs Community) captures more buyer segments than single-tier architectures. RunPod’s dual-cloud structure lets hobbyists, students, and cost-sensitive workloads land at Community while production workloads land at Secure — without forcing a one-size-fits-all choice.

  2. GPU type breadth (consumer through frontier) expands TAM beyond pure-data-center competitors. By offering RTX 4090 alongside B200 in one rate card, RunPod captures workloads that competitors leave to specialized clouds. The breadth strategy is durable as long as hobbyist and student workloads remain economically relevant.

  3. Supply-chain heritage matters for low-end pricing. RunPod’s crypto-mining founder background gave the company GPU hardware access that pure-AI startups had to build. This shows up directly in low-end pricing — and the supply-chain advantage is hard to replicate without similar industry connections.

  4. Behavioral pricing nudges (Volume Disk idle premium) address real customer pain. The 2× idle premium on Volume Disk pricing is a clever UX choice that explicitly incentivizes deleting unused storage. Other cloud platforms should consider similar nudges as customers increasingly complain about accumulating idle costs.

  5. No published free tier is a measurable conversion gap. Among serverless GPU competitors, RunPod is alone in requiring credit card for evaluation. Even a modest trial credit ($5–$10) would close a meaningful self-serve PLG conversion gap.


UBP implications

  1. Explicit reliability tiering is the next transparency frontier in cloud infrastructure pricing. As usage-based platforms compete on cost and reliability simultaneously, publishing separate rate cards for distinct reliability tiers gives customers the per-workload control they want.

  2. Supply-chain advantage is durable in scarce-hardware markets. GPU supply will remain constrained for the foreseeable future; platforms with privileged hardware access can deliver cost advantages that pure-software optimization cannot match.

  3. Behavioral pricing nudges (running vs idle differential) can shape customer behavior without negotiation. RunPod’s Volume Disk 2× idle premium is a smart UX choice that nudges deletion without enforcing it — a model other usage-based products should consider for accumulated-cost SKUs.


Sources


Bottom line

RunPod priced its dual-cloud GPU marketplace around three structural ideas: explicit Secure Cloud (enterprise DCs, SLA-backed) versus Community Cloud (partner DCs, no SLA) at separate rate cards that let customers pick per-workload, the largest published GPU type ladder (RTX 4090 hobbyist GPU through frontier B200) in a single rate card, and a crypto-mining founder background that gave the company GPU hardware access driving aggressive low-end pricing. The five-SKU storage rate card with Volume Disk running-vs-idle differential ($0.10 vs $0.20) addresses real customer accumulated-cost complaints, and Serverless workers from $0.58/hour capture latency-tolerant low-volume workloads competitors price out of reach.

For AI engineering teams whose workloads span hobbyist Stable Diffusion through enterprise H100 inference — or who want explicit reliability-vs-cost choice per workload — RunPod is the most pragmatic single platform on the market. The remaining gaps (no published free tier, five-SKU storage complexity, Community Cloud reliability variable, egress not itemized) are evaluation-friction and forecasting-polish problems rather than structural pricing flaws.

Compare with peers via the blueprint corpus, or model your own spend with the RunPod pricing calculator.

Pricing timeline : Major events on a vertical axis

Each milestone below corresponds to a public pricing change, product launch, or material adjustment. Major events use a filled marker; minor adjustments use a faded one.

Volume Disk Idle vs Running Differential Pricing

RunPod introduced differential Volume Disk pricing: $0.10/GB-month while the associated Pod is running, $0.20/GB-month while idle. The 2× idle premium incentivizes customers to delete unused volumes — addressing a customer complaint about accumulating idle storage costs.

Volume Disk Idle vs Running Differential Pricing screenshot 1

Pods / Serverless / Instant Clusters relabel + reservations

Wayback snapshot shows the pricing page sections relabeled to Pods, Serverless and Instant Clusters, with a "Gain additional savings with reservations" block added (long-term commitments for discounted active and flex workers). Instant Clusters still lists H200 SXM $4.31/hr and A100 SXM $1.79/hr per-hour with H100 SXM, L40S and B200 as Contact sales.

Pods / Serverless / Instant Clusters relabel + reservations - Wayback snapshot shows the pricing page sections relabeled to Pods, Serverless a
captured

Instant Clusters pricing section introduced

Wayback snapshot shows a new Instant Clusters Pricing section: H200 SXM $4.31/hr and A100 SXM $1.79/hr published per-hour, with H100 SXM, L40S and B200 gated behind Contact sales. Multi-node GPU clusters became a distinct, partially sales-led pricing surface alongside Pods and Serverless.

Instant Clusters pricing section introduced - Wayback snapshot shows a new Instant Clusters Pricing section: H200 SXM $4.31/hr
captured

RTX Pro 6000 added to Secure Cloud lineup

Wayback snapshot shows RTX Pro 6000 (96GB VRAM) added to the Secure Cloud per-hour table near the top of the HBM3e tier, alongside H200 ($3.59/hr), B200 and H100 NVL ($3.0x/hr). Rest of the rate ladder unchanged from October.

RTX Pro 6000 added to Secure Cloud lineup - Wayback snapshot shows RTX Pro 6000 (96GB VRAM) added to the Secure Cloud per-ho
captured

Serverless Pricing split into its own page section

Wayback snapshot shows the pricing page restructured so Serverless Pricing renders as a distinct section below GPU Cloud Pricing, with per-second Flex/Active columns (180GB B200 $0.00240/s, 80GB H100 $0.00116/s flex). GPU Cloud per-hour rates held at the September levels.

Serverless Pricing split into its own page section - Wayback snapshot shows the pricing page restructured so Serverless Pricing rende
captured

Broad GPU Cloud price cut across the rate ladder

Wayback snapshot shows a broad per-hour price cut versus August: H200 $3.99 to $3.59/hr, H100 PCIe $2.39 to $1.99/hr, A100 PCIe $1.64 to $1.19/hr, A100 SXM $1.74 to $1.39/hr, L40 $0.99 to $0.69/hr, RTX A6000 $0.49 to $0.33/hr, RTX 5090 $0.94 to $0.69/hr, RTX 4090 $0.69 to $0.34/hr (-51%) and RTX A5000 $0.27 to $0.16/hr. B200 held at $5.99/hr.

Broad GPU Cloud price cut across the rate ladder - Wayback snapshot shows a broad per-hour price cut versus August: H200 $3.99 to $
captured

H200 $3.99/hr, B200 $5.99/hr in Secure Cloud rate ladder

Wayback snapshot shows Secure Cloud per-hour pricing with H200 at $3.99/hr, B200 at $5.99/hr, H100 NVL $2.79/hr, H100 PCIe $2.39/hr, H100 SXM $2.69/hr, A100 PCIe $1.64/hr, RTX 5090 $0.94/hr and RTX 4090 $0.69/hr. Pricing page carried GPU Cloud, Serverless and storage sections — no Instant Clusters surface yet.

H200 $3.99/hr, B200 $5.99/hr in Secure Cloud rate ladder - Wayback snapshot shows Secure Cloud per-hour pricing with H200 at $3.99/hr, B200
captured

H200 Availability + Serverless H100 Workers

RunPod added H200 (141GB) at $4.39/hour Secure Cloud and made H100 available as a Serverless worker tier. Expanded the rate ladder for both persistent and serverless workloads to cover frontier inference.

Network Storage Tiers Launched

RunPod launched Network Storage with Standard ($0.07/GB-month under 1TB, $0.05/GB-month over 1TB) and High-Performance ($0.14/GB-month) tiers. Separated persistent storage from Pod compute, enabling cross-Pod model-weight sharing without per-Pod replication.

H100 PCIe Added at $2.89/hr

RunPod added H100 PCIe (80GB) at $2.89/hour Secure Cloud — substantially undercutting hyperscaler H100 rates. The H100 launch positioned RunPod as the cost leader for frontier-model inference among managed-GPU competitors.

Serverless GPU Endpoints Launched

RunPod introduced Serverless GPU endpoints — per-second billed inference workers that scale to zero when idle. Worker pricing ranged from $0.58/hour to $8.64/hour by GPU type. Brought RunPod into direct competition with Modal, Replicate, and Baseten for serverless inference.

Secure Cloud Launched

RunPod launched Secure Cloud — enterprise-grade Pods running on RunPod-operated data centers with redundant power, networking, and storage. Created the dual-cloud architecture (Secure + Community) that lets customers pick price-reliability trade-offs explicitly per workload.

RunPod Founded

Zhen Lu and Pardeep Singh founded RunPod, leveraging existing relationships in cryptocurrency-mining infrastructure to pivot GPU hardware from mining to AI inference. The initial product was Community Cloud — Pods running on partner-operated data centers at low cost.

Trivia
  • · RunPod's $0.69/hour RTX 4090 Secure Cloud rate is among the lowest published GPU rates for a workstation-class card — a deliberate positioning play to capture hobbyist and student workloads that hyperscalers price out of reach.
  • · RunPod was founded in 2022 by Zhen Lu and Pardeep Singh, both ex-cryptocurrency-mining infrastructure operators who pivoted hardware from GPU mining to AI inference as the mining-to-AI transition accelerated through 2022–2023.
  • · RunPod runs two distinct clouds: Secure Cloud (enterprise-grade data centers, redundant infrastructure) and Community Cloud (lower-cost, partner-operated DCs with reduced reliability guarantees) — letting customers pick the price-reliability trade-off explicitly per workload.

Questions & answers

How much does RunPod cost per hour for GPUs?
Secure Cloud Pods per hour: RTX 4090 $0.69, A40 $0.44, A100 PCIe $1.39, A100 SXM $1.49, H100 PCIe $2.89, H100 NVL $3.19, H200 $4.39, B200 $5.89. Community Cloud rates are typically 20–40% lower with reduced reliability guarantees. Serverless workers bill per-second at $0.58–$8.64/hour equivalent depending on GPU type.
What is the difference between Secure Cloud and Community Cloud?
Secure Cloud runs on RunPod-operated enterprise-grade data centers with redundant infrastructure and SLA backing. Community Cloud runs on partner-operated data centers at lower cost but with reduced reliability guarantees and no SLA. Customers pick per-workload based on their reliability needs.
Does RunPod have a free tier?
No — RunPod requires credit card on file for evaluation. Startup credit programs exist but are not publicly documented; customers should contact sales for credits. The lack of a published free tier is a notable difference from Modal ($30/mo) and Together ($5 trial).
How is RunPod Serverless GPU priced?
Serverless workers bill per-second of active execution. Flex per-hour equivalents range from $0.58/hour (16GB cards) to $8.64/hour (B200); H100 is $4.18/hour and H200 is $5.58/hour. Workers scale to zero between requests — customers pay only for cold-start and active inference time.
What are RunPod's storage costs?
Five distinct storage SKUs: Container Disk $0.10/GB-month; Volume Disk $0.10/GB-month (running) or $0.20/GB-month (idle); Network Storage Standard $0.07/GB-month under 1TB or $0.05/GB-month over 1TB; Network Storage High-Performance $0.14/GB-month. Finance teams must aggregate all five to forecast total storage spend.
How does RunPod's 'up to 80% less than hyperscalers' claim hold up?
The $2.89/hour H100 PCIe Secure Cloud rate undercuts AWS p5.48xlarge effective H100 pricing ($3–$5/hour after sustained-use discounts) by 30–50%, not 80%. The 80% figure is closer to Community Cloud RTX 4090 rates ($0.69/hour) versus equivalent hyperscaler workstation GPU rates — accurate at the low end of the ladder but less so at the H100 end.