How much does DeepSeek API cost?

DeepSeek API has two current models. DeepSeek-V4-Flash: $0.14/1M input (cache miss), $0.0028/1M (cache hit), $0.28/1M output. DeepSeek-V4-Pro: $0.435/1M input (cache miss), $0.003625/1M (cache hit), $0.87/1M output. Both models have a 1M-token context window, and these are the current published first-party rates (verified 2026-06-05).

How does DeepSeek compare to OpenAI GPT-4o on price?

DeepSeek-V4-Flash cache-miss input is $0.14/1M vs GPT-4o's roughly $2.50/1M — about 18× cheaper — and output is $0.28 vs about $10.00. With cache hits, V4-Flash input drops to $0.0028/1M, making reused-context input effectively negligible compared with US frontier models.

Is DeepSeek open source?

Yes. DeepSeek model weights are released under the MIT license on Hugging Face. You can self-host DeepSeek on your own infrastructure or via third-party providers (Fireworks AI, Together AI, Amazon Bedrock) without using DeepSeek's API.

What happened to deepseek-chat and deepseek-reasoner?

The model names deepseek-chat and deepseek-reasoner are now compatibility aliases. deepseek-chat maps to the non-thinking mode of DeepSeek-V4-Flash and deepseek-reasoner maps to its thinking mode. Both aliases are scheduled to be deprecated on 2026-07-24, so new integrations should target deepseek-v4-flash or deepseek-v4-pro directly.

Is DeepSeek safe to use for enterprise workloads?

Enterprise risk must be evaluated carefully. DeepSeek is a Chinese company subject to Chinese law, which creates data residency and sovereignty concerns. It offers no enterprise SLA, BAA, or compliance certifications. Many organizations instead self-host the open-source weights on their own infrastructure to avoid data transfer to DeepSeek servers, or access models via Amazon Bedrock with AWS enterprise agreements.

Does DeepSeek offer a free tier?

DeepSeek's web chat (chat.deepseek.com) is free with no account required for basic use. New API accounts receive a small amount of free credits. There is no ongoing free API tier for production workloads — production use requires purchasing credits.

DeepSeek Pricing

AI Summary

DeepSeek offers a free web chat product and a pay-per-token API with two current models: DeepSeek-V4-Flash (general-purpose, from $0.0028/1M cached input, $0.14 cache-miss in, $0.28 out) and DeepSeek-V4-Pro (higher-capacity, from $0.003625/1M cached input) — both with a 1M-token context window and dramatically cheaper than equivalent OpenAI and Anthropic models.
The legacy model names deepseek-chat and deepseek-reasoner are now compatibility aliases that map to the non-thinking and thinking modes of DeepSeek-V4-Flash, and are scheduled to be deprecated on 2026-07-24.
Both DeepSeek model families ship open-source MIT-licensed weights, enabling self-hosting on private infrastructure. The API is the managed cloud offering; self-hosting via Fireworks AI, Together AI, or Amazon Bedrock provides an alternative cost tier.
DeepSeek-R1's January 2025 launch — a reasoning model competitive with OpenAI o1 at approximately 1/30th the inference cost — triggered a global market reassessment of AI compute economics and caused a 17% single-day drop in Nvidia's stock price.
Data sovereignty risk is the primary constraint on DeepSeek enterprise adoption: the company is subject to Chinese law, offers no enterprise SLA or compliance certifications, and has no data residency controls outside China for its managed API.

Pricing summary

DeepSeek 2026 — Free web chat + API pricing

Free web chat; API from $0.0028/1M tokens (V4-Flash cache hit); V4-Pro from $0.003625/1M

Web Chat (Free)

Free

Individual users, casual AI use

Cache hit $0.0028/1M

API — DeepSeek-V4-Flash

$0.14 /1M input (cache miss)

Developers, production AI applications

Cache hit $0.003625/1M

API — DeepSeek-V4-Pro

$0.435 /1M input (cache miss)

Higher-capacity reasoning, math, coding tasks

Self-Hosted (Open Source)

Infra cost only

Privacy-sensitive orgs, high-volume deployments

API prices are per 1M tokens, verified against DeepSeek's first-party API docs on 2026-06-05. Self-hosting requires significant GPU infrastructure.

About

DeepSeek (深度求索) is a Chinese AI research laboratory founded in July 2023 by Liang Wenfeng, who also founded High-Flyer Capital Management, a quantitative hedge fund. Unlike most AI labs that are independent startups or big-tech subsidiaries, DeepSeek is a research organization funded by a profitable hedge fund — giving it unusual financial autonomy from external investor pressure and quarterly revenue expectations.

DeepSeek’s impact on the global AI industry vastly outstrips its commercial scale. The January 2025 release of DeepSeek-R1 — a reasoning model competitive with OpenAI o1 at roughly 1/30th the inference cost, with full model weights open-sourced under the MIT license — triggered the largest single-day stock market move attributable to an AI event in history. Nvidia’s stock dropped approximately 17%, erasing roughly $600B in market cap, as markets re-evaluated whether massive GPU clusters were actually necessary for frontier AI.

By mid-2025, DeepSeek operates a modest commercial presence compared to its technical impact: a free web chat product, a developer API, and public model weight releases. It has no enterprise sales team, no SLA offerings, and no compliance certifications. Revenue comes entirely from API token consumption. Its open-source releases have driven billions of inference calls globally via third-party hosting providers such as Fireworks AI, Together AI, and Amazon Bedrock — expanding DeepSeek’s reach without proportional commercial investment.

Pricing summary : How DeepSeek’s free chat and pay-per-token API work together

DeepSeek runs a minimal two-surface model: a free web chat for end-users and a pure usage-based API for developers. There is no consumer subscription tier — no equivalent to ChatGPT Plus or Claude Pro. The API is the only revenue model, billed per token with deep cache-hit discounts.

The API pricing is structured around three levers. Base prices are far cheaper than equivalent US-frontier models (V4-Flash at $0.14/1M cache-miss input vs GPT-4o at roughly $2.50/1M). Cache hit pricing is the headline lever: V4-Flash cache-hit input is $0.0028/1M — reduced to one-tenth of its launch price in April 2026 — versus $0.14 cache miss, a 98% reduction on reused context. Tiering by capacity offers V4-Pro (at $0.435/1M cache-miss input) for heavier reasoning workloads alongside the cheaper V4-Flash.

The fourth lever is the open-source alternative: because DeepSeek’s weights are MIT-licensed, developers can self-host entirely independent of DeepSeek’s API. This creates a natural ceiling on API pricing — DeepSeek cannot charge much above the cost of self-hosting without losing users to third-party hosts or on-premise deployment. This is the purest form of usage-based pricing with limited pricing power.

Pricing by product

DeepSeek API — Current Rates

Model	Input Cache Miss ($/1M)	Input Cache Hit ($/1M)	Output ($/1M)	Context	Max output
DeepSeek-V4-Flash	$0.14	$0.0028	$0.28	1M	384K
DeepSeek-V4-Pro	$0.435	$0.003625	$0.87	1M	384K

The legacy model names deepseek-chat and deepseek-reasoner are now compatibility aliases for the non-thinking and thinking modes of DeepSeek-V4-Flash respectively, and are scheduled to be deprecated on 2026-07-24 15:59 UTC. The DeepSeek-V4-Pro figures above are the current published first-party rates (verified 2026-06-05); DeepSeek does not currently publish an off-peak or time-of-use discount window.

Price Comparison: DeepSeek vs. US Frontier Models

Model	Input ($/1M)	Output ($/1M)	vs. DeepSeek-V4-Flash input
DeepSeek-V4-Flash	$0.14	$0.28	—
DeepSeek-V4-Pro	$0.435	$0.87	~3× more expensive
GPT-4.1 nano	~$0.10	~$0.40	~1.4× cheaper
GPT-4o mini	~$0.15	~$0.60	~comparable
GPT-4o	~$2.50	~$10.00	~18× more expensive
Claude 3.5 Sonnet	~$3.00	~$15.00	~21× more expensive

Note: US-frontier prices are approximate reference rates for comparison, not DeepSeek’s advertised prices. DeepSeek’s advantage is most pronounced on cache-hit input and on raw output rates against GPT-4o-class models.

DeepSeek Web Chat

Feature	Details
Access	Free, no account required for basic use
Models	DeepSeek-V4 (non-thinking and thinking/DeepThink modes)
Web search	Available
File uploads	Supported
API access	Not included; API requires separate registration

Sales motions across products: PLG / self-serve only for both web chat and API. No enterprise sales, no support contracts. Open-source self-hosting available at infrastructure cost. Prices accessed 2026-05-30.

Hidden costs : What DeepSeek API users actually pay beyond published per-token rates

Archetype A: Developer switching from GPT-4o to DeepSeek-V4-Flash

A production application processing 1B input tokens and 200M output tokens per month (illustrative, using current V4-Flash rates):

Scenario	Input cost	Output cost	Total
GPT-4o standard	approximately $2,500	approximately $2,000	approximately $4,500
DeepSeek-V4-Flash, all cache-miss	approximately $140	approximately $56	approximately $196
DeepSeek-V4-Flash, heavy cache reuse	approximately $3–$30	approximately $56	approximately $60–$90

The headline savings are real, but the hidden costs of switching to DeepSeek include: legal review for data sovereignty compliance (approximately $5K–$20K one-time), reliability engineering for API instability during peak global demand, and re-architecting prompt-caching boundaries to actually realize the steep cache-hit discount (cache hits only apply to repeated context prefixes within the cache window).

Archetype B: Enterprise evaluating self-hosting DeepSeek-V3

Line item	Monthly cost estimate
8× H100 80GB GPUs (cloud, on-demand)	$18,000–$28,000
Storage + networking	$500–$1,000
Engineering maintenance (0.25 FTE)	$5,000–$10,000
Estimated total	$23,500–$39,000

Self-hosting V3 at scale (8 H100s) costs more per month than the API for most workloads below ~50B tokens/month. Above that volume, self-hosting economics become compelling. Third-party managed hosts (Fireworks AI, Together AI) offer a middle ground with lower management overhead and enterprise agreements.

Use the DeepSeek pricing calculator to estimate your monthly API cost and compare it against self-hosting economics at your token volume.

Pricing evolution : DeepSeek’s pricing history from V2 to R1

Cadence

Quarter	Price changes	Product / SKU additions	Notes
2023 Q3	0	0	DeepSeek founded; internal research only
2023 Q4	0	1	DeepSeek Coder released open-source; no commercial API
2024 Q2	1	1	DeepSeek-V2 API launched at $0.14/1M input
2024 Q4	1	1	DeepSeek-V3: $0.27/1M, frontier performance, MIT weights
2025 Q1	0	2	DeepSeek-R1: $0.55/1M reasoning; off-peak discount added
2025 Q2	0	1	DeepSeek-V3-0324 update; pricing unchanged

Tracked range: 2023 Q3–2025 Q2. Quarters not listed above were verified stable.

Notable changes

2024-05-07 — DeepSeek-V2 API launched at $0.14/1M input — roughly 1/100th of GPT-4 Turbo prices. Forced Chinese AI labs (Baidu, Alibaba Qwen, ByteDance) to cut prices within weeks. First evidence of a credible pricing floor challenge to US frontier labs.
2024-12-26 — DeepSeek-V3 released with MIT open-source weights. At $0.27/1M input vs GPT-4o’s $2.50, V3 was 9× cheaper while matching GPT-4o on many benchmarks. Training cost estimated at $5.5M. (DeepSeek-V3 technical report)
2025-01-20 — DeepSeek-R1 released with MIT open-source weights. Competitive with OpenAI o1 at $0.55/1M vs o1’s $15/1M — a 27× price difference. The open-source release meant any developer could self-host frontier-class reasoning. (DeepSeek-R1 arXiv paper)
2025-01-27 — Nvidia stock dropped ~17% following the R1 release as markets repriced AI compute necessity. The “DeepSeek moment” became shorthand for commoditization risk in AI infrastructure. (Reuters coverage)
2025-03-24 — DeepSeek-V3-0324 update released. Improved coding benchmark performance. No pricing changes.

What’s unique : DeepSeek’s distinctive pricing mechanics

1. Cache-hit pricing is among the most aggressive in AI APIs. DeepSeek cut its cache-hit input price to one-tenth of the launch price in April 2026: V4-Flash cache hits cost approximately $0.0028/1M versus $0.14/1M on a cache miss — a 98% reduction on reused context. For applications with large, stable system prompts or repeated document contexts, architecting around prompt caching is the highest-leverage cost optimization available in any AI API. (Note: DeepSeek’s earlier off-peak / time-of-use discount window is no longer published.)

2. Open-source weights create a self-imposed price ceiling. Because DeepSeek’s flagship weights are MIT-licensed, DeepSeek cannot price its API much above the cost of self-hosting without losing developers to third-party hosts. OpenAI, Anthropic, and Google do not release weights for their flagship models and thus have no such constraint. The result is commodity-level pricing power for DeepSeek — permanently anchored to infrastructure cost rather than value-based pricing.

3. No subscription tier is both a simplification and a limitation. Unlike every US AI company, DeepSeek has no paid consumer subscription. The entire business model is API token consumption. This simplicity creates a pure pay-as-you-go model but limits total addressable revenue: there is no $20/month per-user captured from the millions of people using the free web chat.

4. A 1M-token context window closes a former weakness. Earlier DeepSeek models were capped at 64K tokens — well below US competitors. The V4 generation now ships a 1M-token context window with up to 384K tokens of output, removing what had been DeepSeek’s most cited adoption barrier for long-document analysis, full-codebase review, and long-running agent sessions. Combined with cheap cache-hit input, large-context workloads are now economically viable on DeepSeek in a way they were not a year ago.

5. Technical transparency builds trust in a trust-deficit environment. DeepSeek publishes detailed technical reports with training costs, architecture details, and benchmark methodologies. For a company facing significant trust concerns around data sovereignty, this radical technical transparency serves as a credibility signal that partially offsets sovereignty concerns for developer audiences, even if it doesn’t address enterprise compliance requirements.

Strengths & weaknesses

Strengths	Weaknesses
API pricing 5–30× cheaper than GPT-4o equivalents on reasoning models	Chinese company subject to Chinese law; data sovereignty risk for enterprise
Cache-hit input pricing (V4-Flash $0.0028/1M) is among the most aggressive in any AI API	API has had reliability and capacity issues during peak global demand periods
MIT open-source weights — self-host on any infrastructure	No enterprise SLA, no BAA, no SOC 2, no compliance certifications
Free web chat with no account required	No off-peak / time-of-use discount in the current pricing table
No subscription complexity — pure pay-per-token simplicity	No consumer subscription tier — misses $20/mo per-user monetization entirely
1M-token context window (V4 generation) closes the former 64K context gap	No multimodal capabilities (image generation, audio) in the API
Radical technical transparency (training costs, architecture published)	Many US enterprises restrict or ban DeepSeek API use for legal/compliance reasons

Billing UX : DeepSeek’s API billing and account management experience

Registration — API access requires account creation at platform.deepseek.com. Credit card or equivalent payment required.
Prepaid credits — API usage is billed against a prepaid credit balance. No postpaid billing available.
Free credits — New accounts have historically received a small amount of free trial credits, though the amount has varied over time and is not a guaranteed published figure.
Spend alerts — Basic usage dashboard in the API platform. No documented configurable spend cap or proactive budget alert feature.
Off-peak pricing — Applied automatically based on server time; no developer action required to receive the discount.
Cache pricing — Applied automatically when the same context prefix is reused within the cache TTL window.
No enterprise tier — There is no enterprise sales, SLA, or support contract offering from DeepSeek directly. Organizations requiring enterprise agreements can access DeepSeek models via Amazon Bedrock or other third-party providers.
Third-party hosting — Fireworks AI, Together AI, Amazon Bedrock, and others host DeepSeek models at slightly different price points with enterprise support, data residency options, and compliance features.
Web chat — Free access at chat.deepseek.com, no account required for basic use. V3 (general) and R1 (thinking mode) both available.

Strategic wins : Why DeepSeek’s pricing decisions worked

1. V2’s shock pricing forced a Chinese AI industry price war

DeepSeek-V2’s $0.14/1M input price in May 2024 was not just competitive — it was a strategic declaration. Within weeks, Alibaba’s Qwen, Baidu’s ERNIE, and ByteDance’s Doubao had cut prices to match or undercut V2. DeepSeek established the price anchor for an entire regional AI market from a position of technical and cost advantage. This pricing-as-competitive-strategy approach mirrors how AWS’s EC2 pricing redefined cloud compute economics.

2. Open-sourcing V3 and R1 maximized global adoption over API revenue

The decision to MIT-license frontier-class models eliminated DeepSeek’s API pricing power but maximized adoption. Developers worldwide evaluated and deployed DeepSeek through third-party providers and self-hosted infrastructure, creating a global user base that a commercial API alone could never reach. This adoption-over-margin strategy mirrors Meta’s Llama playbook and is antithetical to OpenAI’s proprietary positioning — the bet is that ecosystem size and developer trust creates long-term strategic value.

3. Off-peak pricing created a unique value proposition for batch workloads

For a period in 2025, DeepSeek offered an off-peak discount that solved a real developer problem: batch AI workloads (nightly enrichment, bulk classification, async document processing) don’t need real-time response but were paying the same rate as latency-sensitive calls. At the time, no US AI API offered a comparable time-of-use discount, giving DeepSeek a usage-based pricing advantage for cost-not-latency workloads. DeepSeek no longer publishes a time-of-use window in its current pricing table — the headline cost lever today is its aggressive cache-hit input pricing.

4. The R1 release reframed the global AI pricing narrative

By demonstrating that a reasoning model competitive with o1 could be built at 1/30th the inference cost and open-sourced, DeepSeek forced every AI lab and enterprise buyer to recalibrate their AI cost floor assumptions. Enterprise buyers who previously accepted GPT-4o pricing as “market rate” now have a credible alternative benchmark at 9× cheaper. DeepSeek’s technical cost transparency changed every AI procurement conversation in 2025 — even for organizations that never use DeepSeek directly.

Areas to improve : Gaps in DeepSeek’s pricing and product approach

1. Data sovereignty and compliance gaps block regulated enterprise adoption

DeepSeek has no BAA, no DPA, no SOC 2 certification, and no data residency controls outside China for its managed API. For healthcare, finance, government, and defense organizations, these are disqualifying factors regardless of price. The fix requires structural investment: offering a data processing agreement, establishing US/EU hosting options, and pursuing compliance certifications. Without this, DeepSeek misses the regulated enterprise segment entirely — leaving significant addressable market to OpenAI Enterprise and Anthropic Enterprise.

2. No default spend caps create runaway cost risk for agentic workloads

DeepSeek’s API has no documented spend cap or proactive budget alert system. While the prepaid balance provides a hard ceiling, a developer whose balance is fully funded has no protection against an agentic loop consuming the entire balance overnight. Configurable spend alerts and hard caps at 50%/80%/100% of balance threshold are table stakes for production API billing — DeepSeek’s absence of these features is a developer trust gap.

3. No multimodal API and a deprecating alias surface add migration friction

DeepSeek’s API remains text-only — no image, audio, or vision endpoints — so teams needing multimodal capabilities must pair it with another provider. At the same time, the legacy deepseek-chat and deepseek-reasoner aliases that many integrations still target are scheduled for deprecation on 2026-07-24, forcing a migration to deepseek-v4-flash / deepseek-v4-pro. For teams that standardized on the old alias names, this is avoidable churn that better-managed deprecation and versioning policies would soften with longer notice windows and clearer migration tooling.

Monetization stack & signals : how DeepSeek builds & buys its revenue engine

Buys 0 Builds 1

The read — where the monetization investment is going

DeepSeek builds rather than buys: it meters its own inference economics in-house — its Open-Source-Week disclosure pegged the V3/R1 serving system's theoretical cost-profit margin at 545% ($87,072 daily H800 cost vs $562,027 of R1-priced tokens) — and that in-house cost meter, not a vendor tool, is what backs its aggressive token pricing.

Stack — build vs buy

Builds in-house · 1

In-house V3/R1 inference-serving + cost-economics system (cross-node Expert Parallelism) Cost & FinOps Blog Mar 2025

“To optimize these two objectives, our solution employs cross-node Expert Parallelism (EP). ... Assuming the leasing cost of one H800 GPU is $2 per hour, the total daily cost amounts to $87,072. ... If all tokens were billed at DeepSeek-R1's pricing, the total daily revenue would be $562,027, with a cost profit margin of 545%.”

Signals reviewed Jun 2026 · derived from engineering blogs

Key takeaways

Cache-hit input pricing is the most underutilized cost lever on DeepSeek today. V4-Flash cache-hit input is $0.0028/1M versus $0.14/1M on a cache miss — a ~98% reduction on reused context. Any team with large, stable system prompts or repeated document context should architect around prompt-cache boundaries to capture it. (DeepSeek’s earlier off-peak / time-of-use discount is no longer published, so this pricing optimization now lives in caching, not time-of-day routing.)
Open-source model weights are a pricing strategy, not just a research output. By releasing V3 and R1 under MIT license, DeepSeek anchored its API price to self-hosting economics and drove global adoption through third-party hosts. For AI companies with strong technical capabilities, open-source releases accelerate distribution faster than any sales motion — at the cost of long-term pricing power.
The cheapest API is not always the lowest total cost. DeepSeek’s price advantage must be weighed against reliability risk, data sovereignty legal overhead, and the absence of enterprise SLAs or compliance certifications. True cost modeling for AI API selection must include these non-token costs.
DeepSeek proved that frontier AI cost is not a sustainable moat. The $5.5M training cost for V3 demonstrated that frontier model capability is achievable at dramatically lower cost than US labs implied. This reframed every AI pricing conversation in 2025 — value-based pricing in AI APIs must now compete against a commodity-priced open-source baseline.
No enterprise tier is a deliberate constraint, not an oversight. DeepSeek’s absence of enterprise sales, support, and compliance features reflects its research-first identity. This means DeepSeek is positioned as a cost-optimization tool within existing AI stacks, not a primary AI vendor for regulated enterprises. Understanding this positioning clarifies where DeepSeek fits vs. OpenAI Enterprise or Anthropic Enterprise.

UBP implications

Cache-aware pricing is becoming a first-class cost lever across AI APIs. DeepSeek’s aggressive cache-hit discount — V4-Flash cache-hit input at one-tenth of cache-miss — shows how much pricing power lives in distinguishing reused context from net-new tokens. (DeepSeek also experimented with an off-peak time-of-use discount in 2025, since discontinued.) Teams should design AI billing systems to meter and price cached vs uncached input as a first-class optimization — not an afterthought.
The open-source/API price parity creates the most efficient usage-based market in AI. Because DeepSeek API competes against self-hosted alternatives at roughly the same marginal cost, it has the most commodity-like pricing of any frontier AI API. This commodity pricing dynamic is the long-run equilibrium for AI APIs as open-source models reach parity — suggesting that AI API pricing strategies must eventually shift toward proprietary data, tooling, or reliability rather than model capability alone.
Sovereignty risk is an emerging hidden cost in AI procurement. Using DeepSeek requires explicitly pricing in legal review, DPA overhead, compliance certification gaps, and potential regulatory restrictions. This non-token cost is invisible in per-token comparisons but real in total cost of ownership. As AI APIs become geopolitically complex, procurement teams need a framework for pricing sovereignty risk alongside model capability and token cost.

Sources

DeepSeek API pricing (docs) (accessed 2026-06-05 — re-verified V4-Flash and V4-Pro rates first-party; no promo/standard split published)
DeepSeek API pricing details, USD (docs) (accessed 2026-05-30)
DeepSeek-V3 technical report on GitHub (accessed 2026-05-29)
DeepSeek-R1 technical paper (arXiv 2501.12948) (accessed 2026-05-29)
DeepSeek model weights on Hugging Face (accessed 2026-05-29)
Amazon Bedrock — DeepSeek model availability (accessed 2026-05-29)
Reuters — DeepSeek R1 and Nvidia stock reaction (accessed 2026-05-29)

Bottom line

DeepSeek is the most disruptive pricing story in AI history: a Chinese research lab trained frontier models at 1/50th the claimed cost of incumbents, open-sourced the weights under MIT license, and priced the managed API at 5–30× below US equivalents. Its aggressive cache-hit input pricing — V4-Flash cache hits at $0.0028/1M, one-tenth of the launch price — creates cost structures that no US AI API can match for repeated-context and long-context workloads, now backed by a 1M-token context window. The gaps are real: data sovereignty concerns block regulated enterprise adoption, there is no multimodal API, and no enterprise support means DeepSeek fits as a cost-optimization layer within existing stacks rather than a primary AI vendor. For developers building cost-sensitive production applications who can manage sovereignty risk — or who self-host the open-source weights — DeepSeek offers the most aggressive value in the AI API market.

Browse the full pricing blueprint to compare DeepSeek against OpenAI, Anthropic, and other AI platforms.

Pricing timeline : Major events on a vertical axis

Each milestone below corresponds to a public pricing change, product launch, or material adjustment. Major events use a filled marker; minor adjustments use a faded one.

DeepSeek-V3-0324 Update — Improved Coding

Mar 2025

DeepSeek released V3-0324, an update to V3 with improved coding and math reasoning. Scored above Claude 3.7 Sonnet on SWE-bench coding benchmark. API pricing unchanged.

DeepSeek-V3-0324 Update — Improved Coding screenshot 1

DeepSeek-V3-0324 Update — Improved Coding screenshot 2

Nvidia Stock Drops 17% Following R1 Release

Jan 2025

Nvidia's stock fell approximately 17% (~$600B market cap loss) as markets re-evaluated whether the massive GPU clusters assumed necessary for frontier AI training were actually required. The R1 technical report showed training at 1/30th assumed compute.

DeepSeek-R1 Released — Reasoning Model, MIT Open-Source

Jan 2025

DeepSeek-R1 launched with MIT open-source weights. Competitive with OpenAI o1 on reasoning benchmarks. API: $0.55/1M input (cache miss), $2.19/1M output — approximately 27× cheaper than OpenAI o1 at $15/$60 per 1M.

DeepSeek-V3 Released — Frontier Performance at $0.27/1M

Dec 2024

DeepSeek-V3 released with MIT open-source weights. Matched or exceeded GPT-4o on multiple benchmarks. API: $0.27/1M input (cache miss), $1.10/1M output — approximately 9× cheaper than GPT-4o. Estimated training cost: $5.5M.

DeepSeek-V2 Launched — $0.14/1M Input, Market Shock

May 2024

DeepSeek-V2, a 236B parameter mixture-of-experts model, launched via API at $0.14/1M input and $0.28/1M output — roughly 1/100th the price of GPT-4 Turbo. Forced immediate price cuts from Chinese AI labs within weeks.

DeepSeek Coder V1 Released Open-Source

Nov 2023

DeepSeek released DeepSeek Coder, a coding-focused open-source model. Weights available on Hugging Face. No commercial API pricing at this stage.

DeepSeek Founded

Jul 2023

DeepSeek AI founded in Hangzhou, China by Liang Wenfeng as a research subsidiary of High-Flyer hedge fund. Initial focus on foundation model research, not commercial deployment.

Trivia

· DeepSeek-R1's January 2025 release caused Nvidia's stock to drop approximately 17% (~$600B in market cap) in a single day — the largest single-day market cap loss attributable to an AI event in history — because R1 demonstrated frontier AI reasoning at roughly 1/30th the inference cost of OpenAI o1.
· DeepSeek is funded by High-Flyer Capital Management, a Chinese quantitative hedge fund. DeepSeek reportedly trained V3 on approximately 2,000 Nvidia H800 GPUs at an estimated total cost of $5.5M — a fraction of the 10,000–100,000 GPU clusters used by US frontier labs for comparable models.
· DeepSeek-V3 and R1 model weights are open-sourced under the MIT license, allowing any developer or company to self-host the models. This makes DeepSeek the only frontier-class model family that is both commercially cheap via API and fully free for self-hosting.

Questions & answers

How much does DeepSeek API cost?: DeepSeek API has two current models. DeepSeek-V4-Flash: $0.14/1M input (cache miss), $0.0028/1M (cache hit), $0.28/1M output. DeepSeek-V4-Pro: $0.435/1M input (cache miss), $0.003625/1M (cache hit), $0.87/1M output. Both models have a 1M-token context window, and these are the current published first-party rates (verified 2026-06-05).
How does DeepSeek compare to OpenAI GPT-4o on price?: DeepSeek-V4-Flash cache-miss input is $0.14/1M vs GPT-4o's roughly $2.50/1M — about 18× cheaper — and output is $0.28 vs about $10.00. With cache hits, V4-Flash input drops to $0.0028/1M, making reused-context input effectively negligible compared with US frontier models.
Is DeepSeek open source?: Yes. DeepSeek model weights are released under the MIT license on Hugging Face. You can self-host DeepSeek on your own infrastructure or via third-party providers (Fireworks AI, Together AI, Amazon Bedrock) without using DeepSeek's API.
What happened to deepseek-chat and deepseek-reasoner?: The model names deepseek-chat and deepseek-reasoner are now compatibility aliases. deepseek-chat maps to the non-thinking mode of DeepSeek-V4-Flash and deepseek-reasoner maps to its thinking mode. Both aliases are scheduled to be deprecated on 2026-07-24, so new integrations should target deepseek-v4-flash or deepseek-v4-pro directly.
Is DeepSeek safe to use for enterprise workloads?: Enterprise risk must be evaluated carefully. DeepSeek is a Chinese company subject to Chinese law, which creates data residency and sovereignty concerns. It offers no enterprise SLA, BAA, or compliance certifications. Many organizations instead self-host the open-source weights on their own infrastructure to avoid data transfer to DeepSeek servers, or access models via Amazon Bedrock with AWS enterprise agreements.
Does DeepSeek offer a free tier?: DeepSeek's web chat (chat.deepseek.com) is free with no account required for basic use. New API accounts receive a small amount of free credits. There is no ongoing free API tier for production workloads — production use requires purchasing credits.