Mistral Medium 3 vs Mistral Small 3
Detailed pricing comparison and cost analysis.
Updated April 2026
Cost Simulator
| Feature | Mistral Medium 3 | Mistral Small 3 |
|---|---|---|
| Provider | Mistral | Mistral |
| Input Price (1M) | $0.40 | $0.03 |
| Output Price (1M) | $2.00 | $0.11 |
| Context Window | 131,000 | 33,000 |
Verdict
Mistral Medium 3 costs $0.40 per 1M input tokens and $2.00 per 1M output tokens. Mistral Small 3 costs $0.03 per 1M input tokens and $0.11 per 1M output tokens. Mistral Small 3 is 93% cheaper on input tokens than Mistral Medium 3. For output tokens, Mistral Small 3 is the more affordable option at $0.11/1M vs $2.00.
On context window, Mistral Medium 3 supports 131,000 tokens — meaning it can fit more conversation history, documents, or code in a single request. This matters for RAG pipelines, long document analysis, and agentic workflows where context builds up over many turns.
When to choose Mistral Medium 3
- ✓ You need a larger context window (131,000 tokens)
- ✓ You are already integrated with Mistral
When to choose Mistral Small 3
- ✓ You need the lowest input token cost ($ 0.03/1M)
- ✓ Your workload is output-heavy — Mistral Small 3 generates text cheaper
- ✓ You are already integrated with Mistral
Use the calculator above to simulate your specific workload and find the exact break-even point. For most applications, the cheapest model is the one that minimises your total monthly bill given your input-to-output token ratio.
Frequently Asked Questions
Is Mistral Medium 3 cheaper than Mistral Small 3? ▼
Mistral Small 3 is cheaper on input tokens at $0.03/1M vs $0.40/1M for Mistral Medium 3 — a 93% saving.
What is the context window of Mistral Medium 3 vs Mistral Small 3? ▼
Mistral Medium 3 has a 131,000-token context window. Mistral Small 3 has a 33,000-token context window. Mistral Medium 3 supports the larger context, suitable for longer documents and agentic workflows.
Which model is better: Mistral Medium 3 or Mistral Small 3? ▼
The best choice depends on your use case. For cost efficiency on input tokens, Mistral Small 3 is the cheaper option. For maximum context length, Mistral Medium 3 supports 131,000 tokens. Use the comparison table above to find the right fit for your workload.