Back to Blog List
Nimish Shinde3/3/2025

This comparison highlights the key differences between o3-mini and DeepSeek-R1 in terms of performance benchmarks and model features.DeepSeek-R1 shows better performance in some coding and math tasks

Benchmark Comparison

Benchmarko3-miniDeepSeek-R1
Codeforces (Competitive Programming Platform)19972029
AIME 2024 (American Invitational Mathematics Examination)78.2%79.8%
GPQA (General-Purpose Question Answering)74.9%71.5%
MMLU (Massive Multitask Language Understanding)Not available90.8%
MMLU-Pro (Massive Multitask Language Understanding - Pro Version)Not available84%
IFEval (Instruction-Following Evaluation)Not available83.3%

Key Features Comparison

Featureo3-miniDeepSeek-R1
ArchitectureNot specifiedMixture of Experts (MoE)
Total ParametersNot specified671 billion
Active ParametersNot specified37 billion per forward pass
Training ApproachReinforcement learning, Deliberative alignmentMulti-stage training with RL and SFT
Reasoning LevelsLow, medium, highNot specified
Chain of ThoughtYesYes
Input Context Window200K tokens128K tokens
Maximum Output100K tokens32K tokens
Function CallingYesNot specified
Structured OutputsYesNot specified
Developer MessagesSupportedNot specified
Safety ApproachDeliberative alignmentNot specified
Open SourceNoYes
CustomizationNot specifiedSupports fine-tuning
Deployment OptionsChatGPT, APIAPI, mobile app, cloud services
Unsafe Response Rate1.19%11.98%
Release DateJanuary 31, 2025.January 21, 2025

This expanded comparison highlights the different approaches and features of o3-mini and DeepSeek-R1, showcasing their unique architectures, training methodologies, and capabilities.

Citations: 1 https://builtin.com/artificial-intelligence/deepseek-r12 https://www.trendingtopics.eu/openai-launches-o3-mini-its-cheapest-but-most-dangerous-ai-model-to-date/3 https://fastbots.ai/blog/deepseek-r1-explained-features-benefits-and-use-cases4 https://aws.amazon.com/blogs/machine-learning/deepseek-r1-model-now-available-in-amazon-bedrock-marketplace-and-amazon-sagemaker-jumpstart/5 https://www.theverge.com/news/603849/openai-o3-mini-launch-chatgpt-api-available-now6 https://www.amitysolutions.com/blog/deepseek-r1-ai-giant-from-china7 https://openai.com/index/openai-o3-mini/8 https://www.vellum.ai/blog/the-training-of-deepseek-r1-and-ways-to-use-it9 https://www.datacamp.com/blog/deepseek-r110 https://ai2sql.io/openai-o3-o3-mini-arc-agi11 https://cdn.openai.com/o3-mini-system-card.pdf12 https://www.infoq.com/news/2024/12/openai-announces-o3/13 https://techcommunity.microsoft.com/discussions/marketplace-forum/o3-mini-reasoning-model-now-available-in-microsoft-azure-openai-service/43728001 https://forum.effectivealtruism.org/posts/d3iFbMyu5gte8xriz/is-deepseek-r1-already-better-than-o3-when-inference-costs2 https://docsbot.ai/models/compare/deepseek-r1/o33 https://www.reddit.com/r/LocalLLaMA/comments/1iebj8p/new_paper_o3mini_vs_deepseekr1_which_one_is_safer/4 https://arxiv.org/html/2501.18438v15 https://docsbot.ai/models/compare/o3/deepseek-r16 https://www.datacamp.com/blog/deepseek-r17 https://www.zdnet.com/article/openais-launches-new-o3-mini-model-heres-how-free-chatgpt-users-can-try-it/8 https://www.trendingtopics.eu/openai-launches-o3-mini-its-cheapest-but-most-dangerous-ai-model-to-date/9 https://docsbot.ai/models/compare/deepseek-v3/o310 https://www.arxiv.org/abs/2501.18438