frsh

This comparison highlights the key differences between o3-mini and DeepSeek-R1 in terms of performance benchmarks and model features.DeepSeek-R1 shows better performance in some coding and math tasks

Benchmark Comparison

Benchmark	o3-mini	DeepSeek-R1
Codeforces (Competitive Programming Platform)	1997	2029
AIME 2024 (American Invitational Mathematics Examination)	78.2%	79.8%
GPQA (General-Purpose Question Answering)	74.9%	71.5%
MMLU (Massive Multitask Language Understanding)	Not available	90.8%
MMLU-Pro (Massive Multitask Language Understanding - Pro Version)	Not available	84%
IFEval (Instruction-Following Evaluation)	Not available	83.3%

Key Features Comparison

Feature	o3-mini	DeepSeek-R1
Architecture	Not specified	Mixture of Experts (MoE)
Total Parameters	Not specified	671 billion
Active Parameters	Not specified	37 billion per forward pass
Training Approach	Reinforcement learning, Deliberative alignment	Multi-stage training with RL and SFT
Reasoning Levels	Low, medium, high	Not specified
Chain of Thought	Yes	Yes
Input Context Window	200K tokens	128K tokens
Maximum Output	100K tokens	32K tokens
Function Calling	Yes	Not specified
Structured Outputs	Yes	Not specified
Developer Messages	Supported	Not specified
Safety Approach	Deliberative alignment	Not specified
Open Source	No	Yes
Customization	Not specified	Supports fine-tuning
Deployment Options	ChatGPT, API	API, mobile app, cloud services
Unsafe Response Rate	1.19%	11.98%
Release Date	January 31, 2025.	January 21, 2025

This expanded comparison highlights the different approaches and features of o3-mini and DeepSeek-R1, showcasing their unique architectures, training methodologies, and capabilities.

Citations: 1 https://builtin.com/artificial-intelligence/deepseek-r12 https://www.trendingtopics.eu/openai-launches-o3-mini-its-cheapest-but-most-dangerous-ai-model-to-date/3 https://fastbots.ai/blog/deepseek-r1-explained-features-benefits-and-use-cases4 https://aws.amazon.com/blogs/machine-learning/deepseek-r1-model-now-available-in-amazon-bedrock-marketplace-and-amazon-sagemaker-jumpstart/5 https://www.theverge.com/news/603849/openai-o3-mini-launch-chatgpt-api-available-now6 https://www.amitysolutions.com/blog/deepseek-r1-ai-giant-from-china7 https://openai.com/index/openai-o3-mini/8 https://www.vellum.ai/blog/the-training-of-deepseek-r1-and-ways-to-use-it9 https://www.datacamp.com/blog/deepseek-r110 https://ai2sql.io/openai-o3-o3-mini-arc-agi11 https://cdn.openai.com/o3-mini-system-card.pdf12 https://www.infoq.com/news/2024/12/openai-announces-o3/13 https://techcommunity.microsoft.com/discussions/marketplace-forum/o3-mini-reasoning-model-now-available-in-microsoft-azure-openai-service/43728001 https://forum.effectivealtruism.org/posts/d3iFbMyu5gte8xriz/is-deepseek-r1-already-better-than-o3-when-inference-costs2 https://docsbot.ai/models/compare/deepseek-r1/o33 https://www.reddit.com/r/LocalLLaMA/comments/1iebj8p/new_paper_o3mini_vs_deepseekr1_which_one_is_safer/4 https://arxiv.org/html/2501.18438v15 https://docsbot.ai/models/compare/o3/deepseek-r16 https://www.datacamp.com/blog/deepseek-r17 https://www.zdnet.com/article/openais-launches-new-o3-mini-model-heres-how-free-chatgpt-users-can-try-it/8 https://www.trendingtopics.eu/openai-launches-o3-mini-its-cheapest-but-most-dangerous-ai-model-to-date/9 https://docsbot.ai/models/compare/deepseek-v3/o310 https://www.arxiv.org/abs/2501.18438