Top Mathematics discussions
@medium.com
//
DeepSeek, a Chinese AI unicorn, has released DeepSeek-R1-0528, a significant update to its R1 reasoning model. This new release aims to enhance the model's capabilities in mathematics, programming, and general logical reasoning, positioning it as a formidable open-source alternative to leading proprietary models like OpenAI's o3 and Google's Gemini 2.5 Pro. The updated model is available on Hugging Face under the MIT license, promoting transparency and accessibility in AI development.
The R1-0528 update showcases improved reasoning depth and inference accuracy. Its performance on the AIME 2025 math benchmark has increased significantly, jumping from 70% to 87.5%. This indicates a deeper reasoning process, averaging 23,000 tokens per question, up from 12,000 in the previous version. These enhancements are attributed to increased computational resources and algorithmic optimizations during post-training. Additionally, the model exhibits improved performance in code generation tasks, ranking just below OpenAI's o4 mini and o3 models on LiveCodeBench benchmarks, and outperforming xAI's Grok 3 mini and Alibaba's Qwen 3.
DeepSeek has also released a distilled version of R1-0528, named DeepSeek-R1-0528-Qwen3-8B. This lightweight model, fine-tuned from Alibaba’s Qwen3-8B, achieves state-of-the-art performance among open-source models on the AIME 2024 benchmark and is designed for efficient operation on a single GPU. The current cost for DeepSeek’s API is $0.14 per 1 million input tokens during regular hours of 8:30 pm to 12:30 pm (drops to $0.035 during discount hours). Output for 1 million tokens is consistently priced at $2.19.
ImgSrc: miro.medium.com
References :
- pub.towardsai.net: DeepSeek R1 : Is It Right For You? (A Practical Self‑Assessment for Businesses and Individuals)
- AI News | VentureBeat: DeepSeek R1-0528 arrives in powerful open source challenge to OpenAI o3 and Google Gemini 2.5 Pro
- Analytics Vidhya: New Deepseek R1-0528 Update is INSANE
- Kyle Wiggers ?: DeepSeek updates its R1 reasoning AI model, releases it on Hugging Face
- MacStories: Testing DeepSeek R1-0528 on the M3 Ultra Mac Studio and Installing Local GGUF Models with Ollama on macOS
- Kyle Wiggers ?: DeepSeek’s updated R1 AI model is more censored, test finds
- www.analyticsvidhya.com: New Deepseek R1-0528 Update is INSANE
- www.marktechpost.com: DeepSeek Releases R1-0528: An Open-Source Reasoning AI Model Delivering Enhanced Math and Code Performance with Single-GPU Efficiency
- NextBigFuture.com: DeepSeek New Deepseek-R1 Model is Competitive With OpenAI O3 and Gemini 2.5 Pro
- MarkTechPost: Information about DeepSeek's R1-0528 model and its enhancements in math and code performance.
Classification: