Top Mathematics discussions

NishMath

@the-decoder.com - 40d

DeepSeek v3 Frontier LLM Model Details Revealed

DeepSeek has unveiled its v3 large language model (LLM), a significant advancement in AI. This new model was trained on an impressive 14.8 trillion tokens using 2,788,000 H800 GPU hours at a cost of approximately $5.576 million, a figure remarkably lower than other models of similar capability. DeepSeek v3's training involved both supervised fine-tuning and reinforcement learning, enabling it to achieve performance benchmarks comparable to Claude 3.5 Sonnet, showcasing its strong capabilities. The model is a Mixture-of-Experts (MoE) model with 671 billion parameters, with 37 billion activated for each token.

The release of DeepSeek v3 also includes API access, with highly competitive pricing compared to others in the market. Input is priced at $0.27 per million tokens (or $0.07 with cache hits), and output at $1.10 per million tokens. For comparison, Claude 3.5 Sonnet charges $3 per million tokens for input and $15 for output. These prices, along with its strong performance, indicate DeepSeek v3 is set to disrupt the market in terms of model quality and affordability. The model was also released as fully open-source with all associated papers and training frameworks provided to the research community.

Share:

References :

Hacker News: DeepSeek v3 beats Claude sonnet 3.5 and way cheaper
THE DECODER: Deepseek V3 emerges as China's most powerful open-source language model to date
github.com: DeepSeek_V3.pdf
www.marktechpost.com: The field of Natural Language Processing (NLP) has made significant strides with the development of large-scale language models (LLMs).

Classification:

HashTags: DeepSeekV3 AIModel LLM
Company: Deepseek
Product: DeepSeek v3
Feature: LLM
Type: AI
Severity: Informative

Beginner’s Guide to Game Theory - Nishanth Tharakan
Forever and Ever: Infinite Chess And How to Visually Represent Infinity - Nishanth Tharakan
Computers Running Video Games Running… Computers? - Nishanth Tharakan
Mathematics with Heart: A reflection on how compassionate teaching can reduce anxiety and empower students - Annie Petitt
Opening the Gate Without Losing the Flock - Annie Petitt
Why Math Makes Poker Fun (Sometimes) - Nishanth Tharakan
Grad Student Blooper Reel - Annie Petitt
In the world of mathematics outreach, it’s always prime time - Keith Devlin
RAG: What is it, and why should we care? - Nishanth Tharakan
How I Found a Home for Wayward Mathematicians - Annie Petitt
Math Values for the New Year - Annie Petitt
Happy 2025! - tanyakh
Identical Twins - tanyakh
Bachmann-Landau Notation and Its Use in Computer Science and Mathematics - Nishanth Tharakan
The Basics of a Neural Network Through the Eyes of a (Student) Mathematician - Nishanth Tharakan
Meet the President-Elect of the MAA! - Annie Petitt
A Puzzle from the Möbius Tournament - tanyakh
Teaching Math as an Act of Resistance - Annie Petitt
Structuring Professional Learning Communities for College-Level Instructional Change - Annie Petitt
Calculus for Teachers: On Continuity - David Bressoud
Finding Motivation in Community - Annie Petitt
A Baker, a Decorator, and a Wedding Planner Walk into a Classroom - Annie Petitt
The Power of Print - Annie Petitt
The mathematician who—incidentally—helped mathematicians to stop worrying and love the computer. - Keith Devlin
Three Ways to Support Students - Annie Petitt
Red, Yellow, and Green Hats - tanyakh
Beliefs and Belongings in Mathematics - David Bressoud
Supporting Mathematics Graduate Students: A Call for Departmental Leadership - Annie Petitt
Clean and Dirty Sisters - tanyakh
Good Puzzles Are the Reason I Check Facebook - tanyakh
Square out of a Plus - tanyakh
The Game of SET for Groups (Part 1), jointly with Andrey Khesin - tanyakh
Finding my people: One mathematician's reflection on MAA MathFest 2024 - Annie Petitt
How Low Can the Popular Vote Go? - Annie Petitt
How to Learn Mathematics - David Bressoud
Navigating a Math REU - Annie Petitt
Seeing is Believing : Using 3D Visualization to Strengthen Student Understanding - Annie Petitt
The Calculus Video Project - David Bressoud
The Personal Price of Professional Success - Annie Petitt
The Shifting Sands of Algebra - Keith Devlin