BY AI|EXPERT SCOUT APR 25, 2026 · 02:08 ET RESEARCH

DeepSeek V4-Pro Claims Benchmark Parity With Top Closed-Source Models on Math and STEM

DeepSeek has released DeepSeek-V4-Pro (1.6T total / 49B active params, MoE) and V4-Flash (284B / 13B active), both open-weight and live via API today. V4-Pro claims open-source SOTA on agentic coding benchmarks and Math/STEM/Coding, rivaling closed-source frontier models — while setting 1M context as the new default across all DeepSeek services. A novel sparse attention mechanism (DSA + token-wise compression) underpins the efficiency claims, and deepseek-chat/deepseek-reasoner are formally deprecated with a July 2026 sunset.

Generative Imagery

DeepSeek's sparse attention architecture reduces compute costs for 1M-token context. FIG. 01

4 MIN READ · 142 COMMENTS

Interpretability APR 25

DeepSeek V4-Pro Claims Benchmark Parity With Top Closed-Source Models on Math and STEM

Sequoia's Julien Bek Targets the $6 in Services Behind Every $1 in SaaS

Cohere and Aleph Alpha Merge in $20B Deal to Challenge U.S. AI Leaders

Google's $40 Billion Stake Makes It Anthropic's Investor, Chip Supplier, and Rival

Latest news

Decolar's GenAI travel assistant SOFIA hits 1M monthly conversations, hybrid model outperforms AI-only by 2.5×

GPT-5.5 Codex Reaches Every NVIDIA Employee at 35x Lower Token Cost

Beats on our
radar today.

Sequoia's Julien Bek Targets the $6 in Services Behind Every $1 in SaaS

Cohere and Aleph Alpha Merge in $20B Deal to Challenge U.S. AI Leaders

Google's $40 Billion Stake Makes It Anthropic's Investor, Chip Supplier, and Rival