Agree
Flash with 13 billion active parameters can reach reasoning performance comparable to GPT-5.2 and Gemini 3.0 Pro when given a larger thinking budget.
Author reports DeepSeek's claim about Flash performance.
Source: How Did DeepSeek Make V4 So Cheap?Agree
Pro Max is claimed as the strongest open model, beating previous open-source models on reasoning, coding, long context, and agentic benchmarks; on Artificial Analysis it wins Terminal Bench Hard and ranks second or third as an open-weights model.
Performance claims from DeepSeek and third-party benchmarks.
Source: How Did DeepSeek Make V4 So Cheap?Neutral
On reasoning, Pro Max falls slightly behind GPT 5.4 and Gemini 3.1 Pro, representing roughly a 3 to 6 month gap behind frontier closed models.
Honest self-assessment from the paper as described by author.
Source: How Did DeepSeek Make V4 So Cheap?Neutral
DeepSeek left some benchmark entries blank when comparing against Kimi K2.6 and GLM 5.1 because their APIs were too busy to return responses, indicating serving capacity issues.
Anecdote from the paper about missing benchmark results.
Source: How Did DeepSeek Make V4 So Cheap?