Agree
DeepSeek V4 Pro is on par with top closed models Opus 4.6 Max and Gemini 3.1 Pro across knowledge, reasoning, and agentic benchmarks.
The author shows benchmark charts and states it's 'pretty much on par with some of the top closed models out there, including Opus 4.6 Max and Gemini 3.1 Pro.'
Source: The insane engineering of Deepseek V4Neutral
DeepSeek V4 achieved a perfect score of 120/120 on the Putnam 2025 undergraduate mathematics competition benchmark.
The video says 'Deepseek V4 achieved a perfect score, 120 out of 120' on the Putnam 2025.
Source: The insane engineering of Deepseek V4Agree
At the extreme 1-million-token context length, DeepSeek V4's retrieval accuracy surpasses Google Gemini 3.1 Pro.
The transcript claims 'its retrieval accuracy even beats Google's latest Gemini 3.1 Pro' when pushed to the 1M limit.
Source: The insane engineering of Deepseek V4Neutral
On the Artificial Analysis leaderboard, DeepSeek V4 Pro is the second best open-source model, below Kimik 2.6, and close to top closed models.
The video references the independent leaderboard showing DeepSeek V4 Pro as second best open-source model and edging close to top closed models.
Source: The insane engineering of Deepseek V4