Home/Bijan Bowen/DeepSeek V4

DeepSeek V4

Bijan Bowen · 23 Claims

Release
Neutral
DeepSeek V4 was released suddenly without warning on Hugging Face, with an official announcement on X following shortly after.
Author describes the event sequence as an unannounced HF appearance then official X post.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Open-source nature
Neutral
The model is open-source, with weights available on Hugging Face for anyone to download and run locally.
Author explains that model weights are uploaded, enabling local execution for those with sufficient hardware.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Hardware requirements
Neutral
The author does not have a machine capable of running the V4 Pro model locally.
Author states he cannot run the V4 Pro model on his own system.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Flash model
Neutral
DeepSeek V4 Flash has 158 billion parameters and is a good candidate for hobbyist/enthusiast local setups.
Author highlights the Flash variant's parameter count and its suitability for enthusiasts.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Pro model specs
Neutral
DeepSeek V4 Pro is a 1.66 trillion parameter mixture-of-experts model with 49 billion active parameters and a 1 million token context length.
Provides technical specifications directly from the model page.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Performance benchmarks
Neutral
Benchmarks compare V4 to older Opus 4.6, GPT 5.4, and Gemini 3.1 Pro High, showing comparable or superior performance in some tests.
Author notes the benchmark chart comparisons, though he personally does not emphasize benchmarks.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Efficiency
Neutral
V4 models are significantly more efficient than V3.2 in computational cost and memory usage.
Citing a comparison chart, the author points out lower computational cost and memory requirements.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Technical limits
Neutral
The model has a maximum output length of 384K tokens.
Author reads this specification from the announcement.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
API pricing
Neutral
API pricing is $1.74 per million input tokens and $348 per million output tokens for V4 Pro, while GPT-5.5 costs $5 per million input and $30 per million output.
Pricing comparison provided as reported by the author, though the output token cost for V4 Pro seems exceptionally high.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Browser OS test
Neutral
In the browser OS test, V4 Pro generated a 2700-line webOS with a functional right-click menu, correct clock, and Windows-style start menu.
Observation from the first test result delivered via the custom chat UI.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
GTA clone quality
Neutral
The GTA clone included buildings, trees, grid streets, and an NPC car but lacked a wanted system or pickups, with the car spawning inside a building.
Detail comparison with GPT-5.5's result; notes mesh colliders but missing interactivity.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
WebOS apps
Neutral
Initial calculator, notepad, terminal, and file explorer apps did not load; the issues were resolved by fixing the code in Open Code.
Documents the initial bug and subsequent fix using Open Code.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Terminal capability
Neutral
The terminal could fully control the GUI environment, including moving windows and changing backgrounds via commands.
Shows a unique feature discovered after the fix.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Subway scene test
Neutral
The subway scene featured arched ceilings, vending machines, and detailed tiles, but movement was inverted and stairs lacked collision.
Author describes the generated 3D subway environment and its quirks.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Flight simulator test
Agree
The flight combat simulator had an impressive plane selection screen with hover effects, but no enemies were visible during gameplay.
Author praised the start screen as one of the more impressive ones but noted the absence of enemies.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Drum kit simulation
Agree
The drum kit simulation (via chat.deepseek.com) produced good model details, stands, and an autoplay feature; the hip-hop beat was notably hard.
Author highlighted sound quality and model rendering, particularly the hip-hop autoplay beat.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
3D printer simulation
Neutral
The 3D printer simulation with thinking mode showed a coreXY printer with correct filament path and stepped bed movement; without thinking, it was basic pancake stacking.
Direct comparison between thinking and non-thinking modes demonstrates significant quality difference.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Skateboarding game
Neutral
The C++ skateboarding game had a boardwalk aesthetic with buildings and many humanoids, but glitches like falling through the map undermined its potential.
Author saw promise in the environment and character density but noted major glitches.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
UI design test
Neutral
The front-end AI analytics dashboard was clean and aesthetically pleasing but had a non-functional search bar.
Evaluates the static UI as well-made visually but lacking interactive search.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Overall verdict
Neutral
Overall, the V4 Pro preview model produced satisfactory but not mind-blowing results, feeling like a preview release.
Author's summary evaluation of the model's performance across all tests.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Cost efficiency
Neutral
Total testing cost was $3.68, and the model's API pricing is significantly cheaper than competitors like GPT-5.5 and Opus.
Author reports actual cost incurred and emphasizes cost advantage.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Open-weight milestone
Neutral
DeepSeek V4 is the largest ever open-weight model release.
Author asserts this as a fact about the release.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!
Flash potential
Agree
The Flash version's size makes it very exciting for local AI enthusiasts.
Author explicitly states excitement for the local-user potential of the smaller Flash model.
Source: DeepSeek V4 Is HERE – Testing the LARGEST Open Source Model Ever!