DeepSeek V4 Pro costs 15x more to run than V3.2 and the efficiency narrative just got complicated

Independent benchmarking data from Artificial Analysis shows DeepSeek V4 Pro costs $1,071 to evaluate on the full Intelligence Index benchmark, roughly 15 times the cost of running V3.2 through the same suite, a figure that forces a more nuanced read of what DeepSeek's latest model actually delivers and for whom.

The headline from today's DeepSeek V4 launch has been capability: a 1.6 trillion parameter Mixture-of-Experts model with a one-million-token context window, open-source weights, and benchmark scores that challenge the best closed-source models from OpenAI and Anthropic. That story is accurate, and it represents a significant leap forward for the open-source community. The story running underneath it, surfaced by Artificial Analysis's benchmark cost data and trending on Reddit's r/singularity throughout Friday, is that V4 Pro is not a cheap model to operate. At $1.74 per million input tokens and $3.48 per million output tokens, it costs Artificial Analysis exactly $1,071 to run its comprehensive Intelligence Index evaluation, compared to a mere $71 for DeepSeek V3.2 on the exact same benchmark. This massive pricing gap is impossible to ignore. The 15x cost increase is real, and it matters fundamentally for how developers and enterprises will actually use the model in daily production.

The context that makes this less alarming than the raw numbers suggest is exactly what you get for that higher cost. DeepSeek V4 Pro is absolutely not V3.2 with a higher price sticker slapped on the API. It is a fundamentally different and much more powerful architecture. We are looking at 1.6 trillion total parameters versus V3.2's 671 billion. You also get a full one-million-token context window powered by the new Engram memory architecture, native multimodal input for handling diverse data types, and benchmark scores that include 80.6% on SWE-bench Verified and 93.5 on LiveCodeBench. These are the highest coding scores of any model publicly evaluated to date. The Reddit thread discussing the cost jump reached an early consensus that makes practical sense: for complex agentic workflows and tasks that actually require V4 Pro's elite capabilities, the operational cost is completely justified. For routine tasks that V3.2 already handles adequately, DeepSeek V4 Flash at $0.14 per million input tokens, which is roughly one-twelfth the price, is the sensible and cost-effective choice.

The framing of V4 Pro as simply an upgrade misses the reality of how DeepSeek is positioning its new lineup. It is a specialized tool designed to go head-to-head with premium models like GPT-4o and Claude 3.5 Sonnet. When you compare V4 Pro's pricing to those premium Western competitors, the cost looks entirely different. You are still getting incredible value for the performance. Enterprise engineering teams looking to automate large-scale code refactors, complex debugging, or advanced data extraction will find V4 Pro highly competitive. Smaller startups and independent developers, however, need to be highly strategic about their routing. Sending basic text summarization or simple customer support queries to V4 Pro will quickly drain your API budget. The smart move is implementing tiered model routing. You use V3.2 or V4 Flash for your bulk daily operations, and reserve V4 Pro specifically for the heavy lifting that requires its unmatched reasoning and coding proficiency. This ensures you leverage their breakthroughs without breaking the bank, proving that reading beyond the benchmark headlines is essential for real-world implementation.

Also read: Researchers say deep learning can have a rigorous scientific theory and the math to prove it is already emerging • A microwave clock and a Reddit thread just taught millions of people how AI image editing works • How to Build a Local Marketing Agency Without a Large Team - Using Agentic Agency