CLUSTER · TIER 2
NVIDIA Blackwell inference stack cuts DeepSeek V4 token costs by 80%.
NVIDIA's Blackwell inference software delivers 5x throughput improvement for DeepSeek V4 through co-designed hardware-software optimizations, reducing token costs to roughly one-fifth of previous levels and enabling up to 20x higher throughput on the same GPU.
Sources
2
X mentions
5.9k ▲
First seen
2Dago
Velocity
+5%/6h