Rising1 sources· last seen 5h ago· first seen 5h ago

The exact KV cache usage of DeepSeek V4

Figure 1 of DSV4 paper seems to imply that DSV3.2 uses \~50GB at 1m context and DSV4 uses \~5GB: [https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek\_V4.pdf](https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf) From my own calculations, the correct FP16

Lead: r/LocalLLaMABigness: 25exactcacheusagedeepseek
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
59
72 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works

Receipts (all sources)

The exact KV cache usage of DeepSeek V4
REDDIT · r/LocalLLaMA · 5h ago · ⬆ 72 · 💬 34
score 120

Figure 1 of DSV4 paper seems to imply that DSV3.2 uses \~50GB at 1m context and DSV4 uses \~5GB: [https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek\_V4.pdf](https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf) From my own calculations, the correct FP16