Cluster1 sources· last seen 1h ago· first seen 1h ago

Getting a feel for how fast X tokens/second really is.

I love following all your adventures with local LLM setups. Quality and size of the models are important, but so is performance. Numbers don't really convey the experienced speed well, however. If someone claims they run Qwen 3.6-27B at 21 tokens/second, how fast is that? Is 10 tokens/second unusab

Lead: r/LocalLLaMABigness: 24gettingfeelfasttokenssecond
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
57
70 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works

Receipts (all sources)

Getting a feel for how fast X tokens/second really is.
REDDIT · r/LocalLLaMA · 1h ago · ⬆ 70 · 💬 23
score 126

I love following all your adventures with local LLM setups. Quality and size of the models are important, but so is performance. Numbers don't really convey the experienced speed well, however. If someone claims they run Qwen 3.6-27B at 21 tokens/second, how fast is that? Is 10 tokens/second unusab