Cluster1 sources· last seen 1h ago· first seen 1h ago

Getting a feel for how fast X tokens/second really is.

I love following all your adventures with local LLM setups. Quality and size of the models are important, but so is performance. Numbers don't really convey the experienced speed well, however. If someone claims they run Qwen 3.6-27B at 21 tokens/second, how fast is that? Is 10 tokens/second unusab

Lead: r/LocalLLaMABigness: 24gettingfeelfasttokenssecond

Open primary source

📡 Coverage

1 news source

🟠 Hacker News

🔴 Reddit

70 upvotes across 1 sub

📈 Google Trends

Full methodology: How scoring works

Receipts (all sources)

Getting a feel for how fast X tokens/second really is.

REDDIT · r/LocalLLaMA · 1h ago · ⬆ 70 · 💬 23

score 126