Rising1 sources· last seen 2h ago· first seen 2h ago

BeeLlama v0.2.0 – major DFlash update. Single RTX 3090: Qwen 3.6 27B up to 164 tps (4.40x), Gemma 4 31B up to 177.8 tps (4.93x). Prompt processing speed near baseline.

**BeeLlama v0.2.0 is here!** >Not quite a pegasus, but close enough. [**GitHub**](https://github.com/Anbeeld/beellama.cpp) **|** [**Qwen 3.6 27B Quick Start**](https://github.com/Anbeeld/beellama.cpp/blob/main/docs/quickstart-qwen36-dflash.md) **|** [**Gemma 4 31B Quick Start**](https://github.

Lead: r/LocalLLaMABigness: 25beellamamajordflashsinglertx

Open primary source

📡 Coverage

1 news source

🟠 Hacker News

🔴 Reddit

61 upvotes across 1 sub

📈 Google Trends

Full methodology: How scoring works

Receipts (all sources)

BeeLlama v0.2.0 – major DFlash update. Single RTX 3090: Qwen 3.6 27B up to 164 tps (4.40x), Gemma 4 31B up to 177.8 tps (4.93x). Prompt processing speed near baseline.

REDDIT · r/LocalLLaMA · 2h ago · ⬆ 61 · 💬 48

score 124