Rising1 sources· last seen 1d ago· first seen 1d ago

Gemma 4 31B beats several frontier models on the FoodTruck Bench

Gemma 4 31B takes an incredible 3rd place on FoodTruck Bench, beating GLM 5, Qwen 3.5 397B and all Claude Sonnets! I'm looking forward to how they'll explain the result. Based on the previous models that failed to finish the run, it would seem that Gemma 4 handles long horizon tasks better and actu

Lead: r/LocalLLaMABigness: 29gemma31bbeatsseveralfrontier
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
69
660 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works

Receipts (all sources)

Gemma 4 31B beats several frontier models on the FoodTruck Bench
REDDIT · r/LocalLLaMA · 1d ago · ⬆ 660 · 💬 108
score 107

Gemma 4 31B takes an incredible 3rd place on FoodTruck Bench, beating GLM 5, Qwen 3.5 397B and all Claude Sonnets! I'm looking forward to how they'll explain the result. Based on the previous models that failed to finish the run, it would seem that Gemma 4 handles long horizon tasks better and actu

Related clusters