Rising1 sources· last seen 6h ago· first seen 6h ago

I tested as many of the small local and OpenRouter models I could with my own agentic text-to-SQL benchmark. Surprises ensured...

Last week I asked for some feedback about what extra models I should test. I've added them all and now the benchmark is available at [https://sql-benchmark.nicklothian.com/](https://sql-benchmark.nicklothian.com/) I didn't say a lot about what the agent at the time, but in simple terms it takes an

Lead: r/LocalLLaMABigness: 28testedsmalllocalopenrouterown

Open primary source

📡 Coverage

1 news source

🟠 Hacker News

🔴 Reddit

137 upvotes across 1 sub

📈 Google Trends

Full methodology: How scoring works

Receipts (all sources)

I tested as many of the small local and OpenRouter models I could with my own agentic text-to-SQL benchmark. Surprises ensured...

REDDIT · r/LocalLLaMA · 6h ago · ⬆ 137 · 💬 37

score 124