Rising1 sources· last seen 6h ago· first seen 6h ago

I tested as many of the small local and OpenRouter models I could with my own agentic text-to-SQL benchmark. Surprises ensured...

Last week I asked for some feedback about what extra models I should test. I've added them all and now the benchmark is available at [https://sql-benchmark.nicklothian.com/](https://sql-benchmark.nicklothian.com/) I didn't say a lot about what the agent at the time, but in simple terms it takes an

Lead: r/LocalLLaMABigness: 28testedsmalllocalopenrouterown
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
66
137 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works

Receipts (all sources)

Last week I asked for some feedback about what extra models I should test. I've added them all and now the benchmark is available at [https://sql-benchmark.nicklothian.com/](https://sql-benchmark.nicklothian.com/) I didn't say a lot about what the agent at the time, but in simple terms it takes an