Rising1 sources· last seen 8h ago· first seen 8h ago

Qwen 3.5 craters on hard coding tasks — tested all Qwen3.5 models (And Codex 5.3) on 70 real repos so you don't have to.

Hey everyone, some of you might remember [https://www.reddit.com/r/LocalLLaMA/comments/1r7shtv/i\_built\_a\_benchmark\_that\_tests\_coding\_llms\_on/](https://www.reddit.com/r/LocalLLaMA/comments/1r7shtv/i_built_a_benchmark_that_tests_coding_llms_on/) where I shared APEX Testing — my benchmark that

Lead: r/LocalLLaMABigness: 32qwencratershardcodingtasks
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
79
331 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works

Receipts (all sources)

Hey everyone, some of you might remember [https://www.reddit.com/r/LocalLLaMA/comments/1r7shtv/i\_built\_a\_benchmark\_that\_tests\_coding\_llms\_on/](https://www.reddit.com/r/LocalLLaMA/comments/1r7shtv/i_built_a_benchmark_that_tests_coding_llms_on/) where I shared APEX Testing — my benchmark that