Cluster1 sources· last seen 1h ago· first seen 1h ago

p-e-w/gemma-4-E2B-it-heretic-ara: Gemma 4's defenses shredded by Heretic's new ARA method 90 minutes after the official release

Google's Gemma models have long been known for their strong "alignment" (censorship). I am happy to report that even the latest iteration, Gemma 4, is not immune to Heretic's new [Arbitrary-Rank Ablation (ARA)](https://github.com/p-e-w/heretic/pull/211) method, which uses matrix optimization to supp

Lead: r/LocalLLaMABigness: 23p-e-wgemma-4-e2b-it-heretic-aragemma4'sdefenses
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
52
43 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works

Receipts (all sources)

Google's Gemma models have long been known for their strong "alignment" (censorship). I am happy to report that even the latest iteration, Gemma 4, is not immune to Heretic's new [Arbitrary-Rank Ablation (ARA)](https://github.com/p-e-w/heretic/pull/211) method, which uses matrix optimization to supp