AI Intelligence Briefing: The SaaSpocalypse, METR's Terrifying Graph, DeepSeek's Blackwell Scandal, and More
Inside Anthropic’s existential negotiations with the Pentagon
Anthropic's weekslong battle with the Department of Defense has played out over social media posts, admonishing public statements, and direct quotes from unnamed Pentagon officials to the news media. But the future of the $380 billion AI startup comes down to just three words: "any lawful use." The
Meta strikes up to $100B AMD chip deal as it chases ‘personal superintelligence’
Meta is buying billions of dollars in AMD AI chips in a multiyear deal tied to a 160 million-share warrant, deepening its push to diversify beyond Nvidia and expand data center capacity.
Anthropic Education Report: The AI Fluency Index
Anthropic Education Report: The AI Fluency Index Anthropic
How it feels listening to Anthropic complain about competitors distilling their models
Anthropic Banned OpenClaw: The OAuth Lockdown That Fractured the Claude Developer Community
Anthropic officially banned the use of Claude subscription OAuth tokens in all third-party tools including OpenClaw. Full timeline, economics, and what it means.
Gemini 3.1 Pro Is Here: Google's Reasoning Just Jumped 2.5× in Three Months
Google ships Gemini 3.1 Pro with a verified 77.1% on ARC-AGI-2 — more than double Gemini 3 Pro. It's now the second-best reasoning model behind Deep Think, and it's available to everyone today.
Anthropic Studied Millions of AI Agent Sessions. The Biggest Finding: Humans Are the Bottleneck.
Anthropic analyzed millions of human-AI interactions and found a massive 'deployment overhang' — AI can handle 5-hour tasks, but users cap them at 42 minutes.
DeepMind's Aletheia Solved 4 Problems No Human Could — And Got 68.5% of Everything Else Wrong
Google DeepMind's Aletheia is an AI research agent that wrote a publishable paper solo and cracked unsolved math problems. But its 6.5% useful answer rate tells the real story.
OpenAI's AI Can Now Hack 72% of Smart Contract Vulnerabilities — And That's the Good News
OpenAI and Paradigm launch EVMbench, a benchmark showing AI agents can exploit most known smart contract bugs. The offense-defense gap is the real story.
Claude Sonnet 4.6: The Mid-Range Model That Keeps Embarrassing Flagships
Anthropic's new Sonnet 4.6 nearly matches Opus on coding and reasoning, crushes computer use benchmarks, and somehow beats every model at office work and finance. Full benchmark breakdown inside.
Anthropic vs The Pentagon: When Your AI Company Gets Treated Like a Foreign Adversary
The Pentagon is threatening to designate Anthropic as a 'supply chain risk' over Claude's military usage guardrails. Here's what's actually happening.
Perplexity.ai tries to connect via UDP without being open
New Benchmark "InsanityBench", Gemini 3.1 Pro scores 15%
InsanityBench is supposed to be a benchmark encapsulating something we deeply care about (the "insane" leaps of creativity often needed in science), can hardly be gamed (because every task is completely different from another) and is nowhere near saturated yet (the best model scores 15%). Leaderboa
People are getting it wrong; Anthropic doesn't care about the distillation, they just want to counter the narrative about Chinese open-source models catching up with closed-source frontier models
Why would they care about distillation when they probably have done the same with OpenAI models and the Chinese labs are paying for the tokens? This is just their attempt to explain to investors and the US government that cheap Chinese models will never be as good as their models without distillatio
Anthropic Says Chinese Firms Used Claude Data to Improve Models
Anthropic claims DeepSeek and two other Chinese AI companies misused its Claude AI model in an attempt to improve their own products. In an announcement on Monday, Anthropic says the "industrial-scale campaigns" involved the creation of around 24,000 fraudulent accounts and more than 16 million exch
OpenAI makes GPT-5.3-Codex available through their API
xAI and Pentagon reach deal to use Grok in classified systems, Anthropic Given Ultimatum
https://www.axios.com/2026/02/23/ai-defense-department-deal-musk-xai-grok >Elon Musk's artificial intelligence company xAI has signed an agreement to allow the military to use its model, Grok, in classified systems, a Defense official confirmed to Axios. >Why it matters: Up to now, Anthropic
Software stocks rebound as Anthropic announces new partnerships
Connected LFM2.5-VL-1.6B to my Blink security camera — 51 tokens/sec with APPLE GPU
I've tested a lot of local VLMs for security camera analysis — SmolVLM2, Qwen3-VL, MiniCPM-V, LLaVA. LFM2.5-VL-1.6B from LiquidAI is the one I keep coming back to. Here's why. **One example output:** >"A mailman is delivering mail to a suburban house. The mailman is wearing a blue uniform and