Perplexity launches Perplexity Computer, a new multi-model system that can solve tasks end-to-end, details below
**Perplexity AI:** Introducing Perplexity Computer. Computer **unifies** every current AI capability into one system. It can research, design, code, deploy and manage any project end-to-end. Perplexity Computer is massively multi-model. Computer orchestrates models to **run agents** in parallel, le
Google relaunches its AI creative studio Flow with new features and integrations
Google is turning its AI creative studio Flow into an all-in-one tool for images and videos, with free image generators and new editing features. The article Google relaunches its AI creative studio Flow with new features and integrations appeared first on The Decoder.
Anthropic at the Rubicon: The Maduro Raid, the Pentagon Ultimatum, and the Death of 'Safety First'
Claude was used in the Venezuela raid. The Pentagon issued a Friday ultimatum. Anthropic gutted its safety pledge. A complete timeline of the crisis.
Andrej Karpathy: Programming Changed More in the Last 2 Months Than in Years
Karpathy says coding agents crossed a reliability threshold in December and can now handle long, multi-step tasks autonomously. He describes this as a major shift from writing code manually to orchestrating AI agents. **Source:** Andrej [Tweet](https://x.com/i/status/2026731645169185220)
AI Intelligence Briefing: The SaaSpocalypse, METR's Terrifying Graph, DeepSeek's Blackwell Scandal, and More
Google Gemini can book an Uber or order food for you on Pixel 10 and Galaxy S26
Gemini can now prep a rideshare or grocery order, though you’ll have to submit the order yourself. | Image: Google Google's Gemini AI is getting one step closer to being more like an actual assistant. Starting with some Pixel 10 phones and the Samsung Galaxy S26 series, Gemini will be able to hail
Sonnet 4.6 states "I am DeepSeek-V3, an AI assistant developed by DeepSeek" when asked "what model are you" by multiple users in Chinese
Anthropic Banned OpenClaw: The OAuth Lockdown That Fractured the Claude Developer Community
Anthropic officially banned the use of Claude subscription OAuth tokens in all third-party tools including OpenClaw. Full timeline, economics, and what it means.
Gemini 3.1 Pro Is Here: Google's Reasoning Just Jumped 2.5× in Three Months
Google ships Gemini 3.1 Pro with a verified 77.1% on ARC-AGI-2 — more than double Gemini 3 Pro. It's now the second-best reasoning model behind Deep Think, and it's available to everyone today.
Anthropic Studied Millions of AI Agent Sessions. The Biggest Finding: Humans Are the Bottleneck.
Anthropic analyzed millions of human-AI interactions and found a massive 'deployment overhang' — AI can handle 5-hour tasks, but users cap them at 42 minutes.
DeepMind's Aletheia Solved 4 Problems No Human Could — And Got 68.5% of Everything Else Wrong
Google DeepMind's Aletheia is an AI research agent that wrote a publishable paper solo and cracked unsolved math problems. But its 6.5% useful answer rate tells the real story.
OpenAI's AI Can Now Hack 72% of Smart Contract Vulnerabilities — And That's the Good News
OpenAI and Paradigm launch EVMbench, a benchmark showing AI agents can exploit most known smart contract bugs. The offense-defense gap is the real story.
Qwen 3.5 craters on hard coding tasks — tested all Qwen3.5 models (And Codex 5.3) on 70 real repos so you don't have to.
Hey everyone, some of you might remember [https://www.reddit.com/r/LocalLLaMA/comments/1r7shtv/i\_built\_a\_benchmark\_that\_tests\_coding\_llms\_on/](https://www.reddit.com/r/LocalLLaMA/comments/1r7shtv/i_built_a_benchmark_that_tests_coding_llms_on/) where I shared APEX Testing — my benchmark that
Claude can now jump between Excel and PowerPoint on its own
Anthropic now lets Claude switch independently between Excel and PowerPoint - for example, running an analysis and then building a presentation directly from the results. The article Claude can now jump between Excel and PowerPoint on its own appeared first on The Decoder.
Anthropic Drops Flagship Safety Pledge
Anthropic scrapped its 2023 promise to halt AI training if safety measures fell behind, with CEO Dario Amodei approving a revamped policy, TIME reported
Anthropic is the leading contributor to open weight models
It just happens to be entirely against their will and TOS. I say: Distill Baby Distill!
Alphabet-owned robotics software company Intrinsic joins Google
Nearly five years after graduating into an independent Alphabet company, Intrinsic is moving under Google's domain.
update your llama.cpp for Qwen 3.5
Qwen 3.5 27B multi-GPU crash fix [https://github.com/ggml-org/llama.cpp/pull/19866](https://github.com/ggml-org/llama.cpp/pull/19866) prompt caching on multi-modal models [https://github.com/ggml-org/llama.cpp/pull/19849](https://github.com/ggml-org/llama.cpp/pull/19849) [https://github.com/ggml