Google releases Gemini 3.1 Pro with improved reasoning capabilities
With Gemini 3.1 Pro, Google wants to improve the core intelligence of its model family. On a demanding reasoning benchmark, performance has more than doubled compared to its predecessor. But benchmarks are just that: benchmarks. The article Google releases Gemini 3.1 Pro with improved reasoning capa
Receipts (all sources)
With Gemini 3.1 Pro, Google wants to improve the core intelligence of its model family. On a demanding reasoning benchmark, performance has more than doubled compared to its predecessor. But benchmarks are just that: benchmarks. The article Google releases Gemini 3.1 Pro with improved reasoning capa
Definitely a noticeable improvement. Some notes: * The actual JSONs which were created from the model's output were noticeably *much* longer than 3.0 Pro; the model's increase in output length is very nice 😋 * The model actually created JSONs which were over 50MB long (for which I actually had t
Google ships Gemini 3.1 Pro with a verified 77.1% on ARC-AGI-2 — more than double Gemini 3 Pro. It's now the second-best reasoning model behind Deep Think, and it's available to everyone today.