Massive2 sources· last seen 17h ago· first seen 18h ago
GPT5.5 slightly outperformed Mythos on a multi-step cyber-attack simulation. One challenge that took a human expert 12 hrs took GPT-5.5 only 11 min at a $1.73 cost
Link to tweets: https://x.com/deredleritt3r/status/2049890601236390098?s=20 https://x.com/AISecurityInst/status/2049868227740565890?s=20 Link to associated blogs: [https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabilities](https://www.aisi.gov.uk/blog/our-evaluation-of-op
Lead: r/singularityBigness: 83gpt-5secondcompleteaisimulti-step
📡 Coverage
50
2 news sources
🟠 Hacker News
27
4 pts, 1 comments
🔴 Reddit
88
794 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works
Receipts (all sources)
GPT-5.5 is the second model to complete AISI multi-step cyber-attack simulation
HACKERNEWS · Hacker News · 17h ago · ▲ 4 · 💬 1
score 130
GPT5.5 slightly outperformed Mythos on a multi-step cyber-attack simulation. One challenge that took a human expert 12 hrs took GPT-5.5 only 11 min at a $1.73 cost
REDDIT · r/singularity · 18h ago · ⬆ 794 · 💬 155
score 118
Link to tweets: https://x.com/deredleritt3r/status/2049890601236390098?s=20 https://x.com/AISecurityInst/status/2049868227740565890?s=20 Link to associated blogs: [https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabilities](https://www.aisi.gov.uk/blog/our-evaluation-of-op