Rising1 sources· last seen 9h ago· first seen 9h ago
A Chinese AI lab just built an AI that writes CUDA code better than torch.compile. 40% better than Claude Opus 4.5. on the hardest benchmark.
Paper: https://cuda-agent.github.io/ Abstract GPU kernel optimization is fundamental to modern deep learning but remains a specialized task requiring deep hardware expertise. Existing CUDA code generation approaches either rely on training-free refinement or fixed execution-feedback loops, which l
Lead: r/singularityBigness: 30chineselabbuiltwritescuda
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
73
260 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works
Receipts (all sources)
A Chinese AI lab just built an AI that writes CUDA code better than torch.compile. 40% better than Claude Opus 4.5. on the hardest benchmark.
REDDIT · r/singularity · 9h ago · ⬆ 260 · 💬 30
score 124
Paper: https://cuda-agent.github.io/ Abstract GPU kernel optimization is fundamental to modern deep learning but remains a specialized task requiring deep hardware expertise. Existing CUDA code generation approaches either rely on training-free refinement or fixed execution-feedback loops, which l