Rising1 sources· last seen 9h ago· first seen 9h ago

A Chinese AI lab just built an AI that writes CUDA code better than torch.compile. 40% better than Claude Opus 4.5. on the hardest benchmark.

Paper: https://cuda-agent.github.io/ Abstract GPU kernel optimization is fundamental to modern deep learning but remains a specialized task requiring deep hardware expertise. Existing CUDA code generation approaches either rely on training-free refinement or fixed execution-feedback loops, which l

Lead: r/singularityBigness: 30chineselabbuiltwritescuda
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
73
260 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works

Receipts (all sources)

Paper: https://cuda-agent.github.io/ Abstract GPU kernel optimization is fundamental to modern deep learning but remains a specialized task requiring deep hardware expertise. Existing CUDA code generation approaches either rely on training-free refinement or fixed execution-feedback loops, which l