Cluster1 sources· last seen 11h ago· first seen 11h ago
Code Review Dataset: 200k+ Cases of Human-Written Code Reviews from Top OSS Projects
I compiled 200k+ human-written code reviews from top OSS projects including React, Tensorflow, VSCode, and more. This dataset helped me finetune a version of Qwen2.5-Coder-32B-Instruct specialized in code reviews. The finetuned model showed significant improvements in generating better cod
Lead: r/LocalLLaMABigness: 23codereviewdataset200kcases
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
54
58 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works
Receipts (all sources)
Code Review Dataset: 200k+ Cases of Human-Written Code Reviews from Top OSS Projects
REDDIT · r/LocalLLaMA · 11h ago · ⬆ 58 · 💬 9
score 111
I compiled 200k+ human-written code reviews from top OSS projects including React, Tensorflow, VSCode, and more. This dataset helped me finetune a version of Qwen2.5-Coder-32B-Instruct specialized in code reviews. The finetuned model showed significant improvements in generating better cod