Cluster1 sources· last seen 11h ago· first seen 11h ago

Code Review Dataset: 200k+ Cases of Human-Written Code Reviews from Top OSS Projects

I compiled 200k+ human-written code reviews from top OSS projects including React, Tensorflow, VSCode, and more. This dataset helped me finetune a version of Qwen2.5-Coder-32B-Instruct specialized in code reviews. The finetuned model showed significant improvements in generating better cod

Lead: r/LocalLLaMABigness: 23codereviewdataset200kcases
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
54
58 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works

Receipts (all sources)

score 111

I compiled 200k+ human-written code reviews from top OSS projects including React, Tensorflow, VSCode, and more. This dataset helped me finetune a version of Qwen2.5-Coder-32B-Instruct specialized in code reviews. The finetuned model showed significant improvements in generating better cod

Related clusters