Skip to main content
Dryad

Code review regression analysis of open source GitHub projects

Data files

Aug 31, 2017 version files 10.56 MB

Abstract

This dataset contains the repository data used for our study "A Large-Scale Study of Modern Code Review and Security in Open Source Projects". This dataset was collected from GitHub, and includes 3,126 projects in 143 languages, with 489,038 issues and 382,771 pull requests. We also include the regression analysis notebooks for reproducing our results from this data.