A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Problems with SZZ and Features: An empirical study of the state of practice of defect prediction data collection
[article]
2021
arXiv
pre-print
Context: The SZZ algorithm is the de facto standard for labeling bug fixing commits and finding inducing changes for defect prediction data. Recent research uncovered potential problems in different parts of the SZZ algorithm. Most defect prediction data sets provide only static code metrics as features, while research indicates that other features are also important. Objective: We provide an empirical analysis of the defect labels created with the SZZ algorithm and the impact of commonly used
arXiv:1911.08938v3
fatcat:xrj2fi7o6jbdfbym2jdflgeex4