The SEOSS 33 Dataset — Requirements, Bug Reports, Code History, and Trace Links for Entire Projects

Michael Rath, Patrick Mäder
2019 Data in Brief  
This paper provides a systematically retrieved dataset consisting of 33 open-source software projects containing a large number of typed artifacts and trace links between them. The artifacts stem from the projects' issue tracking system and source version control system to enable their joint analysis. Enriched with additional metadata, such as time stamps, release versions, component information, and developer comments, the dataset is highly suitable for empirical research, e.g., in
more » ... and software traceability analysis, software evolution, bug and feature localization, and stakeholder collaboration. It can stimulate new research directions, facilitate the replication of existing studies, and act as benchmark for the comparison of competing approaches. The data is hosted on Harvard Dataverse using DOI 10.7910/DVN/PDDZ4Q accessible via
doi:10.1016/j.dib.2019.104005 pmid:31198827 pmcid:PMC6557728 fatcat:bg6xvcyiynbijcwbvm7pdh6nni