1 Hit in 1.2 sec

CodeDJ: Reproducible Queries over Large-Scale Software Repositories

Petr Maj, Konrad Siek, Alexander Kovalenko, Jan Vitek, Manu Sridharan, Anders Møller
Analyzing massive code bases is a staple of modern software engineering research – a welcome side-effect of the advent of large-scale software repositories such as GitHub.  ...  CodeDJ supports reproducibility, historical queries are answered deterministically using past states of the datastore; thus researchers can reproduce published results.  ...  We intend to parallelize queries and explore ideas from the database community regarding query compilation strategies. Finally, we plan on extending our infrastructure.  ... 
doi:10.4230/lipics.ecoop.2021.6 fatcat:rinid5dapvguvdb2mt4jhe3hpa