From Whence It Came: Detecting Source Code Clones by Analyzing Assembler

Ian J. Davis, Michael W. Godfrey
2010 2010 17th Working Conference on Reverse Engineering  
To date, most clone detection techniques have concentrated on various forms of source code analysis, often by analyzing token streams. In this paper, we introduce a complementary technique of analyzing generated assembler for clones. This approach is appealing as it is mostly impervious to trivial changes in the source, with compilation serving as a kind of normalization technique. We have built detectors to analyze both Java VM code as well as GCC Linux assembler for C and C++. In the paper,
more » ... describe our approach and show how it can serve as a valuable complementary semantic approach to syntactic source code based detection.
doi:10.1109/wcre.2010.35 dblp:conf/wcre/DavisG10 fatcat:47bwkubuwja7teov42qfrprvd4