An efficient algorithm for type-safe structural diffing

Victor Cacciari Miraldo, Wouter Swierstra
2019 Proceedings of the ACM on Programming Languages (PACMPL)  
Effectively computing the difference between two version of a source file has become an indispensable part of software development. The de facto standard tool used by most version control systems is the UNIX diff utility, that compares two files on a line-by-line basis without any regard for the structure of the data stored in these files. This paper presents an alternative datatype generic algorithm for computing the difference between two values of any algebraic datatype. This algorithm
more » ... zes sharing between the source and target trees, while still running in linear time. Finally, this paper demonstrates that by instantiating this algorithm to the Lua abstract syntax tree and mining the commit history of repositories found on GitHub, the resulting patches can often be merged automatically, even when existing technology has failed.
doi:10.1145/3341717 fatcat:4cmuozxgafafhme2rh4u2glbsm