Identifying reasons for software changes using historic databases

Mockus, Votta
2000 Proceedings International Conference on Software Maintenance ICSM-94  
Large scale software products must constantly change in order to adapt to a changing environment. Studies of historic data from legacy software systems have identified three specific causes of this change: adding new features; correcting faults; and restructuring code to accommodate future changes. Our hypothesis is that a textual descriptionjeld of a change is essential to understanding why that change was performed. Also, we expect that dificulty, size, and interval would vary strongly across
more » ... different types of changes. To test these hypotheses we have designed a program which automatically classi$es maintenance activity based on a textual description of changes. Developer surveys showed that the automatic classijication was in agreement with developer opinions. Tests of the classifer on a different product found that size and interval for difSerent types of changes did not vary across two products. We have found strong relationships between the type and size of a change and the time required to carry it out. We also discovered a relatively large amount of perfective changes in the system we examined. From this study we have arrived at several suggestions on how to make version control data usejid in diagnosing the state of a software project, without significantly increasing the overhead for the developer using the change management system.
doi:10.1109/icsm.2000.883028 dblp:conf/icsm/MockusV00 fatcat:yksrnp6panbt7g6ixjldvl35le