Software Module Clustering: An In-Depth Literature Analysis

Qusay I. Sarhan, Bestoun S. Ahmed, Miroslav Bures, Kamal Z. Zamli
2020
Software module clustering is an unsupervised learning method used to cluster software entities (e.g., classes, modules, or files) with similar features. The obtained clusters may be used to study, analyze, and understand the software entities' structure and behavior. Implementing software module clustering with optimal results is challenging. Accordingly, researchers have addressed many aspects of software module clustering in the past decade. Thus, it is essential to present the research
more » ... nce that has been published in this area. In this study, 143 research papers from well-known literature databases that examined software module clustering were reviewed to extract useful data. The obtained data were then used to answer several research questions regarding state-of-the-art clustering approaches, applications of clustering in software engineering, clustering processes, clustering algorithms, and evaluation methods. Several research gaps and challenges in software module clustering are discussed in this paper to provide a useful reference for researchers in this field.
doi:10.48550/arxiv.2012.01057 fatcat:w53ee7thprf2na64bd7pm2vmxy