Using ART1 Neural Networks for Clustering Computer Forensics Documents

Georger Araújo, Célia Ralha
2012 The International Journal of Forensic Computer Science  
Computer forensic text corpora are usually very heterogeneous and easily surpass the terabyte range. Classification methods should be an aid in the exploration of such corpora, but they do not help in the task of thematically grouping together documents. In this paper, we propose the use of Adaptive Resonance Theory (ART), applying the ART1 algorithm, to help in the task of thematically grouping together computer forensics documents into clusters. For the clustering approach we present the
more » ... ed conceptual model and the software package implemented, in which a modified version of the ART1 algorithm was developed to improve the running time. Furthermore, real world forensic experiments were carried out to validate the model using a two-fold approach with a quantitative and a qualitative analysis method. The results demonstrate that our approach can generate good clusters when compared to the gold standard defined by domain area experts, with one clear advantage over other clustering methods (e.g. SOM and k-means) since there is no need to supply parameters beforehand such as the number of clusters.
doi:10.5769/j201201003 fatcat:62gwhkvbjnfepmlijvknb57y44