Neural Gas Clustering Adapted for Given Size of Clusters

Iveta Dirgová Luptáková, Marek Šimon, Ladislav Huraj, Jiří Pospíchal
2016 Mathematical Problems in Engineering  
Clustering algorithms belong to major topics in big data analysis. Their main goal is to separate an unlabelled dataset into several subsets, with each subset ideally characterized by some unique characteristic of its data structure. Common clustering approaches cannot impose constraints on sizes of clusters. However, in many applications, sizes of clusters are bounded or known in advance. One of the more recent robust clustering algorithms is called neural gas which is popular, for example,
more » ... data compression and vector quantization used in speech recognition and signal processing. In this paper, we have introduced an adapted neural gas algorithm able to accommodate requirements for the size of clusters. The convergence of algorithm towards an optimum is tested on simple illustrative examples. The proposed algorithm provides better statistical results than its direct counterpart, balancedk-means algorithm, and, moreover, unlike the balancedk-means, the quality of results of our proposed algorithm can be straightforwardly controlled by user defined parameters.
doi:10.1155/2016/9324793 fatcat:mbtsbmv6onb53atxdhry467mi4