A principal component analysis-based feature dimensionality reduction scheme for content-based image retrieval system
TELKOMNIKA (Telecommunication Computing Electronics and Control)
In content-based image retrieval (CBIR) system, one approach of image representation is to employ combination of low-level visual features cascaded together into a flat vector. While this presents more descriptive information, it however poses serious challenges in terms of high dimensionality and high computational cost of feature extraction algorithms to deployment of CBIR on platforms (devices) with limited computational and storage resources. Hence, in this work a feature dimensionality
... ction technique based on principal component analysis (PCA) is implemented. Each image in a database is indexed using 174-dimensional feature vector comprising of 54-dimensional colour moments (CM54), 32-bin HSV-histogram (HIST32), 48-dimensional gabor wavelet (GW48) and 40-dimensional wavelet moments (MW40). The PCA scheme was incorporated into a CBIR system that utilized the entire feature vector space. The k-largest eigenvalues that yielded a not more than 5% degradation in mean precision were retained for dimensionality reduction. Three image databases (DB10, DB20 and DB100) were used for testing. The result obtained showed that with 80% reduction in feature dimensions, tolerable loss of 3.45, 4.39 and 7.40% in mean precision value were achieved on DB10, DB20 and DB100. Keywords: Content-based image retrieval system Feature dimensionality reduction Low-level visual feature Principal component analysis This is an open access article under the CC BY-SA license.