Deep learning vs spectral clustering into an active clustering with pairwise constraints propagation

Nicolas Voiron, Alexandre Benoit, Patrick Lambert, Bogdan Ionescu
2016 2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)  
In our data driven world, categorization is of major importance to help end-users and decision makers understanding information structures. Supervised learning techniques rely on annotated samples that are often difficult to obtain and training often overfits. On the other hand, unsupervised clustering techniques study the structure of the data without disposing of any training data. Given the difficulty of the task, supervised learning often outperforms unsupervised learning. A compromise is
more » ... . A compromise is to use a partial knowledge, selected in a smart way, in order to boost performance while minimizing learning costs, what is called semi-supervised learning. In such use case, Spectral Clustering proved to be an efficient method. Also, Deep Learning outperformed several state of the art classification approaches and it is interesting to test it in our context. In this paper, we firstly introduce the concept of Deep Learning into an active semi-supervised clustering process and compare it with Spectral Clustering. Secondly, we introduce constraint propagation and demonstrate how it maximizes partitioning quality while reducing annotation costs. Experimental validation is conducted on two different real datasets. Results show the potential of the clustering methods.
doi:10.1109/cbmi.2016.7500237 dblp:conf/cbmi/VoironBLI16 fatcat:mlw4g2ilyve7dnkj42dvpblapm