Prototype generation on structural data using dissimilarity space representation

Jorge Calvo-Zaragoza, Jose J. Valero-Mas, Juan R. Rico-Juan
2016 Neural computing & applications (Print)  
Data Reduction techniques play a key role in instance-based classification to lower the amount of data to be processed. Among the different existing approaches, Prototype Selection (PS) and Prototype Generation (PG) are the most representative ones. These two families differ in the way the reduced set is obtained from the initial one: while the former aims at selecting the most representative elements from the set, the latter creates new data out of it. Although PG is considered to delimit more
more » ... efficiently decision boundaries, the operations required are not so well defined in scenarios involving structural data such as strings, trees or graphs. This work studies the possibility of using Dissimilarity Space (DS) methods as an intermediate process for mapping the initial structural representation to a statistical one, thereby allowing the use of PG methods. A comparative experiment over string data is carried out in which our proposal is faced to PS methods on the original space. Results show that the proposed strategy is able to achieve significantly similar results to PS in the initial space, thus standing as a clear alternative to the classic approach, with some additional advantages derived from the DS representation.
doi:10.1007/s00521-016-2278-8 fatcat:2elbhayncnhgvo4d3ldhm6hudu