Evaluation of information loss for privacy preserving data mining through comparison of fuzzy partitions

Isaac Cano, Susana Ladra, Vicenc Torra
2010 International Conference on Fuzzy Systems  
In this paper, we focus on the problem of preserving the data confidentiality when sharing the data for clustering. This problem poses new challenges for novel uses of privacy preserving data mining (PPDM) techniques. Specifically, this paper considers the synthetic data generation as a way to preserve the data privacy. One of the state of the art synthetic data generators is the IPSO family of methods. It has been stated that the use of IPSO to generate synthetic data is appropriate when the
more » ... ropriate when the user plans to apply clustering to the data. Moreover, this paper aims to associate the same property to the FCRM synthetic data generator, and at the same time, to assess the relationship between the information loss produced when generating synthetic data with FCRM and the clustering similarity between the original and synthetic data.
doi:10.1109/fuzzy.2010.5584186 dblp:conf/fuzzIEEE/CanoLT10 fatcat:zqggliauurgeldhycu6ndmt7du