Measuring risk and utility of anonymized data using information theory

Josep Domingo-Ferrer, David Rebollo-Monedero
2009 Proceedings of the 2009 EDBT/ICDT Workshops on - EDBT/ICDT '09  
Before releasing anonymized microdata (individual data) it is essential to evaluate whether: i) their utility is high enough for their release to make sense; ii) the risk that the anonymized data result in disclosure of respondent identity or respondent attribute values is low enough. Utility and disclosure risk measures are used for the above evaluation, which normally lack a common theoretical framework allowing to trade off utility and risk in a consistent way. We explore in this paper the
more » ... e of information-theoretic measures based on the notion of mutual information.
doi:10.1145/1698790.1698811 dblp:conf/edbtw/Domingo-FerrerR09 fatcat:72tid32vtfhjjdcsoo67bwtdda