Wenliang Du, Zhouxuan Teng, Zutao Zhu
2008 Proceedings of the 2008 ACM SIGMOD international conference on Management of data - SIGMOD '08  
Privacy-Preserving Data Publishing (PPDP) deals with the publication of microdata while preserving people' private information in the data. To measure how much private information can be preserved, privacy metrics is needed. An essential element for privacy metrics is the measure of how much adversaries can know about an individual's sensitive attributes (SA) if they know the individual's quasi-identifiers (QI), i.e., we need to measure P (SA | QI). Such a measure is hard to derive when
more » ... derive when adversaries' background knowledge has to be considered. We propose a systematic approach, Privacy-MaxEnt, to integrate background knowledge in privacy quantification. Our approach is based on the maximum entropy principle. We treat all the conditional probabilities P (SA | QI) as unknown variables; we treat the background knowledge as the constraints of these variables; in addition, we also formulate constraints from the published data. Our goal becomes finding a solution to those variables (the probabilities) that satisfy all these constraints. Although many solutions may exist, the most unbiased estimate of P (SA | QI) is the one that achieves the maximum entropy.
doi:10.1145/1376616.1376665 dblp:conf/sigmod/DuTZ08 fatcat:g5jeveprljhn3gnop726blmqoy