Leaking Sensitive Information in Complex Document Files--and How to Prevent It

Simson L. Garfinkel
2014 IEEE Security and Privacy  
Complex document formats such as PDF and Microsoft's Compound File Binary Format can contain information that is hidden but recoverable, as a result of text highlighting, cropping, or the embedding of high-resolution JPEG images. Private information can be released inadvertently if these fi les are distributed in electronic form. Simple experiments involving the creation of test documents can determine whether a particular program embeds hidden information.
doi:10.1109/msp.2013.131 fatcat:v25gpt3yyzch3iyz4swvhgb3ru