Fingerprinting Search Keywords over HTTPS at Scale [article]

Junhua Yan, Hasan Faik Alan, Jasleen Kaur
2020 arXiv   pre-print
The possibility of fingerprinting the search keywords issued by a user on popular web search engines is a significant threat to user privacy. This threat has received surprisingly little attention in the network traffic analysis literature. In this work, we consider the problem of keyword fingerprinting of HTTPS traffic -- we study the impact of several factors, including client platform diversity, choice of search engine, feature sets as well as classification frameworks. We conduct both
more » ... -world and open-world evaluations using nearly 4 million search queries collected over a period of three months. Our analysis reveals several insights into the threat of keyword fingerprinting in modern HTTPS traffic.
arXiv:2008.08161v1 fatcat:z7r4dpknkvafzni5km7bkmspz4