Detecting controversy on the web

Shiri Dori-Hacohen, James Allan
2013 Proceedings of the 22nd ACM international conference on Conference on information & knowledge management - CIKM '13  
A useful feature to facilitate critical literacy would alert users when they are reading a controversial web page. This requires solving a binary classification problem: does a given web page discuss a controversial topic? We explore the feasibility of solving the problem by treating it as supervised k-nearest-neighbor classification. Our approach (1) maps a webpage to a set of neighboring Wikipedia articles which were labeled on a controversiality metric; (2) coalesces those labels into an
more » ... labels into an estimate of the webpage's controversiality; and finally (3) converts the estimate to a binary value using a threshold. We demonstrate the applicability of our approach by validating it on a set of webpages drawn from seed queries. We show absolute gains of 22% in F0.5 on our test set over a sentiment-based approach, highlighting that detecting controversy is more complex than simply detecting opinions.
doi:10.1145/2505515.2507877 dblp:conf/cikm/Dori-HacohenA13 fatcat:wkyozwhmmze4toacukiavrjxqm