UNED-READERS: Filtering Relevant Tweets using Probabilistic Signature Models

Henry Anaya-Sánchez, Anselmo Peñas, Bernardo Cabaleiro
2013 Conference and Labs of the Evaluation Forum  
This paper describes the (usupervised) knowledge-based approach to filter relevant tweets for a given entity that is followed by the UNED-READERS system at RepLab 2013. The approach relies on a new way of contextualizing entity names from relative large and broad collections of texts using probabilistic signature models (i.e., discrete probability distributions of words lexically related to the knowledge or topic underlying set of entities in background text collections). The contextualization
more » ... s intended to recover relevant information about the entity (specifically, lexically related words) from background knowledge. Results obtained in the filtering task are presented.
dblp:conf/clef/Anaya-SanchezPC13 fatcat:rtuv77stgnd6tjm77bvrqxxopm