294 Hits in 3.0 sec

External Query Expansion in the Blogosphere

Wouter Weerkamp, Maarten de Rijke
2008 Text Retrieval Conference  
In the blog distillation task we tried to improve over our (strong) baseline using external expansion, but due to differences in the run setup, comparing these runs is hard.  ...  We describe the participation of the University of Amsterdam's ILPS group in the blog track at TREC 2008. We mainly explored different ways of using external corpora to expand the original query.  ...  Acknowledgments This research was supported by the DuOMAn project carried out within the STEVIN programme which is funded by the Dutch and Flemish Governments (http://www. under project  ... 
dblp:conf/trec/WeerkampR08 fatcat:tes3tqlchbba7ptxq2vr4z3rnm

Predicting the Usefulness of Collection Enrichment for Enterprise Search [chapter]

Jie Peng, Ben He, Iadh Ounis
2009 Lecture Notes in Computer Science  
Query Expansion (QE) often improves the retrieval performance of an Information Retrieval (IR) system.  ...  In this paper, we propose the use of query performance predictors to selectively apply CE on a per-query basis.  ...  In the future, we plan to deploy our proposed method for blog search as collections from the blogosphere contain many spam documents and other noisy vocabulary, meaning that query expansion might benefit  ... 
doi:10.1007/978-3-642-04417-5_41 fatcat:zp55sx7rebeidkibi3z6ll7elq

Blog track research at TREC

Craig Macdonald, Rodrygo L.T. Santos, Iadh Ounis, Ian Soboroff
2010 SIGIR Forum  
The TREC Blog track aims to explore information seeking behaviour in the blogosphere, by building reusable test collections for blog-related search tasks.  ...  In particular, we describe the used corpora, the tasks addressed within the track, and the resulting published research.  ...  We are also thankful to Gilad Mishne and Maarten de Rijke for joining us in organising the TREC 2006 Blog track.  ... 
doi:10.1145/1842890.1842899 fatcat:aydy5eclwnfvdkm2zv5jygyt7y

Selection and context scoping for digital video collections

Robert G. Capra, Christopher A. Lee, Gary Marchionini, Terrell Russell, Chirag Shah, Fred Stutzman
2008 Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries - JCDL '08  
Since May 2007, we have been monitoring the results of 57 queries on YouTube related to the 2008 U.S. presidential election.  ...  In this paper, we describe research conducted to help inform digital curation of on-line video.  ...  These query expansions represent differences in how the Blogosphere and YouTube data sets were collected and are a result of the data collection efforts starting independently.  ... 
doi:10.1145/1378889.1378925 dblp:conf/jcdl/CapraLMRSS08 fatcat:lclh6i2envef7kifxvixeuttmu

Overview of the TREC 2009 Blog Track

Craig Macdonald, Iadh Ounis, Ian Soboroff
2009 Text Retrieval Conference  
We would like to express our thanks and appreciation to the New York Times (http://www.nytimes. com/), for providing the sample of headlines used in the top stories identification task.  ...  They are provided to support research in the TREC blog track. Thanks are also due to the 7 groups who assessed this year's top stories identification task topics.  ...  In addition, UAms used external query expansion on a news corpus and on Wikipedia to further enhance their baseline.  ... 
dblp:conf/trec/MacdonaldOS09 fatcat:3fjndgiqznhsndm2sxa4t5hiqi

Heuristic Ranking and Diversification of Web Documents

Jiyin He, Krisztian Balog, Katja Hofmann, Edgar Meij, Maarten de Rijke, Manos Tsagkias, Wouter Weerkamp
2009 Text Retrieval Conference  
in the top ranked results.  ...  We describe the participation of the University of Amsterdam's Intelligent Systems Lab in the web track at TREC 2009. We participated in the adhoc and diversity task.  ...  Acknowledgments This research was supported by the DAESO and DuO-MAn project carried out within the STEVIN program which is funded by the Dutch and Flemish Governments under project number STE-09-12, and  ... 
dblp:conf/trec/HeBHMRTW09 fatcat:wzfjctnsbbeqvflzhgujeay2je

University of Strathclyde at Headline Ranking TREC BLOG 2010

Dmitri Roussinov
2010 Text Retrieval Conference  
Automated run "strath3" was similar to "strath1" except the weights were estimated based on the amount of the increase in the frequency of use and applied to the query words.  ...  For automated run "strath1", our algorithm identified the word unigrams, the frequencies of mentioning of which in the blog corpus increased substantially on the day of the query.  ...  The University of Glasgow group has kindly provided the index of the news corpus, the daily lexicons of the Blog corpus and the classification of the headlines into the required set of categories.  ... 
dblp:conf/trec/Roussinov10 fatcat:spkwuclsbrhq3aqhlkhppbsxpy

Finding a needle in the blogosphere: An information fusion approach for blog distillation search

José M. Chenlo, Javier Parapar, David E. Losada, José Santos
2015 Information Fusion  
As a matter of fact, the size of the blogosphere is estimated to double every six months.  ...  In the blogosphere, different actors express their opinions about multiple topics. Users, companies or editors socially interact by commenting, recommending and linking blogs and posts.  ...  Acknowledgments This work was supported by the "Ministerio de Economía y Competitividad" of the Goverment of Spain under the research projects TIN2012-33867 and TIN2013-40981-R.  ... 
doi:10.1016/j.inffus.2014.09.001 fatcat:7dygrzctrrgblftrsuhsfh2vra

Connecting Sparsely Distributed Similar Bloggers

Nitin Agarwal, Huan Liu, Shankara Subramanya, John J. Salerno, Philip S. Yu
2009 2009 Ninth IEEE International Conference on Data Mining  
The nature of the Blogosphere determines that the majority of bloggers are only connected with a small number of fellow bloggers, and similar bloggers can be largely disconnected from each other.  ...  In this work, we define the problem, delineate its challenges, and present an approach that uses innovative ways to employ contextual information and collective wisdom to aggregate similar bloggers.  ...  CONCLUSIONS The sparsely linked blogosphere presents a new problem -searching for similar bloggers in the long tail of Blogosphere, in order to discover and connect niches in the long tail where the majority  ... 
doi:10.1109/icdm.2009.38 dblp:conf/icdm/AgarwalLSSY09 fatcat:tw2xfudrgfardju4lfkk6cpony

Retrieval and feedback models for blog feed search

Jonathan L. Elsas, Jaime Arguello, Jamie Callan, Jaime G. Carbonell
2008 Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08  
We perform an in-depth analysis of the behavior of pseudorelevance feedback for this task and develop a novel query expansion technique using the link structure in Wikipedia.  ...  This query expansion technique provides significant and consistent performance improvements for this task, yielding a 22% and 14% improvement in MAP over the unexpanded query for our baseline and federated  ...  Note that PRF.WIKI is identical to the external expansion method developed in [6] , where the relevance model is estimated entirely from our external corpus.  ... 
doi:10.1145/1390334.1390394 dblp:conf/sigir/ElsasACC08 fatcat:bfrwvgs54ffbdcqntsij6cpxr4

An effective coherence measure to determine topical consistency in user-generated content

Jiyin He, Wouter Weerkamp, Martha Larson, Maarten de Rijke
2009 International Journal on Document Analysis and Recognition  
blog to the user query.  ...  The coherence score must, however, be used judiciously in order to avoid boosting the ranking of irrelevant but topically focused blogs.  ...  Open Access This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided  ... 
doi:10.1007/s10032-009-0089-5 fatcat:ql7sz23vnnc5plpqdbclyq23he

Exploiting External Collections for Query Expansion

Wouter Weerkamp, Krisztian Balog, Maarten de Rijke
2012 ACM Transactions on the Web  
In this article we propose a general generative query expansion model that uses external document collections for term generation: the External Expansion Model (EEM).  ...  The main rationale behind our model is our hypothesis that each query requires its own mixture of external collections for expansion and that an expansion model should account for this.  ...  ACKNOWLEDGMENTS We are grateful to our reviewers and the editors of the journal for providing valuable comments and feedback, helping us to improve the quality of this article.  ... 
doi:10.1145/2382616.2382621 fatcat:u3w5pbggjfdnjpup7jnp4ktipa

An effective statistical approach to blog post opinion retrieval

Ben He, Craig Macdonald, Jiyin He, Iadh Ounis
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
Most of the current solutions involve the use of external resources and manual efforts in identifying subjective features.  ...  Finding opinionated blog posts is still an open problem in information retrieval, as exemplified by the recent TREC blog tracks.  ...  Our approach is inspired by the Divergence From Randomness (DFR) query expansion mechanism, which measures the divergence of a term's distribution in a pseudo-relevance set from its distribution in the  ... 
doi:10.1145/1458082.1458223 dblp:conf/cikm/HeMHO08 fatcat:gs7nm4ofindyhh2lg4jmwq2wgy

Overview of the TREC 2006 Blog Track

Iadh Ounis, Craig Macdonald, Maarten de Rijke, Gilad Mishne, Ian Soboroff
2006 Text Retrieval Conference  
We would like to thank Ali Azimi Bolourian for monitoring some of the crawls during the Blog06 test collection creation.  ...  Finally, we would like to thank Gianni Amati for various comments on the Blog track.  ...  In addition to measuring the effect of opinion-detection heuristics, some participants evaluated the benefit of using traditional IR techniques, such as passage retrieval (e.g., UMaryland), or query expansion  ... 
dblp:conf/trec/OunisMRMS06 fatcat:brdjesoacfdvzntjbft6uwohyu

Opinion mining: reviewed from word to document level

Malik Muhammad Saad Missen, Mohand Boughanem, Guillaume Cabanac
2012 Social Network Analysis and Mining  
discussion of the related work gives an overview of opinion mining task in blogosphere, one of most popular online social network, and (4) highlights the importance of online social networks for opinion  ...  In this paper, we provide a very detailed overview of the related work of opinion mining.  ...  Role of external data collections as a tool for query expansion and training data Many opinion finding approaches seek help of some external data collection whether for query expansion or for training  ... 
doi:10.1007/s13278-012-0057-9 fatcat:c3hoqyql5bculfzv42pesm2obu
« Previous Showing results 1 — 15 out of 294 results