A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Filters
External Query Expansion in the Blogosphere
2008
Text Retrieval Conference
In the blog distillation task we tried to improve over our (strong) baseline using external expansion, but due to differences in the run setup, comparing these runs is hard. ...
We describe the participation of the University of Amsterdam's ILPS group in the blog track at TREC 2008. We mainly explored different ways of using external corpora to expand the original query. ...
Acknowledgments This research was supported by the DuOMAn project carried out within the STEVIN programme which is funded by the Dutch and Flemish Governments (http://www. stevin-tst.org) under project ...
dblp:conf/trec/WeerkampR08
fatcat:tes3tqlchbba7ptxq2vr4z3rnm
Predicting the Usefulness of Collection Enrichment for Enterprise Search
[chapter]
2009
Lecture Notes in Computer Science
Query Expansion (QE) often improves the retrieval performance of an Information Retrieval (IR) system. ...
In this paper, we propose the use of query performance predictors to selectively apply CE on a per-query basis. ...
In the future, we plan to deploy our proposed method for blog search as collections from the blogosphere contain many spam documents and other noisy vocabulary, meaning that query expansion might benefit ...
doi:10.1007/978-3-642-04417-5_41
fatcat:zp55sx7rebeidkibi3z6ll7elq
Blog track research at TREC
2010
SIGIR Forum
The TREC Blog track aims to explore information seeking behaviour in the blogosphere, by building reusable test collections for blog-related search tasks. ...
In particular, we describe the used corpora, the tasks addressed within the track, and the resulting published research. ...
We are also thankful to Gilad Mishne and Maarten de Rijke for joining us in organising the TREC 2006 Blog track. ...
doi:10.1145/1842890.1842899
fatcat:aydy5eclwnfvdkm2zv5jygyt7y
Selection and context scoping for digital video collections
2008
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries - JCDL '08
Since May 2007, we have been monitoring the results of 57 queries on YouTube related to the 2008 U.S. presidential election. ...
In this paper, we describe research conducted to help inform digital curation of on-line video. ...
These query expansions represent differences in how the Blogosphere and YouTube data sets were collected and are a result of the data collection efforts starting independently. ...
doi:10.1145/1378889.1378925
dblp:conf/jcdl/CapraLMRSS08
fatcat:lclh6i2envef7kifxvixeuttmu
Overview of the TREC 2009 Blog Track
2009
Text Retrieval Conference
We would like to express our thanks and appreciation to the New York Times (http://www.nytimes. com/), for providing the sample of headlines used in the top stories identification task. ...
They are provided to support research in the TREC blog track. Thanks are also due to the 7 groups who assessed this year's top stories identification task topics. ...
In addition, UAms used external query expansion on a news corpus and on Wikipedia to further enhance their baseline. ...
dblp:conf/trec/MacdonaldOS09
fatcat:3fjndgiqznhsndm2sxa4t5hiqi
Heuristic Ranking and Diversification of Web Documents
2009
Text Retrieval Conference
in the top ranked results. ...
We describe the participation of the University of Amsterdam's Intelligent Systems Lab in the web track at TREC 2009. We participated in the adhoc and diversity task. ...
Acknowledgments This research was supported by the DAESO and DuO-MAn project carried out within the STEVIN program which is funded by the Dutch and Flemish Governments under project number STE-09-12, and ...
dblp:conf/trec/HeBHMRTW09
fatcat:wzfjctnsbbeqvflzhgujeay2je
University of Strathclyde at Headline Ranking TREC BLOG 2010
2010
Text Retrieval Conference
Automated run "strath3" was similar to "strath1" except the weights were estimated based on the amount of the increase in the frequency of use and applied to the query words. ...
For automated run "strath1", our algorithm identified the word unigrams, the frequencies of mentioning of which in the blog corpus increased substantially on the day of the query. ...
The University of Glasgow group has kindly provided the index of the news corpus, the daily lexicons of the Blog corpus and the classification of the headlines into the required set of categories. ...
dblp:conf/trec/Roussinov10
fatcat:spkwuclsbrhq3aqhlkhppbsxpy
Finding a needle in the blogosphere: An information fusion approach for blog distillation search
2015
Information Fusion
As a matter of fact, the size of the blogosphere is estimated to double every six months. ...
In the blogosphere, different actors express their opinions about multiple topics. Users, companies or editors socially interact by commenting, recommending and linking blogs and posts. ...
Acknowledgments This work was supported by the "Ministerio de Economía y Competitividad" of the Goverment of Spain under the research projects TIN2012-33867 and TIN2013-40981-R. ...
doi:10.1016/j.inffus.2014.09.001
fatcat:7dygrzctrrgblftrsuhsfh2vra
Connecting Sparsely Distributed Similar Bloggers
2009
2009 Ninth IEEE International Conference on Data Mining
The nature of the Blogosphere determines that the majority of bloggers are only connected with a small number of fellow bloggers, and similar bloggers can be largely disconnected from each other. ...
In this work, we define the problem, delineate its challenges, and present an approach that uses innovative ways to employ contextual information and collective wisdom to aggregate similar bloggers. ...
CONCLUSIONS The sparsely linked blogosphere presents a new problem -searching for similar bloggers in the long tail of Blogosphere, in order to discover and connect niches in the long tail where the majority ...
doi:10.1109/icdm.2009.38
dblp:conf/icdm/AgarwalLSSY09
fatcat:tw2xfudrgfardju4lfkk6cpony
Retrieval and feedback models for blog feed search
2008
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08
We perform an in-depth analysis of the behavior of pseudorelevance feedback for this task and develop a novel query expansion technique using the link structure in Wikipedia. ...
This query expansion technique provides significant and consistent performance improvements for this task, yielding a 22% and 14% improvement in MAP over the unexpanded query for our baseline and federated ...
Note that PRF.WIKI is identical to the external expansion method developed in [6] , where the relevance model is estimated entirely from our external corpus. ...
doi:10.1145/1390334.1390394
dblp:conf/sigir/ElsasACC08
fatcat:bfrwvgs54ffbdcqntsij6cpxr4
An effective coherence measure to determine topical consistency in user-generated content
2009
International Journal on Document Analysis and Recognition
blog to the user query. ...
The coherence score must, however, be used judiciously in order to avoid boosting the ranking of irrelevant but topically focused blogs. ...
Open Access This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided ...
doi:10.1007/s10032-009-0089-5
fatcat:ql7sz23vnnc5plpqdbclyq23he
Exploiting External Collections for Query Expansion
2012
ACM Transactions on the Web
In this article we propose a general generative query expansion model that uses external document collections for term generation: the External Expansion Model (EEM). ...
The main rationale behind our model is our hypothesis that each query requires its own mixture of external collections for expansion and that an expansion model should account for this. ...
ACKNOWLEDGMENTS We are grateful to our reviewers and the editors of the journal for providing valuable comments and feedback, helping us to improve the quality of this article. ...
doi:10.1145/2382616.2382621
fatcat:u3w5pbggjfdnjpup7jnp4ktipa
An effective statistical approach to blog post opinion retrieval
2008
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08
Most of the current solutions involve the use of external resources and manual efforts in identifying subjective features. ...
Finding opinionated blog posts is still an open problem in information retrieval, as exemplified by the recent TREC blog tracks. ...
Our approach is inspired by the Divergence From Randomness (DFR) query expansion mechanism, which measures the divergence of a term's distribution in a pseudo-relevance set from its distribution in the ...
doi:10.1145/1458082.1458223
dblp:conf/cikm/HeMHO08
fatcat:gs7nm4ofindyhh2lg4jmwq2wgy
Overview of the TREC 2006 Blog Track
2006
Text Retrieval Conference
We would like to thank Ali Azimi Bolourian for monitoring some of the crawls during the Blog06 test collection creation. ...
Finally, we would like to thank Gianni Amati for various comments on the Blog track. ...
In addition to measuring the effect of opinion-detection heuristics, some participants evaluated the benefit of using traditional IR techniques, such as passage retrieval (e.g., UMaryland), or query expansion ...
dblp:conf/trec/OunisMRMS06
fatcat:brdjesoacfdvzntjbft6uwohyu
Opinion mining: reviewed from word to document level
2012
Social Network Analysis and Mining
discussion of the related work gives an overview of opinion mining task in blogosphere, one of most popular online social network, and (4) highlights the importance of online social networks for opinion ...
In this paper, we provide a very detailed overview of the related work of opinion mining. ...
Role of external data collections as a tool for query expansion and training data Many opinion finding approaches seek help of some external data collection whether for query expansion or for training ...
doi:10.1007/s13278-012-0057-9
fatcat:c3hoqyql5bculfzv42pesm2obu
« Previous
Showing results 1 — 15 out of 294 results