A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2016; you can also visit the original URL.
The file type is application/pdf
.
Filters
Natural Language Processing as a Foundation of the Semantic Web
2007
Foundations and Trends® in Web Science
This perspective places NLP and its associated techniques like Information Extraction at the core of the Semantic Web/Web Science enterprise. ...
Those who continue to maintain that "universal words" are not the English words they look most like, must at least tell us which of the senses of the real word closest to the "universal word" they intend ...
doi:10.1561/1800000002
fatcat:n2xfw3qdhverrokidb2globwyq
Statistical source expansion for question answering
2011
Proceedings of the 20th ACM international conference on Information and knowledge management - CIKM '11
and 67% on TREC questions. iv First of all, I would like to thank my advisor Eric Nyberg for his support and guidance throughout my studies at Carnegie Mellon. ...
quiz show and TREC evaluations. By expanding various seed corpora with web search results, we were able to improve the QA accuracy of Watson from 66% to 71% on regular Jeopardy! ...
., , 2007 developed at Carnegie Mellon University and previously at Universität Karlsruhe is a modular and extensible framework that supports the integration of different QA algorithms and knowledge resources ...
doi:10.1145/2063576.2063632
dblp:conf/cikm/SchlaeferCNFZF11
fatcat:whoy62klazctbdo4p57wbevkdu
Entity Centric Information Retrieval
2016
SIGIR Forum
Motivated by the importance of the enterprise search, the enterprise track in TREC has been launched since 2005 [8, 10, 46, 134] . ...
the search quality of enterprise search. ...
doi:10.1145/2964797.2964815
fatcat:qdmhwfminnaefonienqyggckbm
Fast candidate generation for real-time tweet search with bloom filter chains
2013
ACM Transactions on Information Systems
at the time the query is issued. ...
The rise of social media and other forms of user-generated content have created the demand for real-time search: against a high-velocity stream of incoming documents, users desire a list of relevant results ...
Asadi, Department of Computer Science, University of Maryland at College Park; email: nima@cs.umd.edu; J. ...
doi:10.1145/2493175.2493178
fatcat:lp3unrp4bvgwzbs4d72bpqyupy
Building Watson: An Overview of the DeepQA Project
2010
The AI Magazine
After 3 years of intense research and development by a core team of about 20 researches, Watson is performing at human expert-levels in terms of precision, confidence and speed at the Jeopardy! ...
IBM Research undertook a challenge to build a computer system that could compete at the human champion level in real time on the American TV Quiz show, Jeopardy! ...
Acknowledgements We would like to acknowledge the talented team of research scientists and engineers at IBM and at partner universities, listed below, for the incredible work they are doing to influence ...
doi:10.1609/aimag.v31i3.2303
fatcat:bqzu6kcak5gnhpabvyv6vuc7n4
Term-Based Models for Entity Ranking
[chapter]
2018
Advanced Topics in Information Retrieval
In this chapter, we adhere to the "single search box" paradigm, which accepts "free text" search queries, and simply treat queries as sequences of words, referred to hereinafter as terms. ...
Returning specific entities, instead of a mere list of documents, can provide better answers to a broad range of information needs. ...
TREC Enterprise The TREC (Text Retrieval Conference) 2005-2008 Enterprise track [8, 25] featured an expert finding task, where a single type of entity was sought: people, who are experts on a given topic ...
doi:10.1007/978-3-319-93935-3_3
fatcat:dtyxpoiqsbfhtm6m5q24oxp5ii
Remedies against the Vocabulary Gap in Information Retrieval
[article]
2017
arXiv
pre-print
Search engines rely heavily on term-based approaches that represent queries and documents as bags of words. ...
When presented with a search query, the engine then ranks documents according to their relevance scores by computing, among other things, the matching degrees between query and document terms. ...
and published by the New York Times between 1987 and 2007. ...
arXiv:1711.06004v1
fatcat:6vkhvfby3zbzrepgopunm7gie4
Semantic question answering on big data
2016
Proceedings of the International Workshop on Semantic Big Data - SBD '16
Traditionally, the task of answering natural language questions has involved a keyword-based document retrieval step, followed by in-depth processing of candidate answer documents and paragraphs. ...
The improvements in performance over a regular free text search index-based question answering engine prove that QA systems can benefit greatly from the addition and consumption of deep semantic information ...
Mithun Balakrishna received his PhD in Computer Science from The University of Texas at Dallas in 2007. ...
doi:10.1145/2928294.2928302
dblp:conf/sigmod/TatuWBEM16
fatcat:vordb2ihurcmbbgda5lto54ozi
Information Retrieval with Verbose Queries
2015
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '15
We also discussed various applications where supporting search for verbose queries can make a significant difference. ...
CERC is the CSIRO Enterprise Research Collection (CERC), a crawl of *.csiro.au (public) web sites conducted in March 2007 and used in the 2007 edition of the TREC Enterprise track. ...
"In 1981, Obama transferred to Columbia University in New York City, where he majored in political science with a specialty in international relations". ...
doi:10.1145/2766462.2767877
dblp:conf/sigir/GuptaB15
fatcat:tgjnvqbbwfepjiggyecfaupqsa
Content Facets For Individual Information Needs In Media
2018
Zenodo
This enables any media consumer to search for information according to their personal information need. This is a substantial improvement to individualize search in media. ...
Several proposed content facets have successfully been implemented in APA Labs, a Web-based framework for faceted search in traditional and social me- dia. ...
Enterprise Information Retrieval Enterprise Information Retrieval is the task of searching and filtering document collections from a company's knowledge base. ...
doi:10.5281/zenodo.1196397
fatcat:udr3736ejbek5lzl34tu4g4ppq
Content Facets For Individual Information Needs In Media
2018
Zenodo
This enables any media consumer to search for information according to their personal information need. This is a substantial improvement to individualize search in media. ...
Several proposed content facets have successfully been implemented in APA Labs, a Web-based framework for faceted search in traditional and social me- dia. ...
Enterprise Information Retrieval Enterprise Information Retrieval is the task of searching and filtering document collections from a company's knowledge base. ...
doi:10.5281/zenodo.1195993
fatcat:ce3ljnthjfhkpir3y4atnnlicy
Selective Search
2015
ACM Transactions on Information Systems
An increasing number of organizations and enterprises need search solutions that can process large volumes of data. ...
This search technique first partitions the corpus, based on documents' similarity, into topic-based shards. ...
Callan, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213. email: callan@cs.cmu.edu 1 Part of this work was done when the first author was a doctoral student at Carnegie Mellon University ...
doi:10.1145/2738035
fatcat:fistpgm5abemdeecnpiqmt4szi
Query modeling for entity search based on terms, categories, and examples
2011
ACM Transactions on Information Systems
Users often search for entities instead of documents, and in this setting, are willing to provide extra input, in addition to a series of query terms, such as category information and example entities. ...
Query modeling for entity search based on terms, categories, and examples. ...
The TREC 2005 TREC -2008 Enterprise track ] featured an expert finding task: given a topic, return a ranked list of experts on the topic. ...
doi:10.1145/2037661.2037667
fatcat:gotilqrajfdrtl2wykg72agiia
Efficient query processing in distributed search engines
2012
SIGIR Forum
For the second approach, we follow an alternative direction and look at document-at-a-time processing of sub-queries and skipping. ...
These two metrics have a large impact on the operational costs of the search engine and the overall user satisfaction, which determine the revenue of the search engine. ...
This work was done while the second author was an intern at Yahoo! ...
doi:10.1145/2492189.2492201
fatcat:uwasxhngrfgntemkhawyv3te64
Introduction to information retrieval
2009
ChoiceReviews
Document partitioning is found to be superior, at least when the distribution of terms is skewed, as it typically is in practice. ...
The Robots Exclusion Protocol standard is described at http://www.robotstxt.org/wc/exclusion.html. ...
the context of TREC. ...
doi:10.5860/choice.46-2715
fatcat:ruwoe46pgzcupjygnwbnit4z3u
« Previous
Showing results 1 — 15 out of 107 results