Filters








107 Hits in 5.2 sec

Natural Language Processing as a Foundation of the Semantic Web

Yorick Wilks, Christopher Brewster
2007 Foundations and Trends® in Web Science  
This perspective places NLP and its associated techniques like Information Extraction at the core of the Semantic Web/Web Science enterprise.  ...  Those who continue to maintain that "universal words" are not the English words they look most like, must at least tell us which of the senses of the real word closest to the "universal word" they intend  ... 
doi:10.1561/1800000002 fatcat:n2xfw3qdhverrokidb2globwyq

Statistical source expansion for question answering

Nico Schlaefer, Jennifer Chu-Carroll, Eric Nyberg, James Fan, Wlodek Zadrozny, David Ferrucci
2011 Proceedings of the 20th ACM international conference on Information and knowledge management - CIKM '11  
and 67% on TREC questions. iv First of all, I would like to thank my advisor Eric Nyberg for his support and guidance throughout my studies at Carnegie Mellon.  ...  quiz show and TREC evaluations. By expanding various seed corpora with web search results, we were able to improve the QA accuracy of Watson from 66% to 71% on regular Jeopardy!  ...  ., , 2007 developed at Carnegie Mellon University and previously at Universität Karlsruhe is a modular and extensible framework that supports the integration of different QA algorithms and knowledge resources  ... 
doi:10.1145/2063576.2063632 dblp:conf/cikm/SchlaeferCNFZF11 fatcat:whoy62klazctbdo4p57wbevkdu

Entity Centric Information Retrieval

Xitong Liu
2016 SIGIR Forum  
Motivated by the importance of the enterprise search, the enterprise track in TREC has been launched since 2005 [8, 10, 46, 134] .  ...  the search quality of enterprise search.  ... 
doi:10.1145/2964797.2964815 fatcat:qdmhwfminnaefonienqyggckbm

Fast candidate generation for real-time tweet search with bloom filter chains

Nima Asadi, Jimmy Lin
2013 ACM Transactions on Information Systems  
at the time the query is issued.  ...  The rise of social media and other forms of user-generated content have created the demand for real-time search: against a high-velocity stream of incoming documents, users desire a list of relevant results  ...  Asadi, Department of Computer Science, University of Maryland at College Park; email: nima@cs.umd.edu; J.  ... 
doi:10.1145/2493175.2493178 fatcat:lp3unrp4bvgwzbs4d72bpqyupy

Building Watson: An Overview of the DeepQA Project

David Ferrucci, Eric Brown, Jennifer Chu-Carroll, James Fan, David Gondek, Aditya A. Kalyanpur, Adam Lally, J. William Murdock, Eric Nyberg, John Prager, Nico Schlaefer, Chris Welty
2010 The AI Magazine  
After 3 years of intense research and development by a core team of about 20 researches, Watson is performing at human expert-levels in terms of precision, confidence and speed at the Jeopardy!  ...  IBM Research undertook a challenge to build a computer system that could compete at the human champion level in real time on the American TV Quiz show, Jeopardy!  ...  Acknowledgements We would like to acknowledge the talented team of research scientists and engineers at IBM and at partner universities, listed below, for the incredible work they are doing to influence  ... 
doi:10.1609/aimag.v31i3.2303 fatcat:bqzu6kcak5gnhpabvyv6vuc7n4

Term-Based Models for Entity Ranking [chapter]

Krisztian Balog
2018 Advanced Topics in Information Retrieval  
In this chapter, we adhere to the "single search box" paradigm, which accepts "free text" search queries, and simply treat queries as sequences of words, referred to hereinafter as terms.  ...  Returning specific entities, instead of a mere list of documents, can provide better answers to a broad range of information needs.  ...  TREC Enterprise The TREC (Text Retrieval Conference) 2005-2008 Enterprise track [8, 25] featured an expert finding task, where a single type of entity was sought: people, who are experts on a given topic  ... 
doi:10.1007/978-3-319-93935-3_3 fatcat:dtyxpoiqsbfhtm6m5q24oxp5ii

Remedies against the Vocabulary Gap in Information Retrieval [article]

Christophe Van Gysel
2017 arXiv   pre-print
Search engines rely heavily on term-based approaches that represent queries and documents as bags of words.  ...  When presented with a search query, the engine then ranks documents according to their relevance scores by computing, among other things, the matching degrees between query and document terms.  ...  and published by the New York Times between 1987 and 2007.  ... 
arXiv:1711.06004v1 fatcat:6vkhvfby3zbzrepgopunm7gie4

Semantic question answering on big data

Marta Tatu, Steven Werner, Mithun Balakrishna, Tatiana Erekhinskaya, Dan Moldovan
2016 Proceedings of the International Workshop on Semantic Big Data - SBD '16  
Traditionally, the task of answering natural language questions has involved a keyword-based document retrieval step, followed by in-depth processing of candidate answer documents and paragraphs.  ...  The improvements in performance over a regular free text search index-based question answering engine prove that QA systems can benefit greatly from the addition and consumption of deep semantic information  ...  Mithun Balakrishna received his PhD in Computer Science from The University of Texas at Dallas in 2007.  ... 
doi:10.1145/2928294.2928302 dblp:conf/sigmod/TatuWBEM16 fatcat:vordb2ihurcmbbgda5lto54ozi

Information Retrieval with Verbose Queries

Manish Gupta, Michael Bendersky
2015 Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '15  
We also discussed various applications where supporting search for verbose queries can make a significant difference.  ...  CERC is the CSIRO Enterprise Research Collection (CERC), a crawl of *.csiro.au (public) web sites conducted in March 2007 and used in the 2007 edition of the TREC Enterprise track.  ...  "In 1981, Obama transferred to Columbia University in New York City, where he majored in political science with a specialty in international relations".  ... 
doi:10.1145/2766462.2767877 dblp:conf/sigir/GuptaB15 fatcat:tgjnvqbbwfepjiggyecfaupqsa

Content Facets For Individual Information Needs In Media

Elisabeth Lex, Stefanie Lindstaedt, Michael Granitzer, Harald Kosch
2018 Zenodo  
This enables any media consumer to search for information according to their personal information need. This is a substantial improvement to individualize search in media.  ...  Several proposed content facets have successfully been implemented in APA Labs, a Web-based framework for faceted search in traditional and social me- dia.  ...  Enterprise Information Retrieval Enterprise Information Retrieval is the task of searching and filtering document collections from a company's knowledge base.  ... 
doi:10.5281/zenodo.1196397 fatcat:udr3736ejbek5lzl34tu4g4ppq

Content Facets For Individual Information Needs In Media

Elisabeth Lex, Stefanie Lindstaedt, Michael Granitzer, Harald Kosch
2018 Zenodo  
This enables any media consumer to search for information according to their personal information need. This is a substantial improvement to individualize search in media.  ...  Several proposed content facets have successfully been implemented in APA Labs, a Web-based framework for faceted search in traditional and social me- dia.  ...  Enterprise Information Retrieval Enterprise Information Retrieval is the task of searching and filtering document collections from a company's knowledge base.  ... 
doi:10.5281/zenodo.1195993 fatcat:ce3ljnthjfhkpir3y4atnnlicy

Selective Search

Anagha Kulkarni, Jamie Callan
2015 ACM Transactions on Information Systems  
An increasing number of organizations and enterprises need search solutions that can process large volumes of data.  ...  This search technique first partitions the corpus, based on documents' similarity, into topic-based shards.  ...  Callan, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213. email: callan@cs.cmu.edu 1 Part of this work was done when the first author was a doctoral student at Carnegie Mellon University  ... 
doi:10.1145/2738035 fatcat:fistpgm5abemdeecnpiqmt4szi

Query modeling for entity search based on terms, categories, and examples

Krisztian Balog, Marc Bron, Maarten De Rijke
2011 ACM Transactions on Information Systems  
Users often search for entities instead of documents, and in this setting, are willing to provide extra input, in addition to a series of query terms, such as category information and example entities.  ...  Query modeling for entity search based on terms, categories, and examples.  ...  The TREC 2005 TREC -2008 Enterprise track ] featured an expert finding task: given a topic, return a ranked list of experts on the topic.  ... 
doi:10.1145/2037661.2037667 fatcat:gotilqrajfdrtl2wykg72agiia

Efficient query processing in distributed search engines

Simon Jonassen
2012 SIGIR Forum  
For the second approach, we follow an alternative direction and look at document-at-a-time processing of sub-queries and skipping.  ...  These two metrics have a large impact on the operational costs of the search engine and the overall user satisfaction, which determine the revenue of the search engine.  ...  This work was done while the second author was an intern at Yahoo!  ... 
doi:10.1145/2492189.2492201 fatcat:uwasxhngrfgntemkhawyv3te64

Introduction to information retrieval

2009 ChoiceReviews  
Document partitioning is found to be superior, at least when the distribution of terms is skewed, as it typically is in practice.  ...  The Robots Exclusion Protocol standard is described at http://www.robotstxt.org/wc/exclusion.html.  ...  the context of TREC.  ... 
doi:10.5860/choice.46-2715 fatcat:ruwoe46pgzcupjygnwbnit4z3u
« Previous Showing results 1 — 15 out of 107 results