Filters








330,603 Hits in 4.3 sec

Peer-to-peer similarity search over widely distributed document collections

Christos Doulkeridis, Kjetil Nørvåg, Michalis Vazirgiannis
2008 Proceeding of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval - LSDS-IR '08  
Such an application is retrieval of the top-k most similar documents in a widely distributed document collection, as in the case of digital libraries.  ...  Peer-to-peer (P2P) systems emerge as a promising solution to delve with content management in cases of highly distributed data collections.  ...  is not scalable for a large P2P system.  ... 
doi:10.1145/1458469.1458477 dblp:conf/cikm/DoulkeridisNV08 fatcat:rajsws3sevhlzo4pzploisjdbu

TREC

Ellen M. Voorhees
2007 Communications of the ACM  
The fundamental goal of a retrieval system is to help its users find information contained in large stores of free text.  ...  Large-scale test collections drive improvement in search technology to help users find information in free text. was the subject of an entire book Information Retrieval Experiment, edited by Karen Spärck  ...  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage  ... 
doi:10.1145/1297797.1297822 fatcat:dnx7pk5wfbg3zolb64wr2avema

Building Better Search Engines by Measuring Search Quality

Ellen M. Voorhees, Paul Over, Ian Soboroff
2014 IT Professional Magazine  
Search engines help users locate particular information within large stores of content developed for human consumption.  ...  Origins of TREC Today we take search for text documents in our native language for granted, but web search engines such as Yahoo, Google, and Bing were not built in a day, nor is web content the only area  ...  collections) could effectively retrieve documents from "large" collections.  ... 
doi:10.1109/mitp.2013.105 fatcat:hk3zocjbxjawhfuye4k7gkcvqq

The TREC experiments and their impact on Europe

A. F. Smeaton, D. Harman
1997 Journal of information science  
For the most part, the evaluation of IR systems has been carried out on relatively small collections of documents, queries and relevance assessments.  ...  Information retrieval (IR) research on text collections has concentrated on improving the effectiveness of the indexing and retrieval operations.  ...  collection environment, documents, queries and relevance assessments, has meant cross-system comparisons on such large collections can be accommodated.  ... 
doi:10.1177/016555159702300302 fatcat:k237c67jznhfjfrnn63bfzgloq

The TREC experiments and their impact on Europe

Alan F. Smeaton, Donna Harman
1997 Journal of information science  
For the most part, the evaluation of IR systems has been carried out on relatively small collections of documents, queries and relevance assessments.  ...  Information retrieval (IR) research on text collections has concentrated on improving the effectiveness of the indexing and retrieval operations.  ...  collection environment, documents, queries and relevance assessments, has meant cross-system comparisons on such large collections can be accommodated.  ... 
doi:10.1177/016555159702300208 fatcat:5tionsnxuvcg3fn7wto4rtgyfe

Reverse annotation based retrieval from large document image collections

Pramod Sankar K.
2010 Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval - SIGIR '10  
to large collections is hard.  ...  In this work, we aim toward building a retrieval system over 120K document images coming from 1000 scanned books of Telugu literature.  ...  The true recall of the retrieval system cannot be computed, since it is impossible to identify every occurrence of the given query in such large data.  ... 
doi:10.1145/1835449.1835694 dblp:conf/sigir/Sankar10 fatcat:6xqmyuetpbfclps6ced75fbsbm

Content-based document image retrieval in complex document collections

G. Agam, S. Argamon, O. Frieder, D. Grossman, D. Lewis, Xiaofan Lin, Berrin A. Yanikoglu
2007 Document Recognition and Retrieval XIV  
We address important research issues concerning content-based document image retrieval and describe a prototype for integrated retrieval and aggregation of diverse information contained in scanned paper  ...  Large collections of such complex documents are commonly found in legal and security investigations.  ...  A large number of people have provided resources, information, and suggestions on this work, and we likewise thank them.  ... 
doi:10.1117/12.703163 dblp:conf/drr/ArgamonFGL07 fatcat:v73nietpbvh6ldxxb6ouzi6u6i

Granular Computing for the Design of Information Retrieval Support Systems [chapter]

Y. Y. Yao
2004 Network Theory and Applications  
To a large extent, information retrieval can still be viewed as document retrieval by substituting 'document' for 'information', as pointed out by van Rijsbergen long time ago [45] .  ...  An IR system is designed with the objective to provide useful and only useful documents from a large document collection [3, 39, 45] .  ... 
doi:10.1007/978-1-4613-0227-8_10 fatcat:fefy2x2u5jdmnfx7ez3g7tcmzu

Information Retrieval System Assigning Context to Documents by Relevance Feedback

Narina Thakur, Deepti Mehrotra, Abhay Bansal
2012 International Journal of Computer Applications  
Need for user profile and relevance of information while searching and extracting information, from information retrieval system is highlighted.  ...  The documents are re-ranked based on the user profile and his feedback. Proposed Information retrieval system uses vector space model and expert system.  ...  Information retrieval is finding information of an unstructured nature that satisfies an information need from within large collection.  ... 
doi:10.5120/9401-3815 fatcat:oqv7eupyhnendcycoorr2kwvga

A comparative study on the Assortment of Information Retrieval systems

L. Senthilvadivu
2018 International Journal of Scientific Research in Computer Sciences and Engineering  
With the advent of computers, it became possible to store large amounts of information; and finding useful information from such collections became a necessity.  ...  Several Information Retrieval (IR) systems are used on an everyday basis by a wide variety of users.  ...  Information retrieval (IR) is finding material usually documents of an unstructured nature such as text that satisfies an information need from within large collections which is stored on computers.  ... 
doi:10.26438/ijsrcse/v6i2.109112 fatcat:5du75zj3ezh3rlzq5dgbgtufhu

Building a test collection for complex document information processing

D. Lewis, G. Agam, S. Argamon, O. Frieder, D. Grossman, J. Heard
2006 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '06  
Research and development of information access technology for scanned paper documents has been hampered by the lack of public test collections of realistic scope and complexity.  ...  As part of a project to create a prototype system for search and mining of masses of document images, we are assembling a 1.5 terabyte dataset to support evaluation of both end-to-end complex document  ...  Elsewhere we describe experience with a prototype CDIP system for retrieval and data mining on scanned documents [1] .  ... 
doi:10.1145/1148170.1148307 dblp:conf/sigir/LewisAAFGH06 fatcat:x54og6arovbuxh5pvy55sfzlxm

Continuous improvement of knowledge management systems using Six Sigma methodology

ChiaJou Lin, F. Frank Chen, Hung-da Wan, Yuh Min Chen, Glenn Kuriger
2013 Robotics and Computer-Integrated Manufacturing  
Besides, ambiguous query is also an important factor for the performance of knowledge retrieval systems.  ...  The knowledge retrieval evaluation mechanism allows system developers to maintain the knowledge retrieval system with ease and meanwhile enhance the accuracy.  ...  The most commonly used methodology of information retrieval systems evaluation needs a test collection, which contains a document collection, a set of topical queries and a set of relevance assessments  ... 
doi:10.1016/j.rcim.2012.04.018 fatcat:vpe5egyfcvgjzef3iyujglryuy

Best bets

Giuseppe Attardi, Andrea Esuli, Maria Simi
2004 Alternate track papers & posters of the 13th international conference on World Wide Web - WWW Alt. '04  
information retrieval.  ...  We developed techniques for implementing Best Bets systems addressing performance issues for large scale deployment as efficient query search, incremental updates and dynamic ranking.  ...  Document search In Information Retrieval (IR), given a document collection D, the task is to retrieve all or the top k best ranking documents satisfying a given query q, i.e. search(q, D) = { d ∈ D | match  ... 
doi:10.1145/1010432.1010571 fatcat:q2nydxldtzdpbopdzwirdarjuy

Full-text information retrieval: Further analysis and clarification

David C. Blair, M.E. Maron
1990 Information Processing & Management  
In 1985, an article by Blair and Maron described a detailed evaluation of the effectiveness of an operational full text retrieval system used to support the defense of a large corporate lawsuit.  ...  The most critical problem for Information Retrieval research now is to give us an effective model for how large, operational retrieval systems work.  ...  (Simple full-text retrieval systems command by far the greatest market share of new, large-scale document retrieval systems.  ... 
doi:10.1016/0306-4573(90)90102-8 fatcat:6w5pgfgivbeivln2kdpsle2qqi

STAIRS redux: Thoughts on the STAIRS evaluation, ten years after

David C. Blair
1996 Journal of the American Society for Information Science  
studies on large documents retrieval systems.  ...  to benchmark a comparatively large, commercial information retrieval system.  ... 
doi:10.1002/(sici)1097-4571(199601)47:1<4::aid-asi2>3.3.co;2-5 fatcat:5it3y7yujjetpicnlottaypxre
« Previous Showing results 1 — 15 out of 330,603 results