Filters








27,507 Hits in 6.5 sec

On the Separation of Logical and Physical Ranking Models for Text Retrieval Applications

Jimmy Lin, Xueguang Ma, Joel Mackenzie, Antonio Mallia
2021 Biennial Conference on Design of Experimental Search & Information Retrieval Systems  
Viewed in database terms, this captures a tight coupling between the "logical" aspects of ranking (i.e., term weighting) and the "physical" aspects of ranking (query evaluation).  ...  We argue that explicitly decoupling these two aspects offers a framework for thinking about the relationship between sparse retrieval techniques and the rapidly growing literature on dense retrieval techniques  ...  Acknowledgements We'd like to thank Arjen de Vries for helpful comments on an earlier draft of this piece.  ... 
dblp:conf/desires/LinMMM21 fatcat:neuo5rks6fgvzlo5stgryffuza

The Multi-model DBMS Architecture and XML Information Retrieval [chapter]

Arjen P. de Vries, Johan A. List, Henk Ernst Blok
2003 Lecture Notes in Computer Science  
Since long, computer science has distinguished between information retrieval and data retrieval, where information retrieval entails the problem of ranking textual documents on their content (with the  ...  semi-structured documents instead of plain text, with usage scenarios that require the combination of 'conventional' ranking with other query constraints; based on the structure of text documents, on  ...  Another advantage of the separation of concerns in the Multi-Model DBMS architecture is that it allows IR researchers to concentrate on retrieval models and reduce the effort of implementation involved  ... 
doi:10.1007/978-3-540-45194-5_12 fatcat:qtjesgap3zh3dbivuzrzbgn4ee

A Proposed Conceptual Framework for a Representational Approach to Information Retrieval [article]

Jimmy Lin
2021 arXiv   pre-print
I propose a representational approach that breaks the core text retrieval problem into a logical scoring model and a physical retrieval model.  ...  The physical retrieval model defines how a system produces the top-k scoring documents from an arbitrarily large corpus with respect to a query.  ...  Acknowledgements This research was supported in part by the Natural Sciences and Engineering Research Council (NSERC) of Canada.  ... 
arXiv:2110.01529v2 fatcat:iluzpawvjbbwdan3ei2aanywm4

Challenging Ubiquitous Inverted Files

Arjen P. de Vries
2000 DELOS Workshops / Conferences  
We propose to base the development of retrieval systems on 'the database approach': mapping high-level declarative specifications of the retrieval process into efficient query plans.  ...  Stand-alone ranking systems based on highly optimized inverted file structures are generally considered 'the' solution for building search engines.  ...  Acknowledgements Annita Wilschut inspired this research with her work on Moa and GIS. Djoerd Hiemstra has been very supportive and helpful with IR experiments.  ... 
dblp:conf/delos/Vries00 fatcat:e6fnfwyl25eubioznekaahjtbe

An Attribute-based Model for Semantic Retrieval

Hany Azzam, Thomas Roelleke
2010 Lernen, Wissen, Daten, Analysen  
The framework facilitates the transformation of "term-only" retrieval models into "semantic-aware" retrieval models that consist of semantic propositions, such as relationships and classification of objects  ...  The modelling approach represents both semantic and textual data in one unifying framework, referred to as the probabilistic object-relational content modelling framework.  ...  Acknowledgments We would like to thank Jinyoung Kim of the University of Massachusetts Amherst for providing us with the collection and the queries.  ... 
dblp:conf/lwa/AzzamR10 fatcat:euiyz3prdvglnlhhkh356vapyi

TIJAH: Embracing IR Methods in XML Databases

Johan List, Vojkan Mihajlović, Georgina RamÍrez, Arjen P. de Vries, Djoerd Hiemstra, Henk Ernst Blok
2005 Information retrieval (Boston)  
TIJAH's system design follows a 'standard' layered database architecture, carefully separating the conceptual, logical and physical levels.  ...  This paper discusses our participation in INEX (the Initiative for the Evaluation of XML Retrieval) using the TIJAH XML-IR system.  ...  Acknowledgements The authors are grateful to the Netherlands Organization for Scientific Research (NWO), for funding the research described in this paper (grant number 612.061.210).  ... 
doi:10.1007/s10791-005-0747-2 fatcat:titzb6xnifciphso5gux73arvu

A review of structured document retrieval (SDR) technology to improve information access performance in engineering document management

S. Liu, C.A. McMahon, S.J. Culley
2008 Computers in industry (Print)  
This paper reviews the work carried out from the inception to the development and application of SDR in engineering document management.  ...  The paper concludes with the expectation that SDR will make a positive impact on the process of engineering document management from document construction to its delivery in the future, and undoubtedly  ...  Acknowledgements The research reported in this paper was funded by the UK Engineering and Physical Sciences Research Council (EPSRC) under grant number EP/C534220/1 for the project of ''Immortal Information  ... 
doi:10.1016/j.compind.2007.08.001 fatcat:vtoia3k6vbdcdpowrsj5i3tvxq

DOLORES

Norbert Fuhr, Norbert Gövert, Thomas Rölleke
1998 Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '98  
We describe the design and implementation of a system for logic-based multimedia retrieval.  ...  The overall architecture and the flexibility of each layer supports logic-based methods for multimedia information retrieval. 1 We regard hypermedia documents as the most general case, subsuming multimedia  ...  Here we separate the logical and the physical level of our IR system: Depending on the availability and the usage of access structures, there are different subclasses of pred-search: ps-direct uses an  ... 
doi:10.1145/290941.291005 dblp:conf/sigir/FuhrGR98 fatcat:geg4kpqmw5eaje3xf2am25535u

Dependencies: Formalising Semantic Catenae for Information Retrieval [article]

Christina Lioma
2017 arXiv   pre-print
A prerequisite for processing text semantics, common to the above examples, is having some computational representation of text as an abstract object.  ...  This dissertation contributes a series of such tools, diverse in their mathematical formulation, but common in their application to model semantic inferences when machines process text.  ...  State of the art benchmark datasets are used for both text reordering and retrieval (500GB in total), and effectiveness (accuracy, mean reciprocal rank, expected reciprocal rank) is measured against state  ... 
arXiv:1709.03742v1 fatcat:4fdrnsmwdnb4pe37b6ritmvnme

SIGIR 2005 Doctoral Consortium

David J. Harper
2005 Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '05  
The framework is able to support a wide range of structured IR queries, transparent instantiations of different retrieval models, and different physical implementations.  ...  It is based on so-called score region algebra (SRA) that can express the following four essential ranked retrieval aspects for structured IR: term and element selection, element relevance score computation  ...  The application of the idea of text regions to XML documents is straightforward.  ... 
doi:10.1145/1076034.1230135 fatcat:tcyce6wmqvax3lvfxm5ffiujoe

Automated document content characterization for a multimedia document retrieval system

Maija Koivusaari, Jaakko J. Sauvola, Matti Pietikaeinen, C.-C. Jay Kuo, Shih-Fu Chang, Venkat N. Gudivada
1997 Multimedia Storage and Archiving Systems II  
In this context the documents consist of text, picture and other media (possibly embedded) data. Documents are stored in the database as document, page and region objects.  ...  In this paper the object-oriented storage model and the database system are presented in formal and functional domains.  ...  ACKNOWLEDGEMENTS The financial support provided by the Academy of Finland and Technology Development Center is gratefully acknowledged.  ... 
doi:10.1117/12.290337 fatcat:d7vmjmvzcvc7hc5ag6t5vkzcfm

A Survey on Document Image Analysis and Retrieval System

Umesh D. Dixit, Shirdhonkar M.S
2015 International Journal on Cybernetics & Informatics  
Signature, Logo and Layout of the documents present convincing evidence and provide an important form of indexing for effective document image retrieval in a variety of applications.  ...  The digitization of documents and their availability over the network demands solution toward content based document image analysis, indexing, searching and retrieval.  ...  The layout based retrieval could be again depending on logical, physical or functional structures. Physical structures are colors, fonts, block types etc.  ... 
doi:10.5121/ijci.2015.4225 fatcat:qy5zdi2mmfgqhjoa5qreq3rr7q

Geometric and quantum methods for information retrieval

Yaoyong Li, Hamish Cunningham
2008 SIGIR Forum  
It also presents the applications of the concepts and methods in quantum mechanics such as quantum logic and tensor product to document retrieval and meaning of composite words, respectively.  ...  The purpose of the paper is to give the state of the art on and to draw attention of the IR community to the geometric and quantum methods and their potential applications in IR and NLP.  ...  Acknowledgements: We would like to thank our colleague Adam Funk for help improving the English of the manuscript. This work was supported by the EU-founded project LarKC, http://www.larkc.eu/.  ... 
doi:10.1145/1480506.1480510 fatcat:touvnagimjfcvfzfj7z3yjlivi

XML Information Retrieval:An overview [article]

Suma D., U. Dinesh Acharya, Geetha M., Raviraja Holla M
2014 arXiv   pre-print
Locating and distilling the valuable relevant information continued to be the major challenges of Information Retrieval (IR) Systems owing to the explosive growth of online web information.  ...  Meanwhile literatures reveal development of the rapid and intelligent IR systems.  ...  The publications on formalization of ontologies cover a wide spectrum from algebraic approaches and logic-based languages for modeling ontologies to ontologies for conceptual data modeling and data interpretation  ... 
arXiv:1410.7654v1 fatcat:kxr2v2ezb5a4fppup5u5kzukzy

Preliminary experiments using subjective logic for the polyrepresentation of information needs

Christina Lioma, Birger Larsen, Peter Ingwersen
2012 Proceedings of the 4th Information Interaction in Context Symposium on - IIIX '12  
We focus on the polyrepresentation of different types of context relating to user information needs (i.e. work task, user background knowledge, ideal answer) and show that the subjective logic model can  ...  No experimental evidence or practical application has so far validated this model. We extend the work of Lioma et al. (2010) [15], by providing a practical application and analysis of the model.  ...  We do not weight separately any of the query fields; we simply concatenate all text into one query.  ... 
doi:10.1145/2362724.2362755 dblp:conf/iiix/LiomaLI12 fatcat:yyqf7xw34nau3ishkpntfabcyi
« Previous Showing results 1 — 15 out of 27,507 results