Filters








220,243 Hits in 7.9 sec

Layout Based Information Retrieval from Document Images

D Shobana
2012 IOSR Journal of Computer Engineering  
from the document images.  ...  Later when a query image has been supplied, it retrieves the similar layout images from the layout databases. 3.  ...  A Survey Based on the Research: Accessing collections of document text is a problem that has been addressed by the information retrieval (IR) community for many years.  ... 
doi:10.9790/0661-0443135 fatcat:t56rxok7szgppgsm7jh2ntdd6u

Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey [article]

Zahra Abbasiantaeb, Saeedeh Momtazi
2020 arXiv   pre-print
In this paper, we provide a comprehensive overview of different models proposed for the QA task, including both traditional information retrieval perspective, and more recent deep neural network perspective  ...  We also introduce well-known datasets for the task and present available results from the literature to have a comparison between different techniques.  ...  In the query formulation step, a query for a given question is generated for retrieving relevant documents by employing an Information Retrieval (IR) engine.  ... 
arXiv:2002.06612v2 fatcat:wndvk257bncfphgtzxz7sr4o4e

A survey on question answering technology from an information retrieval perspective

Oleksandr Kolomiyets, Marie-Francine Moens
2011 Information Sciences  
It presents the question answering task from an information retrieval perspective and emphasises the importance of retrieval models, i.e., representations of queries and information documents, and retrieval  ...  The survey suggests a general question answering architecture that steadily increases the complexity of the representation level of questions and information objects.  ...  The Future of Question Answering In this survey we considered question answering as an information retrieval task, in which users receive direct answers extracted from documents to their natural language  ... 
doi:10.1016/j.ins.2011.07.047 fatcat:se6dcorj2jbidath35go5wi5yy

On "deep" knowledge extraction from documents

Udo Hahn, Martin Romacker
2000 Open research Areas in Information Retrieval  
S y n D i K A Te com prises a fam ily o f natural language understanding systems for automatically acquiring know l edge from real-w orld texts (e.g., information technology test reports, medical finding  ...  It is also crucial for any inform ation system application m aking use o f automatically generated text know ledge bases in a reliable way.  ...  Martin Romacker is supported by a grant from DFG (Ha 2097/5-1).  ... 
dblp:conf/riao/HahnR00 fatcat:f7coedcddbabjngfs4iwzegv5i

Information Retrieval - From Information Access to Contextual Retrieval

Norbert Fuhr
2020 Zenodo  
Information Retrieval (IR) deals with uncertainty and vagueness in in-formation systems.  ...  Uncertainty is caused by the problem of representing the semantics of text and other media, which cannot be done in a perfect way.  ...  First, a user has to discover potentially relevant sources, from which s/he can retrieve the documents s/he is looking for.  ... 
doi:10.5281/zenodo.4136995 fatcat:ufop5acffbaczf2po7c5hntsti

Requests for information from a film archive: a case study of multimedia retrieval

Morten Hertzum
2003 Journal of Documentation  
Multimedia retrieval is a complex and to some extent still unexplored area.  ...  Based on a full year of e-mail requests addressed to a large film archive this study analyses what types of information needs real users have and how these needs are expressed.  ...  information from a film archive improved performance.  ... 
doi:10.1108/00220410310463473 fatcat:vl5d5zo7v5govitpvmwmueudpa

A Survey of Multilingual Document Clustering

Kavita Moholkar
2017 International Journal Of Engineering And Computer Science  
Classification of documents for the languages without labeled training data set is a major challenge.  ...  The amount of multilingual documents generated on internet, is increasing day by day. Multilingual document clustering (MDC) is a technique of classifying documents in different languages.  ...  Saraswathi [8] proposed a system for information retrieval on festival domain for English and Tamil.  ... 
doi:10.18535/ijecs/v6i4.21 fatcat:tcls565sqnfxxj3bx722smrviq

Contextual Search: From Information Behaviour to Information Retrieval

Luanne Freund, Elaine G. Toms
2013 Proceedings of the Annual Conference of CAIS / Actes du congrès annuel de l'ACSI  
We present research that combines information behaviour and information retrieval approaches to develop a contextual search system for a software engineering work domain.Le contexte influence le comportement  ...  Context influences information seeking behaviour; however, search systems have not made much use of contextual information to date.  ...  Document Analysis: We compiled a proportional sample of 200 documents from 11 core document repositories to identify features of the information space that could be exploited by a contextual search system  ... 
doi:10.29173/cais280 fatcat:fdfri3nvmfbnvf5uqchjt2ttze

An Information Retrieval Pipeline for Legislative Documents from the Brazilian Chamber of Deputies [chapter]

Ellen Souza, Douglas Vitório, Gyovana Moriyama, Luiz Santos, Lucas Martins, Mariana Souza, Márcio Fonseca, Nádia Félix, André C.P.L.F. Carvalho, Hidelberg O. Albuquerque, Adriano L.I. Oliveira
2021 Frontiers in Artificial Intelligence and Applications  
Retrieving the bill that was originated from a specific job request, the BM25L with Savoy stemmer reached a R@20 of 0.7356.  ...  This work investigates information retrieval methods to address the existing difficulties on the Preliminary Search, part of the law making process from the Brazilian Chamber of Deputies.  ...  3233/FAIA210326 An Information Retrieval Pipeline for gislative Documents from the Brazilian Chamber of Deputies SOUZA a,b,1 , Douglas VITÓRIO a,d , Gyovana MORIYAMA b , Luiz SANTOS  ... 
doi:10.3233/faia210326 fatcat:ttz45uzisbf2dhihh2rsrkgbza

A Survey on Document Image Analysis

Dr. S. Vijayarani
2018 International Journal for Research in Applied Science and Engineering Technology  
Information in these document images are more structured and presented in a natural language with the help of a grammar and a script.  ...  This paper discusses document image analysis, applications of document images, challenges and issues for handling document images.  ...  Information retrieval from the document images is a challenging task, hence number of techniques and procedures are used for document image processing [3] .  ... 
doi:10.22214/ijraset.2018.4257 fatcat:ptngdhmucjd4pffgymu5vi5zay

Summarization from medical documents: a survey

Stergos Afantenos, Vangelis Karkaletsis, Panagiotis Stamatopoulos
2005 Artificial Intelligence in Medicine  
Methodology: This survey gives first a general background on documents summarization, presenting the factors that summarization depends upon, discussing evaluation issues and describing briefly the various  ...  Objective: The aim of this paper is to survey the recent work in medical documents summarization.  ...  The retrieved documents are checked for possible relevance by a text passage retrieval component. Irrelevant documents are discarded.  ... 
doi:10.1016/j.artmed.2004.07.017 pmid:15811783 fatcat:n7u6ji5t2rgkvjktacjf4rdire

Automatic document processing: A survey

Yuan Y. Tang, Seong-Whan Lee, Ching Y. Suen
1996 Pattern Recognition  
Surveys of the basic concepts and underlying techniques are presented in this paper.  ...  It provides capabilities for automatically indexing form document for storage/ retrieval to/from a document library, and for capturing information from scanned form images using OCR.  ...  The acquisition of knowledge from such documents by an information system can involve an extensive amount of handcrafting.  ... 
doi:10.1016/s0031-3203(96)00044-1 fatcat:zt3437y3qbfrvhkaxrjgyubiuy

Hairpins in bookstacks: Information retrieval from biomedical text

H. Shatkay
2005 Briefings in Bioinformatics  
She is an active member of the biomedical text-mining community, and one of the first researchers in the area of text mining and information retrieval for bioinformatics.  ...  The following sections provide a survey of basic concepts and methods in information retrieval, discuss the way they are applied in the biomedical domain, and demonstrate the use of information retrieval  ...  INFORMATION RETRIEVAL: THE BASICS Information retrieval is concerned with identifying, within a large document collection, a subset of documents whose content is most relevant to a user's need.  ... 
doi:10.1093/bib/6.3.222 pmid:16212771 fatcat:2liwpccfq5dhza5mmdw66q7m2m

System components for embedded information retrieval from multiple disparate information sources

Ramana Rao, Daniel M. Russell, Jock D. Mackinlay
1993 Proceedings of the 6th annual ACM symposium on User interface software and technology - UIST '93  
The tlrst is a design for a user/system interaction model for retrieval from multiple, disparate information sources.  ...  Current information retrieval interfaces only address a small pant of the reality of rich interactions amongst user, task, and information sources.  ...  Acknowledgements A number of our colleagues have contributed to this work in various ways.  ... 
doi:10.1145/168642.168645 dblp:conf/uist/RaoRM93 fatcat:gklclcoqhndojmlfoy4cjtey34

A Survey of Historical Document Image Datasets [article]

Konstantina Nikolaidou, Mathias Seuret, Hamam Mokayed, Marcus Liwicki
2022 arXiv   pre-print
We present the statistics, document type, language, tasks, input visual aspects, and ground truth information for every dataset.  ...  This paper presents a systematic literature review of image datasets for document image analysis, focusing on historical documents, such as handwritten manuscripts and early prints.  ...  Conclusion We demonstrated a survey of historical document image datasets following a systematic literature review methodology.  ... 
arXiv:2203.08504v2 fatcat:ilgqqgylfzejnpccrsg7vfsncm
« Previous Showing results 1 — 15 out of 220,243 results