A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Filters
Layout Based Information Retrieval from Document Images
2012
IOSR Journal of Computer Engineering
from the document images. ...
Later when a query image has been supplied, it retrieves the similar layout images from the layout databases. 3. ...
A Survey Based on the Research: Accessing collections of document text is a problem that has been addressed by the information retrieval (IR) community for many years. ...
doi:10.9790/0661-0443135
fatcat:t56rxok7szgppgsm7jh2ntdd6u
Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey
[article]
2020
arXiv
pre-print
In this paper, we provide a comprehensive overview of different models proposed for the QA task, including both traditional information retrieval perspective, and more recent deep neural network perspective ...
We also introduce well-known datasets for the task and present available results from the literature to have a comparison between different techniques. ...
In the query formulation step, a query for a given question is generated for retrieving relevant documents by employing an Information Retrieval (IR) engine. ...
arXiv:2002.06612v2
fatcat:wndvk257bncfphgtzxz7sr4o4e
A survey on question answering technology from an information retrieval perspective
2011
Information Sciences
It presents the question answering task from an information retrieval perspective and emphasises the importance of retrieval models, i.e., representations of queries and information documents, and retrieval ...
The survey suggests a general question answering architecture that steadily increases the complexity of the representation level of questions and information objects. ...
The Future of Question Answering In this survey we considered question answering as an information retrieval task, in which users receive direct answers extracted from documents to their natural language ...
doi:10.1016/j.ins.2011.07.047
fatcat:se6dcorj2jbidath35go5wi5yy
On "deep" knowledge extraction from documents
2000
Open research Areas in Information Retrieval
S y n D i K A Te com prises a fam ily o f natural language understanding systems for automatically acquiring know l edge from real-w orld texts (e.g., information technology test reports, medical finding ...
It is also crucial for any inform ation system application m aking use o f automatically generated text know ledge bases in a reliable way. ...
Martin Romacker is supported by a grant from DFG (Ha 2097/5-1). ...
dblp:conf/riao/HahnR00
fatcat:f7coedcddbabjngfs4iwzegv5i
Information Retrieval - From Information Access to Contextual Retrieval
2020
Zenodo
Information Retrieval (IR) deals with uncertainty and vagueness in in-formation systems. ...
Uncertainty is caused by the problem of representing the semantics of text and other media, which cannot be done in a perfect way. ...
First, a user has to discover potentially relevant sources, from which s/he can retrieve the documents s/he is looking for. ...
doi:10.5281/zenodo.4136995
fatcat:ufop5acffbaczf2po7c5hntsti
Requests for information from a film archive: a case study of multimedia retrieval
2003
Journal of Documentation
Multimedia retrieval is a complex and to some extent still unexplored area. ...
Based on a full year of e-mail requests addressed to a large film archive this study analyses what types of information needs real users have and how these needs are expressed. ...
information from a film archive improved performance. ...
doi:10.1108/00220410310463473
fatcat:vl5d5zo7v5govitpvmwmueudpa
A Survey of Multilingual Document Clustering
2017
International Journal Of Engineering And Computer Science
Classification of documents for the languages without labeled training data set is a major challenge. ...
The amount of multilingual documents generated on internet, is increasing day by day. Multilingual document clustering (MDC) is a technique of classifying documents in different languages. ...
Saraswathi [8] proposed a system for information retrieval on festival domain for English and Tamil. ...
doi:10.18535/ijecs/v6i4.21
fatcat:tcls565sqnfxxj3bx722smrviq
Contextual Search: From Information Behaviour to Information Retrieval
2013
Proceedings of the Annual Conference of CAIS / Actes du congrès annuel de l'ACSI
We present research that combines information behaviour and information retrieval approaches to develop a contextual search system for a software engineering work domain.Le contexte influence le comportement ...
Context influences information seeking behaviour; however, search systems have not made much use of contextual information to date. ...
Document Analysis: We compiled a proportional sample of 200 documents from 11 core document repositories to identify features of the information space that could be exploited by a contextual search system ...
doi:10.29173/cais280
fatcat:fdfri3nvmfbnvf5uqchjt2ttze
An Information Retrieval Pipeline for Legislative Documents from the Brazilian Chamber of Deputies
[chapter]
2021
Frontiers in Artificial Intelligence and Applications
Retrieving the bill that was originated from a specific job request, the BM25L with Savoy stemmer reached a R@20 of 0.7356. ...
This work investigates information retrieval methods to address the existing difficulties on the Preliminary Search, part of the law making process from the Brazilian Chamber of Deputies. ...
3233/FAIA210326
An Information Retrieval Pipeline for
gislative Documents from the Brazilian
Chamber of Deputies
SOUZA a,b,1 , Douglas VITÓRIO a,d , Gyovana MORIYAMA b , Luiz SANTOS ...
doi:10.3233/faia210326
fatcat:ttz45uzisbf2dhihh2rsrkgbza
A Survey on Document Image Analysis
2018
International Journal for Research in Applied Science and Engineering Technology
Information in these document images are more structured and presented in a natural language with the help of a grammar and a script. ...
This paper discusses document image analysis, applications of document images, challenges and issues for handling document images. ...
Information retrieval from the document images is a challenging task, hence number of techniques and procedures are used for document image processing [3] . ...
doi:10.22214/ijraset.2018.4257
fatcat:ptngdhmucjd4pffgymu5vi5zay
Summarization from medical documents: a survey
2005
Artificial Intelligence in Medicine
Methodology: This survey gives first a general background on documents summarization, presenting the factors that summarization depends upon, discussing evaluation issues and describing briefly the various ...
Objective: The aim of this paper is to survey the recent work in medical documents summarization. ...
The retrieved documents are checked for possible relevance by a text passage retrieval component. Irrelevant documents are discarded. ...
doi:10.1016/j.artmed.2004.07.017
pmid:15811783
fatcat:n7u6ji5t2rgkvjktacjf4rdire
Automatic document processing: A survey
1996
Pattern Recognition
Surveys of the basic concepts and underlying techniques are presented in this paper. ...
It provides capabilities for automatically indexing form document for storage/ retrieval to/from a document library, and for capturing information from scanned form images using OCR. ...
The acquisition of knowledge from such documents by an information system can involve an extensive amount of handcrafting. ...
doi:10.1016/s0031-3203(96)00044-1
fatcat:zt3437y3qbfrvhkaxrjgyubiuy
Hairpins in bookstacks: Information retrieval from biomedical text
2005
Briefings in Bioinformatics
She is an active member of the biomedical text-mining community, and one of the first researchers in the area of text mining and information retrieval for bioinformatics. ...
The following sections provide a survey of basic concepts and methods in information retrieval, discuss the way they are applied in the biomedical domain, and demonstrate the use of information retrieval ...
INFORMATION RETRIEVAL: THE BASICS Information retrieval is concerned with identifying, within a large document collection, a subset of documents whose content is most relevant to a user's need. ...
doi:10.1093/bib/6.3.222
pmid:16212771
fatcat:2liwpccfq5dhza5mmdw66q7m2m
System components for embedded information retrieval from multiple disparate information sources
1993
Proceedings of the 6th annual ACM symposium on User interface software and technology - UIST '93
The tlrst is a design for a user/system interaction model for retrieval from multiple, disparate information sources. ...
Current information retrieval interfaces only address a small pant of the reality of rich interactions amongst user, task, and information sources. ...
Acknowledgements A number of our colleagues have contributed to this work in various ways. ...
doi:10.1145/168642.168645
dblp:conf/uist/RaoRM93
fatcat:gklclcoqhndojmlfoy4cjtey34
A Survey of Historical Document Image Datasets
[article]
2022
arXiv
pre-print
We present the statistics, document type, language, tasks, input visual aspects, and ground truth information for every dataset. ...
This paper presents a systematic literature review of image datasets for document image analysis, focusing on historical documents, such as handwritten manuscripts and early prints. ...
Conclusion We demonstrated a survey of historical document image datasets following a systematic literature review methodology. ...
arXiv:2203.08504v2
fatcat:ilgqqgylfzejnpccrsg7vfsncm
« Previous
Showing results 1 — 15 out of 220,243 results