4,345 Hits in 3.1 sec

Implicit Semantic Relations Identification through Distributed Representations for Effective Text Retrieval

Rajendra Prasath
2011 DESIDOC Journal of Library & Information Technology  
This paper discusses the use of a distributed representation, namely random indexing for an effective retrieval of relevant text documents.  ...  The modern commercial search methods have proved the efficiency of information retrieval (IR) technologies and made knowledge dissemination simpler where as finding the relevant text documents, given a  ...  Therefore, the higher the normalisation factor for a document, the lower is the chances of retrieval of that document 14 .  ... 
doi:10.14429/djlit.31.4.1104 fatcat:j53tsw7davdfjerfxlrlxpo4ve

Enhanced E-recruitment using Semantic Retrieval of Modeled Serialized Documents

Alaba T. Owoseni, Olatunbosun Olabode, B. A. Ojokoh
2017 International Journal of Mathematical Sciences and Computing  
This paper presents a 3-tier system that models serialized documents of the applicants" worth and they are analyzed using document retrieval and natural language processing techniques for a human-like  ...  Owoseni is a member of international association of engineers and few of its societies and currently awaiting his approval as member of Nigeria Computer Society.  ...  Ojokoh, for their supports during the course of preparing this work and to others that might have in one way or the other contributed to this work.  ... 
doi:10.5815/ijmsc.2017.01.01 fatcat:t7myjzef5zbptly3fakmtqz2ne

Natural Language Information Retrieval: TREC-8 Report

Tomek Strzalkowski, Jose Perez Carballo, Jussi Karlgren, Anette Hulth, Pasi Tapanainen, Timo Lahtinen
1999 Text Retrieval Conference  
As in previous years, we performed a full linguistic analysis of the entire corpus, and used the results of the analysis to provide index terms on a higher level of abstraction than can be provided by  ...  We made use of two different query expansion techniques, one automatic and one manual, both developed for TREC-8. 3.  ...  The topic expansion interaction proceeds as follows: 1. The initial natural language topic statement is submitted to a standard retrieval engine via a Query Expansion Tool (QET) interface.  ... 
dblp:conf/trec/StrzalkowskiCKHTL99 fatcat:wimymfecbjepvlvhaxiiaaaxnu

Page 149 of Journal of Research and Practice in Information Technology Vol. 13, Issue 4 [page]

1981 Journal of Research and Practice in Information Technology  
For example, IDMS (Kroenke, 1977) has a data dictionary system which provides cross reference documentation about sets, records, data items, subschemata and user programs.  ...  The languages providing data dictionary facilities are listed in Figure 7. 3.4 Simplicity A query should be expressable in a form as simple as possible, thus users should not be required to learn pro-  ... 

A Text Similarity Approach for Precedence Retrieval from Legal Documents

D. Thenmozhi, Kawshik Kannan, Chandrabose Aravindan
2017 Forum for Information Retrieval Evaluation  
In this paper, we propose a text similarity approach for precedence retrieval to retrieve older cases that are similar to a given case from a set of legal documents.  ...  Precedence retrieval of legal documents is an information retrieval task to retrieve prior case documents that are related to a given case document.  ...  ACKNOWLEDGMENTS We would like to thank the management of SSN Institutions for funding the High Performance Computing (HPC) lab where this work is being carried out.  ... 
dblp:conf/fire/ThenmozhiKA17 fatcat:wj5l65oyajekpb6ndexocip7ny

Structured document handling---a case for integrating databases and information retrieval

Klemens Böhm, Adrian Múller, Erich Neuhold
1994 Proceedings of the third international conference on Information and knowledge management - CIKM '94  
It will be shown that storage and retrieval of such documents will best be handled by an integration of database and information retrieval technologies.  ...  logic-based models of information retrieval to truly combine structure and content information about the documents in question.  ...  Acknowledgements We would like to thank Karl Aberer and Ulrich Thiel for their comments and suggestions, which have greatly improved the quality of this work.  ... 
doi:10.1145/191246.191271 dblp:conf/cikm/BohmMN94 fatcat:ylxhegh4svg7nomd6eluikf5di

A word spotting framework for historical machine-printed documents

A. L. Kesidis, E. Galiotou, B. Gatos, I. Pratikakis
2010 International Journal on Document Analysis and Recognition  
In this paper, we propose a word spotting framework for accessing the content of historical machine-printed documents without the use of an optical character recognition engine.  ...  Pratikakis morphological generator that enables searching in documents using only a base word-form for locating all the corresponding inflected word-forms and a synonym dictionary that further facilitates  ...  Acknowledgments The research leading to these results has received funding from the Greek Ministry of Research funded R&D (POLY-TIMO project) as well as from the European Community's Seventh Framework  ... 
doi:10.1007/s10032-010-0134-4 fatcat:2vqu3k6qjzbclagqmebyszmt4y

Hierarchical Neural Language Models for Joint Representation of Streaming Documents and their Content

Nemanja Djuric, Hao Wu, Vladan Radosavljevic, Mihajlo Grbovic, Narayan Bhamidipati
2015 Proceedings of the 24th International Conference on World Wide Web - WWW '15  
The documents are represented as low-dimensional vectors and are jointly learned with distributed vector representations of word tokens using a hierarchical framework with two embedded neural language  ...  The models learn continuous vector representations for both word tokens and documents such that semantically similar documents and words are close in a common vector space.  ...  for similar keywords to expand the query (useful in the search product); 2) given a keyword, search for relevant documents such as news stories (useful in document retrieval); 3) given a document, retrieve  ... 
doi:10.1145/2736277.2741643 dblp:conf/www/DjuricWRGB15 fatcat:ikxtmpjscbennohgqm5w46tqvu

Image tag clarity

Aixin Sun, Sourav S. Bhowmick
2009 Proceedings of the first SIGMM workshop on Social media - WSM '09  
Tags associated with images in various social media sharing web sites are valuable information source for superior image retrieval experiences.  ...  It is measured by computing the zero-mean normalized distance between the tag language model estimated from the images annotated by the tag and the collection language model.  ...  This is a challenging issue for the following reason. In textual documents, keywords in a query literally appears in the retrieved documents.  ... 
doi:10.1145/1631144.1631150 dblp:conf/mm/SunB09 fatcat:qn42ucvgbzgkdjq34ghrxsqdkq

Accessing the content of Greek historical documents

Anastasios Kesidis, Eleni Galiotou, Basilis Gatos, Aristomenis Lampropoulos, Ioannis Pratikakis, Ioanna Manolessou, Angela Ralli
2009 Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data - AND '09  
In order to improve the efficiency of accessing and searching, we have used natural language processing techniques that comprise (i) a morphological generator for early Modern Greek which provides the  ...  In this paper, we propose an alternative method for accessing the content of Greek historical documents printed during the 17th and 18th centuries by searching words directly in digitized documents based  ...  The proposed workflow for keyword searching in historical printed documents is presented in the sequel.  ... 
doi:10.1145/1568296.1568307 dblp:conf/and/KesidisGGLPMR09 fatcat:vr35uojehbazlhadhx5wdyuauy

Movie recommender systems using hybrid model based on graphs with co-rated, genre, and closed caption features

Putra Pandu Adikara, Yuita Arum Sari, Sigit Adinugroho, Budi Darma Setiawan
2021 Register: Jurnal Ilmiah Teknologi Sistem Informasi  
as sequel or prequel.  ...  This situation is not good for the movie business too.  ...  A movie with similar content to the query has a higher rank such as sequel or prequel movies.  ... 
doi:10.26594/register.v7i1.2081 fatcat:zlcvqvzrbrhozeow2n5e67snra

Library Classification in Computer Age

P. Dhyani
1999 DESIDOC Bulletin of Information Technology  
Dewey pioneered in devising a scheme of classification for the documentation utility of the organised knowledge.  ...  This paper attempts to delve a state-of-the-art of library classification in the new computer age.  ...  To them the use of such a relational structure may even be able to provide the basis for common retrieval language as suggested by Ranganathan'.  ... 
doi:10.14429/dbit.19.3.3484 fatcat:smwwouxyxbat3ibkyu32jdw6dm

Plagiarism Detection Based on Citing Sentences [chapter]

Sidik Soleman, Atsushi Fujii
2017 Lecture Notes in Computer Science  
For the English language, ParaMaker is examined against six known methods with standard PAN2014 datasets.  ...  In the Persian language, statements of suspicious documents are examined compared to an exact search approach.  ...  An existing solution for this problem is to use the resource retrieval techniques that apply search engines to retrieve the potential sources of plagiarism for a suspicious document.  ... 
doi:10.1007/978-3-319-67008-9_38 fatcat:vjf67csttbg4fch7rra427xh5u

SQL multimedia and application packages (SQL/MM)

Jim Melton, Andrew Eisenberg
2001 SIGMOD record  
have been formally adopted as American National Standards, they will be available at the NCITS web store for very reasonable prices.  ...  a specification for a language called SFQL (Structured Full-text Query Language).  ...  We could retrieve from that table the identifier of documents about full-text searching that contain words closely related to "standard" in the same paragraph as words that sound like "sequel" by using  ... 
doi:10.1145/604264.604280 fatcat:mdey5kgbhfdevefqehd7jizsim

Stemmers for Tamil Language: Performance Analysis [article]

M.Thangarasu, R.Manavalan
2013 arXiv   pre-print
The experimental result clearly show that the proposed approach light stemmer for Tamil language perform better than suffix removal stemmer and also more effective in Information Retrieval System (IRS)  ...  The performance of proposed approach is compared to a rule based suffix removal stemmer based on correctly and incorrectly predicted.  ...  likely wants to retrieve documents containing the terms searching ( §¾Îõ) and searched ( §¾ÊÂ) etc. as well.  ... 
arXiv:1310.0754v1 fatcat:e7u276twnfahjcsvkf5wbxyg6q
« Previous Showing results 1 — 15 out of 4,345 results