718 Hits in 7.2 sec

Extraction of complex index terms in non-English IR: A shallow parsing based approach

Jesús Vilares, Miguel A. Alonso, Manuel Vilares
2008 Information Processing & Management  
Such dependencies are obtained through a shallow parser based on cascades of finite-state transducers in order to reduce as far as possible the overhead due to this parsing process.  ...  index terms.  ...  After a short period as a lecturer in the Univ. of Vigo (Spain), in 2005 he obtained a PhD. in Computer Science from the Univ. of A Coruña. He is currently an assistant professor at this university.  ... 
doi:10.1016/j.ipm.2007.12.005 fatcat:nwpsidwtlrfjnc7ymjywyrwn2y

RePaLi Participation to CLEF eHealth IR Challenge 2014: Leveraging Term Variation

Vincent Claveau, Thierry Hamon, Natalia Grabar, Sébastien Le Maguer
2014 Conference and Labs of the Evaluation Forum  
For this first participation, our approach relies on a state-of-theart IR system called Indri, based on statistical language modeling, and on semantic resources.  ...  This paper describes the participation of RePaLi, a team composed with members of IRISA, LIMSI and STL, to the biomedical information retrieval challenge proposed in the framework of CLEF eHealth.  ...  IR model The IR system at the heart of our runs is based on statistical language modeling (LM) as implemented by Indri, a toolkit for LM-based IR [18] .  ... 
dblp:conf/clef/ClaveauHGM14 fatcat:dbni5iuwnzge7dqi6kgrffj2n4

Managing syntactic variation in text retrieval

Jesús Vilares, Carlos Gómez-Rodríguez, Miguel A. Alonso
2005 Proceedings of the 2005 ACM symposium on Document engineering - DocEng '05  
The use of Natural Language Processing techniques to manage this problem has been studied for a long time, but mainly focusing on English.  ...  In this paper we deal with European languages, taking Spanish as a case in point.  ...  Shallow parsing has shown itself to be useful in several NLP application fields, particularly in Information Extraction [7] , although its application in IR has not yet been studied in depth.  ... 
doi:10.1145/1096601.1096643 dblp:conf/doceng/VilaresGA05 fatcat:z6khtwza25ca3pqnhayivefyau

Morphological and Syntactic Processing for Text Retrieval [chapter]

Jesús Vilares, Miguel A. Alonso, Manuel Vilares
2004 Lecture Notes in Computer Science  
This article describes the application of lemmatization and shallow parsing as a linguistically-based alternative to stemming in Text Retrieval, with the aim of managing linguistic variation at both word  ...  Several alternatives for selecting the index terms among the syntactic dependencies detected by the parser are evaluated.  ...  The other two approaches employ shallow parsing to manage syntactic variation by using syntactic dependencies as complex index terms.  ... 
doi:10.1007/978-3-540-30075-5_36 fatcat:b6xn6age4rfflainp2flj6qlqi

COLE Experiments at CLEF 2003 in the Spanish Monolingual Track [chapter]

Jesús Vilares, Miguel A. Alonso, Francisco J. Ribadas
2004 Lecture Notes in Computer Science  
This process is performed in two steps: firstly, the text is parsed by means of a shallow parser and, secondly, the syntactic dependencies are extracted and conflated into index terms.  ...  For this purpose, we will extract the pairs of words related through syntactic dependencies in order to use them as complex index terms.  ...  Acknowledgements The research described in this paper has been supported in part by Ministerio de Ciencia y Tecnología (TIC2000-0370-C02-01, HP2001-0044 and HF2002-81), FPU grants of Secretaría de Estado  ... 
doi:10.1007/978-3-540-30222-3_33 fatcat:4nkb2qf4kredriljtbbykhfr64

Noun-Phrase Analysis in Unrestricted Text for Information Retrieval [article]

David A. Evans, Chengxiang Zhai
1996 arXiv   pre-print
In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from complex noun phrases using both corpus statistics and linguistic heuristics.  ...  Results of experiments show that indexing based on such extracted subcompounds improves both recall and precision in an information retrieval system.  ...  The evaluation of the experimental results would have been impossible without the help of Robert Lefferts and Nataša Milić-Frayling at CLARITECH Corporation.  ... 
arXiv:cmp-lg/9605019v1 fatcat:dirzxqywgjgoxihkvplbchy5py

Noun-phrase analysis in unrestricted text for information retrieval

David A. Evans, Chengxiang Zhai
1996 Proceedings of the 34th annual meeting on Association for Computational Linguistics -  
In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from complex noun phrases using both corpus statistics and linguistic heuristics.  ...  Results of experiments show that indexing based on such extracted subcompounds improves both recall and precision in an information retrieval system.  ...  The evaluation of the experimental results would have been impossible without the help of Robert Lefferts and Nata~a Mili4-Frayling at CLARITECH Corporation.  ... 
doi:10.3115/981863.981866 dblp:conf/acl/EvansZ96 fatcat:bw7dkol5o5hizgup5sizccdmqi

Towards Supporting Feature Location Using Syntactic Analysis

Jinshui Wang, Xingsi Xue, Shu-Chuan Chu
2016 Journal of Information Hiding and Multimedia Signal Processing  
in software artifacts, and improve the accuracy of IR-based feature location methods.  ...  In particular, the proposed approach firstly analyzes how terms have been used in the text through syntactic analysis.  ...  Chunking recognition, also known as shallow parsing or partial parsing, is used to identify the constituents of a sentence and generate partial (shallow) analysis of sentences rather than a full parse.  ... 
dblp:journals/jihmsp/WangXC16 fatcat:2gvtcvidqzaovhiyx3nof3o5ri

Natural Language Interface Using Shallow Parsing

Rajendra Akerkar, Manish Joshi
2008 International Journal of Computer Science and Applications  
In this paper, we will present rules to tackle linguistic phenomena using shallow parsing and discuss advantages of a novel Natural Language Interface comprising of shallow parsing based algorithms in  ...  Experimental results show that this approach can analyze a wide range of questions with high accuracy and produce reasonable textual responses.  ...  Lexicons are utilized in IR system to ensure that a common vocabulary is used in selecting appropriate indexing or searching terms / phrases.  ... 
dblp:journals/ijcsa/AkerkarJ08 fatcat:bj42fakiajco3p66pbxn7etiby

Efficient Question Answering with Question Decomposition and Multiple Answer Streams [chapter]

Sven Hartrumpf, Ingo Glöckner, Johannes Leveling
2009 Lecture Notes in Computer Science  
IRSAW was introduced in 2007, by integrating the deep answer producer InSicht, several shallow answer producers, and a logical validator.  ...  Using RAVE for merging the results of the answer producers, monolingual German runs and bilingual runs with source language English and Spanish were produced by applying a machine translation web service  ...  The actual extraction performance achieved by the answer producers of the shallow subsystem of IR-SAW has also been investigated, see Table 3 .  ... 
doi:10.1007/978-3-642-04447-2_49 fatcat:6grdchnn7va4vl5k7zoo2ncsly

On the Usefulness of Extracting Syntactic Dependencies for Text Indexing [chapter]

Miguel A. Alonso, Jesús Vilares, Víctor M. Darriba
2002 Lecture Notes in Computer Science  
In this paper we study the impact of using such information, in the form of syntactic dependency pairs, in the performance of a text retrieval system for a Romance language, Spanish.  ...  In particular, different degrees of phrase-level syntactic information have been incorporated in information retrieval systems working on English or Germanic languages such as Dutch.  ...  The research reported in this article has been supported in part by Plan Nacional de Investigación Científica, Desarrollo e Innovación Tecnológica (Grant TIC2000-0370-C02-01), Ministerio de Ciencia y Tecnología  ... 
doi:10.1007/3-540-45750-x_1 fatcat:lmvop4sbazgyzgc6p3d4wlbl6m

Information Extraction: Past, Present and Future [chapter]

Jakub Piskorski, Roman Yangarber
2012 Multi-source, Multilingual Information Extraction and Summarization  
In this chapter we present a brief overview of Information Extraction, which is an area of natural language processing that deals with finding factual information in free text.  ...  In formal terms, facts are structured objects, such as database records.  ...  [58] describes an approach to segmentation and classification of a wider range of names in tweets based on CRFs (using POS and shallow parsing features) and Labeled Latent Dirichlet Allocation respectively  ... 
doi:10.1007/978-3-642-28569-1_2 dblp:series/tanlp/PiskorskiY13 fatcat:aoc7stoinzf6jc2dengl5ltwte

Grammatical Relation Extraction in Arabic Language

2012 Journal of Computer Science  
Conclusion: The main achievement of this study is development of Arabic grammatical relation extractions based ob rule-based approaches.  ...  Approach: We had proposed a rule based production method to recognize Grammatical Relations (GRs), as the rule-based approach had been successfully used in developing many natural language processing systems  ...  Synthesis and Recognition (SSR), Machine Translation (MT), Index Term Generation (ITG), Rule-Based approaches are witnessing a renewed interest in NLP applications in an attempt to solve common problems  ... 
doi:10.3844/jcssp.2012.891.898 fatcat:nmifgtufbnghrgbfuvycplqjua


Bassam Hammo, Hani Abu-Salem, Steven Lytinen
2002 Proceedings of the ACL-02 workshop on Computational approaches to semitic languages -  
The system's primary source of knowledge is a collection of Arabic newspaper text extracted from Al-Raya, a newspaper published in Qatar.  ...  We are tackling this problem for Arabic using traditional Information Retrieval (IR) techniques coupled with a sophisticated Natural Language Processing (NLP) approach.  ...  Among these techniques are: part-of-speech tagging, shallow parsing, query type identification and named entity recognition.  ... 
doi:10.3115/1118637.1118644 dblp:conf/acl-semitic/HammoALE02 fatcat:75oieazukbbppahtnrlmzqzeam

A Survey of Text Question Answering Techniques

Poonam Gupta, Vishal Gupta
2012 International Journal of Computer Applications  
QA systems give the ability to answer questions posed in natural language by extracting, from a repository of documents, fragments of documents that contain material relevant to the answer.  ...  Question classification play primary role in QA system to categorize the question based upon on the type of its entity.  ...  IR systems are usually based on the segmentation of documents and queries into index terms, and their relevance is computed according to the index terms they have in common, as well as according to other  ... 
doi:10.5120/8406-2030 fatcat:pzhcq4v44rhltjsnesw7cqan24
« Previous Showing results 1 — 15 out of 718 results