Filters








638 Hits in 5.3 sec

Using Syntactic Dependency-Pairs Conflation to Improve Retrieval Performance in Spanish [chapter]

Jesús Vilares, Fco. Mario Barcala, Miguel A. Alonso
2000 Lecture Notes in Computer Science  
At sentence level, an approximate grammar is used to conflate syntactic and morphosyntactic variants of a given multi-word term into a common base form.  ...  At word level, productive derivational morphology is used to conflate semantically related words.  ...  FNL: multi-word term conflation via syntactic dependency-pairs and lemmatization. FNF : multi-word term conflation via syntactic dependency-pairs and morphological families.  ... 
doi:10.1007/3-540-45715-1_40 fatcat:6glnqt5t6fb3hfxmechbwwx6di

Managing syntactic variation in text retrieval

Jesús Vilares, Carlos Gómez-Rodríguez, Miguel A. Alonso
2005 Proceedings of the 2005 ACM symposium on Document engineering - DocEng '05  
Two different sources of syntactic information, queries and documents, are studied in order to increase the performance of Information Retrieval systems.  ...  In this paper we deal with European languages, taking Spanish as a case in point.  ...  CONCLUSIONS Throughout this article we have studied the use of syntactic dependencies as complex index terms in an attempt to improve the performance of Information Retrieval systems by, on the one hand  ... 
doi:10.1145/1096601.1096643 dblp:conf/doceng/VilaresGA05 fatcat:z6khtwza25ca3pqnhayivefyau

On the Usefulness of Extracting Syntactic Dependencies for Text Indexing [chapter]

Miguel A. Alonso, Jesús Vilares, Víctor M. Darriba
2002 Lecture Notes in Computer Science  
In this paper we study the impact of using such information, in the form of syntactic dependency pairs, in the performance of a text retrieval system for a Romance language, Spanish.  ...  analysis to syntactic parsing to conceptual-level semantic analysis.  ...  The research reported in this article has been supported in part by Plan Nacional de Investigación Científica, Desarrollo e Innovación Tecnológica (Grant TIC2000-0370-C02-01), Ministerio de Ciencia y Tecnología  ... 
doi:10.1007/3-540-45750-x_1 fatcat:lmvop4sbazgyzgc6p3d4wlbl6m

COLE Experiments in the CLEF 2002 Spanish Monolingual Track [chapter]

Jesús Vilares, Miguel A. Alonso, Francisco J. Ribadas, Manuel Vilares
2003 Lecture Notes in Computer Science  
with syntactic dependencies to deal with the syntactic content of the document.  ...  In this our first participation in CLEF, we applied Natural Language Processing techniques for single word and multiword term conflation.  ...  Acknowledgements The research reported in this article has been supported in part by Plan Nacional de Investigación Científica, Desarrollo e Innovación Tecnológica (grant TIC2000-0370-C02-01), Ministerio  ... 
doi:10.1007/978-3-540-45237-9_22 fatcat:rvesai4nwzfztbg2ulp6wawgga

Towards the Development of Heuristics for Automatic Query Expansion [chapter]

Jesüs Vilares, Manuel Vilares, Miguel A. Alonso
2001 Lecture Notes in Computer Science  
In this paper we study the performance of linguisticallymotivated conflation techniques for Information Retrieval in Spanish.  ...  In particular, we have studied the application of productive derivational morphology for single word term conflation and the extraction of syntactic dependency pairs for multi-word term conflation.  ...  Firstly, we identify relations between pairs of content words inside the multi-word term, to conflate it into syntactic dependency-pairs.  ... 
doi:10.1007/3-540-44759-8_86 fatcat:axl7yjyrh5erpm4v2hvtujifwm

COLE Experiments at CLEF 2003 in the Spanish Monolingual Track [chapter]

Jesús Vilares, Miguel A. Alonso, Francisco J. Ribadas
2004 Lecture Notes in Computer Science  
For this purpose, we will extract the pairs of words related through syntactic dependencies in order to use them as complex index terms.  ...  This process is performed in two steps: firstly, the text is parsed by means of a shallow parser and, secondly, the syntactic dependencies are extracted and conflated into index terms.  ...  The authors also would like to thank Darrin Dimmick, from NIST, for giving us the opportunity to use the ZPrise system, and Fernando Martínez, from Universidad de Jaén, for helping us to make it operative  ... 
doi:10.1007/978-3-540-30222-3_33 fatcat:4nkb2qf4kredriljtbbykhfr64

Morphological and Syntactic Processing for Text Retrieval [chapter]

Jesús Vilares, Miguel A. Alonso, Manuel Vilares
2004 Lecture Notes in Computer Science  
Though this article focuses on Spanish, this approach is extensible to other languages by simply adapting the grammar used by the parser.  ...  This article describes the application of lemmatization and shallow parsing as a linguistically-based alternative to stemming in Text Retrieval, with the aim of managing linguistic variation at both word  ...  In our case, we have used a conflation technique based on the employment of morphological relations in order to improve the management of syntactic variation [18] .  ... 
doi:10.1007/978-3-540-30075-5_36 fatcat:b6xn6age4rfflainp2flj6qlqi

Practical NLP-Based Text Indexing [chapter]

Jésus Vilares, F. Mario Barcala, Miguel A. Alonso, Jorge Graña, Manuel Vilares
2002 Lecture Notes in Computer Science  
to extract syntactic-dependency pairs.  ...  We propose to use these techniques in order to improve the performance of standard indexing engines.  ...  Acknowledgements Supported in part by Plan Nacional de Investigación Científica, Desarrollo e Innovación Tecnológica (TIC2000-0370-C02-01), Ministerio de Ciencia y Tecnología (HP2001-0044) and Xunta de  ... 
doi:10.1007/3-540-36131-6_65 fatcat:im34iy5qmjb67lh36h2jxtfo5u

Extraction of complex index terms in non-English IR: A shallow parsing based approach

Jesús Vilares, Miguel A. Alonso, Manuel Vilares
2008 Information Processing & Management  
We propose the use of syntactic dependencies as complex index terms in an attempt to solve the problems deriving from both syntactic and morpho-syntactic variation and, in this way, to obtain more precise  ...  The performance of Information Retrieval systems is limited by the linguistic variation present in natural language texts.  ...  His research work focuses on Natural Language Processing, Information Retrieval and Extraction and Question Answering.  ... 
doi:10.1016/j.ipm.2007.12.005 fatcat:nwpsidwtlrfjnc7ymjywyrwn2y

MIRACLE's 2005 Approach to Cross-Lingual Question Answering

César de Pablo-Sánchez, Ana González-Ledesma, José Luis Martínez-Fernández, José María Guirao, Paloma Martínez, Antonio Moreno-Sandoval
2005 Conference and Labs of the Evaluation Forum  
This paper presents the 2005 MIRACLE's team approach to CLEF QA with Spanish as a target task using miraQA system.  ...  A thorough error analysis has been carried out to spot critical points for improvement.  ...  Acknowledgements This work has been partially supported by the Spanish R+D National Plan, by means of the project RIMMEL Multilingual and Multimedia Information Retrieval, and its Evaluation), TIN2004-  ... 
dblp:conf/clef/Pablo-SanchezGMGMM05a fatcat:vyky4dlctbbnlmu3jxrw5x5xka

Textual Similarity Measurement Approaches: A Survey (1)

Amira Abo-Elghit, Aya Al-Zoghby, Taher Hamza
2020 The Egyptian Journal of Language Engineering  
Measuring textual similarity tends to have an increasingly important turn in related topics like text classification, recovery of specific information from data, clustering, topic retrieval, subject tracking  ...  Survey research is appropriate and necessary to address certain research question types. This paper aims to provide a general overview of the textual similarity in the literature.  ...  [6] examined the performance of the bigram and trigram techniques in the context of Arabic free text retrieval since the N-grams 9 conflation scheme uses to transform a word into a chain of N-grams.  ... 
doi:10.21608/ejle.2020.42018.1012 fatcat:a2fhtkub7nazlkgzqewqbb7koi

How Effective is Stemming and Decompounding for German Text Retrieval?

Martin Braschler, Bärbel Ripplinger
2004 Information retrieval (Boston)  
Information retrieval systems operating on free text face difficulties when word forms used in the query and documents do not match.  ...  The usual solution is the use of a "stemming component" that reduces related word forms to a common stem.  ...  Acknowledgments We would like to thank IAI for the opportunity to use MPRO.  ... 
doi:10.1023/b:inrt.0000011208.60754.a1 fatcat:efvxo34cxjawrn4l2erhaex3qi

Expansion of multi-word terms for indexing and retrieval using morphology and syntax

Christian Jacquemin, Judith L. Klavans, Evelyne Tzoukermann
1997 Proceedings of the 35th annual meeting on Association for Computational Linguistics -  
This includes a finite-state part of speech tagger, a derivational morphological processor for analysis and generation, and a unificationbased shallow-level parser using transformational rules over syntactic  ...  The contribution of this research is the successful combination of parsing over a seed term list coupled with derivational morphology to achieve greater coverage of multi-word terms for indexing and retrieval  ...  should improve retrieval resuits accordingly. used for the tuning of the metagrammar and a test corpus [AGR] (AGR, 1995) used for evaluation.  ... 
doi:10.3115/976909.979621 dblp:conf/acl/JacqueminKT97 fatcat:ij2a6wxg7fdtfm7pdagrmas4ga

On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages

David Vilares, Miguel A. Alonso, Carlos Gómez-Rodríguez
2015 Journal of the Association for Information Science and Technology  
Empirical results show an improved performance of syntactic approaches over pure lexical models when using large training sets to create a classifier, but this tendency is reversed when small training  ...  This article explores how relating lexical, syntactic and psychometric information can be helpful to perform polarity classification on Spanish tweets.  ...  Acknowledgments Research reported in this article has been partially funded by Ministerio de Economía y Competitividad and FEDER (Grant TIN2010-18552-C03-02) and by Xunta de Galicia (Grants CN2012/008,  ... 
doi:10.1002/asi.23284 fatcat:homckuzndfg35beujkghyn5vvi

The Influence of Context during the Categorization and Discrimination of Spanish and Portuguese Person Names

Zornitsa Kozareva, Sonia Vázquez, Andrés Montoyo
2007 Revista de Procesamiento de Lenguaje Natural (SEPLN)  
The performance of this method ranges from 51 % to 73 % depending on the pair of named entities that have to be disambiguated.  ...  Named Entity Data Set In order to evaluate our method, we have used two languages: Spanish and Portuguese.  ... 
dblp:journals/pdln/KozarevaVM07 fatcat:zr3op46cijhqtcmvzz3xebx2rq
« Previous Showing results 1 — 15 out of 638 results