A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2011; you can also visit the original URL.
The file type is application/pdf
.
Filters
Extraction of complex index terms in non-English IR: A shallow parsing based approach
2008
Information Processing & Management
Such dependencies are obtained through a shallow parser based on cascades of finite-state transducers in order to reduce as far as possible the overhead due to this parsing process. ...
index terms. ...
After a short period as a lecturer in the Univ. of Vigo (Spain), in 2005 he obtained a PhD. in Computer Science from the Univ. of A Coruña. He is currently an assistant professor at this university. ...
doi:10.1016/j.ipm.2007.12.005
fatcat:nwpsidwtlrfjnc7ymjywyrwn2y
Managing syntactic variation in text retrieval
2005
Proceedings of the 2005 ACM symposium on Document engineering - DocEng '05
The use of Natural Language Processing techniques to manage this problem has been studied for a long time, but mainly focusing on English. ...
In this paper we deal with European languages, taking Spanish as a case in point. ...
Shallow parsing has shown itself to be useful in several NLP application fields, particularly in Information Extraction [7] , although its application in IR has not yet been studied in depth. ...
doi:10.1145/1096601.1096643
dblp:conf/doceng/VilaresGA05
fatcat:z6khtwza25ca3pqnhayivefyau
Morphological and Syntactic Processing for Text Retrieval
[chapter]
2004
Lecture Notes in Computer Science
This article describes the application of lemmatization and shallow parsing as a linguistically-based alternative to stemming in Text Retrieval, with the aim of managing linguistic variation at both word ...
Several alternatives for selecting the index terms among the syntactic dependencies detected by the parser are evaluated. ...
The other two approaches employ shallow parsing to manage syntactic variation by using syntactic dependencies as complex index terms. ...
doi:10.1007/978-3-540-30075-5_36
fatcat:b6xn6age4rfflainp2flj6qlqi
COLE Experiments at CLEF 2003 in the Spanish Monolingual Track
[chapter]
2004
Lecture Notes in Computer Science
This process is performed in two steps: firstly, the text is parsed by means of a shallow parser and, secondly, the syntactic dependencies are extracted and conflated into index terms. ...
For this purpose, we will extract the pairs of words related through syntactic dependencies in order to use them as complex index terms. ...
Acknowledgements The research described in this paper has been supported in part by Ministerio de Ciencia y Tecnología (TIC2000-0370-C02-01, HP2001-0044 and HF2002-81), FPU grants of Secretaría de Estado ...
doi:10.1007/978-3-540-30222-3_33
fatcat:4nkb2qf4kredriljtbbykhfr64
Noun-Phrase Analysis in Unrestricted Text for Information Retrieval
[article]
1996
arXiv
pre-print
In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from complex noun phrases using both corpus statistics and linguistic heuristics. ...
Results of experiments show that indexing based on such extracted subcompounds improves both recall and precision in an information retrieval system. ...
The evaluation of the experimental results would have been impossible without the help of Robert Lefferts and Nataša Milić-Frayling at CLARITECH Corporation. ...
arXiv:cmp-lg/9605019v1
fatcat:dirzxqywgjgoxihkvplbchy5py
Noun-phrase analysis in unrestricted text for information retrieval
1996
Proceedings of the 34th annual meeting on Association for Computational Linguistics -
In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from complex noun phrases using both corpus statistics and linguistic heuristics. ...
Results of experiments show that indexing based on such extracted subcompounds improves both recall and precision in an information retrieval system. ...
The evaluation of the experimental results would have been impossible without the help of Robert Lefferts and Nata~a Mili4-Frayling at CLARITECH Corporation. ...
doi:10.3115/981863.981866
dblp:conf/acl/EvansZ96
fatcat:bw7dkol5o5hizgup5sizccdmqi
Efficient Question Answering with Question Decomposition and Multiple Answer Streams
[chapter]
2009
Lecture Notes in Computer Science
IRSAW was introduced in 2007, by integrating the deep answer producer InSicht, several shallow answer producers, and a logical validator. ...
Using RAVE for merging the results of the answer producers, monolingual German runs and bilingual runs with source language English and Spanish were produced by applying a machine translation web service ...
The actual extraction performance achieved by the answer producers of the shallow subsystem of IR-SAW has also been investigated, see Table 3 . ...
doi:10.1007/978-3-642-04447-2_49
fatcat:6grdchnn7va4vl5k7zoo2ncsly
On the Usefulness of Extracting Syntactic Dependencies for Text Indexing
[chapter]
2002
Lecture Notes in Computer Science
In this paper we study the impact of using such information, in the form of syntactic dependency pairs, in the performance of a text retrieval system for a Romance language, Spanish. ...
In particular, different degrees of phrase-level syntactic information have been incorporated in information retrieval systems working on English or Germanic languages such as Dutch. ...
The research reported in this article has been supported in part by Plan Nacional de Investigación Científica, Desarrollo e Innovación Tecnológica (Grant TIC2000-0370-C02-01), Ministerio de Ciencia y Tecnología ...
doi:10.1007/3-540-45750-x_1
fatcat:lmvop4sbazgyzgc6p3d4wlbl6m
Information Extraction: Past, Present and Future
[chapter]
2012
Multi-source, Multilingual Information Extraction and Summarization
In this chapter we present a brief overview of Information Extraction, which is an area of natural language processing that deals with finding factual information in free text. ...
In formal terms, facts are structured objects, such as database records. ...
[58] describes an approach to segmentation and classification of a wider range of names in tweets based on CRFs (using POS and shallow parsing features) and Labeled Latent Dirichlet Allocation respectively ...
doi:10.1007/978-3-642-28569-1_2
dblp:series/tanlp/PiskorskiY13
fatcat:aoc7stoinzf6jc2dengl5ltwte
Grammatical Relation Extraction in Arabic Language
2012
Journal of Computer Science
Conclusion: The main achievement of this study is development of Arabic grammatical relation extractions based ob rule-based approaches. ...
Approach: We had proposed a rule based production method to recognize Grammatical Relations (GRs), as the rule-based approach had been successfully used in developing many natural language processing systems ...
Synthesis and Recognition (SSR), Machine Translation (MT), Index Term Generation (ITG), Rule-Based approaches are witnessing a renewed interest in NLP applications in an attempt to solve common problems ...
doi:10.3844/jcssp.2012.891.898
fatcat:nmifgtufbnghrgbfuvycplqjua
The system's primary source of knowledge is a collection of Arabic newspaper text extracted from Al-Raya, a newspaper published in Qatar. ...
We are tackling this problem for Arabic using traditional Information Retrieval (IR) techniques coupled with a sophisticated Natural Language Processing (NLP) approach. ...
Among these techniques are: part-of-speech tagging, shallow parsing, query type identification and named entity recognition. ...
doi:10.3115/1118637.1118644
dblp:conf/acl-semitic/HammoALE02
fatcat:75oieazukbbppahtnrlmzqzeam
A Survey of Text Question Answering Techniques
2012
International Journal of Computer Applications
QA systems give the ability to answer questions posed in natural language by extracting, from a repository of documents, fragments of documents that contain material relevant to the answer. ...
Question classification play primary role in QA system to categorize the question based upon on the type of its entity. ...
IR systems are usually based on the segmentation of documents and queries into index terms, and their relevance is computed according to the index terms they have in common, as well as according to other ...
doi:10.5120/8406-2030
fatcat:pzhcq4v44rhltjsnesw7cqan24
Is question answering an acquired skill?
2004
Proceedings of the 13th conference on World Wide Web - WWW '04
Compare the QA situation with IR engines, which are largely based on the now-standard vector-space model and TFIDF ranking, a declarative specification of what is a good matching document. ...
We built our system in only a few person-months using offthe-shelf components: a part-of-speech tagger, a shallow parser, a lexical network, and a few well-known supervised learning algorithms. ...
Acknowledgments: This research was supported in part by Tata Consultancy Services, IBM Research, and NEC Research. ...
doi:10.1145/988672.988688
dblp:conf/www/RamakrishnanCPB04
fatcat:udcmmrk7efb4dpndbyh6kgt5pq
Classification of Natural Language Processing Techniques for Requirements Engineering
[article]
2022
arXiv
pre-print
However, in spite of the progress, our recent survey shows that there is still a lack of systematic understanding and organization of commonly used NLP techniques in RE. ...
We believe these two ways of classification are complementary, contributing to a better understanding of the NLP techniques in RE and such understanding is crucial to the development of better NLP tools ...
Related Term: Semantic parsing, semantic trees, shallow parsing, and shallow semantic analysis. ...
arXiv:2204.04282v1
fatcat:i2xxczzl7veuxohe25x3anujha
An Initial Proposal for Cooperative Evaluation on Information Retrieval in Portuguese
[chapter]
2003
Lecture Notes in Computer Science
In this paper we discuss evaluation of information retrieval, Web search and question answering systems, paving the way for the organization of an evaluation contest on IR for Portuguese. ...
Inspired by current international setups, we motivate the need to study the specific problems posed by Portuguese, suggesting a collection suitable for multiple tasks. ...
For researchers of the first profile, it is advantageous to identify the challenges posed by IR in Portuguese, especially where an English-based architecture will be likely to miss the point. ...
doi:10.1007/3-540-45011-4_36
fatcat:4etymvhfhjbqreu4xbh3pjmv2m
« Previous
Showing results 1 — 15 out of 677 results