Filters








10,950 Hits in 5.9 sec

West Group at CLEF 2000: Non-english Monolingual Retrieval [chapter]

Isabelle Moulinier, J. Andrew McCulloh, Elizabeth Lund
2001 Lecture Notes in Computer Science  
Our primary interest was to investigate whether retrieval of German or French documents was any different from the retrieval of English documents.  ...  We focused on two aspects: stemming for both languages and compound breaking for German, and studied several query formulations to take advantage of compounds.  ...  We compare performing no stemming, stemming using an inflectional morphological analyzer, and stemming used a rule-based algorithm similar to Porter's English stemmer.  ... 
doi:10.1007/3-540-44645-1_25 fatcat:3qsaum5ndbh5bfdyxfic3emcdy

The performance evaluation of a knowledge-based information retrieval system

Dongwook Shin, Yongun Yoon, Keysun Choi
1993 Microprocessing and Microprogramming  
HYKIS (HYbrid Knowledge-based Information retrieval System) is a knowledge-based information retrieval system that allows a naive user to retrieve relevant documents effectively, without any help of a  ...  According to an experiment with the CACM data set, it turns out that HYKIS achieves much higher recall and precision rate, compared to a thesaurus-based retrieval system.  ...  We also notice that poor stemming and single-term indexing prevent, a retrieval system from finding preper documents. In a query q6, HYKIS produces low retrieval rate.  ... 
doi:10.1016/0165-6074(93)90082-v fatcat:ocoohup6ujepzoqk73yu7kxgfu

Extended Semantic based Boolean Information Retrieval Algorithm for User-driven Query

Vachhani Upama, Prof. S. M. Shah
2015 International Journal of Engineering Research and  
An existing SBIR algorithm uses lexical database, WordNet to find synonyms of single-word query term considering that the absence of the given term in a document does not necessarily mean that the document  ...  is not a relevant.In this paper, a new algorithm is proposed which works with compound terms and uses modified Porter Stemming Algorithm to solve some stemming errors found in Porter Stemmer Algorithm  ...  Hence, a new rule has been added for words ending with "ise" to solve the above under-stemming error.  ... 
doi:10.17577/ijertv4is050514 fatcat:v67cgbliojdttk556ed5etojoq

AN EFFICIENT INFORMATION RETRIEVAL ONTOLOGY SYSTEM BASED INDEXING FOR CONTEXT

G.Krishna Raju .
2015 International Journal of Research in Engineering and Technology  
It is very difficult to retrieve the relevant context in its original format since we use the pre-processing step, which helps to retrieve context.  ...  In this paper, we utilize the WordNet ontology to retrieve the relevant contexts from the document repository.  ...  Performance Analysis using Evaluation Metrics The performance of the proposed document retrieval system is evaluated based on the input query keywords to the WordNet ontology using the Precision, recall  ... 
doi:10.15623/ijret.2015.0405082 fatcat:5htjg7mgifdy3kk3m56aesrixy

Comparing Crowd-Based, Game-Based, and Machine-Based Approaches in Initial Query and Query Refinement Tasks [chapter]

Christopher G. Harris, Padmini Srinivasan
2013 Lecture Notes in Computer Science  
Traditional web interface users are provided feedback on their initial queries and asked to use this information to reformulate their original queries.  ...  Game interface users are provided with instant scoring and ask to refine their queries based on their scores.  ...  We anticipate additional work to examine what aspects of games can improve initial query and query refinement performance and look at how this can be integrated to make the user experience in search more  ... 
doi:10.1007/978-3-642-36973-5_42 fatcat:btnuqmcw35c67fopt2wns6k3qi

SS4MCT: A Statistical Stemmer for Morphologically Complex Texts [article]

Javid Dadashkarimi, Hossein Nasr Esfahani, Heshaam Faili, Azadeh Shakery
2016 arXiv   pre-print
These rules are used to statistically stem words and can be used in different text mining tasks.  ...  There have been multiple attempts to resolve various inflection matching problems in information retrieval. Stemming is a common approach to this end.  ...  Therefore, in dictionary-based CLIR, retrieval systems are obliged either to stem documents and queries, or to leave them intact [8, 4, 12] , or expand the query with inflections.  ... 
arXiv:1605.07852v2 fatcat:auh37y2445exrcly5balol2zt4

Two-Stage Refinement of Transitive Query Translation with English Disambiguation for Cross-Language Information Retrieval: An Experiment at CLEF 2004 [chapter]

Kazuaki Kishida, Noriko Kando, Kuang-Hua Chen
2005 Lecture Notes in Computer Science  
Thus transitive translation of queries using English as a pivot language was used to search French document collections for German queries without any direct bilingual dictionary or MT system of these  ...  This paper reports experimental results of cross-language information retrieval (CLIR) from German to French.  ...  with the same procedure applied to texts of documents and queries.  ... 
doi:10.1007/11519645_13 fatcat:4irdm2uprrabtef2rpmix2yx2m

A Hybrid Browsing Mechanism Using Conceptual Scales [chapter]

Mihye Kim, Paul Compton
2006 Lecture Notes in Computer Science  
We are developing an approach to domain specific information retrieval that makes much greater use of domain expert knowledge.  ...  Domain-specific information retrieval normally depends on general search engines which make no use of domain knowledge and require a user to look at a linear display of loosely organised search results  ...  When a query is entered, stopwords are first eliminated and the remaining query stemmed using the stemming classes. Next, the system decides whether the query exists in the set of keywords.  ... 
doi:10.1007/11961239_12 fatcat:4pkccgfrsbeajp4jj6p3qxf3ja

Indexing the Indonesian Web: Language Identification and Miscellaneous Issues

Vinsensius Berlian Vega SN, Stéphane Bressan
2001 The Web Conference  
Information retrieval tools and search engines have mainly been leveraging research results and technologies developed for the English language.  ...  The results include original contributions such as a grammar for stemming Indonesian words and a selfimproving language identification algorithm.  ...  The resulting terms are used to index the reference of the document (URL). Queries are processed similarly.  ... 
dblp:conf/www/SNB01 fatcat:z4wvkp6evbcldncdemrda5dvc4

Thomson Legal and Regulatory at CLEF 2001: Monolingual and Bilingual Experiments [chapter]

Hugo Molina-Salgado, Isabelle Moulinier, Mark Knudson, Elizabeth Lund, Kirat Sekhon
2002 Lecture Notes in Computer Science  
Our monolingual runs for Dutch, Spanish and Italian use settings and rules derived from our runs in French and German last year.  ...  Our bilingual runs compared merging strategies for query translation resources.  ...  WIN has also been modified to support non-English document retrieval. This included localization of tokenization rules (for instance, handling elision for French and Italian) and stemming.  ... 
doi:10.1007/3-540-45691-0_20 fatcat:7dys27zixrf2lawffgeqzvdvy4

Exploring Automatic Query Refinement for Text-Based Video Retrieval

Timo Volkmer, Apostol Natsev
2006 2006 IEEE International Conference on Multimedia and Expo  
We evaluate these approaches in the context of the TRECVID 2005 Video Retrieval Benchmark using a baseline approach without any refinement.  ...  In this paper, we explore several automatic query refinement methods to address these issues.  ...  With pseudo-relevance feed-back, for example, the original query is used to retrieve the top N matching documents.  ... 
doi:10.1109/icme.2006.262951 dblp:conf/icmcs/VolkmerN06 fatcat:ir7rhu3bwzfcvholnt3nj24lj4

TRECVID 2010 Known-item Search (KIS) Task by I2R

Lekha Chaisorn, Kong-Wah Wan, Yan-Tao Zheng, Yongwei Zhu, Tian-Shiang Kok, Hui Li Tan, Zixiang Fu, Susanna Bolling
2010 TREC Video Retrieval Evaluation  
By collecting a number of relevant videos, the searchers can perform relevance feedback to refine the retrieval and continue the search.  ...  Locating the unique video for a query, however, poses new challenges over existing information retrieval approaches.  ...  For a given query, we use a heuristic rule-based approach to determine which HLF concepts are relevant.  ... 
dblp:conf/trecvid/ChaisornWZZKTFB10 fatcat:orpcfvvcmnbk3dxg5cwdwsmijy

Oromo-English Information Retrieval Experiments at CLEF 2007

Kula Kekeba Tune, Vasudeva Varma
2007 Conference and Labs of the Evaluation Forum  
The experiments differ from one another in terms of topic fields used for query construction and the application of stemmer for normalization of query terms.  ...  In this paper we describe our Oromo-English retrieval experiments that we have conducted at IIIT-Hyderabad (India) and submitted to the ad hoc retrieval task of CLEF 2007.  ...  We feel these relatively good improvements are due to the enhancement of our lexical resources and refinements of the rules of our stemming algorithm.  ... 
dblp:conf/clef/TuneV07 fatcat:ci54cbtxynhlhngblsesoxukua

A Survey on Cross Language Information Retrieval
IJARCCE - Computer and Communication Engineering

Monika Sharma, Sudha Morwal
2015 IJARCCE  
CLIR can be used to enhance the ability of users to search and retrieve documents in many languages.  ...  Cross language information retrieval (CLIR), whose goal is to find relevant information written in a language different from the language of query.  ...  For example, the stemming rules for word "see" might return just "s" by stemming and "see" or "saw" by lemmatization [14] . 3 Using the dictionary-based translation is a traditional approach in cross-lingual  ... 
doi:10.17148/ijarcce.2015.4287 fatcat:4345kzo2xfaivaea7qnankkpqi

Corpus-Based Arabic Stemming Using N-Grams [chapter]

Abdelaziz Zitouni, Asma Damankesh, Foroogh Barakati, Maha Atari, Mohamed Watfa, Farhad Oroumchian
2010 Lecture Notes in Computer Science  
The experiments show that 3-gram stemming using the dice distance for clustering and the EM similarity measure for refinement performs better than using no stemming; but slightly worse than Light-10 stemmer  ...  We propose a change in the corpus-based stemming approach proposed by Xu and Croft for English and Spanish languages in order to stem Arabic words.  ...  Stemming improves the information retrieval by reducing the word mismatch between the query and the document. This will result in returning more relevant documents to the query.  ... 
doi:10.1007/978-3-642-17187-1_27 fatcat:kpovw73hcjhltnnxstc4bwxwky
« Previous Showing results 1 — 15 out of 10,950 results