Filters








16,854 Hits in 7.6 sec

Creating and exploiting a comparable corpus in cross-language information retrieval

Tuomas Talvensaari, Jorma Laurikkala, Kalervo Järvelin, Martti Juhola, Heikki Keskustalo
2007 ACM Transactions on Information Systems  
We present a method for creating a comparable text corpus from two document collections in different languages.  ...  The collections can be very different in origin: in this study we build a comparable corpus from articles by a Swedish news agency and a U.S. newspaper.  ...  . & Juhola M. (2007) "Creating and exploiting a comparable corpus in cross-language information retrieval" to appear in ACM Transactions on Information Systems (ACM TOIS) 25 (2007).  ... 
doi:10.1145/1198296.1198300 fatcat:ajomowrcl5agphti32eltqcmfy

Exploiting Parallel Sentences and Cosine Similarity for Identifying Target Language Translation

Vijay Kumar Sharma, Namita Mittal
2016 Procedia Computer Science  
Cross-Language Information Retrieval (CLIR) provides a solution for this language barrier where a user can search the required information in his regional language.  ...  In recent times, The Internet has become a huge information resource which contains information in multiple languages.  ...  A Hindi-English parallel corpus 44 is exploited in both PLM and TFM approaches. Vector space model is used for indexing and retrieval.  ... 
doi:10.1016/j.procs.2016.06.092 fatcat:qmhsfxlh3rbytdrss4fx5rpema

Cross Language Information Retrieval (CLIR): A Survey of Approaches for Exploring Web Across Languages

2020 VOLUME-8 ISSUE-10, AUGUST 2019, REGULAR ISSUE  
The need of information retrieval systems to become multilingual has given rise to the research in Cross Language Information Retrieval (CLIR) which can cross the language barriers and retrieve more relevant  ...  Most of the present monolingual information retrieval systems can retrieve documents in the language of query only, missing the information in other languages that may be more relevant to the user.  ...  The Cross Language Information Retrieval abbreviated as CLIR which is a sub domain of Information Retrieval can overcome the language barriers and help retrieving documents in languages that are different  ... 
doi:10.35940/ijitee.k7833.1110120 fatcat:qnfk26zmnndt7loi2kujjfiwr4

Exploiting Wikipedia API for Hindi-english Cross-language Information Retrieval

Vijay Kumar Sharma, Namita Mittal
2016 Procedia Computer Science  
Cross-Language Information Retrieval (CLIR) provides a solution for that language barrier which allows a user to ask a query in the native language and get the relevant documents in the different language  ...  Wikipedia title and inter-wiki link features are exploited for query translation. Target language documents are retrieved using vector space retrieval model and BM25 retrieval algorithm.  ...  Introduction Information Retrieval (IR) is a reasoning process that is used for storing, searching and retrieving the relevant information between a document and the user needs 11 .  ... 
doi:10.1016/j.procs.2016.06.094 fatcat:n6ojd2ubvra3hh4nia6u5ezjhm

Cross-Language Information Retrieval with Latent Topic Models Trained on a Comparable Corpus [chapter]

Ivan Vulić, Wim De Smet, Marie-Francine Moens
2011 Lecture Notes in Computer Science  
In this paper we study cross-language information retrieval using a bilingual topic model trained on comparable corpora such as Wikipedia articles.  ...  The probabilistic interlingual representation is incorporated in a statistical language model for information retrieval.  ...  Cross-language information retrieval deals with the retrieval of documents written in a language different from the language of the user's query.  ... 
doi:10.1007/978-3-642-25631-8_4 fatcat:exnq7ybqdjanpjp7iesfs6xltm

What happened in CLEF 2008: Introduction to the Working Notes

Carol Peters
2008 Conference and Labs of the Evaluation Forum  
This is done through the organisation of annual evaluation campaigns in which a series of tracks designed to test different aspects of mono-and cross-language information retrieval (IR) are offered.  ...  The objective of the Cross Language Evaluation Forum 1 is to promote research in the field of multilingual system development.  ...  Cross-Language Video Retrieval (VideoCLEF): The VideoCLEF Vid2RSS feed task was a classification task performed on a video corpus containing episodes of a dual language television program in Dutch and  ... 
dblp:conf/clef/Peters08a fatcat:sguzidsq25hincctifanop2sz4

CACAO Project at the TEL@CLEF 2008 Task

Alessio Bosca, Luca Dini
2008 Conference and Labs of the Evaluation Forum  
CACAO proposes the development of an infrastructure for multilingual access to digital content, including an information retrieval system able to search for books and texts in all the available languages  ...  Results evidenced a poor initial performance of the system, however since the project started few months ago they can constitute a valuable baseline in order to measure the future advancement of the system  ...  Acknowledgments This work has been supported and founded by CACAO EU project (ECP 2006 DILI 510035). References  ... 
dblp:conf/clef/BoscaD08a fatcat:o3cqflhgefcbvf4voxmn6q5uhi

A Cross-Language Approach to Historic Document Retrieval [chapter]

Marijn Koolen, Frans Adriaans, Jaap Kamps, Maarten de Rijke
2006 Lecture Notes in Computer Science  
using cross-language information retrieval techniques.  ...  We propose a cross-language approach to historic document retrieval, and investigate (1) the automatic construction of translation resources for historic languages, and (2) the retrieval of historic documents  ...  the retrieval of historic documents using cross-language information retrieval techniques.  ... 
doi:10.1007/11735106_36 fatcat:yp7ckxapwnhilcb6mb6ih44h5m

Page 386 of Computational Linguistics Vol. 29, Issue 3 [page]

2003 Computational Linguistics  
In previous studies, parallel texts have been exploited in several ways: using a pseudofeedback ap- proach, capturing global cross-language term associations, transposing to a language- independent semantic  ...  Using a pseudofeedback approach. In Yang et al. (1998) parallel texts are used as follows. A given query in the source language is first used to retrieve a subset of texts from the parallel corpus.  ... 

CACAO Project at the TEL@CLEF 2009 Task

Alessio Bosca, Luca Dini
2009 Conference and Labs of the Evaluation Forum  
CACAO (Cross-language Access to Catalogues And On-line libraries) is an EU project devoted to enabling cross-language access to the contents of a federation of digital libraries with a set of software  ...  The prototype showed good performances with respect to the other participants, resulting among the best 5 in the French Monolingual subtrack, and quite a significative enhancement in comparison with the  ...  In fact, in a context of interlaced cross-border libraries, such as the ones proposed by META OPAC, the absence of a cross-language perspective is likely to cause a substantial impasse: if a user wanted  ... 
dblp:conf/clef/BoscaD09b fatcat:5dhhv46iy5bl7fhthf4t5pl3ee

A Novel Method for Cross-Language Retrieval of Chunks Using Monolingual and Bilingual Corpora [chapter]

Tayebeh Mosavi Miangah, Amin Nezarat
2010 Communications in Computer and Information Science  
Cross-language information retrieval (CLIR) refers to a kind of information retrieval in which the language of the query and that of searched document are different.  ...  Information retrieval (IR) is a crucial area of natural language processing (NLP) and can be defined as finding documents whose content is relevant to the query need of a user.  ...  Chen has also examined effect of phrase translation in cross-language information retrieval between Chinese and English.  ... 
doi:10.1007/978-3-642-15766-0_45 fatcat:j2e7aoc6obbt3hkjvfvw6dhftq

Creating a Persian-English Comparable Corpus [chapter]

Homa Baradaran Hashemi, Azadeh Shakery, Heshaam Faili
2010 Lecture Notes in Computer Science  
Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs.  ...  In this study, we build a Persian-English comparable corpus from two independent news collections: BBC News in English and Hamshahri news in Persian.  ...  Cross-Language Experiments In the next step of our research, we intend to do cross-language information retrieval using the obtained cross-lingual word associations from the comparable corpus.  ... 
doi:10.1007/978-3-642-15998-5_5 fatcat:dpbwztz4z5axtimn77bqtktghy

A Review on the Cross and Multilingual Information Retrieval

PothulaSujatha, Dhavachelvan
2011 International journal of Web & Semantic Technology  
CLIR deals with asking questions in one language and retrieving documents in different language.  ...  In this paper we explore some of the most important areas of information retrieval. In particular, Crosslingual Information Retrieval (CLIR) and Multilingual Information Retrieval (MLIR).  ...  Participated in NTCIR-2 Japanese/English cross-language (J-E and E-J) and multi-lingual (J-JE and E-JE) information retrieval tasks.  ... 
doi:10.5121/ijwest.2011.2409 fatcat:lpz4c77okfhzdc54za66zf2r6e

A Method for Cross-Language Retrieval of Chunks Using Monolingual and Bilingual Corpora

Tayebeh Mosavi Miangah, Amin Nezarat
2010 International Journal of Computer Applications  
Cross-language information retrieval (CLIR) refers to a kind of information retrieval in which the language of the query and that of searched document are different.  ...  Information retrieval (IR) is a crucial area of natural language processing (NLP) and can be defined as finding documents whose content is relevant to the query need of a user.  ...  Chen has also examined effect of phrase translation in cross-language information retrieval between Chinese and English.  ... 
doi:10.5120/1518-1902 fatcat:qf4mdwxduvfcpl2mjpytoj4jye

Multimedia Search Without Visual Analysis: The Value of Linguistic and Contextual Information

Franciska M. G. de Jong, Thijs Westerveld, Arjen P. de Vries
2007 IEEE transactions on circuits and systems for video technology (Print)  
He has worked on a variety of research topics, including (multimedia) information retrieval, database architecture, query processing, retrieval system evaluation, and ambient intelligence.  ...  A number of techniques are presented, including the time-alignment of textual resources, audio and speech processing, content reduction and reasoning tools, and the exploitation of surface features.  ...  More recently in 2005 the Cross-Language Evaluation Forum (CLEF) has started a speech retrieval track on spoken interviews [33] .  ... 
doi:10.1109/tcsvt.2007.890834 fatcat:fiybaunbjzdvddksqewudk7n7m
« Previous Showing results 1 — 15 out of 16,854 results