Filters








18,599 Hits in 4.1 sec

Cross-language information retrieval models based on latent topic models trained with document-aligned comparable corpora

Ivan Vulić, Wim De Smet, Marie-Francine Moens
2012 Information retrieval (Boston)  
In this paper, we study different applications of cross-language latent topic models trained on comparable corpora. The first focus lies on the task of cross-language information retrieval (CLIR).  ...  The second focus lies on the methods for extracting translation candidates and semantically related words using only per-topic word distributions of the cross-language latent topic model.  ...  LDA-Based Cross-Language Information Retrieval (Act I) This section provides a theoretical insight into statistical cross-language information retrieval models relying on per-topic word distributions and  ... 
doi:10.1007/s10791-012-9200-5 fatcat:ednqhlfih5dcphmyagg4hzz37i

Japanese-English Cross Language Information Retrieval based on Comparable Corpora and Bilingual Dictionary

AKITOSHI OKUMURA, KAI ISHIKAWA, KENJI SATOH
1998 Journal of Natural Language Processing  
This paper proposes a method to translate query terms for cross-language information retrieval(CUR).CLIR is generally performed by query translation and information retrieval(IR).CLIR is less precise than  ...  KeyWords:Cross Language Information Retrieval,Comparable Corpora,Query Generation <num>Number:012 <dom>Domain:Environment <title>Topic:Water Pollution <desc>Description: Document is about the pollution  ... 
doi:10.5715/jnlp.5.4_77 fatcat:pe2qx5efqra6rk52htxxzfucaq

Enhancing cross-language information retrieval by an automatic acquisition of bilingual terminology from comparable corpora

Fatiha Sadat, Masatoshi Yoshikawa, Shunsuke Uemura
2003 Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval - SIGIR '03  
This paper presents an approach to bilingual lexicon extraction from comparable corpora and evaluations on Cross-Language Information Retrieval.  ...  pruning to be highly effective in Cross-Language Information Retrieval.  ...  Evaluations on Cross-Language Information Retrieval (CLIR), which consists of retrieving documents written in one language using queries written in another language, is another interest.  ... 
doi:10.1145/860435.860519 dblp:conf/sigir/SadatYU03 fatcat:xydcdi2i5nhtxogt5qyvtx5muq

Bilingual terminology acquisition from comparable corpora and phrasal translation to cross-language information retrieval

Fatiha Sadat, Masatoshi Yoshikawa, Shunsuke Uemura
2003 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - ACL '03  
The present paper will seek to present an approach to bilingual lexicon extraction from non-aligned comparable corpora, phrasal translation as well as evaluations on Cross-Language Information Retrieval  ...  A two-stages translation model is proposed for the acquisition of bilingual terminology from comparable corpora, disambiguation and selection of best translation alternatives according to their linguistics-based  ...  as evaluations on Cross-Language Information Retrieval (CLIR).  ... 
doi:10.3115/1075178.1075201 dblp:conf/acl/SadatYU03 fatcat:3ffo3vcownhq7kb2qw3x22e5r4

Cross Language Information Retrieval (CLIR): A Survey of Approaches for Exploring Web Across Languages

2020 VOLUME-8 ISSUE-10, AUGUST 2019, REGULAR ISSUE  
The need of information retrieval systems to become multilingual has given rise to the research in Cross Language Information Retrieval (CLIR) which can cross the language barriers and retrieve more relevant  ...  This exponential growth of information on the internet has also induced several challenges before the information retrieval systems.  ...  Resolving translation ambiguity of queries is one of the important aspects of cross-language information retrieval.  ... 
doi:10.35940/ijitee.k7833.1110120 fatcat:qnfk26zmnndt7loi2kujjfiwr4

Cross-Language Information Retrieval with Latent Topic Models Trained on a Comparable Corpus [chapter]

Ivan Vulić, Wim De Smet, Marie-Francine Moens
2011 Lecture Notes in Computer Science  
In this paper we study cross-language information retrieval using a bilingual topic model trained on comparable corpora such as Wikipedia articles.  ...  on pre-existing translation dictionaries that are hand-built or constructed based on parallel corpora.  ...  In addition, the cross-language knowledge transfer based on the proposed generative topic models that are trained on comparable corpora might be useful in many other multilingual information management  ... 
doi:10.1007/978-3-642-25631-8_4 fatcat:exnq7ybqdjanpjp7iesfs6xltm

Cross-Lingual Semantic Similarity Measure for Comparable Articles [chapter]

Motaz Saad, David Langlois, Kamel Smaïli
2014 Lecture Notes in Computer Science  
Then, we compare LSI approaches to the dictionary-based approach on several English-Arabic parallel and comparable corpora.  ...  In this paper, we use the LSI in two ways to retrieve Arabic-English comparable articles.  ...  These methods are based on bilingual dictionaries [10, 16, 19] , or based on cross-lingual Information retrieval (CL-IR) [7, 1, 21] , or based on cross-lingual Latent Semantic Indexing (CL-LSI) system  ... 
doi:10.1007/978-3-319-10888-9_11 fatcat:zlzqkwoiwvglljzhetq3xmgwva

A comprehensive survey on cross-language information retrieval system

Gouranga Charan Jena, Siddharth Swarup Rautaray
2019 Indonesian Journal of Electrical Engineering and Computer Science  
Cross language information retrieval (CLIR) is a retrieval process in which the user fires queries in one language to retrieve information from another (different) language.  ...  Information Retrieval can be classified into different classes such as Monolingual information retrieval, Bi-Lingual Information Retrieval, Multilingual information retrieval and Cross language information  ...  Cross-language information retrieval is a retrieval process in which the user fires queries in one language to retrieve information from another language.  ... 
doi:10.11591/ijeecs.v14.i1.pp127-134 fatcat:bg3kk7o5sbcrbbxklsjbfw7aue

Information Retrieval System and Machine Translation: A Review

Mangala Madankar, M.B. Chandak, Nekita Chavhan
2016 Procedia Computer Science  
Cross-lingual Information Retrieval (CLIR), Multi-lingual Information Retrieval (MLIR), Machine translation approaches and techniques.  ...  CLIR deals with asking queries in one language and demand for retrieving documents in another language.  ...  Most crucial challenges in Cross lingual information retrieval and Multilingual Information Retrieval are the retrieval of relevant information for a query fired in native language.  ... 
doi:10.1016/j.procs.2016.02.071 fatcat:y252g5yn2vcbvkdbobrvrhoxna

Mining comparable bilingual text corpora for cross-language information integration

Tao Tao, ChengXiang Zhai
2005 Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining - KDD '05  
Since our method only relies on naturally available comparable corpora, it is generally applicable to any language pairs as long as we have comparable corpora.  ...  In this paper, we propose a general cross-lingual text mining method that does not rely on any of these resources, but can exploit comparable bilingual text corpora to discover mappings between words and  ...  Currently, cross-lingual information integration is often achieved through performing cross-lingual information retrieval(CLIR) [17] , which allows a user to retrieve documents in language A with a query  ... 
doi:10.1145/1081870.1081958 dblp:conf/kdd/TaoZ05 fatcat:o2uv6s3c5bdvdgcjka52ckvfs4

CROSS LANGUAGE INFORMATION RETRIEVAL: IN INDIAN LANGUAGE PERSPECTIVE

Pratibha Bajpai .
2014 International Journal of Research in Engineering and Technology  
This makes cross-language information retrieval (CLIR) and multilingual information retrieval (MLIR) for Web applications a valuable need of the day.  ...  Multilingual information is overflowing on internet these days.  ...  Corpus Based Cross Lingual Information Retrieval Corpus based CLIR methods use multilingual terminology derived from parallel or comparable corpora for query translation and expansion.  ... 
doi:10.15623/ijret.2014.0322010 fatcat:zcjebdivyfcivnzjfvxpk2ec5u

Mining named entity transliteration equivalents from comparable corpora

Raghavendra Udupa, K. Saravanan, A. Kumaran, Jagadeesh Jagarlamudi
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
INTRODUCTION Named Entities (NEs) form a significant fraction of query terms in Information Retrieval (IR) systems and have a substantial impact on their retrieval performance.  ...  NEs are even more important in Cross Language Information Retrieval (CLIR), as in addition to being a significant component of query terms, any errors in their translations act as noise affecting adversely  ... 
doi:10.1145/1458082.1458313 dblp:conf/cikm/UdupaSKJ08 fatcat:kzxksuoccnc6poaukpyfdcdkvm

Cross-language plagiarism detection

Martin Potthast, Alberto Barrón-Cedeño, Benno Stein, Paolo Rosso
2010 Language Resources and Evaluation  
, (ii) state-of-the-art solutions for two important subtasks are reviewed, (iii) retrieval models for the assessment of cross-language similarity are surveyed, and, (iv) the three models CL-CNG, CL-ESA  ...  Our contributions in this field are as follows: (i) a comprehensive retrieval process for cross-language plagiarism detection is introduced, highlighting the differences to monolingual plagiarism detection  ...  Figure 4 4 based on parallel corpora: CL-ASA, CL-LSI, CL-KCCA based on dictionaries: CL-VSM, Eurovoc-based based on comparable corpora: CL-ESA based on syntax:CL-CNG Taxonomy of retrieval models for cross-language  ... 
doi:10.1007/s10579-009-9114-z fatcat:r2wyqrlqezdpjne3d5xhxvur74

A Review on the Cross and Multilingual Information Retrieval

PothulaSujatha, Dhavachelvan
2011 International journal of Web & Semantic Technology  
MLIR deals with asking questions in one or more languages and retrieving documents in one or more different languages.  ...  CLIR deals with asking questions in one language and retrieving documents in different language.  ...  Parallel Corpora When compared to dictionary based corpus based translation typically gives much better performance, as [5] found.  ... 
doi:10.5121/ijwest.2011.2409 fatcat:lpz4c77okfhzdc54za66zf2r6e

A Novel Method for Cross-Language Retrieval of Chunks Using Monolingual and Bilingual Corpora [chapter]

Tayebeh Mosavi Miangah, Amin Nezarat
2010 Communications in Computer and Information Science  
Cross-language information retrieval (CLIR) refers to a kind of information retrieval in which the language of the query and that of searched document are different.  ...  Information retrieval (IR) is a crucial area of natural language processing (NLP) and can be defined as finding documents whose content is relevant to the query need of a user.  ...  The traditional method for cross-language retrieval was based on one or more bilingual dictionary used in the time of searching and displaying the outputs.  ... 
doi:10.1007/978-3-642-15766-0_45 fatcat:j2e7aoc6obbt3hkjvfvw6dhftq
« Previous Showing results 1 — 15 out of 18,599 results