Filters








426 Hits in 6.1 sec

Cross-Lingual Knowledge Discovery: Chinese-to-English Article Linking in Wikipedia [chapter]

Ling-Xiang Tang, Andrew Trotman, Shlomo Geva, Yue Xu
2012 Lecture Notes in Computer Science  
In this paper we examine automated Chinese to English link discovery in Wikipedia and the effects of Chinese segmentation and Chinese to English translation on the hyperlink recommendation.  ...  Our experimental results show that the implemented link discovery framework can effectively recommend Chinese-to-English cross-lingual links.  ...  In the collection, just over half of the Chinese articles (170,637), but only 5% of the English articles (169,974), were language cross-linked. 2.2 Links In Wikipedia Anchored Mono-Lingual Links  ... 
doi:10.1007/978-3-642-35341-3_24 fatcat:emmpnlqvhfa4dh73trkopux6yy

XLORE2: Large-scale Cross-lingual Knowledge Graph Construction and Application

Hailong Jin, Chengjiang Li, Jing Zhang, Lei Hou, Juanzi Li, Peng Zhang
2019 Data Intelligence  
Existing cross-lingual knowledge linking discovery methods heavily depend on the number of existing cross-lingual links. It is a fact that the cross-lingual links in Wikipedia are quite sparse.  ...  Although XLORE is an English-Chinese bilingual knowledge graph, there are only 423,974 cross-lingual links between English instances and Chinese instances.  ...  Cross-lingual Link Discovery: We utilize a logistic regression model to find new cross-linguals between Chinese and English instances.  ... 
doi:10.1162/dint_a_00003 dblp:journals/dint/JinLZHLZ19 fatcat:zdcq2gfyirc2ba4s6scxtytsna

C-BiLDA extracting cross-lingual topics from non-parallel texts by distinguishing shared from unshared content

Geert Heyman, Ivan Vulić, Marie-Francine Moens
2015 Data mining and knowledge discovery  
We study the problem of extracting cross-lingual topics from non-parallel multilingual text datasets with partially overlapping thematic content (e.g., aligned Wikipedia articles in two different languages  ...  We present a full overview of C-BiLDA, and show its utility in the task of cross-lingual knowledge transfer for multi-class document classification on two benchmarking datasets for three language pairs  ...  Compliance with Ethical Standards Funding: The research presented in this article has been carried out in context of the SCATE (SBO-130047) research project financed by the (Flemish) agency for Innovation  ... 
doi:10.1007/s10618-015-0442-x fatcat:763xooijw5ardh4qs4frn5fsxu

Building a Large-Scale Cross-Lingual Knowledge Base from Heterogeneous Online Wikis [chapter]

Mingyang Li, Yao Shi, Zhigang Wang, Yongbin Liu
2015 Lecture Notes in Computer Science  
Particularly, XLORE integrates four online wikis including English Wikipedia, Chinese Wikipedia, Baidu Baike and Hudong Baike to balance the knowledge volume in different languages, employs a link-discovery  ...  However, there are few Chinese-English knowledge bases due to the following reasons: 1) the scarcity of Chinese knowledge in existing cross-lingual knowledge bases; 2) the limited number of cross-lingual  ...  In Wikipedia, cross-lingual links help readers switch to preferred languages. If an entity containing both Chinese article a z and English article a e , then a z and a e are cross-linked.  ... 
doi:10.1007/978-3-319-25207-0_37 fatcat:n244qtdjibcplbwgznp5tcf63m

Cross-lingual Entity Alignment with Incidental Supervision [article]

Muhao Chen, Weijia Shi, Ben Zhou, Dan Roth
2021 arXiv   pre-print
Much research effort has been put to multilingual knowledge graph (KG) embedding methods to address the entity alignment task, which seeks to match entities in different languagespecific KGs that refer  ...  Then, two learning processes are conducted: (i) an embedding learning process to encode the KG and text of each language in one embedding space, and (ii) a selflearning based alignment learning process  ...  This research is supported in part by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (  ... 
arXiv:2005.00171v2 fatcat:xr7rs2y6vvakjkyz7we7xzckyy

The Effectiveness of Cross-lingual Link Discovery

Ling-Xiang Tang, Kelly Y. Itakura, Shlomo Geva, Andrew Trotman, Yue Xu
2011 NTCIR Conference on Evaluation of Information Access Technologies  
Cross-lingual link discovery is a way of automatically finding prospective links between documents in different languages, which is particularly helpful for knowledge discovery of different language domains  ...  This paper describes the evaluation in benchmarking the effectiveness of cross-lingual link discovery (CLLD).  ...  But there are four types of links in Wikipedia: x mono-lingual article-to-article (see also) links; x mono-lingual anchor-to-article links; x cross-lingual article-to-article (language) links; x cross-lingual  ... 
dblp:conf/ntcir/TangIGTX11 fatcat:acoaju4ppbenfnbdnum5saamrm

Automated Cross-lingual Link Discovery in Wikipedia

Ling-Xiang Tang, Daniel Cavanagh, Andrew Trotman, Shlomo Geva, Yue Xu, Laurianne Sitbon
2011 NTCIR Conference on Evaluation of Information Access Technologies  
In this paper we describe our approaches to discovering Chinese, Japanese, and Korean (CJK) cross-lingual links for English documents in Wikipedia.  ...  At NTCIR-9, we participated in the cross-lingual link discovery (Crosslink) task.  ...  CJK CROSS-LINGUAL LINKING To locate CJK cross-lingual links for English Wikipedia articles, we separate the link discovery into two phases: 1) detecting prospective anchors in the source document; and  ... 
dblp:conf/ntcir/TangCTGXS11 fatcat:dckmsjohozdcho4uxlsprpgunu

An evaluation framework for cross-lingual link discovery

Ling-Xiang Tang, Shlomo Geva, Andrew Trotman, Yue Xu, Kelly Y. Itakura
2014 Information Processing & Management  
Several English mono-lingual link discovery tools exist (e.g. Wikipedia Miner 2 Tang, Geva, Trotman, Xu, & Itakura, 2011 ).  ...  These tools help Wikipedia content creators and curators discover and maintain appropriate links for their articles. But none of these tools currently support cross-lingual link discovery.  ...  Together CrossLink and CrossLink-2 provides a real opportunity to enhance Wikipedia cross language linking and we look forward to reporting on that work in the future.  ... 
doi:10.1016/j.ipm.2013.07.003 fatcat:27tuyqp4brfjbpfdavjxuzr54q

KECIR at NTCIR-10 Cross-Lingual Link Discovery Task

Jianxi Zheng, Yu Bai, Cheng Guo, Dongfeng Cai
2013 NTCIR Conference on Evaluation of Information Access Technologies  
This paper presents the methods of KECIR at NTCIR-10 Cross-Lingual Link Discovery Task.  ...  Four runs were submitted, and in the A2F evaluation with Manual Assessment results, the KECIR A2F C2E 01 FSCLIR achieved the highest score of LMAP and R-Prec in Chinese to English task.  ...  We plan to build a united framework for Cross-lingual Link between Chinese articles and English ones in future.  ... 
dblp:conf/ntcir/ZhengBGC13 fatcat:wxjyjz2el5codd65k522yohnlq

Overview of the NTCIR-9 Crosslink Task: Cross-lingual Link Discovery

Ling-Xiang Tang, Shlomo Geva, Andrew Trotman, Yue Xu, Kelly Y. Itakura
2011 NTCIR Conference on Evaluation of Information Access Technologies  
Cross-lingual link discovery (CLLD) is a way of automatically finding potential links between documents in different languages.  ...  This paper presents an overview of NTCIR-9 Cross-lingual Link Discovery (Crosslink) task.  ...  To tackle this problem, cross-lingual link discovery aims to break this language barrier.  ... 
dblp:conf/ntcir/TangGTXI11 fatcat:vfa7yc4atfb7rcjhr6agsytiku

Overview of the NTCIR-10 Cross-Lingual Link Discovery Task

Ling-Xiang Tang, In-Su Kang, Fuminori Kimura, Yi-Hsun Lee, Andrew Trotman, Shlomo Geva, Yue Xu
2013 NTCIR Conference on Evaluation of Information Access Technologies  
This paper presents an overview of NTCIR-10 Cross-lingual Link Discovery (CrossLink-2) task. For the task, we continued using the evaluation framework developed for the NTCIR-9 CrossLink-1 task.  ...  Overall, recommended links were evaluated at two levels (file-to-file and anchor-to-file); and system performance was evaluated with metrics: LMAP, R-Prec and P@N.  ...  Cross-lingual Linking in Wikipedia The remainder of this paper is organized as follows: First, the new cross-lingual link discovery tasks are defined in Section 2.  ... 
dblp:conf/ntcir/TangKKLTGX13 fatcat:mnkt6i3ntjea5n7wq7euvtwfwa

DCU at NTCIR-10 Cross-lingual Link Discovery (CrossLink-2) Task

Shu Chen, Gareth J. F. Jones, Noel E. O'Connor
2013 NTCIR Conference on Evaluation of Information Access Technologies  
DCU participated in the English to Chinese (C2E) and Chinese to English (C2E) subtasks of the NTCIR 10 CrossLink-2 Cross-lingual Link Discovery (CLLD) task.  ...  links for each anchor.  ...  INTRODUCTION The cross-lingual link discovery (CLLD) task aims to detect potentially important semantic links between documents in different languages.  ... 
dblp:conf/ntcir/ChenJO13 fatcat:kk2sbeklcneyvhzjpw6ync6yaa

KMI, The Open University at NTCIR-9 CrossLink: Cross-Lingual Link Discovery in Wikipedia Using Explicit Semantic Analysis

Petr Knoth, Lukás Zilka, Zdenek Zdráhal
2011 NTCIR Conference on Evaluation of Information Access Technologies  
This paper describes the methods used in the submission of Knowledge Media institute (KMI), The Open University to the NTCIR-9 Cross-Lingual Link Discovery (CLLD) task entitled CrossLink.  ...  Three of the runs are based on exploiting the existing cross-lingual mapping between different versions of Wikipedia articles.  ...  In this paper, we present Cross-Lingual Link Discovery (CLLD) methods that can be used to suggest a set of crosslingual links from an English Wikipedia article to articles in another language.  ... 
dblp:conf/ntcir/KnothZZ11 fatcat:hpu46su6wbgtxkjr7d5ogsabjq

Entity Matching across Heterogeneous Sources

Yang Yang, Yizhou Sun, Jie Tang, Bo Ma, Juanzi Li
2015 Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '15  
However, the method would inevitably fails if the two domains have less or no overlapping in the content. An extreme case is that the source domain is in English and the target domain is in Chinese.  ...  Specifically, for handling the text disjointing problem, we use a cross-sampling process in our model to extract topics with terms coming from all the sources, and leverage existing matching relations  ...  The data set is collected as follows: we first randomly select an English article A with a cross-lingual link to a Chinese article B from Wikipedia.  ... 
doi:10.1145/2783258.2783353 dblp:conf/kdd/YangSTML15 fatcat:6bba6rzqvvbdjh3k7h4a43mude

Simple Yet Effective Methods for Cross-Lingual Link Discovery (CLLD) - KMI @ NTCIR-10 CrossLink-2

Petr Knoth, Drahomira Herrmannova
2013 NTCIR Conference on Evaluation of Information Access Technologies  
Cross-Lingual Link Discovery (CLLD) aims to automatically find links between documents written in different languages.  ...  Our methods (team KMI) achieved in the NTCIR-10 CrossLink-2 evaluation the best overall results in the English to Chinese, Japanese and Korean (E2CJK) task and were the top performers in the Chinese, Japanese  ...  In Section 2, we present CLLD methods designed by team KMI that can be used to suggest a set of cross-lingual links from an English Wikipedia article to articles in Chinese, Japanese and Korean (English  ... 
dblp:conf/ntcir/KnothH13 fatcat:p4ysm4gccfeqlog5yy2tghk7ba
« Previous Showing results 1 — 15 out of 426 results