CogNet: A Large-Scale Cognate Database

Khuyagbaatar Batsuren, Gabor Bella, Fausto Giunchiglia
2019 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics  
This paper introduces CogNet, a new, large-scale lexical database that provides cognates-words of common origin and meaning-across languages. The database currently contains 3.1 million cognate pairs across 338 languages using 35 writing systems. The paper also describes the automated method by which cognates were computed from publicly available wordnets, with an accuracy evaluated to 94%. Finally, statistics and early insights about the cognate data are presented, hinting at a possible future
more » ... exploitation of the resource 1 by various fields of lingustics.
doi:10.18653/v1/p19-1302 dblp:conf/acl/BatsurenBG19 fatcat:7wudx56nt5dk5kqxqufod7cvz4