Automatic bilingual lexicon acquisition using random indexing of parallel corpora

2005 Natural Language Engineering  
This paper presents a very simple and effective approach to automatic bilingual lexicon acquisition. The approach is cooccurrence-based, and uses the Random Indexing vector space methodology applied to aligned bilingual data. The approach is simple, efficient and scalable, and generate promising results when compared to a manually compiled lexicon. The paper also discusses some of the methodological problems with the prefered evaluation procedure.
doi:10.1017/s1351324905003876 fatcat:5xamcayjcvd3zpx5pgvkwp5pkq