A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
2021
Transactions of the Association for Computational Linguistics
Progress in cross-lingual modeling depends on challenging, realistic, and diverse evaluation sets. We introduce Multilingual Knowledge Questions and Answers (MKQA), an open- domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). Answers are based on heavily curated, language- independent data representation, making results comparable across languages and independent of
doi:10.1162/tacl_a_00433
fatcat:hef2fubo7jfjjftxt6juo5uuf4