A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
The Intonation System of Tajik: Is it Identical to Persian?
2018
The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages
Recent work considered how images paired with speech can be used as supervision for building speech systems when transcriptions are not available. We ask whether visual grounding can be used for cross-lingual keyword spotting: given a text keyword in one language, the task is to retrieve spoken utterances containing that keyword in another language. This could enable searching through speech in a low-resource language using text queries in a high-resource language. As a proof-of-concept, we use
doi:10.21437/sltu.2018-53
dblp:conf/sltu/KamperR18
fatcat:tpy6o46sizb5xjjip5yg3ni42a