Learning when to trust distant supervision: An application to low-resource POS tagging using cross-lingual projection [article]

Meng Fang, Trevor Cohn
<span title="2016-07-05">2016</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Cross lingual projection of linguistic annotation suffers from many sources of bias and noise, leading to unreliable annotations that cannot be used directly. In this paper, we introduce a novel approach to sequence tagging that learns to correct the errors from cross-lingual projection using an explicit debiasing layer. This is framed as joint learning over two corpora, one tagged with gold standard and the other with projected tags. We evaluated with only 1,000 tokens tagged with gold
more &raquo; ... tags, along with more plentiful parallel data. Our system equals or exceeds the state-of-the-art on eight simulated low-resource settings, as well as two real low-resource languages, Malagasy and Kinyarwanda.
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1607.01133v1">arXiv:1607.01133v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/dwz6vcqxi5extb5te42evdl2ly">fatcat:dwz6vcqxi5extb5te42evdl2ly</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200907040325/https://arxiv.org/pdf/1607.01133v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ab/4e/ab4e416dd69970cc35ee2c4808bcccfc3756d1c8.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1607.01133v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>