A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Part-of-speech Taggers for Low-resource Languages using CCA Features
2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
In this paper, we address the challenge of creating accurate and robust partof-speech taggers for low-resource languages. We propose a method that leverages existing parallel data between the target language and a large set of resourcerich languages without ancillary resources such as tag dictionaries. Crucially, we use CCA to induce latent word representations that incorporate cross-genre distributional cues, as well as projected tags from a full array of resource-rich languages. We develop a
doi:10.18653/v1/d15-1150
dblp:conf/emnlp/KimSS15
fatcat:6faju5ipmjf7hojldmcm4lmaou