A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Hallym: Named Entity Recognition on Twitter with Word Representation
2015
Proceedings of the Workshop on Noisy User-generated Text
Twitter is a type of social media that contains diverse user-generated texts. Traditional models are not applicable to tweet data because the text style is not as grammaticalized as that of newswire. In this paper, we construct word embeddings via canonical correlation analysis (CCA) on a considerable amount of tweet data and show the efficacy of word representation. Besides word embedding, we use partof-speech (POS) tags, chunks, and brown clusters induced from Wikipedia as features. Here, we
doi:10.18653/v1/w15-4310
dblp:conf/aclnut/YangK15
fatcat:p4esrps7pbhz7ecjqyvio5vcvi