Word Ordering as Unsupervised Learning Towards Syntactically Plausible Word Representations

Noriki Nishida, Hideki Nakayama
2017 International Joint Conference on Natural Language Processing  
The research question we explore in this study is how to obtain syntactically plausible word representations without using human annotations. Our underlying hypothesis is that word ordering tests, or linearizations, is suitable for learning syntactic knowledge about words. To verify this hypothesis, we develop a differentiable model called Word Ordering Network (WON) that explicitly learns to recover correct word order while implicitly acquiring word embeddings representing syntactic knowledge.
more » ... We evaluate the word embeddings produced by the proposed method on downstream syntax-related tasks such as partof-speech tagging and dependency parsing. The experimental results demonstrate that the WON consistently outperforms both order-insensitive and order-sensitive baselines on these tasks.
dblp:conf/ijcnlp/NishidaN17 fatcat:minavrybifegxiy2hsk6w7iif4