Discourse Sense Classification from Scratch using Focused RNNs

Gregor Weiss, Marko Bajec
2016 Proceedings of the CoNLL-16 shared task  
The subtask of CoNLL 2016 Shared Task focuses on sense classification of multilingual shallow discourse relations. Existing systems rely heavily on external resources, hand-engineered features, patterns, and complex pipelines fine-tuned for the English language. In this paper we describe a different approach and system inspired by end-to-end training of deep neural networks. Its input consists of only sequences of tokens, which are processed by our novel focused RNNs layer, and followed by a
more » ... se neural network for classification. Neural networks implicitly learn latent features useful for discourse relation sense classification, make the approach almost language-agnostic and independent of prior linguistic knowledge. In the closed-track sense classification task our system achieved overall 0.5246 F 1 -measure on English blind dataset and achieved the new state-of-the-art of 0.7292 F 1 -measure on Chinese blind dataset.
doi:10.18653/v1/k16-2006 dblp:conf/conll/WeissB16 fatcat:hwwz7nyairejvjjnpyamhlgoly