Improving NLP through Marginalization of Hidden Syntactic Structure

Jason Naradowsky, Sebastian Riedel, David A. Smith
2012 Conference on Empirical Methods in Natural Language Processing  
Many NLP tasks make predictions that are inherently coupled to syntactic relations, but for many languages the resources required to provide such syntactic annotations are unavailable. For others it is unclear exactly how much of the syntactic annotations can be effectively leveraged with current models, and what structures in the syntactic trees are most relevant to the current task. We propose a novel method which avoids the need for any syntactically annotated data when predicting a related
more » ... LP task. Our method couples latent syntactic representations, constrained to form valid dependency graphs or constituency parses, with the prediction task via specialized factors in a Markov random field. At both training and test time we marginalize over this hidden structure, learning the optimal latent representations for the problem. Results show that this approach provides significant gains over a syntactically uninformed baseline, outperforming models that observe syntax on an English relation extraction task, and performing comparably to them in semantic role labeling.
dblp:conf/emnlp/NaradowskyRS12 fatcat:axhvgqmxnbeureo6jbtmwjwyni