A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Re-estimation of lexical parameters for treebank PCFGs
2008
Proceedings of the 22nd International Conference on Computational Linguistics - COLING '08
unpublished
We present procedures which pool lexical information estimated from unlabeled data via the Inside-Outside algorithm, with lexical information from a treebank PCFG. The procedures produce substantial improvements (up to 31.6% error reduction) on the task of determining subcategorization frames of novel verbs, relative to a smoothed Penn Treebank-trained PCFG. Even with relatively small quantities of unlabeled training data, the re-estimated models show promising improvements in labeled
doi:10.3115/1599081.1599106
fatcat:3ekcxxu4enc5vipm3s4ym4vrc4