Incorporating information status into generation ranking

Aoife Cahill, Arndt Riester
2009 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - ACL-IJCNLP '09   unpublished
We investigate the influence of information status (IS) on constituent order in German, and integrate our findings into a loglinear surface realisation ranking model. We show that the distribution of pairs of IS categories is strongly asymmetric. Moreover, each category is correlated with morphosyntactic features, which can be automatically detected. We build a loglinear model that incorporates these asymmetries for ranking German string realisations from input LFG F-structures. We show that it
more » ... achieves a statistically significantly higher BLEU score than the baseline system without these features.
doi:10.3115/1690219.1690261 fatcat:bmugt2ejr5avbl3njkvbgui2qm