Estimating Grammar Correctness for a Priori Estimation of Machine Translation Post-Editing Effort

Nicholas H. Kirk, Guchun Zhang, Georg Groh
2014 Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation  
We present a supervised learning pilot application for estimating Machine Translation (MT) output reusability, in view of supporting a human post-editor of MT content. We train our model on typed dependencies (labeled grammar relationships) extracted from human reference and raw MT data, to then predict grammar relationship correctness values that we aggregate to provide a binary segmentlevel evaluation. In view of scaling up to larger data, we provide implemented Naïve Bayes and Stochastic
more » ... and Stochastic Gradient Descent with Support Vector Machine loss function approaches and their evaluation, and verify the correlation of predicted values with human judgement.
doi:10.3115/v1/w14-0303 dblp:conf/eacl/KirkZG14 fatcat:qlr6hhr3jbhhdcod72qwhiyqj4