A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability
2011
Annual Meeting of the Association for Computational Linguistics
In statistical machine translation, a researcher seeks to determine whether some innovation (e.g., a new feature, model, or inference algorithm) improves translation quality in comparison to a baseline system. To answer this question, he runs an experiment to evaluate the behavior of the two systems on held-out data. In this paper, we consider how to make such experiments more statistically reliable. We provide a systematic analysis of the effects of optimizer instability-an extraneous variable
dblp:conf/acl/ClarkDLS11
fatcat:5jgr7xk5pbfujazl5r7b7iu4xq