A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction
[article]
2016
arXiv
pre-print
Current methods for automatically evaluating grammatical error correction (GEC) systems rely on gold-standard references. However, these methods suffer from penalizing grammatical edits that are correct but not in the gold standard. We show that reference-less grammaticality metrics correlate very strongly with human judgments and are competitive with the leading reference-based evaluation metrics. By interpolating both methods, we achieve state-of-the-art correlation with human judgments.
arXiv:1610.02124v1
fatcat:aqwiwawz4zeozjg7jo2h6jul3m