A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Counterfactual Interventions Reveal the Causal Effect of Relative Clause Representations on Agreement Prediction
2021
Proceedings of the 25th Conference on Computational Natural Language Learning
unpublished
When language models process syntactically complex sentences, do they use their representations of syntax in a manner that is consistent with the grammar of the language? We propose AlterRep, an intervention-based method to address this question. For any linguistic feature of a given sentence, AlterRep generates counterfactual representations by altering how the feature is encoded, while leaving intact all other aspects of the original representation. By measuring the change in a model's word
doi:10.18653/v1/2021.conll-1.15
fatcat:zxtykv7rs5h2ba6jbimltqj3gi