A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Statistically Significant Detection of Semantic Shifts using Contextual Word Embeddings
2021
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems
unpublished
Detecting lexical semantic change in smaller data sets, e.g. in historical linguistics and digital humanities, is challenging due to a lack of statistical power. This issue is exacerbated by non-contextual embedding models that produce one embedding per word and, therefore, mask the variability present in the data. In this article, we propose an approach to estimate semantic shift by combining contextual word embeddings with permutation-based statistical tests. We use the false discovery rate
doi:10.18653/v1/2021.eval4nlp-1.11
fatcat:z2tt4hdnsrbgflwisdxlls4e5m