UWB at SemEval-2020 Task 1: Lexical Semantic Change Detection

Ondřej Pražák, Pavel Přibáň, Stephen Taylor, Jakub Sido
2020 Proceedings of the Fourteenth Workshop on Semantic Evaluation   unpublished
In this paper, we describe our method for detection of lexical semantic change, i.e., word sense changes over time. We examine semantic differences between specific words in two corpora, chosen from different time periods, for English, German, Latin, and Swedish. Our method was created for the SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection. We ranked 1 st in Sub-task 1: binary change detection, and 4 th in Sub-task 2: ranked change detection. Our method is fully
more » ... and language independent. It consists of preparing a semantic vector space for each corpus, earlier and later; computing a linear transformation between earlier and later spaces, using Canonical Correlation Analysis and Orthogonal Transformation; and measuring the cosines between the transformed vector for the target word from the earlier corpus and the vector for the target word in the later corpus.
doi:10.18653/v1/2020.semeval-1.30 fatcat:4qkaywkm6jfxxfjbeutdmwc4q4