Empirical Linguistic Study of Sentence Embeddings

Katarzyna Krasnowska-Kieraś, Alina Wróblewska
2019 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics  
The purpose of the research is to answer the question whether linguistic information is retained in vector representations of sentences. We introduce a method of analysing the content of sentence embeddings based on universal probing tasks, along with the classification datasets for two contrasting languages. We perform a series of probing and downstream experiments with different types of sentence embeddings, followed by a thorough analysis of the experimental results. Aside from dependency
more » ... ser-based embeddings, linguistic information is retained best in the recently proposed LASER sentence embeddings.
doi:10.18653/v1/p19-1573 dblp:conf/acl/Krasnowska-Kieras19 fatcat:5kwxe5f3urfehhmlk7u6d3tbpe