Source Code Level Word Embeddings in Aiding Semantic Test-to-Code Traceability

Viktor Csuvik, Andras Kicsi, Laszlo Vidacs
2019 2019 IEEE/ACM 10th International Symposium on Software and Systems Traceability (SST)  
Proper recovery of test-to-code traceability links from source code could considerably aid software maintenance. Scientific research has already shown that this can be achieved to an extent with a range of techniques relying on various information sources. This includes information retrieval which considers the natural language aspects of the source code. Latent Semantic Indexing (LSI) is widely looked upon as the mainstream technique of this approach. Techniques utilizing word embedding
more » ... tion however also use similar data and nowadays enjoy immense popularity in several fields of study. In this work, we present our evaluation of both LSI and word embeddings in aiding class level test-to-code traceability of 4 open source software systems, the assessment relying on naming convention information.
doi:10.1109/sst.2019.00016 dblp:conf/icse/CsuvikKV19 fatcat:6rqs2txdwzhjrbm7lhzthro5mm