MITRE: Seven Systems for Semantic Similarity in Tweets

Guido Zarrella, John Henderson, Elizabeth M. Merkhofer, Laura Strickhart
2015 Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)  
This paper describes MITRE's participation in the Paraphrase and Semantic Similarity in Twitter task (SemEval-2015 Task 1). This effort placed first in Semantic Similarity and second in Paraphrase Identification with scores of Pearson's r of 61.9%, F1 of 66.7%, and maxF1 of 72.4%. We detail the approaches we explored including mixtures of string matching metrics, alignments using tweet-specific distributed word representations, recurrent neural networks for modeling similarity with those
more » ... y with those alignments, and distance measurements on pooled latent semantic features. Logistic regression is used to tie the systems together into the ensembles submitted for evaluation.
doi:10.18653/v1/s15-2002 dblp:conf/semeval/ZarrellaHMS15 fatcat:qejkqan2rrbqxfynqxzds7usfe