A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is
In this paper we present the evaluation of different features for multiligual and crosslevel semantic textual similarity. ... Three different types of features were used: lexical, knowledge-based and corpus-based. ... In this paper, we evaluate different features for determining those that obtain the best performances for calculating both, cross-level semantic similarity and multilingual semantic textual similarity. ...doi:10.3115/v1/s14-2022 dblp:conf/semeval/VilarinoPLTB14 fatcat:u2uju352n5cdjmrjh7bzel756a
This paper presents the second round of the task on Cross-lingual Textual Entailment for Content Synchronization, organized within SemEval-2013. ... We report on the training and test data used for evaluation, the process of their creation, the participating systems (six teams, 61 runs), the approaches adopted and the results achieved. ... The authors would also like to acknowledge Pamela Forner and Giovanni Moretti from CELCT, and the volunteer translators that contributed to the creation of the dataset: Giusi Calo, Victoria Díaz, Bianca ...dblp:conf/semeval/NegriMMBG13 fatcat:n5god77ng5auxkspgooihr7vbe
This paper presents the first round of the task on Cross-lingual Textual Entailment for Content Synchronization, organized within SemEval-2012. ... We report on the training and test data used for evaluation, the process of their creation, the participating systems (10 teams, 92 runs), the approaches adopted and the results achieved. ... The authors would also like to acknowledge Giovanni Moretti from CELCT for evaluation scripts and technical assistance, and the volunteer translators that contributed to the creation of the dataset: María ...dblp:conf/semeval/NegriMMBG12 fatcat:uba2fngi55bnldjlao7elr6wkq
As models continue to scale up in parameters, training data, and compute, understanding memorization in language models is both important from a learning-theoretical point of view, and is practically crucial ... We further estimate the influence of each training example on the validation set and on generated texts, and show that this can provide direct evidence of the source of memorization at test time. ... Moreover, it will be necessary to construct datasets where such kind of semantically similar but textually different document pairs exist. ...arXiv:2112.12938v1 fatcat:72hizr3uajfyhjad7gssqjxsxi