3 Hits in 2.6 sec

LIDIOMS: A Multilingual Linked Idioms Data Set [article]

Diego Moussallem, Mohamed Ahmed Sherif, Diego Esteves, Marcos Zampieri, Axel-Cyrille Ngonga Ngomo
2018 arXiv   pre-print
In this paper, we describe the LIDIOMS data set, a multilingual RDF representation of idioms currently containing five languages: English, German, Italian, Portuguese, and Russian.  ...  We also provide the details of linking LIDIOMS to well-known multilingual data sets such as BabelNet.  ...  Special thanks to John McCrae, Jorge Gracia and Gilles Sérasset for their constructive discussions about modelling and best practices to create our data set.  ... 
arXiv:1802.08148v1 fatcat:dockltoiuvai3ao4rqycahy26u

IBERT: Idiom Cloze-style reading comprehension with Attention [article]

Ruiyang Qin, Haozheng Luo, Zheheng Fan, Ziang Ren
2021 arXiv   pre-print
The idiom cloze task is a challenge problem in Natural Language Processing (NLP) research problem.  ...  Our model uses XLNET as the encoder and RoBERTa for choosing the most probable idiom for a given context.  ...  LIDIOMS: A Multilingual Linked Idioms Data Set.  ... 
arXiv:2112.02994v1 fatcat:yff2qwiwx5cw5gjuxyexhn5yfm

Il dato empirico in lessicografia: dizionari tradizionali e collaborativi a confronto

M-C Bibliografia De Marneffe, C Potts, E Marello, C Corino, Onesti
2009 Jezek, E   unpublished
twofold: on the one hand, it deals with the main methodological problems related to the study of Italian compound words using linguistic corpora; on the other hand, it aims at evaluating to what extent data  ...  Bisetto, A. (1999), Note sui composti VN dell'italiano, in P. Benincà, A. Mioni, L. Vanelli (a c. di), Fonologia e morfologia dell'italiano e dei dialetti d'Italia, Roma, Bulzoni, pp. 503-38.  ...  On the other hand, each valency-capable word in the treebanks is linked to a frame entry in the lexicon.  ...