A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Comparing Two Basic Methods for Discriminating Between Similar Languages and Varieties
2016
Workshop on NLP for Similar Languages, Varieties and Dialects
This article describes the systems submitted by the Citius Ixa Imaxin team to the Discriminating Similar Languages Shared Task 2016. The systems are based on two different strategies: classification with ranked dictionaries and Naive Bayes classifiers. The results of the evaluation show that ranking dictionaries are more sound and stable across different domains while basic bayesian models perform reasonably well on in-domain datasets, but their performance drops when they are applied on out-of-domain texts.
dblp:conf/vardial/GamalloACA16
fatcat:yzl5jwfhcvdghnqk3u4ascikqa