A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Automatic identification of arabic dialects in social media
2014
Proceedings of the first international workshop on Social media retrieval and analysis - SoMeRA '14
Modern Standard Arabic (MSA) is the formal language in most Arabic countries. Arabic Dialects (AD) or daily language differs from MSA especially in social media communication. However, most Arabic social media texts have mixed forms and many variations especially between MSA and AD. This paper aims to bridge the gap between MSA and AD by providing a framework for AD classification using probabilistic models across social media datasets. We present a set of experiments using the character n-gram
doi:10.1145/2632188.2632207
dblp:conf/sigir/SadatKF14
fatcat:rmrj4okd5zblrm534rl24dnia4