A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Arabic Document Classification Using Multiword Features
2013
International Journal of Computer and Communication Engineering
Weinvestigate the use of multiword features to improve Arabic document classification. The Arabic language is both morphologically rich and highly inflected. Accordingly it presents more challenges when enhancing Arabic information retrieval to a level comparable to English. The multiword features are modeled as a combination of words appearing within windows of varying sizes. Our experiments show multiword features combined with dice similarity distance outperform the cosine similarity
doi:10.7763/ijcce.2013.v2.269
fatcat:bmtidmfzerdipeddjlc5ipfznu