Filters








977 Hits in 3.0 sec

Rational Kernels for Arabic Stemming and Text Classification [article]

Attia Nehar and Djelloul Ziadi and Hadda Cherroun
2015 arXiv   pre-print
This document representation allows us to use and explore rational kernels as a framework for Arabic Text Classification.  ...  In this paper, we address the problems of Arabic Text Classification and stemming using Transducers and Rational Kernels.  ...  Rational Kernels for Arabic Text Classification Our ATC system is divided into three stages: 1. preprocessing step. 2. feature extraction: the previous transducer is applied on each word of the document  ... 
arXiv:1502.07504v1 fatcat:2uhwov34zrfwjfhytjykwxbt7y

Rational kernels for Arabic Root Extraction and Text Classification

Attia Nehar, Djelloul Ziadi, Hadda Cherroun
2016 Journal of King Saud University: Computer and Information Sciences  
This document representation allows us to use and explore rational kernels as a framework for Arabic Text Classification.  ...  In this paper, we address the problems of Arabic Text Classification and root extraction using transducers and rational kernels.  ...  Rational kernels for Arabic Text Classification Our ATC system is structured as follows: 1. Preprocessing step. 2.  ... 
doi:10.1016/j.jksuci.2015.11.004 fatcat:y3nddtuu5jc4hoqa2o5vdtvbxa

Investigating the Effect of Different Kernel Functions on the Performance of SVM for Recognizing Arabic Characters

Sayed Fadel, Said Ghoniemy, Mohamed Abdallah, Hussein Abu
2016 International Journal of Advanced Computer Science and Applications  
This paper studies the effect of different kernel functions on the performance of SVMs for recognizing Arabic characters. Eleven different kernel functions are used throughout this study.  ...  The resulting kernel functions can be considered as base for future studies aiming at enhancing their performance.  ...  ((x, y) + c) Many authors tried the investigation of using SVMs and similar tools for recognizing Arabic characters and categorizing Arabic text.  ... 
doi:10.14569/ijacsa.2016.070160 fatcat:nwf24c76afecba4z6hc3nf752y

Text mining: A survey of Arabic root extraction algorithms

Hamza et al., Department of Computer and Self Development, Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi Arabia, Faculty of Computer Science and Information Technology, Omdurman Islamic University, Omdurman, Sudan
2021 International Journal of Advanced and Applied Sciences  
This paper will present a brief background for a number of stemming algorithms on how to extracting the root and stem of the Arabic word, then make a comparison and discussion of a number of selected algorithms  ...  This paper will present a brief background and comprehensive presentation of a number of algorithms that handle the Arabic text to extract the word root in its light, heavy, hybrid, leading, and Markovian  ...  Root extraction using Rational Kernels and text classification.  ... 
doi:10.21833/ijaas.2021.01.002 fatcat:4girld4edfc2zc2bceukd5xpp4

End-Shape Recognition for Arabic Handwritten Text Segmentation [chapter]

Amani T. Jamal, Nicola Nobile, Ching Y. Suen
2014 Lecture Notes in Computer Science  
Text segmentation is an essential pre-processing stage for many systems such as text recognition and word spotting. However, few methods have been published for Arabic text segmentation.  ...  In Arabic handwritten documents, separating text into words is challenging due to the enormous different Arabic handwriting styles.  ...  LibSVM uses a Radial Basis Function (RBF) kernel for mapping a nonlinear sample into a higher sample space.  ... 
doi:10.1007/978-3-319-11656-3_21 fatcat:r74agajthrbkbfw3rglukfeowm

ARAACOM: ARAbic Algerian Corpus for Opinion Mining [article]

Zitouni Abdelhafid, Abdelhafid Zitouni
2020 arXiv   pre-print
In this paper, we propose our approach, for opinion mining in Arabic Algerian news paper. CCS CONCEPTS ∙Information systems Sentiment analysis ∙ Computing methodologies Natural language processing  ...  So, this makes for us a lot of data which need powerful mean to exploit.  ...  Naïve Bayes The naive bayes classifier is a well known algorithm used in text classification.  ... 
arXiv:2001.08010v1 fatcat:wgiendgbjzgpbdc73tqhbsh7ea

A Study of Sindhi Related and Arabic Script Adapted languages Recognition [article]

Dil Nawaz Hakro, A. Z. Talib, Zeeshan Bhatti, G. N. Moja
2014 arXiv   pre-print
Arabic script is also one of mature script from OCR perspective. The adaptive languages which share Arabic script or its extended characters; still lacking the OCRs for their language.  ...  A large number of publications are available for the Optical Character Recognition (OCR). Significant researches, as well as articles are present for the Latin, Chinese and Japanese scripts.  ...  The techniques include text line segmentation, word and character segmentation and classification.  ... 
arXiv:1412.4217v1 fatcat:a5k6co7ilrd3njsc2dnlfh3yx4

Estimating Intelligence Quotient Using Stylometry and Machine Learning Techniques: A Review

Glory O. Adebayo, Roman V. Yampolskiy
2022 Big Data Mining and Analytics  
The task of trying to quantify a person's intelligence has been a goal of psychologists for over a century.  ...  The unavailability of large datasets in this area of research has led to very few publications in IQ estimation from written text.  ...  Using Farasa (a fast and accurate text processing toolkit for Arabic) they tokenized the long text into tokens. Table 9 Gender classification results for loose classification [48] .  ... 
doi:10.26599/bdma.2022.9020002 fatcat:426awpckabcvto3svay7cab6wu

Comparison of Feature Extraction Techniques for Pattern Classification

Binu P Chacko
2021 International Journal for Research in Applied Science and Engineering Technology  
In this article, a pattern recognition problem for handwritten Malayalam character is presented. This system goes through two different stages of HCR namely, feature extraction and classification.  ...  Pattern recognition is a challenging task in research field for the last few decades.  ...  It has also been proved to be very successful in many other applications such as handwritten character recognition, image classification, face detection, object detection, and text classification.  ... 
doi:10.22214/ijraset.2021.36214 fatcat:tii3ayi5wrcarbrhd5u76drjna

Creation of Arabic Ontology for Hadith Science

Abdelkarim Abdelkader, Umm Al-Qura University, KSA
2019 International Journal of Advanced Trends in Computer Science and Engineering  
The kernel of the ontology is created with Protégé. The result is an OWL/XML document.  ...  The main objective of this paper is to build and implement an ontology for all concepts and main knowledge of Hadith Science.  ...  them for hadith semantic annotation, information retrieval and classification.  ... 
doi:10.30534/ijatcse/2019/96862019 fatcat:cn4p3dyma5h7ziprp5czwptohe

Arabic Fake News Detection: Comparative Study of Neural Networks and Transformer-Based Approaches

Maha Al-Yahya, Hend Al-Khalifa, Heyam Al-Baity, Duaa AlSaeed, Amr Essam, M. Irfan Uddin
2021 Complexity  
This paper presents a comprehensive comparative study of neural network and transformer-based language models used for Arabic FND.  ...  We examine the use of neural networks and transformer-based language models for Arabic FND and show their performance compared to each other.  ...  all models for classification.  ... 
doi:10.1155/2021/5516945 fatcat:4of6srfkkbfsxazmpv5pojmnw4

Arabic Opinion Mining Using a Hybrid Recommender System Approach [article]

Fouzi Harrag, Abdulmalik Salman Al-Salman, Alaa Alquahtani
2020 arXiv   pre-print
This research focuses especially on Arabic reviews, where the model is evaluated using Opinion Corpus for Arabic (OCA) dataset.  ...  research proposed a hybrid approach combining sentiment analysis and recommender systems to tackle the problem of data sparsity problems by predicting the rating of products from users reviews using text  ...  Train ration Our Model Baseline.  ... 
arXiv:2009.07397v1 fatcat:3xutk3oys5bb3fvsjwrvxtrb7i

Automatic Language Identification in Texts: A Survey [article]

Tommi Jauhiainen, Marco Lui, Marcos Zampieri, Timothy Baldwin, Krister Lindén
2018 arXiv   pre-print
Today, LI is a key part of many text processing pipelines, as text processing techniques generally assume that the language of the input text is known.  ...  Automatic LI has been extensively researched for over fifty years.  ...  We would like to thank Kimmo Koskenniemi for many valuable discussions and comments concerning the early phases of the features and the methods sections.  ... 
arXiv:1804.08186v2 fatcat:4rmixp4i5fb55itb7ze5avkgqy

Automatic Language Identification in Texts: A Survey

Tommi Jauhiainen, Marco Lui, Marcos Zampieri, Timothy Baldwin, Krister Lindén
2019 The Journal of Artificial Intelligence Research  
Today, LI is a key part of many text processing pipelines, as text processing techniques generally assume that the language of the input text is known.  ...  Automatic LI has been extensively researched for over fifty years.  ...  We would like to thank Kimmo Koskenniemi for many valuable discussions and comments concerning the early phases of the features and the methods sections.  ... 
doi:10.1613/jair.1.11675 fatcat:axugpuogyne3nptvamgd3zwgty

Fast transpose methods for kernel learning on sparse data

Patrick Haffner
2006 Proceedings of the 23rd international conference on Machine learning - ICML '06  
On very large natural language tasks (tagging, translation, text classification) with sparse feature representations, a 20 to 80-fold speedup over LIBSVM is observed using the same SMO algorithm.  ...  Caching and shrinking are also optimized for sparsity.  ...  This type of input can be used for text classification (Joachims, 1998b) , Machine Translation, text annotation and tagging (Bangalore & Joshi, 1999) .  ... 
doi:10.1145/1143844.1143893 dblp:conf/icml/Haffner06 fatcat:4d2xrk5hyvbdxlaww53k6xcara
« Previous Showing results 1 — 15 out of 977 results