Filters








14,389 Hits in 7.6 sec

Feature Reduction Based on Genetic Algorithm and Hybrid Model for Opinion Mining

P. Kalaivani, K. L. Shunmuganathan
2015 Scientific Programming  
In this paper, we proposed an optimized feature reduction that incorporates an ensemble method of machine learning approaches that uses information gain and genetic algorithm as feature reduction techniques  ...  The effectiveness of single classifiers Naïve Bayes, logistic regression, support vector machine, and ensemble technique for opinion mining are compared on five datasets.  ...  Conflict of Interests The authors declare that there is no conflict of interests regarding the publication of this paper.  ... 
doi:10.1155/2015/961454 fatcat:frcm7uipabhptd2t4spvsys4gm

Mining Newsgroups Using Ensemble Classifiers in Social Network Analysis

M. Govindarajan
2017 International Journal of Engineering Science Advanced Computing and Bio-Technology  
A Classifier ensemble is designed using Naive Bayes (NB), Support Vector Machine (SVM) and Genetic Algorithm (GA) as base classifiers.  ...  The ability to accurately perform a classification task depends on the representations of documents to be classified.  ...  The authors in (N. Priyadharshini et al., 2013) used an approach used to segment image document and classify the document regions as text, image, drawings and table.  ... 
doi:10.26674/ijesacbt/2017/49177 fatcat:h6b7nxrorva4ri6ooro6k343qu

Effectiveness of web search results for genre and sentiment classification

Jin-Cheon Na, Tun Thura Thet
2009 Journal of information science  
In this study we used only the snippets rather than their original full-text documents, and applied a common machine learning technique, SVM (Support Vector Machine), and heuristic approaches to investigate  ...  For genre classification, the hybrid approach which made use of both the machine learning approach using n-gram terms and a heuristic approach using the Title, Summary Text and the URL performed slightly  ...  Thus we apply an effective machine learning algorithm, SVM (Support Vector Machine) [3] , and heuristic and hybrid approaches to investigate how effectively the snippets can be used for genre and sentiment  ... 
doi:10.1177/0165551509104233 fatcat:i6rrqbvwvfgrhmpqxhbocjfkim

Text Document Pre-Processing Using the Bayes Formula for Classification Based on the Vector Space Model

Dino Isa, Lee Lam Hong, V. P. Kallimani, R. Rajkumar
2008 Computer and Information Science  
Using this probability distribution as the vectors to represent the document, the text classification algorithms based on the vector space model, such as the Support Vector Machine (SVM) and Self-Organizing  ...  The Bayes formula gives a range of probabilities to which the document can be assigned according to a pre determined set of topics (categories).  ...  The main problem associated with using the support vector machine for document classification is the effort needed to transform text data to numerical data.  ... 
doi:10.5539/cis.v1n4p79 fatcat:s7m2ra53jnhm5btnua3q4ys3vq

A hybrid BSO-Chi2-SVM approach to Arabic text categorization

Riadh Belkebir, Ahmed Guessoum
2013 2013 ACS International Conference on Computer Systems and Applications (AICCSA)  
In this paper, we present the results of Arabic text categorization based on three different approaches: artificial neural networks, support vector machines (SVMs) and a hybrid approach BSO-CHI-SVM.  ...  Automatic categorization of documents consists in assigning a category to a text based on the information it contains. It aims to automate the association of a document with a category.  ...  These modes are the same for all approaches presented in this paper (neural network, support vector machine and our hybrid approach BSO-CHI-SVM). 1) Stemming: For the representation of the documents  ... 
doi:10.1109/aiccsa.2013.6616437 dblp:conf/aiccsa/BelkebirG13 fatcat:eja6fh6pwzhfbiawt4a2m5qnge

A Review of Artificial Intelligence Algorithms in Document Classification

Adrian Bilski
2011 International Journal of Electronics and Telecommunications  
A proper classification of e-documents, various Internet information, blogs, emails and digital libraries requires application of data mining and machine learning algorithms to retrieve the desired data  ...  With the evolution of Internet, the meaning and accessibility of text documents and electronic information has increased.  ...  In [42] authors introduced a new hybrid approach to classify web documents, built on graph and vector representations. The k-NN algorithm shows that this approach properly classifies documents.  ... 
doi:10.2478/v10177-011-0035-6 fatcat:wlxgznossrccrlaxlebhvhl4py

Plagiarism Detection through Internet using Hybrid Artificial Neural Network and Support Vectors Machine

Imam Much Ibnu Subroto, Ali Selamat
2014 TELKOMNIKA (Telecommunication Computing Electronics and Control)  
Machine learning methods like knearest neighbors (KNN), support vector machine (SVM), artificial neural networks (ANN) is a technique that is commonly used in solving the problem based on statistical data  ...  The data collection method in this work using an Internet search to ensure that a document is in the detection is up-to-date.  ...  Two features are enough to detect the presence of elements in a document plagiarism. , ∑ (2) Hybrid Machine Learning Machine learning KNN, SVM, ANN has two inputs in the form of training dataset and  ... 
doi:10.12928/telkomnika.v12i1.4 fatcat:vqu56jf26ngrrct7z3p6466dvq

Plagiarism Detection through Internet using Hybrid Artificial Neural Network and Support Vectors Machine

Imam Much Ibnu Subroto, Ali Selamat
2014 TELKOMNIKA (Telecommunication Computing Electronics and Control)  
Machine learning methods like knearest neighbors (KNN), support vector machine (SVM), artificial neural networks (ANN) is a technique that is commonly used in solving the problem based on statistical data  ...  The data collection method in this work using an Internet search to ensure that a document is in the detection is up-to-date.  ...  Two features are enough to detect the presence of elements in a document plagiarism. , ∑ (2) Hybrid Machine Learning Machine learning KNN, SVM, ANN has two inputs in the form of training dataset and  ... 
doi:10.12928/telkomnika.v12i1.648 fatcat:5aftpgmtdzckjpbw7wzt5vxrg4

Security-level classification for confidential documents by using adaptive neuro-fuzzy inference systems

Erdem Alparslan, Adem Karahoca, Hayretdin Bahşi
2012 Expert systems  
In the third approach we have developed a hybrid solution consists of support vector machines and adaptive neuro-fuzzy inference systems.  ...  For each security problem, according to the nature of the business processes of documents for each organization, support vector phase of our hybrid approach may be reorganized.  ...  Sub-classification areas like document type, area or format are other distinctive properties of documents.  ... 
doi:10.1111/j.1468-0394.2012.00634.x fatcat:rxma2t67h5dgbiqycs674gwjbu

Machine Learning for Web Page Classification: A Survey

safae lassri, EL HABIB BENLAHMAR, Abderrahim TRAGHA
2019 International Journal of Information Science and Technology  
In this paper, we present the characteristics of web page classification, we produce a literature review by summarizing and evaluating all sources related to web page classification crawled automatically  ...  To exploit this data, a Web information retrieval system and a categorization of internet content based on the classification of web pages are essential.  ...  Links Based Kernel to Enrich Support Vector Machine for Web Page Classification Hybrid Dimensionality Reduction Approach for Web Page Classification Classification System Based on Anchor Graph Hashing  ... 
doaj:483a4b9f259046a29c57adc3021a50d0 fatcat:hdznsdeotnhwpgpuigi7iovhja

A Regularized Linear Classifier for Effective Text Classification [chapter]

Sharad Nandanwar, M. Narasimha Murty
2012 Lecture Notes in Computer Science  
In document community support vector machines and naïve bayes classifier are known for their simplistic yet excellent performance.  ...  The essence of this paper is a linear classifier, very similar to these two. We propose a novel way of combining these two approaches, which synthesizes best of them into a hybrid model.  ...  Background Theory Support Vector Machine Support Vector Machine (SVM) [1] is a supervised learning algorithm which is based on the principle of structural risk minimization [2] .  ... 
doi:10.1007/978-3-642-34481-7_27 fatcat:76fpcw7f6ze3piq6cphjrz4tx4

A Review of Machine Learning Algorithms for Text-Documents Classification

Baharum Baharudin, Lam Hong Lee, Khairullah Khan
2010 Journal of Advances in Information Technology  
The aim of this paper is to highlight the important techniques and methodologies that are employed in text documents classification, while at the same time making awareness of some of the interesting challenges  ...  This paper provides a review of the theory and methods of document classification and text mining, focusing on the existing literature.  ...  The authors in [95] suggest a new hybrid approach to web document classification built upon both, graph and vector representations.  ... 
doi:10.4304/jait.1.1.4-20 fatcat:nx23oqf3gbgiha45s2enn2hqqq

Arabic Text Classification Based on Features Reduction Using Artificial Neural Networks

F. A. Zaghoul, S. Al-Dhaheri
2013 2013 UKSim 15th International Conference on Computer Modelling and Simulation  
Since the number of unique words in the collection set is big, features reduction methods have been used to select the most relevant features for the classification.  ...  In this paper, we present and analyze the results of the application of Artificial Neural Network (ANN) for the classification of Arabic language documents.  ...  Sebastiani (2002) , conducted a survey that explains the main machine learning approaches to TC, and stated that the machine learning is the dominant approach to TC in the research community [18] .  ... 
doi:10.1109/uksim.2013.135 dblp:conf/uksim/ZaghoulA13 fatcat:3h67ywpzhbdtrgey3stbuywhuu

A Content Vector Model For Text Classification

Eric Jiang
2008 Zenodo  
As a popular rank-reduced vector space approach, Latent Semantic Indexing (LSI) has been used in information retrieval and other applications.  ...  The proposed classifier has been applied to email classification and its experiments on a benchmark spam testing corpus (PU1) have shown that the approach represents a competitive alternative to other  ...  As a comparison to some other popular classifiers, our content vector model is evaluated against the Support Vector Machines (SVM) and naïve Bayes (NB) approaches.  ... 
doi:10.5281/zenodo.1078288 fatcat:ptd4hleikrgrnn77chuq6qwp5a

Text Mining: Classification of Text Documents using Granular Hybrid Classification Technique

Shiva Prasad KM, Dr.T Hanumantha Reddy
2019 International Journal of Research in Advent Technology  
There are also cases in classification where instead of classifying a category in the target function, we classify a code.  ...  In our paper, we study the repercussions of a corpus which outgrows memory after vectorizing and perform a comparative analysis of various algorithms used during the process with our algorithm.  ...  Support Vector Machine Classifier "Support Vector Machine" (SVM) is a supervised machine learning algorithm which can be used for both classification or regression challenges.SVM (Support Vector Machine  ... 
doi:10.32622/ijrat.76201910 fatcat:sjpaeob3bzf3xmq4fi27sfp4n4
« Previous Showing results 1 — 15 out of 14,389 results