Classifying clinically actionable genetic mutations using KNN and SVM

Rohit Chivukula, T. Jaya Lakshmi, Sanku Satya Uday, Satti Thanuja Pavani
2021 Indonesian Journal of Electrical Engineering and Computer Science  
Cancer is one of the major causes of death in humans. Early diagnosis of genetic mutations that cause cancer tumor growth leads to personalized medicine to the decease and can save the life of majority of patients. With this aim, Kaggle has conducted a competition to classify clinically actionable gene mutations based on clinical evidence and some other features related to gene mutations. The dataset contains 3321 training data points that can be classified into 9 classes. In this work, an
more » ... pt is made to classify these data points using K-nearest neighbors (KNN) and linear support vector machines (SVM) in a multi class environment. As the features are categorical, one hot encoding as well as response coding are applied to make them suitable to the classifiers. The prediction performance is evaluated using log loss and KNN has performed better with a log loss value of 1.10 compared to that of SVM 1.24.
doi:10.11591/ijeecs.v24.i3.pp1672-1679 fatcat:uap5wbi3ybdujkbtwjjrdmoqti