Intelligent mining of large-scale bio-data: Bioinformatics applications

Farahnaz Sadat Golestan Hashemi, Mohd Razi Ismail, Mohd Rafii Yusop, Mahboobe Sadat Golestan Hashemi, Mohammad Hossein Nadimi Shahraki, Hamid Rastegari, Gous Miah, Farzad Aslani
2017 Biotechnology & Biotechnological Equipment  
Today, there is a collection of a tremendous amount of bio-data because of the computerized applications worldwide. Therefore, scholars have been encouraged to develop effective methods to extract the hidden knowledge in these data. Consequently, a challenging and valuable area for research in artificial intelligence has been created. Bioinformatics creates heuristic approaches and complex algorithms using artificial intelligence and information technology in order to solve biological problems.
more » ... Intelligent implication of the data can accelerate biological knowledge discovery. Data mining, as biology intelligence, attempts to find reliable, new, useful and meaningful patterns in huge amounts of data. Hence, there is a high potential to raise the interaction between artificial intelligence and bio-data mining. The present paper argues how artificial intelligence can assist bio-data analysis and gives an up-to-date review of different applications of bio-data mining. It also highlights some future perspectives of data mining in bioinformatics that can inspire further developments of data mining instruments. Important and new techniques are critically discussed for intelligent knowledge discovery of different types of row datasets with applicable examples in human, plant and animal sciences. Finally, a broad perception of this hot topic in data science is given. KEYWORDS Bioinformatics; data mining; artificial intelligence; intelligent knowledge discovery; bio-data analysis; heuristic algorithms Abbreviations AUC area under the curve BADH betaine aldehyde dehydrogenase CRM customer relationship management GABA 4-Aminobutyric acid GABald g-aminobutyraldehyde GB glycine betaine HBH-AMADHs high BADH homology aminoaldehyde dehydrogenases MAS marker assisted selection MAPK mitogen-activated protein kinase NAD nicotinamide adenine dinucleotide NB naive Bayes OLAP on-line analytic processing Put putrescine QTL quantitative trait loci ROC receiver operating characteristic ROS reactive oxygen species SMG selection marker gene Spd spermidine Spm spermine 2AP 2-acetyl-1-pyrroline
doi:10.1080/13102818.2017.1364977 fatcat:qmbiss53wfggtc7ayj2ysgt5rq