A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Hierarchical Clustering on RNA Dependent RNA Polymerase using Machine Learning
[article]
2021
bioRxiv
pre-print
RNA Dependent RNA Polymerase (RdRP) catalyzes the replication of RNA from an RNA template and is mostly found in Viruses. We have collected over 161 viral RdRP FASTA Sequences from the NCBI protein database using python script. Each of these sequences was transformed with TfidfVectorizer using sklearn module, with the one Letter word, because each Letter belongs to one Amino acid. These transformed data were sent to Hierarchical clustering using scipy library and visualized data using
doi:10.1101/2021.08.23.457366
fatcat:pg6t3yivzrc6bmwzkonnyfuw4m