IDENTIFYING NEUROTOXINS IN SNAKE VENOMS THROUGH AUTOMATED GENERATION OF PROBABILISTIC CONSENSUS MOTIFS

Akash Nag, Sunil Karforma
unpublished
In this paper, we propose an algorithm that automatically generates a consensus motif identifying a protein family given an alignment of a set of protein sequences. The algorithm also assigns probabilistic scores to each amino-acid choice at each position in the sequence that will better aid in matching the motif against a protein database. Using aligned neurotoxin sequences from 91 species of Elapid snakes, the neurotoxin-motif generated by our algorithm successfully matched 175 proteins,
more » ... ding the 91 input proteins, in the UniProt database, most of which were also neurotoxins from other snake venoms.
fatcat:lteyzi3xxbfi3ki7vyw3fb2fcu