Protein Multiple Alignment Incorporating Primary and Secondary Structure Information

Nak-Kyeong Kim, Jun Xie
2006 Journal of Computational Biology  
Identifying common local segments, also called motifs, in multiple protein sequences plays an important role for establishing homology between proteins. Homology is easy to establish when sequences are similar (sharing an identity > 25%). However, for distant proteins, it is much more difficult to align motifs that are not similar in sequences but still share common structures or functions. This paper is a first attempt to align multiple protein sequences using both primary and secondary
more » ... nd secondary structure information. A new sequence model is proposed so that the model assigns high probabilities not only to motifs that contain conserved amino acids but also to motifs that present common secondary structures. The proposed method is tested in a structural alignment database BAliBASE. We show that information brought by the predicted secondary structures greatly improves motif identification. A website of this program is available at http://www.stat.purdue.edu/∼junxie/2ndmodel/sov.html.
doi:10.1089/cmb.2006.13.1735 pmid:17238842 fatcat:ncxydwkdhjcmtlksw6dpiyn5fm