Triage protein fold prediction

Hongxian He, Gregory McAllister, Temple F. Smith
2002 Proteins: Structure, Function, and Bioinformatics  
We have constructed, in a completely automated fashion, a new structure template library for threading that represents 358 distinct SCOP folds where each model is mathematically represented as a Hidden Markov model (HMM). Because the large number of models in the library can potentially dilute the prediction measure, a new triage method for fold prediction is employed. In the first step of the triage method, the most probable structural class is predicted using a set of manually constructed,
more » ... h-level, generalized structural HMMs that represent seven general protein structural classes: all-␣, all-␤, ␣/␤, ␣؉␤, irregular small metal-binding, transmembrane ␤-barrel, and transmembrane ␣-helical. In the second step, only those fold models belonging to the determined structural class are selected for the final fold prediction. This triage method gave more predictions as well as more correct predictions compared with a simple prediction method that lacks the initial classification step. Two different schemes of assigning Bayesian model priors are presented and discussed. Proteins 2002;48: 654-663.
doi:10.1002/prot.10194 pmid:12211033 fatcat:ubkspdktzneqvawwv55kyqjaje