Clustering Proteins and Reconstructing Evolutionary Events [chapter]

Boris Mirkin
2010 Studies in Classification, Data Analysis, and Knowledge Organization  
The issue of clustering proteins into homologous families has attracted considerable attention by researchers. On one side, many databases of protein families have been developed by using relatively simple clustering methods and a lot of manual curation. On the other side, more elaborated clustering approaches have been used, yet with a very limited degree of success. This paper advocates an approach to clustering protein families involving the knowledge of protein functions to adjust the
more » ... ter of similarity scale shift. We proceed to reconstruct HPF evolutionary histories to both further narrow down the choice of the cluster solution and interpret clusters.
doi:10.1007/978-3-642-10745-0_4 fatcat:m6tpbbidmnghrf53hobg4ruziu