A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Large scale hierarchical clustering of protein sequences
2005
BMC Bioinformatics
Searching a biological sequence database with a query sequence looking for homologues has become a routine operation in computational biology. In spite of the high degree of sophistication of currently available search routines it is still virtually impossible to identify quickly and clearly a group of sequences that a given query sequence belongs to. We report on our developments in grouping all known protein sequences hierarchically into superfamily and family clusters. Our graph-based
doi:10.1186/1471-2105-6-15
pmid:15663796
pmcid:PMC547898
fatcat:az3blh77ifa6doinrtch4cap4i