S4: structure-based sequence alignments of SCOP superfamilies

J. Casbon
2004 Nucleic Acids Research  
S4 is an automatically generated database of multiple structure-based sequence alignments of protein superfamilies in the SCOP database. All structural domains that do not share more than 40% sequence identity as defined by the ASTRAL compendium of protein structures are included. The alignments are constructed using pairwise structural alignments to generate residue equivalences that are then integrated into multiple alignments using sequence alignment tools. We describe the database and give
more » ... xamples showing how the automatically generated S4 alignments compare favourably to hand-crafted alignments. Available at:
doi:10.1093/nar/gki043 pmid:15608181 pmcid:PMC539997 fatcat:wluivkaxlnha7gif4nn3hiwawi