The accuracy of several multiple sequence alignment programs for proteins

Paulo A S Nuin, Zhouzhi Wang, Elisabeth R M Tillier
<span title="2006-10-24">2006</span> <i title="Springer (Biomed Central Ltd.)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/n5zrklrhlzhtdorf4rk4rmeo3i" style="color: black;">BMC Bioinformatics</a> </i> &nbsp;
There have been many algorithms and software programs implemented for the inference of multiple sequence alignments of protein and DNA sequences. The "true" alignment is usually unknown due to the incomplete knowledge of the evolutionary history of the sequences, making it difficult to gauge the relative accuracy of the programs. We tested nine of the most often used protein alignment programs and compared their results using sequences generated with the simulation software Simprot which
more &raquo; ... known alignments under realistic and controlled evolutionary scenarios. We have simulated more than 30,000 alignment sets using various evolutionary histories in order to define strengths and weaknesses of each program tested. We found that alignment accuracy is extremely dependent on the number of insertions and deletions in the sequences, and that indel size has a weaker effect. We also considered benchmark alignments from the latest version of BAliBASE and the results relative to BAliBASE- and Simprot-generated data sets were consistent in most cases. Our results indicate that employing Simprot's simulated sequences allows the creation of a more flexible and broader range of alignment classes than the usual methods for alignment accuracy assessment. Simprot also allows for a quick and efficient analysis of a wider range of possible evolutionary histories that might not be present in currently available alignment sets. Among the nine programs tested, the iterative approach available in Mafft (L-INS-i) and ProbCons were consistently the most accurate, with Mafft being the faster of the two.
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1186/1471-2105-7-471">doi:10.1186/1471-2105-7-471</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/17062146">pmid:17062146</a> <a target="_blank" rel="external noopener" href="https://pubmed.ncbi.nlm.nih.gov/PMC1633746/">pmcid:PMC1633746</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/2slzmchfdvccpko6jkxfkelvha">fatcat:2slzmchfdvccpko6jkxfkelvha</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190427182052/https://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/1471-2105-7-471" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/7d/3d/7d3df7bca6dc758307267f2ad7d2eda05375e006.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1186/1471-2105-7-471"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> springer.com </button> </a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1633746" title="pubmed link"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> pubmed.gov </button> </a>