A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
fastMSA: Accelerating Multiple Sequence Alignment with Dense Retrieval on Protein Language
[article]
2021
bioRxiv
pre-print
Evolutionarily related sequences provide information for the protein structure and function. Multiple sequence alignment, which includes homolog searching from large databases and sequence alignment, is efficient to dig out the information and assist protein structure and function prediction, whose efficiency has been proved by AlphaFold. Despite the existing tools for multiple sequence alignment, searching homologs from the entire UniProt is still time-consuming. Considering the success of
doi:10.1101/2021.12.20.473431
fatcat:jw5fk4pjljezpg4ncvrtfoxojq