A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
End-to-end multitask learning, from protein language to protein features without alignments
[article]
2019
bioRxiv
pre-print
AbstractCorrectly predicting features of protein structure and function from amino acid sequence alone remains a supreme challenge for computational biology. For almost three decades, state-of-the-art approaches combined machine learning and evolutionary information from multiple sequence alignments. Exponentially growing sequence databases make it infeasible to gather evolutionary information for entire microbiomes or metaproteomics. On top, for many important proteins (e.g. dark proteome and
doi:10.1101/864405
fatcat:6i5sr524zrd6nk27xf7tbas3ye