A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
PaperBLAST: Text Mining Papers for Information about Homologs
2017
mSystems
Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot,
doi:10.1128/msystems.00039-17
pmid:28845458
pmcid:PMC5557654
fatcat:xpcwjz2qsnfarg5emlgnnwoyte