A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Infrastructure for Semantic Annotation in the Genomics Domain
2020
International Conference on Language Resources and Evaluation
We describe a novel super-infrastructure for biomedical text mining which incorporates an end-to-end pipeline for the collection, annotation, storage, retrieval and analysis of biomedical and life sciences literature, combining NLP and corpus linguistics methods. The infrastructure permits extreme-scale research on the open access PubMed Central archive. It combines an updatable Gene Ontology Semantic Tagger (GOST) for entity identification and semantic markup in the literature, with a NLP
dblp:conf/lrec/El-HajRCEPIKPMR20
fatcat:cpon3hcvjrffrfon2wnmu5kzne