A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is
Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology - LNLBioNLP '06
The identification of genes in biomedical text typically consists of two stages: identifying gene mentions and normalization of gene names. We have created an automated process that takes the output of named entity recognition (NER) systems designed to identify genes and normalizes them to standard referents. The system identifies human gene synonyms from online databases to generate an extensive synonym lexicon. The lexicon is then compared to a list of candidate gene mentions using variousdoi:10.3115/1654415.1654423 fatcat:gpzzssscgnejphbiottilgzlpm