A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
What's in a gene name? Automated refinement of gene name dictionaries
2007
Workshop on Biomedical Natural Language Processing
Many approaches for named entity recognition rely on dictionaries gathered from curated databases (such as Entrez Gene for gene names.) Strategies for matching entries in a dictionary against arbitrary text use either inexact string matching that allows for known deviations, dictionaries enriched according to some observed rules, or a combination of both. Such refined dictionaries cover potential structural, lexical, orthographical, or morphological variations. In this paper, we present an
dblp:conf/bionlp/Hakenberg07
fatcat:dcj5lyu6tnh7fp2uwrykohinlu