A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Incorporating rich background knowledge for gene named entity classification and recognition
2009
BMC Bioinformatics
Gene named entity classification and recognition are crucial preliminary steps of text mining in biomedical literature. Machine learning based methods have been used in this area with great success. In most state-of-the-art systems, elaborately designed lexical features, such as words, n-grams, and morphology patterns, have played a central part. However, this type of feature tends to cause extreme sparseness in feature space. As a result, out-of-vocabulary (OOV) terms in the training data are
doi:10.1186/1471-2105-10-223
pmid:19615051
pmcid:PMC2725142
fatcat:z7h5n4r3uva5bno3pfxefjaina