One model per entity: using hundreds of machine learning models to recognize and normalize biomedical names in text

Victor Bellon, Raul Rodriguez-Esteban
2017 Proceedings of the Biomedical NLP Workshop  
We explored a new approach to named entity recognition based on hundreds of machine learning models, each trained to distinguish a single entity, and showed its application to gene name identification (GNI). The rationale for our approach, which we named "one model per entity" (OMPE), was that increasing the number of models would make the learning task easier for each individual model. Our training strategy leveraged freelyavailable database annotations instead of manually-annotated corpora.
more » ... ile its performance in our proof-of-concept was disappointing, we believe that there is enough room for improvement that such approaches could reach competitive performance while eliminating the cost of creating costly training corpora.
doi:10.26615/978-954-452-044-1_007 dblp:conf/ranlp/BellonR17 fatcat:uwhum2pm3jhztf5yp4apmbqzc4