Discovering Informative Syntactic Relationships between Named Entities in Biomedical Literature

Annalisa Appice, Michelangelo Ceci, Corrado Loglisci
2010 2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications  
The discovery of new and potentially meaningful relationships between named entities in biomedical literature can take great advantage from the application of multirelational data mining approaches in text mining. This is motivated by the peculiarity of multi-relational data mining to be able to express and manipulate relationships between entities. We investigate the application of such an approach to address the task of identifying informative syntactic structures, which are frequent in
more » ... ical abstract corpora. Initially, named entities are annotated in text corpora according to some biomedical dictionary (e.g. MeSH taxonomy). Tagged entities are then integrated in syntactic structures with the role of subject and/or object of the corresponding verb. These structures are represented in a first-order language. Multi-relational approach to frequent pattern discovery allows to identify the verb-based relationships between the named entities which frequently occur in the corpora. Preliminary experiments with a collection of abstracts obtained by querying Medline on a specific disease are reported.
doi:10.1109/dbkda.2010.14 dblp:conf/dbkda/AppiceCL10 fatcat:kyowhpzeynelfcym76klqp3r2a