Literature-Based Discovery beyond the ABC paradigm: a contrastive approach [article]

Erwan Moreau, Orla Hardiman, Mark Heverin, Declan O'Sullivan
2021 bioRxiv   pre-print
Literature-Based Discovery (LBD) aims to help researchers to identify relations between concepts which are worthy of further investigation by text-mining the biomedical literature. The vast majority of the LBD research follows the ABC model: a relation (A,C) is a candidate for discovery if there is some intermediate concept B which is related to both A and C. The ABC model has been successful in applications where the search space is strongly constrained, but there is limited evidence about its
more » ... usefulness when applied in a broader context. Through a case study of 8 recent discoveries related to neurodegenerative diseases (NDs), we show the limitations of the ABC model in an open-ended context. The study emphasizes the impact of the choice of source data and extraction method on the resulting knowledge base: different "views" of the biomedical literature offer different levels of accuracy and coverage. We propose a novel contrastive approach which leverages these differences between "views" in order to target relations between concepts of interest. We explore various parameters and demonstrate the relevance of our approach through quantitative evaluation on the 8 target discoveries. The source data used in this article are publicly available. The different parts of the software used to process the data are published under open-source license and provided with detailed instructions. A prototype of the system is also provided as an online exploration tool.
doi:10.1101/2021.09.22.461375 fatcat:43e32dfhnrhmvab4y627ymimxe