Towards a Semantic Data Harmonization Federated Infrastructure [chapter]

Catalina Martinez-Costa, Francisco Abad-Navarro
2021 Studies in Health Technology and Informatics  
Data integration is an increasing need in medical informatics projects like the EU Precise4Q project, in which multidisciplinary semantically and syntactically heterogeneous data across several institutions needs to be integrated. Besides, data sharing agreements often allow a virtual data integration only, because data cannot leave the source repository. We propose a data harmonization infrastructure in which data is virtually integrated by sharing a semantically rich common data
more » ... that allows their homogeneous querying. This common data model integrates content from well-known biomedical ontologies like SNOMED CT by using the BTL2 upper level ontology, and is imported into a graph database. We successfully integrated three datasets and made some test queries showing the feasibility of the approach.
doi:10.3233/shti210116 pmid:34042701 fatcat:jxttu4mqc5a5bjgqrtszlav5ku