Uncertain Groupings: Probabilistic Combination of Grouping Data [chapter]

Brend Wanders, Maurice van Keulen, Paul van der Vet
2015 Lecture Notes in Computer Science  
Probabilistic approaches for data integration have much potential [7] . We view data integration as an iterative process where data understanding gradually increases as the data scientist continuously refines his view on how to deal with learned intricacies like data conflicts. This paper presents a probabilistic approach for integrating data on groupings. We focus on a bio-informatics use case concerning homology. A bio-informatician has a large number of homology data sources to choose from.
more » ... o enable querying combined knowledge contained in these sources, they need to be integrated. We validate our approach by integrating three real-world biological databases on homology in three iterations.
doi:10.1007/978-3-319-22849-5_17 fatcat:hbekaz3nffe4di6hqxcox6ufh4