Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema [chapter]

Ana Kostovska, Sašo Džeroski, Panče Panov
2020 Lecture Notes in Computer Science  
With the pervasiveness of data mining (DM) in many areas of our society, the management of digital data, readily available for analysis, has become increasingly important. Consequently, nearly all community accepted guidelines and principles (e.g. FAIR and TRUST) for publishing such data in the digital ecosystem, stress the importance of semantic data enhancement. Having rich semantic annotation of DM datasets would support the data mining process at various choice points, such as data
more » ... ding, automatic identification of the analysis task, and reasoning over the obtained results. In this paper, we report on the developments of an ontology-based annotation schema for semantic description of DM datasets. The annotation schema combines three different aspects of semantic annotation, i.e., annotation of provenance, data mining specific, and domain-specific information. We demonstrate the utility of these annotations in two use cases: semantic annotation of remote sensing data and data about neurodegenerative diseases.
doi:10.1007/978-3-030-61527-7_10 fatcat:7trft2tijffplbfc5xjqsz56i4