From Scientific Workflow Patterns to 5-star Linked Open Data

Alban Gaignard, Hala Skaf-Molli, Audrey Bihouée
2016 Workshop on the Theory and Practice of Provenance  
Scientific Workflow management systems have been largely adopted by data-intensive science communities. Many efforts have been dedicated to the representation and exploitation of provenance to improve reproducibility in data-intensive sciences. However, few works address the mining of provenance graphs to annotate the produced data with domain-specific context for better interpretation and sharing of results. In this paper, we propose PoeM, a lightweight framework for mining provenance in
more » ... ific workflows. PoeM allows to produce linked in silico experiment reports based on workflow runs. PoeM leverages semantic web technologies and reference vocabularies (PROV-O, P-Plan) to generate provenance mining rules and finally assemble linked scientific experiment reports (Micropublications, Experimental Factor Ontology). Preliminary experiments demonstrate that PoeM enables the querying and sharing of Galaxy 1 -processed genomic data as 5-star linked datasets.
dblp:conf/tapp/GaignardSB16 fatcat:ygbcgyve5fdnreeikcpjoc4awu