A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
ProvenanceWeek 2020: Machine Learning Pipelines: Provenance, Reproducibility and FAIR Data Principles
[article]
2020
Figshare
This presentation is given in ProvenanceWeek2020.In this presentation, we describe our goals and initial steps in supporting the end-to-end reproducibility of ML pipelines. We investigate which factors beyond the availability of source code and datasets influence reproducibility of ML experiments. We propose ways to apply FAIR data practices to ML workflows. We present our preliminary results on the role of our tool, ProvBook, in capturing and comparing provenance of ML experiments and their
doi:10.6084/m9.figshare.12529634.v1
fatcat:6ls7f7hlqbcuzo27zkzfsdsuhi