A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Filters
Data Conservancy Provenance, Context, and Lineage Services: Key Components for Data Preservation and Curation
2013
Data Science Journal
The DCS provenance, context, and lineage services cross the four layers in the DCS data curation stack model: storage, archiving, preservation, and curation. ...
Among the key services that institutional data management infrastructures must provide are provenance and lineage tracking and the ability to associate data with contextual information needed for understanding ...
Funding for the Data Conservancy and the Johns Hopkins University Data Management Services is provided by the JHU Sheridan Libraries. We acknowledge contributions from our Data Conservancy colleagues. ...
doi:10.2481/dsj.12-039
fatcat:dwxeoonmjzfrjegp5tw34xg3zi
A data model and architecture for long-term preservation
2008
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries - JCDL '08
benefit is that it serves as a fallback strategy against, and as a foundation for, more sophisticated (and costly) preservation strategies. ...
The National Geospatial Digital Archive, one of eight initial projects funded under the Library of Congress's NDIIPP program, has been researching how geospatial data can be preserved on a national scale ...
ACKNOWLEDGMENTS The authors would like to thank Catherine Masi of the Map & Imagery Laboratory and David Valentine of SDSC for their contributions to this work.
9. ...
doi:10.1145/1378889.1378912
dblp:conf/jcdl/JaneeMF08
fatcat:izm7epd5evgcxji6uypvpmdecu
Variables As Currency: Linking Meta-Analysis Research and Data Paths in Sciences
2014
Data Science Journal
The framework focuses on key variables that represent primary/secondary datasets or derived socio-ecological data, contexts of use, and the data transformations that are applied. ...
Some important meta-analytic tasks, such as the selection of relevant studies for review and the integration of research datasets or findings, are not well supported in current data curation systems. ...
We gratefully acknowledge Mary Marlino and Karon Kelly at NCAR Library and Integrated Information Systems, UCAR; Katie Dickinson and Lawrence Buja at the Research Applications Laboratory at NCAR; Aaron ...
doi:10.2481/dsj.14-030
fatcat:4rvfbomk45hzviuz6xx3doj5lu
Lessons from a Marine Spatial Planning data management process for Ireland
2020
International Journal of Digital Earth
This paper presents a framework containing ten components to deliver a data management process for the storage and management of data used for Marine Spatial Planning (MSP) in Ireland. ...
The process presents a means of managing data and metadata to ensure data lineage is optimised by carrying information about the origin of and the processing applied to the data; to evaluate the quality ...
Acknowledgement This work is part supported by the Irish Government and the European Maritime & Fisheries Fund as part of the EMFF Operational Programme for 2014-2020. ...
doi:10.1080/17538947.2020.1808720
fatcat:77d7laol7banlhl7jk67bkpqba
Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data
2013
International Journal of Digital Curation
For a provenance-based approach to be reusable and supportable with software tools, the provenance records must use a well-defined model of the transformation process. ...
Records of provenance that document how this process has taken place will enable users of the data warehouse to utilise the data appropriately and ensure that future data added from another source is transformed ...
The work was also part funded and supported by the National Institute for Health Research (NIHR) Biomedical Research Centre, based at Guy's and St Thomas' NHS Foundation Trust and King's College London ...
doi:10.2218/ijdc.v8i2.262
fatcat:iot3dqphyfeedmos7v7m67yymy
Workflow Provenance in the Lifecycle of Scientific Machine Learning
[article]
2021
arXiv
pre-print
In these domains, users need to perform comprehensive data analyses combining scientific data and ML models to provide for critical requirements, such as reproducibility, model explainability, and experiment ...
We contribute with (i) characterization of the lifecycle and taxonomy for data analyses; (ii) design principles to build this view, with a W3C PROV compliant data representation and a reference system ...
Provenance (also referred to as lineage) data management techniques help reproduce, trace, assess, understand, and explain data, models, and their transformation processes [11] , [12] , [13] . ...
arXiv:2010.00330v2
fatcat:4xfpfcsocjajxbq4qumu57swya
Recommendations for connecting molecular sequence and biodiversity research infrastructures through ELIXIR
2021
F1000Research
As a research infrastructure developing services and technical solutions that help integrate and coordinate life science resources across Europe, ELIXIR is a key player. ...
and services that connect molecular and wider biodiversity domains. ...
In the context of infraspecific diversity conserved in plant, forest, and animal genetic resources, several projects are developing common recommendations and metadata standards to improve the conservation ...
doi:10.12688/f1000research.73825.1
fatcat:wphybjxe4bd3dgdewjqobb2hfe
Data and Metadata Management for Better VGI Reusability
[chapter]
2017
Mapping and the Citizen Sensor
This VGI may be of huge value for institutions, individuals and decision-makers, but only if it can be discovered, evaluated for quality and fitness-for-purpose and combined with data from other sources ...
open source technologies which can underpin robust and sustainable data management for VGI. ...
These requirements for repeatability, transparency and independent evaluation inevitably suggest a need to curate and preserve data collections. ...
doi:10.5334/bbf.k
fatcat:blw7ofx5s5ft3edbal7kipuj6u
A Unified Framework for Measuring Stewardship Practices Applied to Digital Environmental Datasets
2015
Data Science Journal
The rationale for each key component and its maturity levels is described. ...
Nine key components are identified based on requirements imposed on digital environmental data and information that are cared for and disseminated by U.S. ...
Transparency/traceability This key component measures the degrees of transparency and traceability via availability of information on data provenance and data processing systems. ...
doi:10.2481/dsj.14-049
fatcat:dbbaavhelzelbngng3iu5fdxsa
D6.1 State of the Art and Community Needs Report from Use Cases
2020
Zenodo
and needs. ...
Finally, some options are suggested in terms of common actions and developments. ...
: "Defining procedures/service to enforce data provenance for thematic communities and beyond" Provenance management is a key component in order to guarantee scientific data discovery, reproducibility ...
doi:10.5281/zenodo.3894677
fatcat:ekb6e4ol3zaubp52itar52y7ju
Preservation, Characterization and Exploitation of Microbial Biodiversity: The Perspective of the Italian Network of Culture Collections
2019
Microorganisms
Data sharing and web services as well as the tight interconnection between CCs and the biotechnological industry are highlighted. ...
In this context, culture collections (CCs) and microbial biological resource centres (mBRCs) are crucial for the safeguarding and circulation of biological resources, as well as for the progress of life ...
Acknowledgments: The authors are grateful to all the members of the JRU MIRRI-IT for providing data related their own collections. ...
doi:10.3390/microorganisms7120685
pmid:31842279
fatcat:m7d7skb7mzgzdfcxvql53dolau
From the Field to the Cloud: A Review of Three Approaches to Sharing Historical Data From Field Stations Using Principles From Data Science
2018
Frontiers in Environmental Science
This article responds to several calls stressing the importance of empirical historical materials and urges their preservation and accessibility. ...
and shepherding historical data into the twenty-first century. ...
ACKNOWLEDGMENTS We would first like to acknowledge support from the Institute for the Study of Ecological and Evolutionary Climate Impacts (ISEECI) in funding the historical ecology working group that ...
doi:10.3389/fenvs.2018.00088
fatcat:ihtewxlg7jci3mt4uagoq2367q
From writing to reading the encyclopedia of life
2016
Philosophical Transactions of the Royal Society of London. Biological Sciences
We are very thankful to Ann McCain Evans and Chris Evans for their generosity in defraying the Open Access charges for this special issue. ...
We also thank Suzanne Bateson, Sujeevan Ratnasingham and Dirk Steinke for their aid in generating the figures. ...
contexts [119] . ...
doi:10.1098/rstb.2015.0321
pmid:27481778
pmcid:PMC4971178
fatcat:rntzqdszhve3dj7hljkkxx7wwq
Toward an Integrated Set of Surface Meteorological Observations for Climate Science and Applications
2017
Bulletin of The American Meteorological Society - (BAMS)
To ensure provenance (data lineage), these collected data should be as close to their original format and source as possible. ...
Venema et al. 2012) , to create datasets and data products. 17 The data lineage (provenance) of observations should be carried through the entire data archive, and each stage of processing recorded in ...
doi:10.1175/bams-d-16-0165.1
fatcat:6l5i7tux3bc7nddyvde73szg2u
Studies on Monitoring and Tracking Genetic Resources: An Executive Summary
2009
Standards in Genomic Sciences
Acknowledgements This work was commissioned under a contract from the UN Secretariat on the Convention on Biological Diversity to Michigan State University, Contract No. 8-26-17306 and is reprinted with ...
These efforts are well supported by publicly available tools and highly curated data sets of aligned 16S rRNA [17] [18] [19] . ...
The three principal objectives of the CBD are: conservation of biological resources, sustainable use of its components, and fair and equitable sharing of benefits arising out of their utilization. ...
doi:10.4056/sigs.1491
pmid:21304641
pmcid:PMC3035216
fatcat:ly4yyt7d4rdfnfsimey6vj2dxe
« Previous
Showing results 1 — 15 out of 1,295 results