1,295 Hits in 7.7 sec

Data Conservancy Provenance, Context, and Lineage Services: Key Components for Data Preservation and Curation

Matthew S. Mayernik, Tim DiLauro, Ruth Duerr, Elliot Metsger, Anne E. Thessen, G. Sayeed Choudhury
2013 Data Science Journal  
The DCS provenance, context, and lineage services cross the four layers in the DCS data curation stack model: storage, archiving, preservation, and curation.  ...  Among the key services that institutional data management infrastructures must provide are provenance and lineage tracking and the ability to associate data with contextual information needed for understanding  ...  Funding for the Data Conservancy and the Johns Hopkins University Data Management Services is provided by the JHU Sheridan Libraries. We acknowledge contributions from our Data Conservancy colleagues.  ... 
doi:10.2481/dsj.12-039 fatcat:dwxeoonmjzfrjegp5tw34xg3zi

A data model and architecture for long-term preservation

Greg Janée, Justin Mathena, James Frew
2008 Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries - JCDL '08  
benefit is that it serves as a fallback strategy against, and as a foundation for, more sophisticated (and costly) preservation strategies.  ...  The National Geospatial Digital Archive, one of eight initial projects funded under the Library of Congress's NDIIPP program, has been researching how geospatial data can be preserved on a national scale  ...  ACKNOWLEDGMENTS The authors would like to thank Catherine Masi of the Map & Imagery Laboratory and David Valentine of SDSC for their contributions to this work. 9.  ... 
doi:10.1145/1378889.1378912 dblp:conf/jcdl/JaneeMF08 fatcat:izm7epd5evgcxji6uypvpmdecu

Variables As Currency: Linking Meta-Analysis Research and Data Paths in Sciences

Hua Qin, Lynne Davis, Matthew Mayernik, Patricia Romero Lankao, John D'Ignazio, Peter Alston
2014 Data Science Journal  
The framework focuses on key variables that represent primary/secondary datasets or derived socio-ecological data, contexts of use, and the data transformations that are applied.  ...  Some important meta-analytic tasks, such as the selection of relevant studies for review and the integration of research datasets or findings, are not well supported in current data curation systems.  ...  We gratefully acknowledge Mary Marlino and Karon Kelly at NCAR Library and Integrated Information Systems, UCAR; Katie Dickinson and Lawrence Buja at the Research Applications Laboratory at NCAR; Aaron  ... 
doi:10.2481/dsj.14-030 fatcat:4rvfbomk45hzviuz6xx3doj5lu

Lessons from a Marine Spatial Planning data management process for Ireland

Sarah Flynn, Will Meaney, Adam M. Leadbetter, Jeffrey P. Fisher, Caitriona Nic Aonghusa
2020 International Journal of Digital Earth  
This paper presents a framework containing ten components to deliver a data management process for the storage and management of data used for Marine Spatial Planning (MSP) in Ireland.  ...  The process presents a means of managing data and metadata to ensure data lineage is optimised by carrying information about the origin of and the processing applied to the data; to evaluate the quality  ...  Acknowledgement This work is part supported by the Irish Government and the European Maritime & Fisheries Fund as part of the EMFF Operational Programme for 2014-2020.  ... 
doi:10.1080/17538947.2020.1808720 fatcat:77d7laol7banlhl7jk67bkpqba

Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data

Richard Bache, Simon Miles, Bolaji Coker, Adel Taweel
2013 International Journal of Digital Curation  
For a provenance-based approach to be reusable and supportable with software tools, the provenance records must use a well-defined model of the transformation process.  ...  Records of provenance that document how this process has taken place will enable users of the data warehouse to utilise the data appropriately and ensure that future data added from another source is transformed  ...  The work was also part funded and supported by the National Institute for Health Research (NIHR) Biomedical Research Centre, based at Guy's and St Thomas' NHS Foundation Trust and King's College London  ... 
doi:10.2218/ijdc.v8i2.262 fatcat:iot3dqphyfeedmos7v7m67yymy

Workflow Provenance in the Lifecycle of Scientific Machine Learning [article]

Renan Souza, Leonardo G. Azevedo, Vítor Lourenço, Elton Soares, Raphael Thiago, Rafael Brandão, Daniel Civitarese, Emilio Vital Brazil, Marcio Moreno, Patrick Valduriez, Marta Mattoso, Renato Cerqueira (+1 others)
2021 arXiv   pre-print
In these domains, users need to perform comprehensive data analyses combining scientific data and ML models to provide for critical requirements, such as reproducibility, model explainability, and experiment  ...  We contribute with (i) characterization of the lifecycle and taxonomy for data analyses; (ii) design principles to build this view, with a W3C PROV compliant data representation and a reference system  ...  Provenance (also referred to as lineage) data management techniques help reproduce, trace, assess, understand, and explain data, models, and their transformation processes [11] , [12] , [13] .  ... 
arXiv:2010.00330v2 fatcat:4xfpfcsocjajxbq4qumu57swya

Recommendations for connecting molecular sequence and biodiversity research infrastructures through ELIXIR

Robert M. Waterhouse, Anne-Françoise Adam-Blondon, Donat Agosti, Petr Baldrian, Bachir Balech, Erwan Corre, Robert P. Davey, Henrik Lantz, Graziano Pesole, Christian Quast, Frank Oliver Glöckner, Niels Raes (+7 others)
2021 F1000Research  
As a research infrastructure developing services and technical solutions that help integrate and coordinate life science resources across Europe, ELIXIR is a key player.  ...  and services that connect molecular and wider biodiversity domains.  ...  In the context of infraspecific diversity conserved in plant, forest, and animal genetic resources, several projects are developing common recommendations and metadata standards to improve the conservation  ... 
doi:10.12688/f1000research.73825.1 fatcat:wphybjxe4bd3dgdewjqobb2hfe

Data and Metadata Management for Better VGI Reusability [chapter]

2017 Mapping and the Citizen Sensor  
This VGI may be of huge value for institutions, individuals and decision-makers, but only if it can be discovered, evaluated for quality and fitness-for-purpose and combined with data from other sources  ...  open source technologies which can underpin robust and sustainable data management for VGI.  ...  These requirements for repeatability, transparency and independent evaluation inevitably suggest a need to curate and preserve data collections.  ... 
doi:10.5334/bbf.k fatcat:blw7ofx5s5ft3edbal7kipuj6u

A Unified Framework for Measuring Stewardship Practices Applied to Digital Environmental Datasets

Ge Peng, Jeffrey L Privette, Edward J Kearns, Nancy A Ritchey, Steve Ansari
2015 Data Science Journal  
The rationale for each key component and its maturity levels is described.  ...  Nine key components are identified based on requirements imposed on digital environmental data and information that are cared for and disseminated by U.S.  ...  Transparency/traceability This key component measures the degrees of transparency and traceability via availability of information on data provenance and data processing systems.  ... 
doi:10.2481/dsj.14-049 fatcat:dbbaavhelzelbngng3iu5fdxsa

D6.1 State of the Art and Community Needs Report from Use Cases

Alessandro Rizzo, Federica Tanlongo, Fulvio Galeazzi, Christelle Pierkot
2020 Zenodo  
and needs.  ...  Finally, some options are suggested in terms of common actions and developments.  ...  : "Defining procedures/service to enforce data provenance for thematic communities and beyond" Provenance management is a key component in order to guarantee scientific data discovery, reproducibility  ... 
doi:10.5281/zenodo.3894677 fatcat:ekb6e4ol3zaubp52itar52y7ju

Preservation, Characterization and Exploitation of Microbial Biodiversity: The Perspective of the Italian Network of Culture Collections

Luciana De Vero, Maria Beatrice Boniotti, Marilena Budroni, Pietro Buzzini, Stefano Cassanelli, Roberta Comunian, Maria Gullo, Antonio F. Logrieco, Ilaria Mannazzu, Rosario Musumeci, Iolanda Perugini, Giancarlo Perrone (+4 others)
2019 Microorganisms  
Data sharing and web services as well as the tight interconnection between CCs and the biotechnological industry are highlighted.  ...  In this context, culture collections (CCs) and microbial biological resource centres (mBRCs) are crucial for the safeguarding and circulation of biological resources, as well as for the progress of life  ...  Acknowledgments: The authors are grateful to all the members of the JRU MIRRI-IT for providing data related their own collections.  ... 
doi:10.3390/microorganisms7120685 pmid:31842279 fatcat:m7d7skb7mzgzdfcxvql53dolau

From the Field to the Cloud: A Review of Three Approaches to Sharing Historical Data From Field Stations Using Principles From Data Science

Kelly Easterday, Tim Paulson, Proxima DasMohapatra, Peter Alagona, Shane Feirer, Maggi Kelly
2018 Frontiers in Environmental Science  
This article responds to several calls stressing the importance of empirical historical materials and urges their preservation and accessibility.  ...  and shepherding historical data into the twenty-first century.  ...  ACKNOWLEDGMENTS We would first like to acknowledge support from the Institute for the Study of Ecological and Evolutionary Climate Impacts (ISEECI) in funding the historical ecology working group that  ... 
doi:10.3389/fenvs.2018.00088 fatcat:ihtewxlg7jci3mt4uagoq2367q

From writing to reading the encyclopedia of life

Paul D. N. Hebert, Peter M. Hollingsworth, Mehrdad Hajibabaei
2016 Philosophical Transactions of the Royal Society of London. Biological Sciences  
We are very thankful to Ann McCain Evans and Chris Evans for their generosity in defraying the Open Access charges for this special issue.  ...  We also thank Suzanne Bateson, Sujeevan Ratnasingham and Dirk Steinke for their aid in generating the figures.  ...  contexts [119] .  ... 
doi:10.1098/rstb.2015.0321 pmid:27481778 pmcid:PMC4971178 fatcat:rntzqdszhve3dj7hljkkxx7wwq

Toward an Integrated Set of Surface Meteorological Observations for Climate Science and Applications

P. W. Thorne, R. J. Allan, L. Ashcroft, P. Brohan, R. J. H Dunn, M. J. Menne, P. R. Pearce, J. Picas, K. M. Willett, M. Benoy, S. Bronnimann, P. O. Canziani (+29 others)
2017 Bulletin of The American Meteorological Society - (BAMS)  
To ensure provenance (data lineage), these collected data should be as close to their original format and source as possible.  ...  Venema et al. 2012) , to create datasets and data products. 17 The data lineage (provenance) of observations should be carried through the entire data archive, and each stage of processing recorded in  ... 
doi:10.1175/bams-d-16-0165.1 fatcat:6l5i7tux3bc7nddyvde73szg2u

Studies on Monitoring and Tracking Genetic Resources: An Executive Summary

George M. Garrity, Lorraine M. Thompson, David W. Ussery, Norman Paskin, Dwight Baker, Philippe Desmeth, D.E. Schindel, P.S. Ong
2009 Standards in Genomic Sciences  
Acknowledgements This work was commissioned under a contract from the UN Secretariat on the Convention on Biological Diversity to Michigan State University, Contract No. 8-26-17306 and is reprinted with  ...  These efforts are well supported by publicly available tools and highly curated data sets of aligned 16S rRNA [17] [18] [19] .  ...  The three principal objectives of the CBD are: conservation of biological resources, sustainable use of its components, and fair and equitable sharing of benefits arising out of their utilization.  ... 
doi:10.4056/sigs.1491 pmid:21304641 pmcid:PMC3035216 fatcat:ly4yyt7d4rdfnfsimey6vj2dxe
« Previous Showing results 1 — 15 out of 1,295 results