Filters








12,680 Hits in 7.9 sec

Integrity Protection for Scientific Workflow Data

Mats Rynge, William L. Poehlman, F. Alex Feltus, Karan Vahi, Ewa Deelman, Anirban Mandal, Ilya Baldin, Omkar Bhide, Randy Heiland, Von Welch, Raquel Hill
2019 Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning) - PEARC '19  
assure the integrity of the scientific data.  ...  With the continued rise of scientific computing and the enormous increases in the size of data being processed, scientists must consider whether the processes for transmitting and storing data sufficiently  ...  ACKNOWLEDGMENTS The Scientific Workflow Integrity with Pegasus (SWIP) Project is supported by the National Science Foundation under grant 1642070, 1642053, and 1642090.  ... 
doi:10.1145/3332186.3332222 dblp:conf/xsede/RyngeVDMBBHWHPF19 fatcat:qibvfgo5hzbbxncte2gan5eitq

Addressing big data issues in Scientific Data Infrastructure

Yuri Demchenko, Paola Grosso, Cees de Laat, Peter Membrey
2013 2013 International Conference on Collaboration Technologies and Systems (CTS)  
for Big Data Infrastructure.  ...  This paper discusses the challenges that are imposed by Big Data on the modern and future Scientific Data Infrastructure (SDI).  ...  The following types of scientific data are defined [13] :  Raw data collected from observation and from experiment (according to an initial research model)  Structured data and datasets that went through  ... 
doi:10.1109/cts.2013.6567203 dblp:conf/cts/DemchenkoGLM13 fatcat:rlpqy5evdrcm5exjbjawmhzoeu

Data Curation Policies and Data Provenance in EUDAT Collaborative Data Infrastructure [chapter]

Vasily Bunakov, Alexander Atamas, Alexia de Casanove, Pascal Dugénie, Rene van Horik, Simon Lambert, Javier Quinteros, Linda Reijnhoudt
2018 Communications in Computer and Information Science  
"Data curation policies and data provenance in EUDAT Collaborative Data Infrastructure."  ...  Practical use cases are described, as well as results of defining and implementing data curation policies and data provenance patterns.  ...  The views expressed are those of authors and not necessarily of the project.  ... 
doi:10.1007/978-3-319-96553-6_18 fatcat:mkbogmcy3zfuzgf5rfc2u34eju

Thoughtful artificial intelligence: Forging a new partnership for data science and scientific discovery

Yolanda Gil, Michel Dumontier
2017 Data Science  
We present a personal perspective on a research agenda for thoughtful artificial intelligence, and discuss its potential for data science and scientific discovery.  ...  While in recent years computers have propelled science by crunching through data and leading to a data science revolution, qualitatively different scientific advances will result from advanced intelligent  ...  We would like to thank Daniel Garijo, Hiroaki Kitano, and Parag Mallick for many thoughtful discussions.  ... 
doi:10.3233/ds-170011 dblp:journals/datasci/Gil17 fatcat:aammbgnvvrfp7giupju76xcuqi

Data Curation and Preservation [chapter]

Keith Jeffery
2020 Lecture Notes in Computer Science  
Data is a valuable resource. In some scientific disciplines, experiments can be redone to reproduce the data.  ...  Digital curation establishes, maintains and adds value to repositories of digital data for present and future use.  ...  This work was supported by the European Union's Horizon 2020 research and innovation programme via the ENVRIplus project under grant agreement No. 654182.  ... 
doi:10.1007/978-3-030-52829-4_7 fatcat:tdk2tqcf3veb5mxfzcg4oc6zxy

Data Management Plan 2

Ulrich Goldmann, Gerhard Ecker, Vitaly Sedlyarov, Lia Scarabottolo, Vania Manolova, Claire Colas
2020 Zenodo  
Initial version of DMP after 6 months, an updated version latest mid-term and a final version  ...  format) Data Source Processing workflow for peak picking and integration applied to mass spectra obtained in LC-MS/MS metabolomics above.  ...  (open, human readable format) Data Source Processing workflow for peak picking, identification and integration applied to mass spectra obtained in LC-MS/MS metabolomics above.  ... 
doi:10.5281/zenodo.5179815 fatcat:v26u2p6fl5ajlgsld4wxvlhyrm

Incorporation of Synthetic Data Generation Techniques within a Controlled Data Processing Workflow in the Health and Wellbeing Domain

Mikel Hernandez, Gorka Epelde, Andoni Beristain, Roberto Álvarez, Cristina Molina, Xabat Larrea, Ane Alberdi, Michalis Timoleon, Panagiotis Bamidis, Evdokimos Konstantinidis
2022 Electronics  
analysis or experiment testing workflow.  ...  In this paper, we present the initial design and implementation of our synthetic data generation approach in the context of VITALISE Living Lab controlled data processing workflow, together with identified  ...  Implementation of this workflow for privacy-preserving data processing can also be motivated by intellectual property rights protection and the uninterrupted operation of basic necessities services in  ... 
doi:10.3390/electronics11050812 fatcat:vowg2z2tjvhu5egygyya4lxqeq

Securing the Intermediate Data of Scientific Workflows in Clouds with ACISO

Yawen Wang, Yunfei Guo, Zehua Guo, Wenyan Liu, Chao Yang
2019 IEEE Access  
A scientific workflow is a complicated scientific computing task consisting of many sub-tasks, and each sub-task execution can generate the intermediate data used for the successor sub-task execution.  ...  For these problems, we propose ACISO scheme to secure the intermediate data by improving its availability, confidentiality, and integrity.  ...  BACKGROUND AND MOTIVATION A.  ... 
doi:10.1109/access.2019.2938823 fatcat:b6dnkoju5beehbvghfxgiklt2a

Process Data Infrastructure and Data Services

Reginald Cushing, Onno Valkering, Adam Belloum, Souley Madougou, Martin Bobak, Ondrej Habala, Viet Tran, Jan Meizner, Piotr Nowakowski, Mara Graziani, Henning Müller
2020 Computing and informatics  
In this paper we propose a scalable and programmable data infrastructure that is easy to deploy and can be tuned to support various data-intensive scientific applications.  ...  architecture and solutions are well positioned within the European computing and data management landscape namely PRACE, EGI, and EUDAT.  ...  programme under grant agreement No. 777533, by the project APVV-17-0619 (U-COMP) "Urgent Computing for Exascale Data" and by the VEGA project "New Methods and Approaches for Distributed Scalable Computing  ... 
doi:10.31577/cai_2020_4_724 fatcat:6gskonylcnhkfdkzn35cemm3je

Trustworthy Pre-Processing of Sensor Data in Data On-chaining Workflows for Blockchain-based IoT Applications [article]

Jonathan Heiss, Anselm Busse, Stefan Tai
2021 arXiv   pre-print
In this paper, we propose trustworthy pre-processing as enabler for end-to-end sensor data integrity in data on-chaining workflows.  ...  We define requirements for trustworthy pre-processing, present a model and common workflow for data on-chaining, select off-chain computation utilizing Zero-knowledge Proofs (ZKPs) and Trusted Execution  ...  Experiments Given our proof-of-concept implementations, we can now conduct initial experiments to obtain the first practical insights into trustworthy pre-processing with zkSNARKs and TEEs.  ... 
arXiv:2110.15869v1 fatcat:ojlp2q6iuba45fq3soleywafay

Data Infrastructure for Medical Research

Thomas Heinis, Anastasia Ailamaki
2017 Foundations and Trends in Databases  
MRI), new sources of structured data like activity trackers, the wide-spread use of electronic health records and many others.  ...  References 95 Abstract While we are witnessing rapid growth in data across the sciences and in many applications, this growth is particularly remarkable in the medical domain, be it because of higher resolution  ...  Initially developed to support gravitational wave experiments, Triana Majithia et al. [2004] has been developed into a complete workflow management system designed for scientific applications.  ... 
doi:10.1561/1900000050 fatcat:fakmpm37lfcelokzvlx6tcld6m

Enabling Quantitative Data Analysis Through e-Infrastructure

Koon Leai Larry Tan, Paul S. Lambert, Ken J. Turner, Jesse Blum, Vernon Gayle, Simon B. Jones, Richard O. Sinnott, Guy Warner
2009 Social science computer review  
This article discusses how quantitative data analysis in the social sciences can engage with and exploit an e-Infrastructure.  ...  We conclude by discussing how these issues are relevant to the Data Management through e-Social Science (DAMES) research Node, an ongoing project that aims to develop e-Infrastructural resources for quantitative  ...  Butchart, Chen, Wassermann, & Price, 2005) for scientific workflows.  ... 
doi:10.1177/0894439309332647 fatcat:fgmc332ounerpk5z7ubkkd6xpu

Challenges and Opportunities of Open Data in Ecology

O. J. Reichman, M. B. Jones, M. P. Schildhauer
2011 Science  
Reproducibility of analyses is also important, and executable workflows are addressing this issue by capturing data provenance.  ...  Sociological challenges, including inadequate rewards for sharing data, must also be resolved.  ...  Researchers discover and access data from the federation and then (B) integrate and process the data in an analysis workflow, resulting in derived data products, visualizations, and scholarly papers that  ... 
doi:10.1126/science.1197962 pmid:21311007 fatcat:34i7z7medjc5bcsasvrje4ddc4

Data Curation through Catalogs: A Repository-Independent Model for Data Discovery

Helenmary Sheridan, Anthony J. Dellureficio, Melissa A. Ratajeski, Sara Mannheimer, Terrie R. Wheeler
2021 Journal of eScience Librarianship  
The article also reports on the development of a community of practice for data catalogs and data discovery initiatives.  ...  Data catalogs—metadata-only indices of research data that provide detailed access instructions and conditions for use—are one potential solution, and may be especially suitable for "challenging" datasets  ...  Acknowledgements The authors would like to thank Ian Lamb for writing the software code for the original data catalog that was used by the Data Catalog Collaboration Project (DCCP) which became the DDC  ... 
doi:10.7191/jeslib.2021.1203 fatcat:ak7ztu72lbby5lh3a4whmqkd4a

Defining architecture components of the Big Data Ecosystem

Yuri Demchenko, Cees de Laat, Peter Membrey
2014 2014 International Conference on Collaboration Technologies and Systems (CTS)  
Big Data are becoming a new technology focus both in science and in industry and motivate technology shift to data centric architecture and operational models.  ...  The presented work intends to provide a consolidated view of the Big Data phenomena and related challenges to modern technologies, and initiate wide discussion.  ...  The authors are also looking into defining data structures for high performance streaming applications and developing new types of disk based stream oriented data bases, continuing the work started from  ... 
doi:10.1109/cts.2014.6867550 dblp:conf/cts/DemchenkoLM14 fatcat:c4dcnflvyvhc5do3xvkqxapney
« Previous Showing results 1 — 15 out of 12,680 results