Filters








2,158 Hits in 3.9 sec

A Provenance-Based Fault Tolerance Mechanism for Scientific Workflows [chapter]

Daniel Crawl, Ilkay Altintas
2008 Lecture Notes in Computer Science  
Capturing provenance information in scientific workflows is not only useful for determining data-dependencies, but also for a wide range of queries including fault tolerance and usage statistics.  ...  It also describes how the collected information based on these dependencies could be used for a fault tolerance framework in different models of computation.  ...  DE-FC02-07ER25811 for SDM Center, NSF Award No. DBI 0619060 for REAP, and NSF Award OCI-0722079 for Kepler CORE.  ... 
doi:10.1007/978-3-540-89965-5_17 fatcat:6b67bm3q55dc7ca5vgnxivbaa4

Towards a Grid Platform for Scientific Workflows Management

Alexandru Costan, Corina Stratan, Eliana-Dina Tirsa, Mugurel Ionut Andreica, Valentin Cristea
2009 arXiv   pre-print
fault tolerance support.  ...  Although a wide range of workflow engines are developed in enterprise environments, the open source engines available for scientific applications lack some functionalities or are too difficult to use for  ...  In Section 4 we present the most commonly used fault tolerance mechanisms and our approach for this aspect.  ... 
arXiv:0910.0626v1 fatcat:d6hh6e3v6vfeln7ep5rex4buey

Dynamic execution of scientific workflows in cloud

E. Kail, J. Kovacs, M. Kozlovszky, P. Kacsuk
2016 2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)  
Scientific workflows have emerged in the past decade as a new solution for representing complex scientific experiments.  ...  In this paper we investigate dynamic execution capabilities, focused on fault tolerance behavior in the Occopus framework which was developed by SZTAKI and was targeted to provide automatic features for  ...  and propose a fault tolerant mechanism which is based on the workflow structure and replication technique.  ... 
doi:10.1109/mipro.2016.7522162 dblp:conf/mipro/KailKKK16 fatcat:vvtb7itkz5dsbpasqrndnisffy

Theoretical enzyme design using the Kepler scientific workflows on the Grid

Jianwu Wang, Prakashan Korambath, Seonah Kim, Scott Johnson, Kejian Jin, Daniel Crawl, Ilkay Altintas, Shava Smallen, Bill Labate, Kendall N. Houk
2010 Procedia Computer Science  
parallelism, provenance tracking, fault tolerance and workflow reuse, which results in an automated, pipelined, efficient, extensible, stable, and easy-to-use computational process for enzyme design. many  ...  This paper presents a scientific workflow based approach to facilitate the inside-out enzyme design process in the Grid execution environment by providing features such as resource consolidation, task  ...  This work was supported by NSF SDCI Award OCI-0722079 for Kepler/CORE, NSF CEO:P Award No. DBI 0619060 for REAP, DOE SciDac Award No. DE-FC02-07ER25811 for SDM Center, and UCGRID Project.  ... 
doi:10.1016/j.procs.2010.04.131 fatcat:yvu4usd4r5bandir4yztbcnp3u

Trident: Scientific Workflow Workbench for Oceanography

R.S. Barga, J. Jackson, N. Araujo, D. Guo, N. Gautam, K. Grochow, E. Lazowska
2008 2008 IEEE Congress on Services - Part I  
Trident is being developed in collaboration with the scientific community for use in a number of ongoing escience projects that make use of scientific workflows.  ...  We introduce Trident, a scientific workflow workbench that is built on top of a commercial workflow system to leverage existing functionality.  ...  that schedules workflows on HPC clusters, and a fault-tolerance and recovery service for workflows.  ... 
doi:10.1109/services-1.2008.101 dblp:conf/services/BargaJAGGGL08 fatcat:2hnwrhdjwfhsxkdchimhplqf6q

The Trident Scientific Workflow Workbench

Roger Barga, Jared Jackson, Nelson Araujo, Dean Guo, Nitin Gautam, Yogesh Simmhan
2008 2008 IEEE Fourth International Conference on eScience  
Trident is being developed in collaboration with the scientific community for use in a number of ongoing escience projects that make use of scientific workflows.  ...  We introduce Trident, a scientific workflow workbench that is built on top of a commercial workflow system to leverage existing functionality.  ...  that schedules workflows on HPC clusters, and a fault-tolerance and recovery service for workflows.  ... 
doi:10.1109/escience.2008.126 dblp:conf/eScience/BargaJAGGS08 fatcat:kwfpke3nwneujj73hfpkzt57b4

Improving Many-Task computing in scientific workflows using P2P techniques

Jonas Dias, Eduardo Ogasawara, Daniel de Oliveira, Esther Pacitti, Marta Mattoso
2010 2010 3rd Workshop on Many-Task Computing on Grids and Supercomputers  
We present a fault tolerance mechanism, a dynamic resource management and a hierarchical organization of computing nodes to handle workflow instances execution properly.  ...  Large-scale scientific experiments are usually supported by scientific workflows that may demand high performance computing infrastructure.  ...  To improve MTC scheduling and execution for scientific workflows, P2P techniques may help providing fault tolerant mechanisms and dynamic resource management.  ... 
doi:10.1109/mtags.2010.5699430 dblp:conf/sc/DiasO0PM10 fatcat:bjilkiz6tnasrf744ejrnzfwqy

Lattice QCD Workflows: A Case Study

Luciano Piccoli, James B. Kowalkowski, James N. Simone, Xian-He Sun, Hui Jin, Donald J. Holmgren, Nirmal Seenu, Amitoj G. Singh
2008 2008 IEEE Fourth International Conference on eScience  
This paper discusses the application of existing workflow management systems to a real world science application (LQCD). Typical workflows and execution environment used in production are described.  ...  Requirements for the LQCD production system are discussed. The workflow management systems Askalon and Swift were tested by implementing the LQCD workflows and evaluated against the requirements.  ...  Among the most relevant requirements the data provenance and fault tolerance for participants and workflows have the highest priority.  ... 
doi:10.1109/escience.2008.168 dblp:conf/eScience/PiccoliKSSJHSS08 fatcat:nelztlz4hfeybgu7odyrhaan6m

Provenance Based Checkpointing Method for Dynamic Health Care Smart System

Eszter Kail, Krisztian Karoczkai, Peter Kacsuk, Miklos Kozlovszky
2016 Scalable Computing : Practice and Experience  
In this work we give a brief overview of the different checkpointing techniques and propose two new provenance based checkpointing algorithms which uses the information stored in the workflow structure  ...  In HPC environment high number of failures can arise during health care smart systems workflow enactment, so the use of fault tolerance techniques is unavoidable.  ...  This work was supported by EU project SCI-BUS (SCIentific gateway Based User Support).  ... 
doi:10.12694/scpe.v17i2.1162 fatcat:jo66bmpfuzhl5i2bf2uyelewpi

FT-Grid: a system for achieving fault tolerance in grids

Jie Xu, Paul Townend, Nik Looker, Paul Groth
2008 Concurrency and Computation  
We then illustrate a potential problem with voting-based fault-tolerance schemes in the service-oriented paradigm-namely that individual channels within a fault-tolerant system, supposed to be independent  ...  The FT-Grid system introduces a fault-tolerance framework that allows faults occurring in serviceoriented systems to be tolerated, thus increasing the dependability of such systems.  ...  to provide a fault-tolerance mechanism.  ... 
doi:10.1002/cpe.1266 fatcat:l4drae2s7bg57piin5tpad72xa

Fault-Tolerant Scheduling for Scientific Workflows in Cloud Environments

K. Vinay, S.M. Dilip Kumar
2017 2017 IEEE 7th International Advance Computing Conference (IACC)  
In this paper, a new heuristic called Cluster based Heterogeneous Earliest Finish Time (C-HEFT) algorithm to enhance the scheduling and fault tolerance mechanism for SWf in highly distributed cloud environments  ...  Executing clustered tasks has proven to be an efficient method to improve the computation of Scientific Workflows (SWf) on clouds.  ...  In this paper, a new heuristic called Cluster based Heterogeneous Earliest Finish Time (C-HEFT) algorithm to enhance the scheduling and fault tolerance mechanism for SWf in highly distributed cloud environments  ... 
doi:10.1109/iacc.2017.0043 fatcat:qbxxgmveqner3kr3juqicsnelu

Scalable Workflow-Driven Hydrologic Analysis in HydroFrame [chapter]

Shweta Purawat, Cathie Olschanowsky, Laura E. Condon, Reed Maxwell, Ilkay Altintas
2020 Lecture Notes in Computer Science  
The Kepler workflow ensures complete reproducibility through a built-in provenance framework that collects workflow specific parameters, software versions, and hardware system configuration.  ...  As a part of HydroFrame, we seek to design innovative workflow solutions that create pathways to enable hydrologic analysis for three target user groups: the modeler, the analyzer, and the domain science  ...  This work is supported by NSF OAC CSSI 1835855, and DOE DE-SC0012630 for IPPD.  ... 
doi:10.1007/978-3-030-50371-0_20 fatcat:c6632vj24nd25dcdq73i6bxhyu

Facilitating e-Science Discovery Using Scientific Workflows on the Grid [chapter]

Jianwu Wang, Prakashan Korambath, Seonah Kim, Scott Johnson, Kejian Jin, Daniel Crawl, Ilkay Altintas, Shava Smallen, Bill Labate, Kendall N. Houk
2011 Guide to e-Science  
, provenance tracking, fault tolerance, and workflow reuse.  ...  This chapter explains how scientific workflow systems can facilitate e-Science discovery in Grid environments by providing features including scientific process automation, resource consolidation, parallelism  ...  This work was supported by NSF SDCI Award OCI-0722079 for Kepler/ CORE, NSF CEO:P Award No. DBI 0619060 for REAP, DOE SciDac Award No. DE-FC02-07ER25811 for SDM Center, and UCGRID Project.  ... 
doi:10.1007/978-0-85729-439-5_13 dblp:series/ccn/WangKKJJCASLH11 fatcat:66f2iqtcabbe3dp6ijprsavjgi

An Architectural Model for a Grid based Workflow Management Platform in Scientific Applications [article]

Alexandru Costan, Florin Pop, Corina Stratan, Ciprian Dobre, Catalin Leordeanu, Valentin Cristea
2011 arXiv   pre-print
fault tolerance support.  ...  In this work we propose an architectural model for a grid based workflow management platform providing features like an intuitive way to describe workflows, efficient data handling mechanisms and flexible  ...  mechanisms and flexible fault tolerance support.  ... 
arXiv:1106.5846v1 fatcat:2gung2hfb5egjd4iofvripuaui

A Survey of Data-Intensive Scientific Workflow Management

Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
2015 Journal of Grid Computing  
A data-intensive scientific workflow is useful for modeling such process.  ...  Finally, we identify research issues for improving the execution of data-intensive scientific workflows in a multisite cloud.  ...  The fault tolerance functionality of Kepler can be achieved by a framework that provides three complementary mechanisms.  ... 
doi:10.1007/s10723-015-9329-8 fatcat:5urst5aphjftbli3pukmnbutri
« Previous Showing results 1 — 15 out of 2,158 results