A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Rethinking data management for big data scientific workflows
2013
2013 IEEE International Conference on Big Data
Scientific workflows consist of tasks that operate on input data to generate new data products that are used by subsequent tasks. Workflow management systems typically stage data to computational sites before invoking the necessary computations. In some cases data may be accessed using remote I/O. There are limitations with these approaches, however. First, the storage at a computational site may be limited and not able to accommodate the necessary input and intermediate data. Second, even if
doi:10.1109/bigdata.2013.6691724
dblp:conf/bigdataconf/VahiRJMD13
fatcat:3varrrwktven7kgonqlzqp4eiq