Positioning Dynamic Storage Caches for Transient Data

Sudharshan S. Vazhkudai, Douglas Thain, Xiaosong Ma, Vincent W. Freeh
2006 2006 IEEE International Conference on Cluster Computing  
Simulations, experiments and observatories are generating a deluge of scientific data. Even more staggering is the ever growing application demand to process and assimilate these datasets. Application users perform a range of data operations, collaborate and share data in many novel ways. The current storage landscape is struggling to keep up with these trends in scientific data processing. Application users pay the price due to over-crowded shared filesystems, or expensive storage area
more » ... , or not enough local storage, or high-latency archival or wide-area transfers. In order to sustain and maximize I/O bandwidth relative to increasing CPU speeds, applications must take advantage of large amounts of intermediate commodity storage, However, intermediate storage presents new challenges above and beyond the traditional distributed filesystem paradigm: persistent scheduling, storage/CPU coallocation, namespace management, lifetime management, and novel application interfaces. In this paper, we describe applications that require intermediate storage management, suggest several open research problems, and illustrate two systems -Freeloader and Tactical Storage -that attack different aspects of these problems. 1
doi:10.1109/clustr.2006.311900 dblp:conf/cluster/VazhkudaiTMF06 fatcat:crlajnv54rf3jjfhkbbgrtp4se