A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
ERMS: An Elastic Replication Management System for HDFS
2012
2012 IEEE International Conference on Cluster Computing Workshops
The Hadoop Distributed File System (HDFS) is a distributed storage system that stores large-scale data sets reliably and streams those data sets to applications at high bandwidth. HDFS provides high performance, reliability and availability by replicating data, typically three copies of every data. The data in HDFS changes in popularity over time. To get better performance and higher disk utilization, the replication policy of HDFS should be elastic and adapt to data popularity. In this paper,
doi:10.1109/clusterw.2012.25
dblp:conf/cluster/ChengLMXQRZG12
fatcat:lht766l5knb4jennanzzlyg6dq