A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2015; you can also visit the original URL.
The file type is application/pdf
.
Optimizing intermediate data management in MapReduce computations
2011
Proceedings of the First International Workshop on Cloud Computing Platforms - CloudCP '11
Many cloud computations process large datasets. Programming paradigms have been proposed to design this type of applications, so as to take advantage of the huge processing and storage options the cloud holds, but at the same time, to provide the user with a clean and easy to use interface. Among these programming models, we consider the MapReduce paradigm and its reference implementation, the Hadoop framework. We focus on the aspect of intermediate data, that is data produced and transferred
doi:10.1145/1967422.1967427
fatcat:4hiyvk6yujbovmhqrjnw7aqewy