A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
A performance evaluation of Hive for scientific data management
2013
2013 IEEE International Conference on Big Data
It is very important to evaluate the MapReducebased frameworks for scientific data processing applications. Scientists need a low-cost, scalable, easy-to-use and faulttolerance platform for large volume data processing eagerly. This paper presents an implementation of a scientific data management benchmark, SSDB, on Hive, a MapReduce-based data warehouse. A complete strategy of migrating SSDB to Hive is described in detail including query HQL implementation, data partition schema and
doi:10.1109/bigdata.2013.6691696
dblp:conf/bigdataconf/LiuLLL13
fatcat:mvgsipp2dbeujh6r3t3caexzca