A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
A Comparison of ORC-Compress Performance with Big Data Workload on Virtualization
2016
Applied Mechanics and Materials
Big Data is widely used in many organizations nowadays. Hive is an open source data warehouse system for managing large data set. It provides a SQL-like interface to Hadoop over Map-Reduce framework. Currently, Big Data solution starts to adopt HiveQL tool to improve execution time of relational information. In this paper, we investigate on an execution time of query processing issues comparing two algorithm of ORC file: ZLIB and SNAPPY. The results show that ZLIB can compress data up to 87%
doi:10.4028/www.scientific.net/amm.855.153
fatcat:vdy2ovsjbzamhly4zkwpiywtgm