A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Architectural Impact on Performance of In-memory Data Analytics: Apache Spark Case Study
[article]
2016
arXiv
pre-print
While cluster computing frameworks are continuously evolving to provide real-time data analysis capabilities, Apache Spark has managed to be at the forefront of big data analytics for being a unified framework for both, batch and stream data processing. However, recent studies on micro-architectural characterization of in-memory data analytics are limited to only batch processing workloads. We compare micro-architectural performance of batch processing and stream processing workloads in Apache
arXiv:1604.08484v1
fatcat:tp3yp5g32nek3d2ndr73vaavf4