Σύγκριση streaming big data πλατφόρμων [article]

Michail Vrachasotakis, National Technological University Of Athens, National Technological University Of Athens
2018
The oncoming assault of Big Data and their creation rate every moment have rendered necessary the utilisation of distributed streaming engines. There exist many open source tools for this purpose and two heavily-utilised, Apache Spark and Apache Flink, will be compared to determine the best one for a particular streaming scenario. A small cluster will be used for this experiment with the help of open benchmarks and the metrics are median latency and the percentage of data that contribute to the highest latency.
doi:10.26240/heal.ntua.15438 fatcat:rlglcxgjajek5kpmpcv52uq62m