1 Hit in 2.7 sec

Benchmarking dataflow systems for scalable machine learning

Christoph Boden, Technische Universität Berlin, Technische Universität Berlin, Volker Markl
Our results show, that while being able to robustly scale with increasing data set sizes, current state of the art data flow systems are surprisingly inefficient at coping with high dimensional data, which  ...  It does not cover machine learning workloads.• TPC Express Benchmark™ HS (TPCx-HS) is a benchmark developed to evaluate commercial Apache Hadoop File System API compatible software distributions such as  ...  tuples (k2, v2).  ... 
doi:10.14279/depositonce-7532 fatcat:mwjd6bnzvjaknbjnfohjokkj4y