BRAID - A Hybrid Processing Architecture for Big Data

Corinna Giebler, Christoph Stach, Holger Schwarz, Bernhard Mitschang
2018 Proceedings of the 7th International Conference on Data Science, Technology and Applications  
The Internet of Things is applied in many domains and collects vast amounts of data. This data provides access to a lot of knowledge when analyzed comprehensively. However, advanced analysis techniques such as predictive or prescriptive analytics require access to both, history data, i. e., long-term persisted data, and real-time data as well as a joint view on both types of data. State-of-the-art hybrid processing architectures for big data-namely, the Lambda and the Kappa Architecture-support
more » ... the processing of history data and real-time data. However, they lack of a tight coupling of the two processing modes. That is, the user has to do a lot of work manually in order to enable a comprehensive analysis of the data. For instance, the user has to combine the results of both processing modes or apply knowledge from one processing mode to the other. Therefore, we introduce a novel hybrid processing architecture for big data, called BRAID. BRAID intertwines the processing of history data and real-time data by adding communication channels between the batch engine and the stream engine. This enables to carry out comprehensive analyses automatically at a reasonable overhead.
doi:10.5220/0006861802940301 dblp:conf/data/GieblerSSM18 fatcat:7daepor6eraclebzrlk2b6enzm