Data Life Cycle: Towards a Reference Architecture

Mohammed EL Arass
2020 International Journal of Advanced Trends in Computer Science and Engineering  
Data management becomes highly complex with the emergence of Big Data era. Different organizations lean to produce high quality frameworks to manage data throughout their lifecycle like the developed architecture for Big Data named NIST Big Data Reference Architecture (NBD-RA). This paper aims to extend NBD-RA by adding phases to its main component, Big Data Application Provider, in order to fit with Big Data requirements. Also, the enhanced version enriches the NIST architecture and could be
more » ... open reference architecture allowing companies that want to create value from their collected data "Big" and manage it in order to transform them into "Smart" Data. To achieve this purpose, we have followed a methodology that aims to study first the foundation of NBD-RA then identify and analyze the most relevant data lifecycles. Then we define the phases that enrich the NIST architecture. We validated the proposed architecture through a case study of a company that wants to manage the huge amount of information and events produced by all the IT infrastructure including designing, implementing and testing a security information and events system POC (Proof Of Concept) made up of a Big Data platform and open source security tools.
doi:10.30534/ijatcse/2020/215942020 fatcat:rbnyq5fsv5hfpo2oknvet62t7y