Graph Based Data Governance Model for Real Time Data Ingestion

Hiren Dutta
2016 International Journal of Information Technology and Computer Science  
Data governance is one of the strongest pillars in Data management program which goes hand in hand with data quality. In industrial Data Lake huge amount of unstructured data is getting ingested at high velocity fro m different source systems. Similarly, through mult iple channels of data are getting queried and transformed fro m Data Lake. Based on 3Vs of big data it's a real challenge to set up a rule based on traditional data governance system for an Enterprise. In today's world governance
more » ... semi structured or unstructured data on Industrial Data lake is a real issue to the Enterprise in terms of query, create, maintain and storage effectively and secured way. On the other hand different stakeholders i.e. Business, IT and Policy team want to visualize the same data in different view to analyze, imposes constraints, and to place effective workflow mechanis m for approval to the policy makers. In this paper author proposed property graph based governance architecture and process model so that real time unstructured data can effectively govern, visualize, manage and queried from Industrial Data Lake.
doi:10.5815/ijitcs.2016.10.07 fatcat:77k2lfjg5zdk5hrbv5xkqyjkbe