From Data Quality to Big Data Quality

Carlo Batini, Anisa Rula, Monica Scannapieco, Gianluigi Viscusi
2015 Journal of Database Management  
This article investigates the evolution of data quality issues from traditional structured data managed in relational databases to Big Data. In particular, The paper examines the nature of the relationship between Data Quality and several research coordinates that are relevant in Big Data, such as the variety of data types, data sources and application domains, focusing on maps, semistructured texts, linked open data, sensor & sensor networks and official statistics. Consequently a set of
more » ... ural characteristics is identified and a systematization of the a posteriori correlation between them and quality dimensions is provided. Finally, Big Data quality issues are considered in a conceptual framework suitable to map the evolution of the quality paradigm according to three core coordinates that are significant in the context of the Big Data phenomenon: the data type considered, the source of data, and the application domain. Thus, the framework allows ascertaining the relevant changes in data quality emerging with the Big Data phenomenon, through an integrative and theoretical literature review.
doi:10.4018/jdm.2015010103 fatcat:6qmqgd4kgrehbafk5aqq7u2tuq