A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Data Profiling
2017
Proceedings of the 2017 ACM International Conference on Management of Data - SIGMOD '17
One of the crucial requirements before consuming datasets for any application is to understand the dataset at hand and its metadata. The process of metadata discovery is known as data profiling. Profiling activities range from ad-hoc approaches, such as eye-balling random subsets of the data or formulating aggregation queries, to systematic inference of structural information and statistics of a dataset using dedicated profiling tools. In this tutorial, we highlight the importance of data
doi:10.1145/3035918.3054772
dblp:conf/sigmod/AbedjanGN17
fatcat:dwqqb6w6pzfu7l5nkz3m67oxsq