Cell-based Causality for Data Repairs

Maxime Debosschere, Floris Geerts
2015 Workshop on the Theory and Practice of Provenance  
In recent work, Salimi and Bertossi provide a tight connection between causality and tuple-based data repairs. We investigate this connection between causality and two other kinds of repair models. First, we consider cell-based V -repairs, i.e., repairs that are obtained by modifying cells in the data. In contrast, tuple-based repairs only allow for the deletion of tuples. Second, we introduce a new notion of repairs, called chase repairs, that take into account the procedural (chase) steps
more » ... lead to a repair. We establish a connection between causes (and the associated notion of responsibility) and V -repairs, and analyse the complexity of verifying whether a cell is a cause and whether its responsibility is above a certain threshold. Our understanding of chase repairs is still very preliminary, and we argue that provenance models that are specifically targeted to data repairs and data quality in general are needed to make formal connections between causality and chase repairs.
dblp:conf/tapp/DebosschereG15 fatcat:qq3aksobvfbwdlasfkdvnin5fm