Repair-oriented relational schemas for multidimensional databases

Mahkameh Yaghmaie, Leopoldo Bertossi, Sina Ariyan
2012 Proceedings of the 15th International Conference on Extending Database Technology - EDBT '12  
Summarizability in a multidimensional (MD) database refers to the correct reusability of pre-computed aggregate queries (or views) when computing higher-level aggregations or rollups. A dimension instance has this property if and only if it is strict and homogeneous. A dimension instance may fail to satisfy either of these two semantics conditions, and has to be repaired, restoring strictness and homogeneity. In this work, we take a relational approach to the problem of repairing dimension
more » ... nces. A dimension repair is obtained by translating the dimension instance into a relational instance, repairing the latter using established techniques in the relational framework, and properly inverting the process. We show that the common relational star and snowflake schemas for MD databases are not the best choice for this process. Actually, for this purpose, we propose and formalize the path relational schema, which becomes the basis for obtaining dimensional repairs. The path schema turns out to have useful properties in general, as a basis for a relational representation and implementation of MD databases and data warehouses. It is also particularly suitable for restoring MD summarizability through relational repairs. We compare the dimension repairs so obtained with existing repair approaches for MD databases.
doi:10.1145/2247596.2247644 dblp:conf/edbt/YaghmaieBA12 fatcat:2irhe6fuxvcuhmv4f22w2lcyc4