Discovering functional dependencies for multidimensional design

Oscar Romero, Diego Calvanese, Alberto Abelló, Mariano Rodríguez-Muro
2009 Proceeding of the ACM twelfth international workshop on Data warehousing and OLAP - DOLAP '09  
Nowadays, it is widely accepted that the data warehouse design task should be largely automated. Furthermore, the data warehouse conceptual schema must be structured according to the multidimensional model and as a consequence, the most common way to automatically look for subjects and dimensions of analysis is by discovering functional dependencies (as dimensions functionally depend of the fact) over the data sources. Most advanced methods for automating the design of the data warehouse carry
more » ... ut this process from relational OLTP systems, assuming that a RDBMS is the most common kind of data source we may find, and taking as starting point a relational schema. In contrast, in our approach we propose to rely instead on a conceptual representation of the domain of interest formalized through a domain ontology expressed in the DL-Lite Description Logic. In our approach, we propose an algorithm to discover functional dependencies from the domain ontology that exploits the inference capabilities of DL-Lite, thus fully taking into account the semantics of the domain. We also provide an evaluation of our approach in a real-world scenario.
doi:10.1145/1651291.1651293 dblp:conf/dolap/RomeroCAR09 fatcat:krfswm3arbhdpifnyzqqj4g4nq