Integration and dimensional modeling approaches for complex data warehousing

O. Boussaid, Adrian Tanasescu, Fadila Bentayeb, Jérôme Darmont
2006 Journal of Global Optimization  
With the broad development of the World Wide Web, various kinds of heterogeneous data (including multimedia data) are now available to decision support tasks. A data warehousing approach is often adopted to prepare data for relevant analysis. Data integration and dimensional modeling indeed allow the creation of appropriate analysis contexts. However, the existing data warehousing tools are wellsuited to classical, numerical data. They cannot handle complex data. In our approach, we adapt the
more » ... ree main phases of the data warehousing process to complex data. In this paper, we particularly focus on two main steps in complex data warehousing. The first step is data integration. We define a generic UML model that helps representing a wide range of complex data, including their possible semantic properties. Complex data are then stored in XML documents generated by a piece of software we designed. The second important phase we address is the preparation of data for dimensional modeling. We propose an approach that exploits data mining techniques to assist users in building relevant dimensional models.
doi:10.1007/s10898-006-9064-6 fatcat:3j7a775airddbdnbym2wrkrsky