Transforming statistical linked data for use in OLAP systems

Benedikt Kämpgen, Andreas Harth
2011 Proceedings of the 7th International Conference on Semantic Systems - I-Semantics '11  
The amount of available Linked Data on the Web is increasing, and data providers start to publish statistical datasets that comprise numerical data. Such statistical datasets differ significantly from the currently predominant networkstyle data published on the Web. We explore the possibility of integrating statistical data from multiple Linked Data sources. We provide a mapping from statistical Linked Data into the Multidimensional Model used in data warehouses. We use an
more » ... (ETL) pipeline to convert statistical Linked Data into a format suitable for loading into an open-source OLAP system, and thus demonstrate how standard OLAP infrastructure can be used for elaborate querying and visualisation of integrated statistical Linked Data. We discuss lessons learned from three experiments and identify areas which require future work to ultimately arrive at a well-interlinked set of statistical data from multiple sources which is processable with standard OLAP systems.
doi:10.1145/2063518.2063523 dblp:conf/i-semantics/KampgenH11 fatcat:ncxsu5o6bvgbnmshojcihjq6rm