Filters








993 Hits in 2.2 sec

A taxonomy of ETL activities

Panos Vassiliadis, Alkis Simitsis, Eftychia Baikousi
2009 Proceeding of the ACM twelfth international workshop on Data warehousing and OLAP - DOLAP '09  
In doing so, we follow a black-box approach and provide a taxonomy that characterizes ETL activities in terms of the relationship of their input to their output and provide a normal form that is based  ...  However, each one of them follows a different approach for the modeling of ETL activities; i.e., of the building blocks of an ETL workflow.  ...  A RATIONALE FOR THE TAXONOMY An ETL workflow can be seen as a directed graph. The nodes of this graph are activities and recordsets.  ... 
doi:10.1145/1651291.1651297 dblp:conf/dolap/VassiliadisSB09 fatcat:scr4tns4bzeudd7qeaghfrkgre

Representing Interoperable Provenance Descriptions for ETL Workflows [chapter]

André Freitas, Benedikt Kämpgen, João Gabriel Oliveira, Seán O'Riain, Edward Curry
2015 Lecture Notes in Computer Science  
This paper proposes the convergence of this two Web data management concerns, introducing a principled provenance model for ETL processes in the form of a vocabulary based on the Open Provenance Model  ...  The proposed ETL provenance model is instantiated in a real-world sustainability reporting scenario.  ...  [16] investigate generic properties present in ETL activities across different ETL implementations. These properties build the base for the construction of a taxonomy.  ... 
doi:10.1007/978-3-662-46641-4_4 fatcat:o7d5txmqa5ai7iiohmxwezr7qm

ETL queues for active data warehousing

Alexandros Karakasidis, Panos Vassiliadis, Evaggelia Pitoura
2005 Proceedings of the 2nd international workshop on Information quality in information systems - IQIS '05  
In our framework, we have implemented ETL activities over queue networks and employ queue theory for the prediction of the performance and the tuning of the operation of the overall refreshment process  ...  In this paper, we propose a framework for the implementation of active data warehousing, with the following goals: (a) minimal changes in the software configuration of the source, (b) minimal overhead  ...  A Taxonomy of ETL Activities Each ETL queue can direct customers to more than one subsequent queue, depending on the type of operation it performs.  ... 
doi:10.1145/1077501.1077509 dblp:conf/iqis/KarakasidisVP05 fatcat:exuh5ynz7raobi3rhbjmaze4hm

A Review of Contemporary Data Quality Issues in Data Warehouse ETL Environment

Rupali Gill, Jaiteg Singh
2014 Journal on Today s Ideas-Tomorrow s Technologies  
The task of carrying out the eTl process is potentially a complex, hard and time consuming. Organisations now -a-days are concerned about vast qualities of data.  ...  This work proposes a framework for quality of extraction transformation and loading of data into a warehouse.  ...  The researcher provided a taxonomy that characterizes eTl activities in terms of the relationship of their input to their output and the proposed taxonomy can be used in the construction of larger modules  ... 
doi:10.15415/jotitt.2014.22012 fatcat:ytxrw4p6kjeixjy5fmohmywd4q

An Open Source ETL Tool - Medium and Small Scale Enterprise ETL(MaSSEETL)

Rupali Gill, Jaiteg Singh
2014 International Journal of Computer Applications  
In order to bring all the data together in a standard, homogeneous environment, Extraction-transformationloading (ETL) tools are used.  ...  In Data Warehouse (DW) environment, Extraction-Transformation-Loading (ETL) processes consumes up to 70% of resources.  ...  The researcher provided a taxonomy that characterizes ETL activities in terms of the relationship of their input to their output and the proposed taxonomy that can be used in the construction of larger  ... 
doi:10.5120/18899-0190 fatcat:4ucwdlgqeva5fjzi7x3k2c7btm

Business Intelligence in Environmental Reporting Powered by XBRL

Michal Hodinka, Michael Štencl, Jiří Hřebíček, Oldřich Trenz
2014 Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis  
This phenomenon commonly named as "Big Data", has transformed from a vague description of massive corporate data to a household term that refers to not just volume but the diversity of data and velocity  ...  The methodology refl ects today's technical standards of XBRL accordingly to the application via ETL process.  ...  Name of the Project: Construction of Methods for Multi Factor Assessment of Company Complex Performance in Selected Sectors. Registration No. P403/11/2085.  ... 
doi:10.11118/actaun201462020355 fatcat:bofryuvx3jbkfeax7skygdoshi

Towards a Taxonomy of Real-Time Business Intelligence Systems

Mario Nadj, Christian Schieder
2017 European Conference on Information Systems  
For practice, our taxonomy helps organizations to evaluate their RTBI systems or conceive the challenges of building such a system either from scratch or as an update of their existing infrastructure.  ...  Our taxonomy may serve as a foundational step to incorporate a broader theoretical perspective to integrate concepts and findings across all seven dimensions.  ...  Löser, A., S. Lutter, P. Düssel and V. Markl (2009) Conference on  ... 
dblp:conf/ecis/NadjS17 fatcat:sfyw2rjtynb2tora7dztbm2stm

Introduction to the special issue on data quality

Mourad Ouzzani, Paolo Papotti, Erhard Rahm
2013 Information Systems  
The authors deal with the problem of scheduling the execution of ETL activities, such as transformations and data quality operations, with the goal of minimizing execution time and allocated memory.  ...  The last paper is a survey by Dinusha Vatsalan, Peter Christen, and Vassilios S. Verykios titled ''A taxonomy of privacy-preserving record linkage techniques''.  ... 
doi:10.1016/j.is.2013.03.001 fatcat:aot256o53vaptctbwexdlk65ei

A Framework for the Design of ETL Scenarios [chapter]

Panos Vassiliadis, Alkis Simitsis, Panos Georgantas, Manolis Terrovitis
2003 Lecture Notes in Computer Science  
Moreover, we present a palette of several templates, representing frequently used ETL activities along with their semantics and their interconnection.  ...  We describe a framework for the declarative specification of ETL scenarios with two main characteristics: genericity and customization.  ...  Generic Model of ETL Activities The purpose of this section is to present a formal logical model for the activities of an ETL environment.  ... 
doi:10.1007/3-540-45017-3_35 fatcat:ctwnnuvio5gzzmfm2xtxewyrhq

Benchmarking ETL Workflows [chapter]

Alkis Simitsis, Panos Vassiliadis, Umeshwar Dayal, Anastasios Karagiannis, Vasiliki Tziovara
2009 Lecture Notes in Computer Science  
A plethora of ETL tools is currently available constituting a multi-million dollar market.  ...  In this paper, we identify common characteristics of ETL workflows in an effort of proposing a unified evaluation method for ETL.  ...  Since such constructs are based on the classification of ETL activities discussed before, they form a taxonomy as aid for designing or understanding complex ETL workflows.  ... 
doi:10.1007/978-3-642-10424-4_15 fatcat:mzyiwt6iwbgghkpsbgxhp7yo2y

Meeting the Need for ETL Documentation: A Model-driven Framework for Customizable Documentation Generation

Frieder Jacobi, Robert Krawatzeck, Marcus Hofmann
2012 Americas Conference on Information Systems  
To keep costs and expenditure of time for maintenance and evolution of those systems slight, ETL processes should be documented.  ...  Within Business Intelligence systems (BI systems), ETL (extract, transform and load) processes move numerous data from heterogeneous sources to a data warehouse and become more complex with growing enterprise  ...  A Model-driven Framework for ETL Documentation Generation Proceedings of the Eighteenth Americas Conference on Information Systems, Seattle, Washington, August 9-12, 2012.  ... 
dblp:conf/amcis/JacobiKH12 fatcat:3rfdql4r25beveqtnfny3ytyja

Integration of Plot-based Ecology Data: A Semantic Approach

Siddeswara Guru, Simon Cox, Edmond Chuc
2019 International Semantic Web Conference  
Typically, use of these data for analysis is confined to a jurisdiction from where the data was collected.  ...  There is a large amount of plot-based ecological data collected by different agencies and at different jurisdictions.  ...  The core structure of the TERN-Plot ontology consists of classes and properties to describe plots, sampling activities that happen within a plot, and an observation or collection of observations which  ... 
dblp:conf/semweb/GuruCC19 fatcat:knheimi43naujebmseijbmccy4

Integrating Big Data: A Semantic Extract-Transform-Load Framework

Srividya K. Bansal, Sebastian Kagemann
2015 Computer  
effective data integration, facilitating the creation of smart urban apps for smarter living.  ...  The proposed Semantic Extract-Transform-Load (ETL) framework that uses semantic technologies to integrate and publish data from multiple sources as open linked data provides an extensible solution for  ...  One of the popular approaches to data integration has been ETL as shown in, which describe the taxonomy of activities in ETL and a framework using a workflow approach to design ETL activities.  ... 
doi:10.1109/mc.2015.76 fatcat:qmwmjxtdrza7ncgbun6g65s5fa

Quality Measures for ETL Processes [chapter]

Vasileios Theodorou, Alberto Abelló, Wolfgang Lehner
2014 Lecture Notes in Computer Science  
ETL processes play an increasingly important role for the support of modern business operations.  ...  The apparent complexity of these activities has been examined through the prism of Business Process Management, mainly focusing on functional requirements and performance optimization.  ...  This way we reviewed a commonly accepted, generic taxonomy of software quality attributes, while at the same time avoiding the adherence to more recent, strictly defined standards for practical industrial  ... 
doi:10.1007/978-3-319-10160-6_2 fatcat:qxlnb2ncgfhorixmr2je3jzioy

Deciding the physical implementation of ETL workflows

Vasiliki Tziovara, Panos Vassiliadis, Alkis Simitsis
2007 Proceedings of the ACM tenth international workshop on Data warehousing and OLAP - DOLAP '07  
In this paper, we deal with the problem of determining the best possible physical implementation of an ETL workflow, given its logical-level description and an appropriate cost model as inputs.  ...  We experimentally assess our method based on a principled organization of test suites.  ...  Finally, due to the lack of any standard, commonly agreed set of test suites for ETL workflows, we build upon a taxonomy for ETL workflows that classifies typical real-world ETL workflows in different  ... 
doi:10.1145/1317331.1317341 dblp:conf/dolap/TziovaraVS07 fatcat:ljnyn233g5cdhptah5rqmmbhae
« Previous Showing results 1 — 15 out of 993 results