1,871 Hits in 4.4 sec

Rule-Based Management of Schema Changes at ETL Sources [chapter]

George Papastefanatos, Panos Vassiliadis, Alkis Simitsis, Timos Sellis, Yannis Vassiliou
2010 Lecture Notes in Computer Science  
ETL activities and its sources are uniformly modeled as a graph that is annotated with rules for the management of evolution events.  ...  In this paper, we visit the problem of the management of inconsistencies emerging on ETL processes as results of evolution operations occurring at their sources.  ...  The goal is to provide a mechanism to the designer for the smooth adaptation of ETL scenarios to evolution changes occurring at their sources as well as for the early detection of vulnerable parts in the  ... 
doi:10.1007/978-3-642-12082-4_8 fatcat:wvsr27zzbjh6bdiqabn5fyniiq

LOD for Data Warehouses: Managing the Ecosystem Co-Evolution

Selma Khouri, Ladjel Bellatreche
2018 Information  
The particularity of LOD is that they contribute to evolving the DW at several levels: (i) source level, (ii) DW schema level, and (iii) DW design-cycle constructs.  ...  However, the incorporation of LOD in the DW must be accompanied by careful management.  ...  The FEDER-PLAIBDE project is directed by the lead manager of aYaline ( company, and the laboratories partners: L3i ( laboratory of La Rochelle University  ... 
doi:10.3390/info9070174 fatcat:f6tmnoodfbaijczogp3uqxos6e

Verifying Data Integration Configurations for Semantical Correctness and Completeness

Mark R Stöhr, Andreas Günther, Raphael W Majeed
2019 Studies in Health Technology and Informatics  
The technical part of data integration is based on rules interpreted by software. These rules define how to perform the translation of source database schemata into the target database schema.  ...  Data integration is the problem of combining data residing at different sources and providing the user with a unified view of these data.  ...  Second, data managers configuring the ETL mappings need a reflection of changes in the metadata that affect their configurations.  ... 
doi:10.3233/shti190807 pmid:31483256 fatcat:yuwrvikm3bbpvghmorivfmvmpe

Meta-data version and configuration management in multi-vendor environments

John R Friedrich
2005 Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05  
This article is based upon the real challenges found in these complicated meta-data environments, and identifies the often overlooked distinctions and importance of meta-data version and configuration  ...  management (CM), including the extensive use of automated meta-data comparison, mapping comparison, mapping generation and mapping update functions, which comprise a complete meta-data CM environment.  ...  and interfaces • A change of transformation, validation or generation rules.  ... 
doi:10.1145/1066157.1066251 dblp:conf/sigmod/Friedrich05 fatcat:v3fhqup2hfdxjj6c5za5m7ugcy

Toward Evolution Models for Data Warehouses

Saïd Taktak, Jamel Feki, Gilles Zurfluh
2014 Proceedings of the 2nd International Conference on Model-Driven Engineering and Software Development  
Hence, evolutions of the DS schema need to be propagated to the DW schema and content. This paper presents a model-driven approach for the evolution of a multidimensional DW.  ...  A Data warehouse (DW) is characterized by a complex architecture, designed in order to integrate data derived from operational data sources (DS), hence providing advanced analytical tools of these data  ...  It allows the management of changes in the decision information system levels, namely Data Warehousing and ETL.  ... 
doi:10.5220/0004877304720479 dblp:conf/modelsward/TaktakFZ14 fatcat:dibpuc3myngrxhsoxe64gjf3li

Special Issue on: Evolution and Versioning in Semantic Data Integration Systems

Ladjel Bellatreche, Robert Wrembel
2013 Journal on Data Semantics  
Acknowledgments The guest editors would like to acknowledge the help of all involved in the review process of this special issue of the Journal on Data Semantics. The reviewers provided comprehensive,  ...  Still open issues in this area concern: modeling ETL workflows, designing taxonomy and rules for ETL evolution, deploying these rules, and plugging in the evolution techniques into existing ETL engines  ...  From our experience, schemas of EDSs may change even more frequently. For example, telecommunication data sources changed their schemas every 7-13 days, on the average.  ... 
doi:10.1007/s13740-013-0020-6 fatcat:owcwpo3viff63f32ms7g2f7q6m

Conceptual workflow for complex data integration using AXML

Rashed Salem, Omar Boussai'd, Jerome Darmont
2010 2010 International Conference on Machine and Web Intelligence  
The workflow of integration tasks is organized via a set of rules, managed and controlled by ECA and AXML engines.  ...  Notice that not all encountered changes at data sources are relevant. Such changes are called irrelevant changes and should not affect the input schema.  ... 
doi:10.1109/icmwi.2010.5648085 fatcat:4gr2yldnevfwzg3ptzuzzptgmm

An Active XML-based framework for integrating complex data

Rashed Salem, Omar Boussaïd, Jérôme Darmont
2012 Proceedings of the 27th Annual ACM Symposium on Applied Computing - SAC '12  
Secondly, beside warehousing logged events into event repository, it exploits active rules and framework events mining to self-manage, automate and activate different data integration tasks.  ...  Current data integration systems also lack of self-managing capabilities. Therefore, we propose a data integration framework for integrating complex data actively.  ...  browsing changes of data sources, and defining ECA rules.  ... 
doi:10.1145/2245276.2245449 dblp:conf/sac/SalemBD12 fatcat:lzwake4awbcnhid33cszxqpir4

A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Case Study Using the NCBI Database for Genetic Variation

Zaineb Naamane, Vladan Jovanovic
2017 International Journal of Computer Science & Information Technology (IJCSIT)  
However, research in the problem of DW evolution has focused mainly on managing changes in the dimensional model while other aspects related to the ETL, and maintaining the history of changes has not been  ...  The DW or the data mart that is based on these data sources needs to reflect these changes.  ...  Research in the data warehouse environment has not adequately addressed the management of schema changes to source databases.  ... 
doi:10.5121/ijcsit.2017.9307 fatcat:4tsespm53jdyljpbp77imsgrj4

E-ETL: Framework for Managing Evolving ETL Workflows

Artur Wojciechowski
2013 Foundations of Computing and Decision Sciences  
Data warehouses integrate external data sources (EDSs), which very often change their data structures (schemas).  ...  Detection of changes in EDSs causes a repa- ration of the fragment of ETL workow which interacts with the changed EDSs.  ...  for managing the changes, • define rules for the evolution of ETL workflow, • present to the user the impact analyses of the ETL workflow, • store versions of the ETL workflow and history of EDS changes  ... 
doi:10.2478/fcds-2013-0005 fatcat:fswtj3wbl5ckra4446ulbo5afm

A Multi-Agent Framework for Data Extraction,Transformation and Loading in Data Warehouse

Ramzan Talib, Muhammad Kashif, Fakeeha Fatima, Shaeela Ayesha
2016 International Journal of Advanced Computer Science and Applications  
The identification of errors at different stages of ETL process become easy. This was difficult and time consuming in traditional ETL process.  ...  Extraction, Transformation and Loading (ETL) process gather data from different sources and integrate it into data warehouse.  ...  Set of rules to translate coded values and to derive new values are applied to clean and transmit the data. At this phase, schema and instance level mapping are performed to standardize the data.  ... 
doi:10.14569/ijacsa.2016.071146 fatcat:rbjgq2wuc5ez7h5i3ue7yt3lm4

Leveraging Knowledge Representation to Maintain Immunization Clinical Decision Support

Janos L Mathe, Scott D Nelson, Stuart T Weinberg, Christoph U Lehmann, Andras Nadas, Asli O Weitkamp
2018 AMIA Annual Symposium Proceedings  
processing of the available immunization content to implement mature knowledge lifecycle management practices locally.  ...  We demonstrate the creation of a tool that enables content curators to visualize, track, and implement immunization changes.  ...  For convenience, COMET computes an ETL plan-based version noting any changes to the sources processed.  ... 
pmid:30815121 pmcid:PMC6371352 fatcat:6bzgltbl7rcypjguruszja4f5u

A proposed model for data warehouse ETL processes

Shaker H. Ali El-Sappagh, Abdeltawab M. Ahmed Hendawi, Ali Hamed El Bastawissy
2011 Journal of King Saud University: Computer and Information Sciences  
Extraction-transformation-loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, its cleansing, customization, reformatting, integration, and insertion  ...  The source area has standard models such as entity relationship diagram, and the destination area has standard models such as star schema, but the mapping area has not a standard model till now.  ...  It includes management rules of an enterprise's applications, data spread rules for concerned applications, and data conversion rules.  ... 
doi:10.1016/j.jksuci.2011.05.005 fatcat:fpkriw5wanb7vi7jt7gfdujcqy

Uclean: A Requirement Based Object-Oriented Etl Framework

Payal Pahwa, Shweta Taneja, Garima Thakur
2011 International Journal of Computer Science & Engineering Survey  
This framework takes into account the concept of requirements of the users .The data is extracted from different UML sources and is converted into a multidimensional model.  ...  This ETL process is the key to the success of a data warehouse. In this paper we propose a conceptual ETL framework for an object oriented data warehouse design, the framework is called UCLEAN.  ...  system Figure 3 . 3 Snowflake schema for University Management System Table 2 . 2 Metadata at different levels 4.  ... 
doi:10.5121/ijcses.2011.2404 fatcat:nr5mvvxmrje4pffpvnd4et7axq

Data management research at the Knowledge and Database Systems Lab

Timos Sellis, Yannis Vassiliou
2006 SIGMOD record  
The research achievements at KDBS Lab is the result of the work of many present and past members of the group.  ...  Sources store and manage their data locally, revealing only part of their schemas to the rest of the peers.  ...  At each routing step, the query is rewritten to the schema of its new host based on the respective acquaintance mappings.  ... 
doi:10.1145/1147376.1147389 fatcat:2oytqxfkynamvl6yxza7eanzhi
« Previous Showing results 1 — 15 out of 1,871 results