XML based Mediated Query Re-writing Framework

Jahangir khan, Muhammad Ahmed, Muhammad Khalid khan
2011 International Journal of Computer Applications  
To integrate the information from heterogeneous data sources and give it a unified representation to the users is known as Information Integration. There are many application architectures that are designed for Enterprise Information Integration for solving the problems of semantic heterogeneity (the modeling problem) and query optimization (the querying problem) in Integration Architecture. Architectures such as Mediator-based in which information is coming from disseminate sources, Agentbased
more » ... Architectures that have various software agents specialized in specific tasks work together to provide various integration services. Federated Architectures in which data is integrated through message-oriented middle wares. Enterprise Information Integration depends on sophisticated technologies and complex architectures. However, Query Optimization/Management is the major area of research in XML based integration systems. Since XQuery precludes the features of traditional SQL or OQL as it deals with the structured and semi-structured data sources. Focus is to present a solution to the problem of query optimization in XML-based data integration in hybrid peer to peer data management environment. The contributions to this paper are: providing a conceptual frame work for Information Integration System based on XML query language, formulation of rewriting algorithm for XML query and implementation of the proposed algorithm. General Terms Data Integration Services, distributed database management system, enterprise and Information Services. Access: The frame work should have capability to provide access virtually to all kinds of data sources, including relational databases, flat file, mainframe legacy, XML web data and even packaged data from Enterprise Resource Planning (ERP). Integrate: The framework should have the flexibility for deployment across all the architectures including client/server, Web/application server (XML, ADO/ASP, etc.) and distributed computing, supporting all major models of Microsoft COM, EJB, and CORBA. To integrate disparate data into information, the framework should have key data integration features including global business metadata catalog, ontology, or global schema. Manage: Framework should be predictable for maintenance purpose. It should include for centralized management technologies including named services through LDAP or Microsoft Active Directory Services. Secure: The framework must address security issues that an enterprise faces today or can expect to face in the future. The framework can enable centralized control of data access resource utilization and security. It should support multilevel authentication including database, application, and host and system level. Scalability: The framework must support the performance and scalability for online systems, and should consider the new demands of e-business. Now let us discuss some benefits of the information integration framework in terms of business and technology aspects.
doi:10.5120/1769-2427 fatcat:szghb74re5glxeywghqkhzz2um