Principles of P2P Data Integration

Maurizio Lenzerini
2004 International Workshop on Data Integration over the Web  
Integrating heterogeneous computational resources and databases, which are distributed over highly dynamic computer networks, is one of the crucial challenges at the current evolutionary stage of Information Technology infrastructures. Large enterprises, business organizations, e-government systems, and, in short, any kind of internetworking community, need today an integrated and virtualized access to distributed information resources, which grow in number, kind, and complexity. Most of the
more » ... mal approaches to data integration refer to an architecture based on a global schema and a set of sources. The sources contain the real data, while the global schema provides a reconciled, integrated, and virtual view of the underlying sources. As observed in several contexts, this centralized achitecture is not the best choice for supporting data integration, cooperation and coordination in highly dynamic computer networks. A more appealing architecture is the one based on peer-to-peer systems. In these systems every peer acts as both client and server, and provides part of the overall information available from a distributed environment, without relying on a single global view. In this paper, we study the problem of data integration in peer-to-peer systems, with the aim of singling out the principles that should form the basis for the design of data integration systems in this architecture. Particular emphasis is given to the problem of assigning formal semantics to peer-to-peer data integration. We discuss two different methods for defining such a semantics, and we compare them with respect to the above mentioned principles. 1 Technically we resort to epistemic FOL with standard names, and therefore with a fixed domain, and rigid interpretation of constants [22] .
dblp:conf/diweb/Lenzerini04 fatcat:r6bgxgs2wnd2nn62ykpy3wjgia