Metadata for Managing Grid Resources in Data Mining Applications

Carlo Mastroianni, Domenico Talia, Paolo Trunfio
2004 Journal of Grid Computing  
The Grid is an infrastructure for resource sharing and coordinated use of those resources in dynamic heterogeneous distributed environments. The effective use of a Grid requires the definition of metadata for managing the heterogeneity of involved resources that include computers, data, network facilities, and software tools provided by different organizations. Metadata management becomes a key issue when complex applications, such as dataintensive simulations and data mining applications, are
more » ... xecuted on a Grid. This paper discusses metadata models for heterogeneous resource management in Grid-based data mining applications. In particular, it discusses how resources are represented and managed in the KNOWLEDGE GRID, a framework for Grid-enabled distributed data mining. The paper illustrates how XML-based metadata is used to describe data mining tools, data sources, mining models, and execution plans, and how metadata is used for the design and execution of distributed knowledge discovery applications on Grids.
doi:10.1007/s10723-004-2809-x fatcat:5x7vr3nvgnfstayaa6zsarldfy