Large-Scale Peer-to-Peer Autonomic Monitoring

Joao Leitao, Liliana Rosa, Luis Rodrigues
2008 2008 IEEE Globecom Workshops  
The increasing scale and complexity of distributed system motivates the need for autonomous management. One of the key aspects in the management of distributed systems is the issue of component monitoring. Component monitoring is particularly challenging in large-scale dynamic systems, given the need to ensure that each component is monitored by at least one non-faulty component, despite joins, leaves, and failures, both at node and at network level. This paper proposes that components
more » ... nize in an unstructured overlay network of constant degree in order to ensure that each component is always monitored by a threshold of other components. This work was partially funded by FCT project REDICO -Dynamic Reconfiguration of Communication Protocols -(PTDC/EIA/
doi:10.1109/glocomw.2008.ecp.18 fatcat:loy5zqzrxbctnl77rcgo2ivebq