Scheduling data-intensive bags of tasks in P2P grids with bittorrent-enabled data distribution

Cyril Briquet, Xavier Dalem, Sébastien Jodogne, Pierre-Arnoul de Marneffe
2007 Proceedings of the second workshop on Use of P2P, GRID and agents for the development of content networks - UPGRADE '07  
Scheduling Data-Intensive Bags of Tasks in P2P Grids leads to transfers of large input data files, which cause delays in completion times. We propose to combine several existing technologies and patterns to perform efficient data-aware scheduling: (1) use of the BitTorrent P2P file sharing protocol to transfer data, (2) data caching on computational Resources, (3) use of a data-aware Resource selection scheduling algorithm similar to Storage Affinity, (4) a new Task selection scheduling
more » ... m (Temporal Tasks Grouping), based on the temporally grouped scheduling of Tasks sharing input data files. Data replication is also discussed. The proposed approach does not need an overlay network or Predictive Communications Ordering, making our operational implementation of a P2P Grid middleware easily deployable in unstructured P2P networks. Experiments show that performance gains are achieved by combining BitTorrent, caching, Storage Affinity and Temporal Tasks Grouping. This work can be summarized as combining P2P Grid computing and P2P data transfer technologies.
doi:10.1145/1272980.1272990 dblp:conf/hpdc/BriquetDJM07 fatcat:ao7eqpipwjdrxhfoeuqem2m3h4