Grid Workflow Software for a High-Throughput Proteome Annotation Pipeline [chapter]

Adam Birnbaum, James Hayes, Wilfred W. Li, Mark A. Miller, Peter W. Arzberger, Phililp E. Bourne, Henri Casanova
2005 Lecture Notes in Computer Science  
The goal of the Encyclopedia of Life (EOL) Project is to predict structural information for all proteins, in all organisms. This calculation presents challenges both in terms of the scale of the computational resources required (approximately 1.8 million CPU hours), as well as in data and workflow management. While tools are available that solve some subsets of these problems, it was necessary for us to build software to integrate and manage the overall Grid application execution. In this
more » ... we present this workflow system, detail its components, and report on the performance of our initial prototype implementation for runs over a large-scale Grid platform during the SC'03 conference. 2
doi:10.1007/978-3-540-32251-1_7 fatcat:ajnovyiy5bhd5dlslkq4pqw3sa