Partitioned schedules for clustered VLIW architectures

M.M. Fernandes, J. Llosa, N. Topham
Proceedings of the First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing  
This paper presents results on a new approach to partitioning a modulo-scheduled loop for distributed execution on parallel clusters of functional units organized as a VLIW machine. A distinctive characteristic of this architecture is the use of register files organized by means of queues, which results in a number of advantages over conventional schemes, but also requires the development of specific compiling and hardware features. We have investigated a scheme based on copy operations to deal
more » ... with data values to be consumed more than once during loop execution. Experiments with loop unrolling were also performed in order to optimize both loop execution and the use of machine resources. A partitioning algorithm has been implemented to perform some experiments with the clustered architecture model, an organization widely accepted as being essential for very wide issue machines. 1
doi:10.1109/ipps.1998.669945 dblp:conf/ipps/FernandesLT98 fatcat:afy6wpmcmrahzncrkgwtenwqlq