Dynamic access ordering for streamed computations

D.A.B. Weikle, S.I. Hong, M.H. Salinas, R.H. Klenke, J.H. Aylor, W.A. Wulf, S.A. McKee
2000 IEEE transactions on computers  
AbstractÐMemory bandwidth is rapidly becoming the limiting performance factor for many applications, particularly for streaming computations such as scientific vector processing or multimedia (de)compression. Although these computations lack the temporal locality of reference that makes traditional caching schemes effective, they have predictable access patterns. Since most modern DRAM components support modes that make it possible to perform some access sequences faster than others, the
more » ... ability of the stream accesses makes it possible to reorder them to get better memory performance. We describe a Stream Memory Controller (SMC) system that combines compile-time detection of streams with execution-time selection of the access order and issue. The SMC effectively prefetches read-streams, buffers write-streams, and reorders the accesses to exploit the existing memory bandwidth as much as possible. Unlike most other hardware prefetching or stream buffer designs, this system does not increase bandwidth requirements. The SMC is practical to implement, using existing compiler technology and requiring only a modest amount of specialpurpose hardware. We present simulation results for fast-page mode and Rambus DRAM memory systems and we describe a prototype system with which we have observed performance improvements for inner loops by factors of 13 over traditional access methods. Index TermsÐMemory systems architecture, memory latency, memory bandwidth, memory access ordering, memory access scheduling.
doi:10.1109/12.895941 fatcat:gj5dhemidrgv3ewc5nwlht2vzy