Increasing Memory Bandwidth for Vector ComputationsReport
Memory bandwidth is rapidly becoming the performance bottleneck in the application of high performance micro- processors to vector-like algorithms, including the "Grand Challenge" scientific problems. Caching is not the sole solution for these applications due to the poor temporal and spatial locality of their data accesses. Moreover, the nature of memories themselves has changed. Achieving greater bandwidth requires exploiting the characteristics of memory components "on the other side of the cache" - they should not be treated as uniform access-time RAM. This paper describes the use of hardware-assisted access ordering, a technique that combines compile-time detection of memory access patterns with a memory subsystem that decouples the order of requests generated by the processor from that issued to the memory system. This decoupling permits the requests to be issued in an order that optimizes use of the memory system. Our simulations show significant speedup on important scientific ker- nels.
All rights reserved (no additional license for public reuse)
McKee, Sally, Steven Moyer, Wm Wulf, and Charles Hitchcock. "Increasing Memory Bandwidth for Vector Computations." University of Virginia Dept. of Computer Science Tech Report (1993).
University of Virginia, Department of Computer Science