In our experiments with the memory access pattern, we have seen that good data locality is a key to good software performance. Accessing memory sequentially and splitting the data set into small-sized pieces which are processed individually improves data locality and software speed. In this post, we will present a few techniques to improve the…
All posts tagged loop tiling
Memory Access Pattern and Performance: the Example of Matrix Multiplication
Posted on Author Ivica BogosavljevićPosted in Computational Performance, Low Level Performance, PerformanceLeave a Reply
We use matrix multiplication example to investigate loop interchange and loop tiling as techniques to speed up your program that works with matrices.