In our experiments with the memory access pattern, we have seen that good data locality is a key to good software performance. Accessing memory sequentially and splitting the data set into small-sized pieces which are processed individually improves data locality and software speed. In this post, we will present a few techniques to improve the…
All posts tagged loop interchange
Loop Optimizations: how does the compiler do it?
Posted on Author Ivica BogosavljevićPosted in Help the Compiler, Performance, Toolchain and Performance2 Replies
We investigate what are the techniques your compiler employs to make your loop run faster.
Memory Access Pattern and Performance: the Example of Matrix Multiplication
Posted on Author Ivica BogosavljevićPosted in Computational Performance, Low Level Performance, PerformanceLeave a Reply
We use matrix multiplication example to investigate loop interchange and loop tiling as techniques to speed up your program that works with matrices.