In this post we introduce a few most common tools used for memory subsystem performance debugging.

In this post we introduce a few most common tools used for memory subsystem performance debugging.
We continue the investigation from the previous post, trying to measure how the memory subsystem affects software performance. We write small programs (kernels) to quantify the effects of cache line, memory latency, TLB cache, cache conflicts, vectorization and branch prediction.