Memory Subsystem Performance Archives

An optimizing compiler doesn’t help much with long instruction dependencies

May 31, 2025July 2, 2025Ivica Bogosavljević2 Minute Reads, Memory Subsystem Performance, Performance, Toolchain and Performance1 Reply

Does it matter if we are compiling with optimizations off (O0) or optimizations on (O3) if the problem is memory bound? Let’s find out…

Memory Subsystem Optimizations – The Remaining Topics

October 31, 2024November 13, 2024Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, PerformanceLeave a Reply

This is the last memory optimization that we are covering in this blog. You can see the full list of all memory subsystem optimization that we covered earlier here. Definitely a read for anyone who is trying to improve performance of memory intensive software. In this post, we are covering a few remaining optimization techniques…

Read

Latency-Sensitive Application and the Memory Subsystem Part 2: Memory Management Mechanisms

June 28, 2024August 30, 2024Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, Performance4 Replies

In this post we talk about memory mechanism that increase memory accesses latency and we explore the techniques to avoid them in latency-sensitive systems.

Latency-Sensitive Applications and the Memory Subsystem: Keeping the Data in the Cache

April 30, 2024May 1, 2024Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, PerformanceLeave a Reply

We explore performance of latency-sensitive application, or more specifically, how to avoid evicting your critical data from the data cache.

The pros and cons of explicit software prefetching

March 31, 2024April 4, 2024Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, PerformanceLeave a Reply

We investigate explicit software prefetching, a mechanism software developers can use to prefetch the data in advance so it is ready once the program needs it.

Unexpected Ways Memory Subsystem Interacts with Branch Prediction

December 26, 2023December 30, 2023Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, Performance3 Replies

We investigate the unusual way memory subsystem interacts with branch prediction and how this interaction shapes software performance.

Multithreading and the Memory Subsystem

November 30, 2023December 3, 2023Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, PerformanceLeave a Reply

In this post we investigate how the memory subsystem behaves in an environment where several threads compete for memory subsystem resources. We also investigate techniques to improve the performance of multithreaded programs – programs that split the workload onto several CPU cores so that they finish faster.