Performance Archives - Page 2 of 8 - Johnny's Software Lab

Latency-Sensitive Application and the Memory Subsystem Part 2: Memory Management Mechanisms

June 28, 2024August 30, 2024Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, Performance4 Replies

In this post we talk about memory mechanism that increase memory accesses latency and we explore the techniques to avoid them in latency-sensitive systems.

Latency-Sensitive Applications and the Memory Subsystem: Keeping the Data in the Cache

April 30, 2024May 1, 2024Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, PerformanceLeave a Reply

We explore performance of latency-sensitive application, or more specifically, how to avoid evicting your critical data from the data cache.

The pros and cons of explicit software prefetching

March 31, 2024April 4, 2024Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, PerformanceLeave a Reply

We investigate explicit software prefetching, a mechanism software developers can use to prefetch the data in advance so it is ready once the program needs it.

A story of a very large loop with a long instruction dependency chain

February 29, 2024February 29, 2024Ivica BogosavljevićComputational Performance, Low Level Performance, Performance, VectorizationLeave a Reply

A story of a very large loop with a long instruction dependency chain.

On Avoiding Register Spills in Vectorized Code with Many Constants

January 31, 2024February 7, 2024Ivica BogosavljevićLow Level Performance, Performance, Vectorization3 Replies

How to avoid register spilling in vectorized code with many constants?

Unexpected Ways Memory Subsystem Interacts with Branch Prediction

December 26, 2023December 30, 2023Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, Performance3 Replies

We investigate the unusual way memory subsystem interacts with branch prediction and how this interaction shapes software performance.

Multithreading and the Memory Subsystem

November 30, 2023December 3, 2023Ivica BogosavljevićLow Level Performance, Memory Subsystem Performance, PerformanceLeave a Reply

In this post we investigate how the memory subsystem behaves in an environment where several threads compete for memory subsystem resources. We also investigate techniques to improve the performance of multithreaded programs – programs that split the workload onto several CPU cores so that they finish faster.