Johnny's Software Lab

Johnny's Software Lab

We help you deliver fast software

  • Home
  • Performance
    • 2 Minute Reads
    • C++ Performance
    • Standard Library and Performance
    • Algorithms and Performance
    • Toolchain and Performance
    • Help the Compiler
    • Performance Analysis Tools
    • Computational Performance
    • Low Level Performance
    • Parallelization
    • Multithreaded Performance
    • Performance Contest
  • Debugging
  • Developer Tools
  • Need help?
  • Talks
  • Contact
  • About us
Menu

All posts tagged clang

Tune your program’s speed with profile guided optimizations

Tune your program’s speed with profile guided optimizations

Posted on September 5, 2020March 19, 2022Author Ivica BogosavljevićPosted in Performance, Toolchain and PerformanceLeave a Reply

Profile guided optimizations are a compiler-supported optimization technique that is easy to use and will make your program run faster with little effort. Here you will learn how to enable it on your project and what kind of improvements you can expect.

CPU Dispatching: Make your code both portable and fast

CPU Dispatching: Make your code both portable and fast

Posted on June 28, 2020March 19, 2022Author Ivica BogosavljevićPosted in Low Level Performance, Performance1 Reply

CPU dispatchingh is all about making your code portable and fast. We will talk about how to make your detect features your CPU has at is disposal and select the fastest function for that particular CPU without a need to recompile your software.

Link Time Optimizations: New Way to Do Compiler Optimizations

Link Time Optimizations: New Way to Do Compiler Optimizations

Posted on May 27, 2020March 19, 2022Author Ivica BogosavljevićPosted in Memory Footprint, Performance, Toolchain and Performance3 Replies

Traditional compilation-linking cycle generates binaries that work fine, but in case you need more speed, you need to learn about link time optimizations. Here we talk about what link time optimizations are, how to enable them and what improvements to expect.

Like what you’re reading? Follow us!

  • Frugal Programming: Saving Memory Subsystem Bandwidth
  • What is faster: vec.emplace_back(x) or vec[x] ?
  • When an instruction depends on the previous instruction depends on the previous instructions… : long instruction dependency chains and performance
  • The memory subsystem from the viewpoint of software: how memory subsystem affects software performance 2/3
  • The memory subsystem from the viewpoint of software: how memory subsystem affects software performance 1/3

Recent Posts

  • Frugal Programming: Saving Memory Subsystem Bandwidth
  • What is faster: vec.emplace_back(x) or vec[x] ?
  • When an instruction depends on the previous instruction depends on the previous instructions… : long instruction dependency chains and performance
  • The memory subsystem from the viewpoint of software: how memory subsystem affects software performance 2/3
  • The memory subsystem from the viewpoint of software: how memory subsystem affects software performance 1/3

Recent Comments

  • Ivica Bogosavljević on The quest for the fastest linked list
  • Ivica Bogosavljević on Make your programs run faster by better using the data cache
  • Matthew Tedder on The quest for the fastest linked list
  • etiennemlb on Make your programs run faster by better using the data cache
  • JuYi on Speedscope: visualize what your program is doing and where it is spending time

Archives

  • January 2023
  • October 2022
  • September 2022
  • August 2022
  • July 2022
  • June 2022
  • May 2022
  • April 2022
  • March 2022
  • February 2022
  • January 2022
  • December 2021
  • November 2021
  • October 2021
  • September 2021
  • August 2021
  • July 2021
  • June 2021
  • May 2021
  • April 2021
  • March 2021
  • February 2021
  • January 2021
  • December 2020
  • November 2020
  • October 2020
  • September 2020
  • August 2020
  • July 2020
  • June 2020
  • May 2020

Categories

  • 2 Minute Reads
  • Algorithms and Performance
  • C++ Performance
  • Computational Performance
  • Data Structure Performance
  • Debugging
  • Developer Tools
  • Help the Compiler
  • Kernel Space and Performance
  • Low Level Performance
  • Memory Footprint
  • Memory Subsystem Performance
  • Multithreaded Performance
  • Parallelization
  • Performance
  • Performance Analysis Tools
  • Performance Contest
  • Reliability
  • Standard Library and Performance
  • System Design
  • Toolchain and Performance

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org

©2023 Johnny's Software Lab | WordPress Theme by Superb WordPress Themes