cpp loop fusion and fission
After seeing the below tweet.
IMO the two most important transformations for performant code are Array of Structs to Struct of Arrays and Loop Fusion to Loop Fission. The 1st is essential for cache perf and the 2nd breaks data dependencies and often unlocks more optimizations that were not obvious before. pic.twitter.com/hjLvzusExz
— Daniel Holden (@anorangeduck) October 16, 2021
This is super cool! I sense a lot of value for whoever using Thurst
algorithms.