Loop Nest Optimization
High-level loop optimizations that improve performance by better exploiting caches and instruction-level parallelism
- loop interchange
- cache blocking and outer loop unrolling
- loop fusion
- loop fission
- prefetching (limited to array references in well behaved loops)