REDUCER: Elimination of Repetitive Codes for Accelerated Iterative Compilation

Abstract Existing iterative compilation and machine learning-based optimization techniques have been proven very successful in achieving better optimizations than the standard optimization levels of a compiler. However, they were not engineered to support the tuning of a compiler’s optimizer as part of the compiler’s daily development cycle. In this paper, we first establish the required properties that a technique must exhibit to enable such tuning. We then introduce an enhancement to the classic nightly routine testing of compilers, which exhibits all the required properties and thus is capable of driving the improvement and tuning of the compiler’s common optimizer. This is achieved by leveraging resource usage and compilation information collected while systematically exploiting prefixes of the transformations applied at standard optimization levels. Experimental evaluation using the LLVM v6.0.1 compiler demonstrated that the new approach was able to reveal hidden cross-architecture and architecture-dependent potential optimizations on two popular processors: the Intel i5-6300U and the Arm Cortex-A53-based Broadcom BCM2837 used in the Raspberry Pi 3B+. As a case study, we demonstrate how the insights from our approach enabled us to identify and remove a significant shortcoming of the CFG simplification pass of the LLVM v6.0.1 compiler.

Download Full-text

Evaluating Iterative Compilation

Languages and Compilers for Parallel Computing - Lecture Notes in Computer Science ◽

10.1007/11596110_24 ◽

2005 ◽

pp. 362-376 ◽

Cited By ~ 14

Author(s):

G. G. Fursin ◽

M. F. P. O’Boyle ◽

P. M. W. Knijnenburg

Keyword(s):

Iterative Compilation

Download Full-text

Combining Model and Iterative Compilation for Program Performance Optimization

Journal of Software ◽

10.4304/jsw.4.3.240-247 ◽

2009 ◽

Vol 4 (3) ◽

Author(s):

Pingjing Lu ◽

Yonggang Che ◽

Zhenghua Wang

Keyword(s):

Performance Optimization ◽

Program Performance ◽

Iterative Compilation

Download Full-text

A feasibility study in iterative compilation

Lecture Notes in Computer Science - High Performance Computing ◽

10.1007/bfb0094916 ◽

1999 ◽

pp. 121-132 ◽

Cited By ~ 25

Author(s):

Toru Kisuki ◽

Peter M. W. Knijnenburg ◽

Mike F. P. O'Boyle ◽

François Bodin ◽

Harry A. G. Wijshoff

Keyword(s):

Feasibility Study ◽

Iterative Compilation

Download Full-text

Cache Models for Iterative Compilation

Euro-Par 2001 Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/3-540-44681-8_37 ◽

2001 ◽

pp. 254-261 ◽

Cited By ~ 2

Author(s):

Peter M. W. Knijnenburg ◽

Toru Kisuki ◽

Kyle Gallivan

Keyword(s):

Iterative Compilation

Download Full-text

Iterative Compilation of Multiagent Probabilistic Graphical Models

2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology ◽

10.1109/iat.2006.82 ◽

2006 ◽

Author(s):

Xiangdong An ◽

Nick Cercone

Keyword(s):

Graphical Models ◽

Probabilistic Graphical Models ◽

Iterative Compilation

Download Full-text

Improving Program Performance via Auto-Vectorization of Loops with Conditional Statements with GCC Compiler Setting

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.433-435.1410 ◽

2013 ◽

Vol 433-435 ◽

pp. 1410-1414

Author(s):

Qi Shen Zhu

Keyword(s):

Search Space ◽

Compiler Optimizations ◽

Program Performance ◽

Performance Improvements ◽

Iterative Compilation ◽

Conditional Statements ◽

Speed Up ◽

Performance Ability ◽

Number Of Passes ◽

Selection Of

The GCC is an auto-vectorization compiler across iterations of loops to parallelism data. Turning GCC compiler optimizations flags for auto-vectorization is a way to improve the performance ability, which is a popular approach to speed up program performance. However, there are many options in GCC compiler and selecting the best combination of these options to improve program performance through vectorization is non-trivial ( as the search space is very large ).In this work we focus on the selection of compiler transformations to auto-vectorize loops with conditional statements. The selection of compiler transformations is based on the correlation between program features, speed-up, and the analysis of the code generated and a small number of passes of iterative compilation. Our preliminary experimental results show that proposed technique attains performance improvements the best ~ 6x using loops in the TSVC benchmark suite on the state-of-the-art Intel Core i3 processor.

Download Full-text

Combined selection of tile sizes and unroll factors using iterative compilation

Proceedings 2000 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.PR00622) ◽

10.1109/pact.2000.888348 ◽

2002 ◽

Cited By ~ 49

Author(s):

T. Kisuki ◽

P.M.W. Knijnenburg ◽

M.F.P. O'Boyle

Keyword(s):

Iterative Compilation ◽

Combined Selection ◽

Selection Of

Download Full-text