A Holistic Approach to Automatic Mixed-Precision Code Generation and Tuning for Affine Programs
Reducing floating-point (FP) precision is used to trade the quality degradation of a numerical program’s output for performance. Existing tools implement this optimization in a decoupled way by first generating all mixed-precision code variants and next searching the best-performing one. In this paper, we present a holistic approach called PrecTuner to consolidate the code generator and the autotuner via only one parameter r. Initialized by automatically sampled values, r is first used to generate several code variants in the polyhedral model, combining this optimization with various loop transformations. These code variants are then used to solve a performance model expressed in terms of r, possibly under a quality degradation budget. The best-performing value of r is finally predicted without evaluating all code variants. Experimental results of the PolyBench benchmarks on CPU demonstrate that PrecTuner outperforms LuIs by 3.28x while achieving smaller errors, and we also validate its effectiveness to optimize a real-life large-scale application. In addition, PrecTuner also obtains a mean speedup of 1.81x and 1.52x-1.73x over Pluto on single- and multi-core CPU, respectively, and 1.71x over PPCG on GPU.
Mon 4 MarDisplayed time zone: London change
11:30 - 12:50 | Compilers and Runtimes for Parallel SystemsMain Conference at Moorfoot Chair(s): Mohamed Riyadh Baghdadi | ||
11:30 20mTalk | Liger: Interleaving Intra- and Inter-Operator Parallelism for Distributed Large Model Inference Main Conference Jiangsu Du Sun Yat-sen University, jinhui wei Sun Yat-sen University, Jiazhi Jiang Sun Yat-sen University, Shenggan Cheng National University of Singapore, Zhiguang Chen Sun Yat-sen University, Dan Huang , Yutong Lu Sun Yat-sen University Link to publication DOI | ||
11:50 20mTalk | A Holistic Approach to Automatic Mixed-Precision Code Generation and Tuning for Affine Programs Main Conference Jinchen Xu Information Engineering University, Guanghui Song Li Auto Inc., Bei Zhou Information Engineering University, Fei Li Information Engineering University, Jiangwei Hao Information Engineering University, Jie Zhao State Key Laboratory of Mathematical Engineering and Advanced Computing Link to publication DOI | ||
12:10 20mTalk | Language-Agnostic Static Deadlock Detection for Futures Main Conference Stefan K. Muller Illinois Institute of Technology Link to publication DOI | ||
12:30 20mTalk | Recurrence Analysis for Automatic Parallelization of Subscripted Subscripts Main Conference Link to publication DOI |