Skip to main content
SHARE
Publication

Climbing the Summit and Pushing the Frontier of Mixed Precision Benchmarks at Extreme Scale

Publication Type
Conference Paper
Book Title
SC22: International Conference for High Performance Computing, Networking, Storage and Analysis
Publication Date
Page Numbers
1 to 15
Publisher Location
New Jersey, United States of America
Conference Name
International Conference for High Performance Computing, Networking, Storage and Analysis (SC22)
Conference Location
Dallas TX, Texas, United States of America
Conference Sponsor
OLCF
Conference Date
-

The rise of machine learning (ML) applications and their use of mixed precision to perform interesting science are driving forces behind AI for science on HPC. The convergence of ML and HPC with mixed precision offers the possibility of transformational changes in computational science. The HPL-AI benchmark is designed to measure the performance of mixed precision arithmetic as opposed to the HPL benchmark which measures double precision performance. Pushing the limits of systems at extreme scale is nontrivial -little public literature explores optimization of mixed precision computations at this scale. In this work, we demonstrate how to scale up the HPL-AI benchmark on the pre-exascale Summit and exascale Frontier systems at the Oak Ridge Leadership Computing Facility (OLCF) with a cross-platform design. We present the implementation, performance results, and a guideline of optimization strategies employed for delivering portable performance on both AMD and NVIDIA GPUs at extreme scale.