OpenACC unified programming environment for GPU and FPGA multi-hybrid acceleration

Show authors

Publication Type

Conference Paper

Book Title

13th International Symposium on
High-Level Parallel Programming and Applications (HLPP 2020)

Publication Date

July, 2020

Page Numbers

1 to 20

Publisher Location

New York City, New York, United States of America

Conference Name

13th International Symposium on High-level Parallel Programming and Applications (HLPP)

Conference Location

Porto, Portugal

Conference Sponsor

FCUP, DCC, CRACS, HUAWEI

Conference Date

Jul 9, 2020 - Jul 10, 2020

Abstract

Attached accelerators have been frequently used in recent High Per- formance Computing (HPC) systems because of their high performance/power ratio. In particular, the Graphics Processing Unit (GPU) is the most popu- lar accelerator owing to its high peak FLOPS performance and high memory bandwidth supported by HBM2, etc. However, the performance of GPU depends highly on a large degree of SIMD parallelism and has difficulty sustaining a high performance on programs with frequent branch operations or a partially low degree of parallelism.
By contrast, a Field Programmable Gate Array (FPGA) has received attention as a different type of accelerator than GPU as a fully reconfigurable processor fitting the target applications. The high performance of FPGA is mainly provided by a pipelined operation and optimized circuit suitable for any operation even with frequent conditional branches. We have been focusing on the flexibility of FPGA to compensate for the weakness of GPU. We believe that the coupling of GPU with FPGA can result in one of the most powerful accelerating platforms available.
However, the program coding of GPU and FPGA coupling can be quite difficult for application users. Traditionally, CUDA by NVIDIA has been the most popular programming language with the largest share of GPUs used in HPC, whereas a hardware description language such as Verilog HDL has been used in FPGA programming. OpenCL coding has recently become available even on high-end FPGAs. Moreover, several recent studies have also enabled the OpenACC coding for use in FPGA. In this study, we provide a unified programming system based on OpenACC for a platform equipped with both GPU and FPGA aiming at the next-generation accelerated supercomputer framework. Our programming environment is called Multi-Hybrid OpenACC Translator (MHOAT), and in this paper, we describe the basic concept and prototype system of MHOAT based on an evaluation on the amount of coding required and the performance of a hybrid multi-device accelerated system.

OpenACC unified programming environment for GPU and FPGA multi-hybrid acceleration

Abstract

Researchers

Organizations