Projects per year
Abstract
Engineering is an important domain for supercomputing, with the Alya model being a popular code for undertaking such simulations. With ever increasing demand from users to model larger, more complex systems at reduced time to solution it is important to explore the role that novel hardware technologies, such as FPGAs, can play in accelerating these workloads on future exascale systems.
In this paper we explore the porting of Alya's in-compressible flow matrix assembly kernel, which accounts for a large proportion of the model runtime, onto FPGAs. After describing in detail successful strategies for optimisation at the kernel level, we then explore sharing the workload between the FPGA and host CPU, mapping most appropriate parts of the kernel between these technologies, enabling us to more effectively exploit the FPGA. We then compare the performance of our approach on a Xilinx Alveo U280 against a 24-core Xeon Platinum CPU and Nvidia V100 GPU, with the FPGA significantly out-performing the CPU and performing comparably against the GPU, whilst drawing substantially less power. The result of this work is both an experience report describing appropriate dataflow optimisations which we believe can be applied more widely across HPC codes, and a performance comparison for this specific workload that demonstrates the potential for FPGAs in accelerating HPC engineering simulations.
In this paper we explore the porting of Alya's in-compressible flow matrix assembly kernel, which accounts for a large proportion of the model runtime, onto FPGAs. After describing in detail successful strategies for optimisation at the kernel level, we then explore sharing the workload between the FPGA and host CPU, mapping most appropriate parts of the kernel between these technologies, enabling us to more effectively exploit the FPGA. We then compare the performance of our approach on a Xilinx Alveo U280 against a 24-core Xeon Platinum CPU and Nvidia V100 GPU, with the FPGA significantly out-performing the CPU and performing comparably against the GPU, whilst drawing substantially less power. The result of this work is both an experience report describing appropriate dataflow optimisations which we believe can be applied more widely across HPC codes, and a performance comparison for this specific workload that demonstrates the potential for FPGAs in accelerating HPC engineering simulations.
Original language | English |
---|---|
Title of host publication | 2021 IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC) |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 9-20 |
ISBN (Electronic) | 978-1-6654-4664-8 |
ISBN (Print) | 978-1-6654-3071-5 |
DOIs | |
Publication status | Published - 20 Dec 2021 |
Event | Seventh International Workshop on Heterogeneous High-performance Reconfigurable Computing - St. Louis, United States Duration: 15 Nov 2021 → 15 Nov 2021 https://h2rc.cse.sc.edu/ |
Workshop
Workshop | Seventh International Workshop on Heterogeneous High-performance Reconfigurable Computing |
---|---|
Abbreviated title | H2RC21 |
Country/Territory | United States |
City | St. Louis |
Period | 15/11/21 → 15/11/21 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- FPGA
- Xilinx Alveo U280
- High Level Synthesis
- Alya
- engineering simulations
- incompressible flow
Fingerprint
Dive into the research topics of 'Porting incompressible flow matrix assembly to FPGAs for accelerating HPC engineering simulations'. Together they form a unique fingerprint.Projects
- 1 Finished
-
The European Centre of Excellence for Engineering Applications
Parsons, M., Filipiak, M. & Graham, P.
1/12/18 → 31/05/22
Project: Research