Acceleration of tensor-product operations for high-order finite element methods

Świrydowicz, K.; Chalmers, N.; Karakus, A.; Warburton, T.

Acceleration of tensor-product operations for high-order finite element methods

Files

Draft Version (6.01 MB)

Downloads: 263

Date

2017-09

Authors

Abstract

This paper is devoted to GPU kernel optimization and performance analysis of three tensor-product operators arising in finite element methods. We provide a mathematical background to these operations and implementation details. Achieving close-to-the-peak performance for these operators requires extensive optimization because of the operators' properties: low arithmetic intensity, tiered structure, and the need to store intermediate results inside the kernel. We give a guided overview of optimization strategies and we present a performance model that allows us to compare the efficacy of these optimizations against an empirically calibrated roofline.

Keywords

cs.MS, cs.DC, cs.NA, cs.PF, math.NA

Persistent link

http://hdl.handle.net/10919/81866

Collections

All Faculty Deposits
Scholarly Works, Mathematics

Full item page

Acceleration of tensor-product operations for high-order finite element methods

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections