On the Complexity of Robust Source-to-Source Translation from CUDA to OpenCL

Sathre, Paul Daniel

On the Complexity of Robust Source-to-Source Translation from CUDA to OpenCL

Files

Sathre_PD_T_2013.pdf (556.34 KB)

Downloads: 259

Date

2013-06-12

Authors

Sathre, Paul Daniel

Publisher

Virginia Tech

Abstract

The use of hardware accelerators in high-performance computing has grown increasingly prevalent, particularly due to the growth of graphics processing units (GPUs) as general-purpose (GPGPU) accelerators. Much of this growth has been driven by NVIDIA's CUDA ecosystem for developing GPGPU applications on NVIDIA hardware. However, with the increasing diversity of GPUs (including those from AMD, ARM, and Qualcomm), OpenCL has emerged as an open and vendor-agnostic environment for programming GPUs as well as other parallel computing devices such as the CPU (central processing unit), APU (accelerated processing unit), FPGA (field programmable gate array), and DSP (digital signal processor).

The above, coupled with the broader array of devices supporting OpenCL and the significant conceptual and syntactic overlap between CUDA and OpenCL, motivated the creation of a CUDA-to-OpenCL source-to-source translator. However, there exist sufficient differences that make the translation non-trivial, providing practical limitations to both manual and automatic translation efforts. In this thesis, the performance, coverage, and reliability of a prototype CUDA-to-OpenCL source translator are addressed via extensive profiling of a large body of sample CUDA applications. An analysis of the sample body of applications is provided, which identifies and characterizes general CUDA source constructs and programming practices that obstruct our translation efforts. This characterization then led to more robust support for the translator, followed by an evaluation that demonstrated the performance of our automatically-translated OpenCL is on par with the original CUDA for a subset of sample applications when executed on the same NVIDIA device.

Keywords

Source Translation, Clang, CUDA, OpenCL, GPU, GPGPU, Compilers

Persistent link

http://hdl.handle.net/10919/52631

Collections

Masters Theses

Full item page

On the Complexity of Robust Source-to-Source Translation from CUDA to OpenCL

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections