On the Complexity of Robust Source-to-Source Translation from CUDA to OpenCL

Sathre, Paul Daniel

On the Complexity of Robust Source-to-Source Translation from CUDA to OpenCL

dc.contributor.author	Sathre, Paul Daniel	en
dc.contributor.committeechair	Feng, Wu-chun	en
dc.contributor.committeemember	Gardner, Mark K.	en
dc.contributor.committeemember	Tilevich, Eli	en
dc.contributor.department	Computer Science	en
dc.date.accessioned	2015-05-27T08:04:17Z	en
dc.date.available	2015-05-27T08:04:17Z	en
dc.date.issued	2013-06-12	en
dc.description.abstract	The use of hardware accelerators in high-performance computing has grown increasingly prevalent, particularly due to the growth of graphics processing units (GPUs) as general-purpose (GPGPU) accelerators. Much of this growth has been driven by NVIDIA's CUDA ecosystem for developing GPGPU applications on NVIDIA hardware. However, with the increasing diversity of GPUs (including those from AMD, ARM, and Qualcomm), OpenCL has emerged as an open and vendor-agnostic environment for programming GPUs as well as other parallel computing devices such as the CPU (central processing unit), APU (accelerated processing unit), FPGA (field programmable gate array), and DSP (digital signal processor). The above, coupled with the broader array of devices supporting OpenCL and the significant conceptual and syntactic overlap between CUDA and OpenCL, motivated the creation of a CUDA-to-OpenCL source-to-source translator. However, there exist sufficient differences that make the translation non-trivial, providing practical limitations to both manual and automatic translation efforts. In this thesis, the performance, coverage, and reliability of a prototype CUDA-to-OpenCL source translator are addressed via extensive profiling of a large body of sample CUDA applications. An analysis of the sample body of applications is provided, which identifies and characterizes general CUDA source constructs and programming practices that obstruct our translation efforts. This characterization then led to more robust support for the translator, followed by an evaluation that demonstrated the performance of our automatically-translated OpenCL is on par with the original CUDA for a subset of sample applications when executed on the same NVIDIA device.	en
dc.description.degree	Master of Science	en
dc.format.medium	ETD	en
dc.identifier.other	vt_gsexam:808	en
dc.identifier.uri	http://hdl.handle.net/10919/52631	en
dc.publisher	Virginia Tech	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	Source Translation	en
dc.subject	Clang	en
dc.subject	CUDA	en
dc.subject	OpenCL	en
dc.subject	GPU	en
dc.subject	GPGPU	en
dc.subject	Compilers	en
dc.title	On the Complexity of Robust Source-to-Source Translation from CUDA to OpenCL	en
dc.type	Thesis	en
thesis.degree.discipline	Computer Science and Applications	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	masters	en
thesis.degree.name	Master of Science	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Sathre_PD_T_2013.pdf
Size:: 556.34 KB
Format:: Adobe Portable Document Format

Download

Collections

Masters Theses