Architecture-Aware Optimization on a 1600-core Graphics Processor

dc.contributor.authorDaga, Mayanken
dc.contributor.authorScogland, Thomas R. W.en
dc.contributor.authorFeng, Wu-chunen
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2013-06-19T14:36:02Zen
dc.date.available2013-06-19T14:36:02Zen
dc.date.issued2011-07-01en
dc.description.abstractThe graphics processing unit (GPU) continues to make significant strides as an accelerator in commodity cluster computing for high-performance computing (HPC). For example, three of the top five fastest supercomputers in the world, as ranked by the TOP500, employ GPUs as accelerators. Despite this increasing interest in GPUs, however, optimizing the performance of a GPU-accelerated compute node requires deep technical knowledge of the underlying architecture. Although significant literature exists on how to optimize GPU performance on the more mature NVIDIA CUDA architecture, the converse is true for OpenCL on the AMD GPU. Consequently, we present and evaluate architecture-aware optimizations for the AMD GPU. The most prominent optimizations include (i) explicit use of registers, (ii) use of vector types, (iii) removal of branches, and (iv) use of image memory for global data. We demonstrate the efficacy of our AMD GPU optimizations by applying each optimization in isolation as well as in concert to a large-scale, molecular modeling application called GEM. Via these AMD-specific GPU optimizations, the AMD Radeon HD 5870 GPU delivers 65% better performance than with the wellknown NVIDIA-specific optimizations.en
dc.format.mimetypeapplication/pdfen
dc.identifierhttp://eprints.cs.vt.edu/archive/00001159/en
dc.identifier.sourceurlhttp://eprints.cs.vt.edu/archive/00001159/01/Arch-Aware-Opt.pdfen
dc.identifier.trnumberTR-11-08en
dc.identifier.urihttp://hdl.handle.net/10919/19401en
dc.language.isoenen
dc.publisherDepartment of Computer Science, Virginia Polytechnic Institute & State Universityen
dc.relation.ispartofComputer Science Technical Reportsen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectSoftware engineeringen
dc.titleArchitecture-Aware Optimization on a 1600-core Graphics Processoren
dc.typeTechnical reporten
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Arch-Aware-Opt.pdf
Size:
474.04 KB
Format:
Adobe Portable Document Format