An Application-Oriented Approach for Accelerating Data-Parallel Computation with Graphics Processing Unit
dc.contributor.author | Ponce, Sean | en |
dc.contributor.author | Jing, Huang | en |
dc.contributor.author | Park, Seung In | en |
dc.contributor.author | Khoury, Chase | en |
dc.contributor.author | Quek, Francis | en |
dc.contributor.author | Cao, Yong | en |
dc.contributor.department | Computer Science | en |
dc.date.accessioned | 2013-06-19T14:35:53Z | en |
dc.date.available | 2013-06-19T14:35:53Z | en |
dc.date.issued | 2009-03-01 | en |
dc.description.abstract | This paper presents a novel parallelization and quantitative characterization of various optimization strategies for data-parallel computation on a graphics processing unit (GPU) using NVIDIA's new GPU programming framework, Compute Unified Device Architecture (CUDA). CUDA is an easy-to-use development framework that has drawn the attention of many different application areas looking for dramatic speed-ups in their code. However, the performance tradeoffs in CUDA are not yet fully understood, especially for data-parallel applications. Consequently, we study two fundamental mathematical operations that are common in many data-parallel applications: convolution and accumulation. Specifically, we profile and optimize the performance of these operations on a 128-core NVIDIA GPU. We then characterize the impact of these operations on a video-based motion-tracking algorithm called vector coherence mapping, which consists of a series of convolutions and dynamically weighted accumulations, and present a comparison of different implementations and their respective performance profiles. | en |
dc.format.mimetype | application/pdf | en |
dc.identifier | http://eprints.cs.vt.edu/archive/00001064/ | en |
dc.identifier.sourceurl | http://eprints.cs.vt.edu/archive/00001064/01/paper.pdf | en |
dc.identifier.trnumber | TR-09-05 | en |
dc.identifier.uri | http://hdl.handle.net/10919/20162 | en |
dc.language.iso | en | en |
dc.publisher | Department of Computer Science, Virginia Polytechnic Institute & State University | en |
dc.relation.ispartof | Computer Science Technical Reports | en |
dc.rights | In Copyright | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
dc.subject | Parallel computation | en |
dc.subject | Algorithms | en |
dc.subject | Data structures | en |
dc.title | An Application-Oriented Approach for Accelerating Data-Parallel Computation with Graphics Processing Unit | en |
dc.type | Technical report | en |
dc.type.dcmitype | Text | en |
Files
Original bundle
1 - 1 of 1