IterML: Iterative Machine Learning for Intelligent Parameter Pruning and Tuning in Graphics Processing Units

Cui, Xuewen; Feng, Wu-chun

IterML: Iterative Machine Learning for Intelligent Parameter Pruning and Tuning in Graphics Processing Units

dc.contributor.author	Cui, Xuewen	en
dc.contributor.author	Feng, Wu-chun	en
dc.date.accessioned	2024-03-04T15:11:06Z	en
dc.date.available	2024-03-04T15:11:06Z	en
dc.date.issued	2020-11-06	en
dc.description.abstract	With the rise of graphics processing units (GPUs), the parallel computing community needs better tools to productively extract performance from the GPU. While modern compilers provide flags to activate different optimizations to improve performance, the effectiveness of such automated optimization has been limited at best. As a consequence, extracting the best performance from an algorithm on a GPU requires significant expertise and manual effort to exploit both spatial and temporal sharing of computing resources. In particular, maximizing the performance of an algorithm on a GPU requires extensive hyperparameter (e.g., thread-block size) selection and tuning. Given the myriad of hyperparameter dimensions to optimize across, the search space of optimizations is extremely large, making it infeasible to exhaustively evaluate. This paper proposes an approach that uses statistical analysis with iterative machine learning (IterML) to prune and tune hyperparameters to achieve better performance. During each iteration, we leverage machine-learning models to guide the pruning and tuning for subsequent iterations. We evaluate our IterML approach on the GPU thread-block size across many benchmarks running on an NVIDIA P100 or V100 GPU. Our experimental results show that our automated IterML approach reduces search effort by 40% to 80% when compared to traditional (non-iterative) ML and that the performance of our (unmodified) GPU applications can improve significantly — between 67% and 95% — simply by changing the thread-block size.	en
dc.description.version	Accepted version	en
dc.format.extent	Pages 391-403	en
dc.format.mimetype	application/pdf	en
dc.identifier.doi	https://doi.org/10.1007/s11265-020-01604-4	en
dc.identifier.eissn	1939-8115	en
dc.identifier.issn	1939-8018	en
dc.identifier.issue	4	en
dc.identifier.orcid	Feng, Wu-chun [0000-0002-6015-0727]	en
dc.identifier.uri	https://hdl.handle.net/10919/118248	en
dc.identifier.volume	93	en
dc.language.iso	en	en
dc.publisher	Springer	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.title	IterML: Iterative Machine Learning for Intelligent Parameter Pruning and Tuning in Graphics Processing Units	en
dc.title.serial	Journal of Signal Processing Systems	en
dc.type	Article - Refereed	en
dc.type.dcmitype	Text	en
dc.type.other	Journal Article	en
pubs.organisational-group	/Virginia Tech	en
pubs.organisational-group	/Virginia Tech/Engineering	en
pubs.organisational-group	/Virginia Tech/Engineering/Computer Science	en
pubs.organisational-group	/Virginia Tech/Faculty of Health Sciences	en
pubs.organisational-group	/Virginia Tech/All T&R Faculty	en
pubs.organisational-group	/Virginia Tech/Engineering/COE T&R Faculty	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Feng-JSPS-IterML.pdf
Size:: 1.21 MB
Format:: Adobe Portable Document Format
Description:: Accepted version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.5 KB
Format:: Plain Text
Description:

Download

Collections

All Faculty Deposits
Scholarly Works, Computer Science