Scalability Analysis of Synchronous Data-Parallel Artificial Neural Network (ANN) Learners
dc.contributor.author | Sun, Chang | en |
dc.contributor.committeechair | Plassmann, Paul E. | en |
dc.contributor.committeemember | Patterson, Cameron D. | en |
dc.contributor.committeemember | Jones, Mark T. | en |
dc.contributor.department | Electrical and Computer Engineering | en |
dc.date.accessioned | 2018-09-15T08:00:17Z | en |
dc.date.available | 2018-09-15T08:00:17Z | en |
dc.date.issued | 2018-09-14 | en |
dc.description.abstract | Artificial Neural Networks (ANNs) have been established as one of the most important algorithmic tools in the Machine Learning (ML) toolbox over the past few decades. ANNs' recent rise to widespread acceptance can be attributed to two developments: (1) the availability of large-scale training and testing datasets; and (2) the availability of new computer architectures for which ANN implementations are orders of magnitude more efficient. In this thesis, I present research on two aspects of the second development. First, I present a portable, open source implementation of ANNs in OpenCL and MPI. Second, I present performance and scaling models for ANN algorithms on state-of-the-art Graphics Processing Unit (GPU) based parallel compute clusters. | en |
dc.description.abstractgeneral | Artificial Neural Networks (ANNs) have been established as one of the most important algorithmic tools in the Machine Learning (ML) toolbox over the past few decades. ANNs’ recent rise to widespread acceptance can be attributed to two developments: (1) the availability of large-scale training and testing datasets; and (2) the availability of new computer architectures for which ANN implementations are orders of magnitude more efficient. In this thesis, I present research on two aspects of the second development. First, I present a portable, open source implementation of ANNs in OpenCL and MPI. Second, I present performance and scaling models for ANN algorithms on state-of-the-art Graphics Processing Unit (GPU) based parallel compute clusters. | en |
dc.description.degree | Master of Science | en |
dc.format.medium | ETD | en |
dc.identifier.other | vt_gsexam:17090 | en |
dc.identifier.uri | http://hdl.handle.net/10919/85020 | en |
dc.publisher | Virginia Tech | en |
dc.rights | In Copyright | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
dc.subject | artificial neural networks | en |
dc.subject | Machine learning | en |
dc.subject | heterogenous computing | en |
dc.subject | parallel computing | en |
dc.title | Scalability Analysis of Synchronous Data-Parallel Artificial Neural Network (ANN) Learners | en |
dc.type | Thesis | en |
thesis.degree.discipline | Computer Engineering | en |
thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
thesis.degree.level | masters | en |
thesis.degree.name | Master of Science | en |