Achieving More with Less: Learning Generalizable Neural Networks With Less Labeled Data and Computational Overheads

Bu, Jie

Achieving More with Less: Learning Generalizable Neural Networks With Less Labeled Data and Computational Overheads

dc.contributor.author	Bu, Jie	en
dc.contributor.committeechair	Karpatne, Anuj	en
dc.contributor.committeemember	Sandu, Adrian	en
dc.contributor.committeemember	Lourentzou, Ismini	en
dc.contributor.committeemember	Ramakrishnan, Narendran	en
dc.contributor.committeemember	Arora, Anish	en
dc.contributor.department	Computer Science and Applications	en
dc.date.accessioned	2023-03-16T08:00:17Z	en
dc.date.available	2023-03-16T08:00:17Z	en
dc.date.issued	2023-03-15	en
dc.description.abstract	Recent advancements in deep learning have demonstrated its incredible ability to learn generalizable patterns and relationships automatically from data in a number of mainstream applications. However, the generalization power of deep learning methods largely comes at the costs of working with very large datasets and using highly compute-intensive models. Many applications cannot afford these costs needed to ensure generalizability of deep learning models. For instance, obtaining labeled data can be costly in scientific applications, and using large models may not be feasible in resource-constrained environments involving portable devices. This dissertation aims to improve efficiency in machine learning by exploring different ways to learn generalizable neural networks that require less labeled data and computational resources. We demonstrate that using physics supervision in scientific problems can reduce the need for labeled data, thereby improving data efficiency without compromising model generalizability. Additionally, we investigate the potential of transfer learning powered by transformers in scientific applications as a promising direction for further improving data efficiency. On the computational efficiency side, we present two efforts for increasing parameter efficiency of neural networks through novel architectures and structured network pruning.	en
dc.description.abstractgeneral	Deep learning is a powerful technique that can help us solve complex problems, but it often requires a lot of data and resources. This research aims to make deep learning more efficient, so it can be applied in more situations. We propose ways to make the deep learning models require less data and less computer power. For example, we leverage the physics rules as additional information for training the neural network to learn from less labeled data and we use a technique called transfer learning to leverage knowledge from data that is from other distribution. Transfer learning may allow us to further reduce the need for labeled data in scientific applications. We also look at ways to make the deep learning models use less computational resources, by effectively reducing their sizes via novel architectures or pruning out redundant structures.	en
dc.description.degree	Doctor of Philosophy	en
dc.format.medium	ETD	en
dc.identifier.other	vt_gsexam:36567	en
dc.identifier.uri	http://hdl.handle.net/10919/114108	en
dc.language.iso	en	en
dc.publisher	Virginia Tech	en
dc.rights	Creative Commons Attribution-NonCommercial 4.0 International	en
dc.rights.uri	http://creativecommons.org/licenses/by-nc/4.0/	en
dc.subject	Machine Learning Efficiency	en
dc.subject	Physics-guided Machine Learning	en
dc.subject	Efficient Neural Architecture	en
dc.subject	Neural Network Pruning	en
dc.subject	Transfer Learning	en
dc.title	Achieving More with Less: Learning Generalizable Neural Networks With Less Labeled Data and Computational Overheads	en
dc.type	Dissertation	en
thesis.degree.discipline	Computer Science and Applications	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	doctoral	en
thesis.degree.name	Doctor of Philosophy	en

Files

Original bundle

Now showing 1 - 2 of 2

Name:: Bu_J_D_2023.pdf
Size:: 6.13 MB
Format:: Adobe Portable Document Format

Download

Name:: Bu_J_D_2023_support_1.pdf
Size:: 81.64 KB
Format:: Adobe Portable Document Format
Description:: Supporting documents

Download

Collections

Doctoral Dissertations