Browsing by Author "Albahar, Hadeel Ahmad"
Now showing 1 - 1 of 1
Results Per Page
Sort Options
- Optimizing Systems for Deep Learning ApplicationsAlbahar, Hadeel Ahmad (Virginia Tech, 2023-03-01)Modern systems for Machine Learning (ML) workloads support heterogeneous workloads and resources. However, existing resource managers in these systems do not differentiate between heterogeneous GPU resources. Moreover, users are usually unaware of the sufficient and appropriate type and amount of GPU resources to request for their ML jobs. In this thesis, we analyze the performance of ML training and inference jobs and identify ML model and GPU characteristics that impact this performance. We then propose ML-based prediction models to accurately determine appropriate and sufficient resource requirements to ensure improved job latency and GPU utilization in the cluster.