Compute Overlap Stall (COS): Predicting Performance of Power Management for Shared Memory Codes When Throttling Processors, Memory, and Thread Concurrency
| dc.contributor.author | McCoy, Alexandra Kirin | en |
| dc.contributor.committeechair | Cameron, Kirk W. | en |
| dc.contributor.committeemember | Ellis, Margaret O.'Neil | en |
| dc.contributor.committeemember | Back, Godmar Volker | en |
| dc.contributor.department | Computer Science and#38; Applications | en |
| dc.date.accessioned | 2026-06-03T08:00:25Z | en |
| dc.date.available | 2026-06-03T08:00:25Z | en |
| dc.date.issued | 2026-06-02 | en |
| dc.description.abstract | Maximizing performance under power constraints is a priority for highly parallel scientific applications. Modern systems offer control over operating modes, including processor speed (DVFS), memory speed (DMT), and concurrency level (DCT). Throttling speed and core usage reduces energy consumption at the cost of possible performance loss. Accurate execution time prediction mechanisms are useful for choosing system configurations that yield workload efficiency. The Compute Overlap Stall model predicts execution time of parallel applications across these operating modes. The key insight of the model is that pure compute time, pure stall time, and compute-memory overlap are discretely affected by these three operating modes. We validate and update the model with an emergent architecture and reduce the size of the training set with negligible loss in prediction accuracy. We extend the model to support performance prediction for heterogeneous multi-core processors. We employ the optimized COS model on three architectures for 14 application benchmarks. We observe a mean prediction error within 10% for the homogeneous model, and within 13% for the heterogeneous-aware model for most applications. | en |
| dc.description.abstractgeneral | Large-scale computers incur high monetary and environmental costs to power and use. Because of this, power itself has become the bottleneck in data-center level systems. Modern computers expose interfaces for system administrators to slow down the machine and therefore save power. These techniques must be used in intelligent coordination with the desired workload to avoid sacrificing a timely job completion in the pursuit of power savings. This document analyzes two analytical performance models that predict the impacts of throttling three machine characteristics on job completion time. The document first adapts an existing model to a contemporary machine, and then extends the model to support accurate completion time prediction on an emergent computer architecture. | en |
| dc.description.degree | Master of Science | en |
| dc.format.medium | ETD | en |
| dc.identifier.other | vt_gsexam:46701 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/143232 | en |
| dc.language.iso | en | en |
| dc.publisher | Virginia Tech | en |
| dc.rights | In Copyright | en |
| dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
| dc.subject | Dynamic Voltage and Frequency Scaling | en |
| dc.subject | Dynamic Memory Throttling | en |
| dc.subject | Dynamic Concurrency Throttling | en |
| dc.subject | Execution Time Prediction | en |
| dc.subject | Linear Regression | en |
| dc.subject | Performance Prediction | en |
| dc.subject | Asymmetric Multiprocessing | en |
| dc.title | Compute Overlap Stall (COS): Predicting Performance of Power Management for Shared Memory Codes When Throttling Processors, Memory, and Thread Concurrency | en |
| dc.type | Thesis | en |
| thesis.degree.discipline | Computer Science & Applications | en |
| thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
| thesis.degree.level | masters | en |
| thesis.degree.name | Master of Science | en |
Files
Original bundle
1 - 1 of 1