Cross-stack Improvement on Memory Efficiency
| dc.contributor.author | Li, Yuze | en |
| dc.contributor.committeechair | Butt, Ali | en |
| dc.contributor.committeemember | Nikolopoulos, Dimitrios S. | en |
| dc.contributor.committeemember | Nikolaev, Ruslan | en |
| dc.contributor.committeemember | Li, Huaicheng | en |
| dc.contributor.committeemember | Williams, Daniel John | en |
| dc.contributor.committeemember | Sundararajah, Kirshanthan | en |
| dc.contributor.department | Computer Science and#38; Applications | en |
| dc.date.accessioned | 2026-05-27T08:01:05Z | en |
| dc.date.available | 2026-05-27T08:01:05Z | en |
| dc.date.issued | 2026-05-26 | en |
| dc.description.abstract | Modern data centers increasingly host memory-intensive applications that demand vast memory capacity. However, memory performance has not scaled at the same pace as compute performance, and indiscriminately expanding memory resources incurs substantial infrastructure and maintenance costs, ultimately limiting scalability and efficiency. By co-designing solutions across the programming-language runtime, compiler, operating system, and hardware, this dissertation systematically improves both the space efficiency and performance of memory systems in modern data centers. The first contribution, MTP, integrates fine-grained object hotness tracking into the CPython virtual machine to enable per-application memory tiering for Python workloads, allowing applications to run on a small fast tier backed by larger, cheaper CXL memory and lowering the effective per-gigabyte cost of memory. MTP infers object access patterns from reference-count deltas and employs an eagerness-aware migration policy that adapts to runtime memory behavior, outperforming TPP and AutoNUMA on the majority of 33 evaluated configurations and matching MEMTIS on a substantial fraction. The second contribution, CPM, restores secure memory sharing in cloud environments by decoupling cache sharing from memory sharing. CPM places all hardware defense logic within the memory controller, using a table-free merging mechanism that maps groups of unique physical page numbers to shared physical pages via a mathematical formula. The first hardware prototype of reuse cache side-channel defense, built on an Intel server with an FPGA-based CXL memory controller, eliminates Flush+Reload attacks across containers and VMs while improving performance by 2.2% on average and increasing VM consolidation density by 2.6x on 16GB of CXL memory over the current practice of disabling sharing. The third contribution, SABER, addresses the memory-level parallelism (MLP) destroyed by branch mispredictions. Hardware predictors fail on data-dependent branches, and existing software if-conversion operates too late in the compilation pipeline---in the target-dependent backend, where program semantics needed to reason about memory safety have been lost. SABER lifts this analysis to the target-independent middle-end IR, synthesizing software memory predication that reaches branches no prior pass could touch, and governs its application through a three-tier cost framework: a static model reasoning from IR structure, an offline profile-guided optimization (PGO) tier empirically calibrated against measured memory behavior, and an online tier adapting to input-dependent branch predictability at runtime. SABER delivers 13.6% geomean speedup on microbenchmarks and 4.0% on SPEC CPU 2017 with zero regressions, while transforming fewer than 10% of structurally eligible branches. | en |
| dc.description.abstractgeneral | Today's data centers power applications we use every day, from web search to large language models. These applications consume enormous amounts of memory, and while computing power continues to grow rapidly, memory technology has not kept pace, creating a widening gap that drives up costs and limits performance. This dissertation makes memory systems more efficient by working at multiple layers of the computer system: the programming language, the compiler, the operating system, and the hardware. No single layer has all the information needed to make memory work well, so progress requires coordinated changes across the stack. First, we extend Python, one of the most popular cloud languages, so that frequently used data automatically moves to fast memory while rarely used data is offloaded to slower, cheaper memory, lowering memory cost without sacrificing speed. Second, we design a hardware mechanism that lets different users on the same server safely share identical copies of data in memory; today, the risk of one user spying on another through shared memory forces cloud providers to keep wasteful redundant copies, and our work removes that risk so the copies can be eliminated. Third, we rethink how compilers handle the unpredictable branches that prevent the processor from overlapping multiple memory accesses---the main way modern hardware hides slow memory---using a staged analysis to remove only the branches where doing so actually helps. Together, these improvements let computing systems handle more data at lower cost, with greater speed and stronger security. | en |
| dc.description.degree | Doctor of Philosophy | en |
| dc.format.medium | ETD | en |
| dc.identifier.other | vt_gsexam:46754 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/143157 | en |
| dc.language.iso | en | en |
| dc.publisher | Virginia Tech | en |
| dc.rights | In Copyright | en |
| dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
| dc.subject | Memory System | en |
| dc.subject | Tiered Memory | en |
| dc.subject | Programming Language Virtual Machine | en |
| dc.subject | Compiler | en |
| dc.subject | Branch Elimination | en |
| dc.subject | CPU Pipeline | en |
| dc.subject | Operating System | en |
| dc.subject | Side-channel Attack | en |
| dc.title | Cross-stack Improvement on Memory Efficiency | en |
| dc.type | Dissertation | en |
| thesis.degree.discipline | Computer Science & Applications | en |
| thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
| thesis.degree.level | doctoral | en |
| thesis.degree.name | Doctor of Philosophy | en |
Files
Original bundle
1 - 1 of 1