Search
Now showing items 1-9 of 9
GPU Power Prediction via Ensemble Machine Learning for DVFS Space Exploration
(Department of Computer Science, Virginia Polytechnic Institute & State University, 2018-02-02), TR-18-01
A software-based approach to achieve high performance within a power budget often involves dynamic voltage and frequency scaling (DVFS). Consequently, accurately predicting the power consumption of an application at different ...
AutoMatch: Automated Matching of Compute Kernels to Heterogeneous HPC Architectures
(Department of Computer Science, Virginia Polytechnic Institute & State University, 2016-12-13), TR-16-06
Abstract—HPC systems contain a wide variety of heterogeneous computing resources, ranging from general-purpose CPUs to specialized accelerators. Porting sequential applications to such systems for achieving high performance ...
A Composable Workflow for Productive FPGA Computing via Whole-Program Analysis and Transformation (with Code Excerpts)
(Department of Computer Science, Virginia Polytechnic Institute & State University, 2018-07-24), TR-18-05
We present a composable workflow to enable highly-productive heterogeneous computing on FPGAs. The workflow consists of a trio of static analysis and transformation tools: (1) a whole-program, source-to-source translator ...
SOLR supported search on an OpenStack metadata service
(Virginia Tech, 2017-08-26)
In cloud computing, the use of databases, particularly the MySQL database system, is a common practice. While the MySQL database system has advantages such as consistency and transaction support, some software architects ...
CommAnalyzer: Automated Estimation of Communication Cost on HPC Clusters Using Sequential Code
(Department of Computer Science, Virginia Polytechnic Institute & State University, 2017-08-14), TR-17-04
MPI+X is the de facto standard for programming applications on HPC clusters. The performance and scalability on such systems is limited by the communication cost on different number of processes and compute nodes. Therefore, ...
Compiler-Directed Failure Atomicity for Nonvolatile Memory
(Department of Computer Science, Virginia Polytechnic Institute & State University, 2019-07-15), TR-19-02
This paper presents iDO, a compiler-directed approach to failure atomicity with nonvolatile memory. Unlike most prior work, which instruments each store of persistent data for redo or undo logging, the iDO compiler identifies ...
ETH: A Framework for the Design-Space Exploration of Extreme-Scale Visualization
(Department of Computer Science, Virginia Polytechnic Institute & State University, 2017-09-29), TR-17-05
As high-performance computing (HPC) moves towards the exascale era, large-scale scientific simulations are generating enormous datasets. A variety of techniques (e.g., in-situ methods, data sampling, and compression) have ...
MOANA: Modeling and Analyzing I/O Variability in Parallel System Experimental Design
(Department of Computer Science, Virginia Polytechnic Institute & State University, 2018-04-19), TR-18-04
Exponential increases in complexity and scale make variability a growing threat to sustaining HPC performance at exascale. Performance variability in HPC I/O is common, acute, and formidable. We take the first step towards ...
Personal Reflections on 50 Years of Scientific Computing: 1967–2017
(Department of Computer Science, Virginia Polytechnic Institute & State University, 2017-08-10), TR-17-03
Computer hardware, software, numerical algorithms, and science and engineering applications are traced for a half century from the author's perspective.