Scalable Surrogates for Counts and Computer Experiments

dc.contributor.authorBarnett, Steven D.en
dc.contributor.committeechairGramacy, Robert B.en
dc.contributor.committeememberOsthus, Daviden
dc.contributor.committeememberHouse, Leanna L.en
dc.contributor.committeememberHigdon, Daviden
dc.contributor.departmentStatisticsen
dc.date.accessioned2026-02-27T09:00:09Zen
dc.date.available2026-02-27T09:00:09Zen
dc.date.issued2026-02-26en
dc.description.abstractData collected by the Interstellar Boundary Explorer (IBEX), recording counts of heliospheric energetic neutral atoms (ENAs), exhibit a phenomenon that has caused space scientists to revise hypotheses about the physical processes, and computer simulations under those models, that are in play at the boundary of our solar system. Providing estimates and associated uncertainty quantification (UQ) of the rate at which ENAs are generated is vital to theory development and validation. Gaussian processes (GPs) constitute an excellent nonparametric regression tool that can provide accurate out-of-sample prediction and UQ. But GPs are unconventional for modeling non-Gaussian observations, are inefficient on large training data, and struggle to model nonstationary response surfaces, all issues present in the IBEX application. To address this gap, I propose a fully Bayesian, Vecchia-approximated, Poisson deep GP surrogate model. I demonstrate its improved predictive capability over competitors through multiple simulated examples. Further, I develop a novel, fully Bayesian framework for solving Bayesian inverse problems, coupling a Poisson response with a Vecchia-approximated GP surrogate of an expensive simulator with high-dimensional output. I demonstrate the utility of this new framework via simulated scenarios in terms of recovering the "true" computer model parameters and enhancing prediction over models that rely exclusively on physical observations. I apply these new technologies to IBEX satellite data and associated computer models developed at Los Alamos National Laboratory.en
dc.description.abstractgeneralThe Interstellar Boundary Explorer (IBEX) orbits Earth and detects energetic neutral atoms (ENAs) originating from the edge of the heliosphere (the bubble-like structure that encompasses our solar system). Accurately estimating the rate at which ENAs are generated in different regions of the heliosphere is vital to the development and validation of theories concerning the behavior at the boundary of our solar system. Gaussian processes (GPs) are statistical models that are commonly used to predict such a response over a fine grid of unobserved locations. However, GPs have limitations that are exacerbated by the IBEX data: 1) GPs are not typically used to model count data; 2) GPs struggle to model responses that have notably contrasting behaviors in different regions of the input space; and 3) GPs are innately inadequate to model large-scale data in a reasonable amount of time. Therefore, I employ deep GPs (an extension of GPs that are better equipped for complex surfaces) and develop a sparse Poisson deep GP, making use of a GP approximation that overcomes the computational burden resulting from large-scale data. Further, I augment this modeling approach with output from an expensive computer simulation that attempts to represent the ENA rate throughout the heliosphere, based on a number of input parameters. The simulator output aids the model by filling in gaps typically present when observing a process such as the ENA rate. I demonstrate how both frameworks improve predictive capability over competitors through multiple simulated examples and apply it to data collected by the IBEX satellite and associated computer models developed at Los Alamos National Laboratory.en
dc.description.degreeDoctor of Philosophyen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:45668en
dc.identifier.urihttps://hdl.handle.net/10919/141582en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectBayesian inferenceen
dc.subjectGaussian processen
dc.subjectemulatoren
dc.subjectPoissonen
dc.subjectVecchia approximationen
dc.subjectelliptical slice samplingen
dc.subjectuncertainty quantificationen
dc.subjectmonotonicityen
dc.subjectheliospheric scienceen
dc.subjectIBEXen
dc.subjectcomputer model calibrationen
dc.titleScalable Surrogates for Counts and Computer Experimentsen
dc.typeDissertationen
thesis.degree.disciplineStatisticsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.nameDoctor of Philosophyen

Files

Original bundle
Now showing 1 - 1 of 1
Name:
Barnett_SD_D_2026.pdf
Size:
25.55 MB
Format:
Adobe Portable Document Format