Hierarchical Gaussian Processes for Spatially Dependent Model Selection
dc.contributor.author | Fry, James Thomas | en |
dc.contributor.committeechair | Leman, Scotland C. | en |
dc.contributor.committeemember | Resler, Lynn M. | en |
dc.contributor.committeemember | Gramacy, Robert B. | en |
dc.contributor.committeemember | Smith, Eric P. | en |
dc.contributor.department | Statistics | en |
dc.date.accessioned | 2018-07-19T08:00:32Z | en |
dc.date.available | 2018-07-19T08:00:32Z | en |
dc.date.issued | 2018-07-18 | en |
dc.description.abstract | In this dissertation, we develop a model selection and estimation methodology for nonstationary spatial fields. Large, spatially correlated data often cover a vast geographical area. However, local spatial regions may have different mean and covariance structures. Our methodology accomplishes three goals: (1) cluster locations into small regions with distinct, stationary models, (2) perform Bayesian model selection within each cluster, and (3) correlate the model selection and estimation in nearby clusters. We utilize the Conditional Autoregressive (CAR) model and Ising distribution to provide intra-cluster correlation on the linear effects and model inclusion indicators, while modeling inter-cluster correlation with separate Gaussian processes. We apply our model selection methodology to a dataset involving the prediction of Brook trout presence in subwatersheds across Pennsylvania. We find that our methodology outperforms the stationary spatial model and that different regions in Pennsylvania are governed by separate Gaussian process regression models. | en |
dc.description.abstractgeneral | In this dissertation, we develop a statistical methodology for analyzing data where observations are related to each other due to spatial proximity. Our overall goal is to determine which attributes are important when predicting the response of interest. However, the effect and importance of an attribute may differ depending on the spatial location of the observation. Our methodology accomplishes three goals: (1) partition the observations into small spatial regions, (2) determine which attributes are important within each region, and (3) enforce that the importance of variables should be similar in regions that are near each other. We apply our technique to a dataset involving the prediction of Brook trout presence in subwatersheds across Pennsylvania. | en |
dc.description.degree | Ph. D. | en |
dc.format.medium | ETD | en |
dc.identifier.other | vt_gsexam:16674 | en |
dc.identifier.uri | http://hdl.handle.net/10919/84161 | en |
dc.publisher | Virginia Tech | en |
dc.rights | In Copyright | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
dc.subject | spatial statistics | en |
dc.subject | Gaussian process | en |
dc.subject | model selection | en |
dc.subject | nonstationary process | en |
dc.subject | Ising distribution | en |
dc.subject | CAR model | en |
dc.title | Hierarchical Gaussian Processes for Spatially Dependent Model Selection | en |
dc.type | Dissertation | en |
thesis.degree.discipline | Statistics | en |
thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
thesis.degree.level | doctoral | en |
thesis.degree.name | Ph. D. | en |
Files
Original bundle
1 - 1 of 1