Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models

Lipkovich, Ilya A.

Bayesian Model Averaging and Variable Selection in Multivariate Ecological Models

Files

BMADisserFinal.pdf (1.97 MB)

Downloads: 1595

Date

2002-04-09

Authors

Lipkovich, Ilya A.

Publisher

Virginia Tech

Abstract

Bayesian Model Averaging (BMA) is a new area in modern applied statistics that provides data analysts with an efficient tool for discovering promising models and obtaining esti-mates of their posterior probabilities via Markov chain Monte Carlo (MCMC). These probabilities can be further used as weights for model averaged predictions and estimates of the parameters of interest. As a result, variance components due to model selection are estimated and accounted for, contrary to the practice of conventional data analysis (such as, for example, stepwise model selection). In addition, variable activation probabilities can be obtained for each variable of interest. This dissertation is aimed at connecting BMA and various ramifications of the multivari-ate technique called Reduced-Rank Regression (RRR). In particular, we are concerned with Canonical Correspondence Analysis (CCA) in ecological applications where the data are represented by a site by species abundance matrix with site-specific covariates. Our goal is to incorporate the multivariate techniques, such as Redundancy Analysis and Ca-nonical Correspondence Analysis into the general machinery of BMA, taking into account such complicating phenomena as outliers and clustering of observations within a single data-analysis strategy. Traditional implementations of model averaging are concerned with selection of variables. We extend the methodology of BMA to selection of subgroups of observations and im-plement several approaches to cluster and outlier analysis in the context of the multivari-ate regression model. The proposed algorithm of cluster analysis can accommodate re-strictions on the resulting partition of observations when some of them form sub-clusters that have to be preserved when larger clusters are formed.

Keywords

Canonical Correspondence Analysis, Outlier Analysis, Bayesian Model Averaging, Cluster Analysis

Persistent link

http://hdl.handle.net/10919/11045

Collections

Doctoral Dissertations

Full item page