Designing and modeling high-throughput phenotyping data in quantitative genetics

Yu, Haipeng

Designing and modeling high-throughput phenotyping data in quantitative genetics

dc.contributor.author	Yu, Haipeng	en
dc.contributor.committeechair	Morota, Gota	en
dc.contributor.committeemember	Notter, David R.	en
dc.contributor.committeemember	Hoeschele, Ina	en
dc.contributor.committeemember	Saghai-Maroof, Mohammad A.	en
dc.contributor.committeemember	Bradford, Heather L.	en
dc.contributor.department	Animal and Poultry Sciences	en
dc.date.accessioned	2020-04-10T08:00:42Z	en
dc.date.available	2020-04-10T08:00:42Z	en
dc.date.issued	2020-04-09	en
dc.description.abstract	Quantitative genetics aims to bridge the genome to phenome gap. The advent of high-throughput genotyping technologies has accelerated the progress of genome to phenome mapping, but a challenge remains in phenotyping. Various high-throughput phenotyping (HTP) platforms have been developed recently to obtain economically important phenotypes in an automated fashion with less human labor and reduced costs. However, the effective way of designing HTP has not been investigated thoroughly. In addition, high-dimensional HTP data bring up a big challenge for statistical analysis by increasing computational demands. A new strategy for modeling high-dimensional HTP data and elucidating the interrelationships among these phenotypes are needed. Previous studies used pedigree-based connectetdness statistics to study the design of phenotyping. The availability of genetic markers provides a new opportunity to evaluate connectedness based on genomic data, which can serve as a means to design HTP. This dissertation first discusses the utility of connectedness spanning in three studies. In the first study, I introduced genomic connectedness and compared it with traditional pedigree-based connectedness. The relationship between genomic connectedness and prediction accuracy based on cross-validation was investigated in the second study. The third study introduced a user-friendly connectedness R package, which provides a suite of functions to evaluate the extent of connectedness. In the last study, I proposed a new statistical approach to model high-dimensional HTP data by leveraging the combination of confirmatory factor analysis and Bayesian network. Collectively, the results from the first three studies suggested the potential usefulness of applying genomic connectedness to design HTP. The statistical approach I introduced in the last study provides a new avenue to model high-dimensional HTP data holistically to further help us understand the interrelationships among phenotypes derived from HTP.	en
dc.description.abstractgeneral	Quantitative genetics aims to bridge the genome to phenome gap. With the advent of genotyping technologies, the genomic information of individuals can be included in a quantitative genetic model. A new challenge is to obtain sufficient and accurate phenotypes in an automated fashion with less human labor and reduced costs. The high-throughput phenotyping (HTP) technologies have emerged recently, opening a new opportunity to address this challenge. However, there is a paucity of research in phenotyping design and modeling high-dimensional HTP data. The main themes of this dissertation are 1) genomic connectedness that could potentially be used as a means to design a phenotyping experiment and 2) a novel statistical approach that aims to handle high-dimensional HTP data. In the first three studies, I first compared genomic connectedness with pedigree-based connectedness. This was followed by investigating the relationship between genomic connectedness and prediction accuracy derived from cross-validation. Additionally, I developed a connectedness R package that implements a variety of connectedness measures. The fourth study investigated a novel statistical approach by leveraging the combination of dimension reduction and graphical models to understand the interrelationships among high-dimensional HTP data.	en
dc.description.degree	Doctor of Philosophy	en
dc.format.medium	ETD	en
dc.identifier.other	vt_gsexam:24611	en
dc.identifier.uri	http://hdl.handle.net/10919/97579	en
dc.publisher	Virginia Tech	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	bayesian network	en
dc.subject	factor analysis	en
dc.subject	genomic connectedness	en
dc.subject	genomic prediction	en
dc.subject	high-throughput phenotyping data	en
dc.title	Designing and modeling high-throughput phenotyping data in quantitative genetics	en
dc.type	Dissertation	en
thesis.degree.discipline	Animal and Poultry Sciences	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	doctoral	en
thesis.degree.name	Doctor of Philosophy	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Yu_H_D_2020.pdf
Size:: 2.17 MB
Format:: Adobe Portable Document Format

Download

Collections

Doctoral Dissertations