Modeling multiple phenotypes in wheat using data-driven genomic exploratory factor analysis and Bayesian network learning

dc.contributor.authorMomen, Mehdien
dc.contributor.authorBhatta, Madhaven
dc.contributor.authorHussain, Waseemen
dc.contributor.authorYu, Haipengen
dc.contributor.authorMorota, Gotaen
dc.contributor.departmentAnimal and Poultry Sciencesen
dc.date.accessioned2021-06-08T15:40:57Zen
dc.date.available2021-06-08T15:40:57Zen
dc.date.issued2021-01en
dc.description.abstractInferring trait networks from a large volume of genetically correlated diverse phenotypes such as yield, architecture, and disease resistance can provide information on the manner in which complex phenotypes are interrelated. However, studies on statistical methods tailored to multidimensional phenotypes are limited, whereas numerous methods are available for evaluating the massive number of genetic markers. Factor analysis operates at the level of latent variables predicted to generate observed responses. The objectives of this study were to illustrate the manner in which data-driven exploratory factor analysis can map observed phenotypes into a smaller number of latent variables and infer a genomic latent factor network using 45 agro-morphological, disease, and grain mineral phenotypes measured in synthetic hexaploid wheat lines (Triticum aestivum L.). In total, eight latent factors including grain yield, architecture, flag leaf-related traits, grain minerals, yellow rust, two types of stem rust, and leaf rust were identified as common sources of the observed phenotypes. The genetic component of the factor scores for each latent variable was fed into a Bayesian network to obtain a trait structure reflecting the genetic interdependency among traits. Three directed paths were consistently identified by two Bayesian network algorithms. Flag leaf-related traits influenced leaf rust, and yellow rust and stem rust influenced grain yield. Additional paths that were identified included flag leaf-related traits to minerals and minerals to architecture. This study shows that data-driven exploratory factor analysis can reveal smaller dimensional common latent phenotypes that are likely to give rise to numerous observed field phenotypes without relying on prior biological knowledge. The inferred genomic latent factor structure from the Bayesian network provides insights for plant breeding to simultaneously improve multiple traits, as an intervention on one trait will affect the values of focal phenotypes in an interrelated complex trait system.en
dc.description.notesVirginia Polytechnic Institute and State Universityen
dc.description.sponsorshipVirginia Polytechnic Institute and State Universityen
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1002/pld3.304en
dc.identifier.eissn2475-4455en
dc.identifier.issue1en
dc.identifier.othere00304en
dc.identifier.pmid33532691en
dc.identifier.urihttp://hdl.handle.net/10919/103700en
dc.identifier.volume5en
dc.language.isoenen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectBayesian networken
dc.subjectconfirmatory factor analysisen
dc.subjectexploratory factor analysisen
dc.subjectmulti-traiten
dc.subjectwheaten
dc.titleModeling multiple phenotypes in wheat using data-driven genomic exploratory factor analysis and Bayesian network learningen
dc.title.serialPlant Directen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
dc.type.dcmitypeStillImageen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
pld3.304.pdf
Size:
3.4 MB
Format:
Adobe Portable Document Format
Description:
Published version