Utilizing genomics and historical data to optimize gene pools for new breeding programs: A case study in winter wheat

Ballen-Taborda, Carolina; Lyerly, Jeanette; Smith, Jared; Howell, Kimberly; Brown-Guedira, Gina; Babar, Md. Ali; Harrison, Stephen A.; Mason, Richard E.; Mergoum, Mohamed; Murphy, J. Paul; Sutton, Russell; Griffey, Carl A.; Boyles, Richard E.

Utilizing genomics and historical data to optimize gene pools for new breeding programs: A case study in winter wheat

dc.contributor.author	Ballen-Taborda, Carolina	en
dc.contributor.author	Lyerly, Jeanette	en
dc.contributor.author	Smith, Jared	en
dc.contributor.author	Howell, Kimberly	en
dc.contributor.author	Brown-Guedira, Gina	en
dc.contributor.author	Babar, Md. Ali	en
dc.contributor.author	Harrison, Stephen A.	en
dc.contributor.author	Mason, Richard E.	en
dc.contributor.author	Mergoum, Mohamed	en
dc.contributor.author	Murphy, J. Paul	en
dc.contributor.author	Sutton, Russell	en
dc.contributor.author	Griffey, Carl A.	en
dc.contributor.author	Boyles, Richard E.	en
dc.date.accessioned	2023-05-02T19:08:46Z	en
dc.date.available	2023-05-02T19:08:46Z	en
dc.date.issued	2022-10	en
dc.description.abstract	With the rapid generation and preservation of both genomic and phenotypic information for many genotypes within crops and across locations, emerging breeding programs have a valuable opportunity to leverage these resources to 1) establish the most appropriate genetic foundation at program inception and 2) implement robust genomic prediction platforms that can effectively select future breeding lines. Integrating genomics-enabled (1) breeding into cultivar development can save costs and allow resources to be reallocated towards advanced (i.e., later) stages of field evaluation, which can facilitate an increased number of testing locations and replicates within locations. In this context, a reestablished winter wheat breeding program was used as a case study to understand best practices to leverage and tailor existing genomic and phenotypic resources to determine optimal genetics for a specific target population of environments. First, historical multi-environment phenotype data, representing 1,285 advanced breeding lines, were compiled from multi-institutional testing as part of the SunGrains cooperative and used to produce GGE biplots and PCA for yield. Locations were clustered based on highly correlated line performance among the target population of environments into 22 subsets. For each of the subsets generated, EMMs and BLUPs were calculated using linear models with the 'lme4' R package. Second, for each subset, TPs representative of the new SC breeding lines were determined based on genetic relatedness using the 'STPGA' R package. Third, for each TP, phenotypic values and SNP data were incorporated into the 'rrBLUP' mixed models for generation of GEBVs of YLD, TW, HD and PH. Using a five-fold cross-validation strategy, an average accuracy of r = 0.42 was obtained for yield between all TPs. The validation performed with 58 SC elite breeding lines resulted in an accuracy of r = 0.62 when the TP included complete historical data. Lastly, QTL-by-environment interaction for 18 major effect genes across three geographic regions was examined. Lines harboring major QTL in the absence of disease could potentially underperform (e.g., Fhb1 R-gene), whereas it is advantageous to express a major QTL under biotic pressure (e.g., stripe rust R-gene). This study highlights the importance of genomics-enabled breeding and multi-institutional partnerships to accelerate cultivar development.	en
dc.description.notes	This work was supported by the USDA NIFA AFRI Foundational project SC-2020-03599 awarded to REB (award no. 2021-67014-33941) and the Sun Grains cooperative breeding program.	en
dc.description.sponsorship	USDA NIFA AFRI Foundational project; Sun Grains cooperative breeding program [SC-2020-03599, 2021-67014-33941]	en
dc.description.version	Published version	en
dc.format.mimetype	application/pdf	en
dc.identifier.doi	https://doi.org/10.3389/fgene.2022.964684	en
dc.identifier.eissn	1664-8021	en
dc.identifier.other	964684	en
dc.identifier.pmid	36276956	en
dc.identifier.uri	http://hdl.handle.net/10919/114895	en
dc.identifier.volume	13	en
dc.language.iso	en	en
dc.publisher	Frontiers	en
dc.rights	Creative Commons Attribution 4.0 International	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	en
dc.subject	breeding	en
dc.subject	winter wheat (Triticum aestivum L.)	en
dc.subject	historical data	en
dc.subject	training populations	en
dc.subject	genomic selection	en
dc.subject	prediction accuracy	en
dc.subject	yield	en
dc.title	Utilizing genomics and historical data to optimize gene pools for new breeding programs: A case study in winter wheat	en
dc.title.serial	Frontiers in Genetics	en
dc.type	Article - Refereed	en
dc.type.dcmitype	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: fgene-13-964684.pdf
Size:: 2.01 MB
Format:: Adobe Portable Document Format
Description:: Published version

Download

Collections

Scholarly Works, School of Plant and Environmental Sciences