Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle

dc.contributor.authorBaba, Toshimien
dc.contributor.authorPegolo, Saraen
dc.contributor.authorMota, Lucio Flavio Macedoen
dc.contributor.authorPeñagaricano, Franciscoen
dc.contributor.authorBittante, Giovannien
dc.contributor.authorCecchinato, Alessioen
dc.contributor.authorMorota, Gotaen
dc.date.accessioned2021-03-22T11:57:45Zen
dc.date.available2021-03-22T11:57:45Zen
dc.date.issued2021-03-16en
dc.date.updated2021-03-21T04:24:58Zen
dc.description.abstractAbstract Background Over the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy for milk protein phenotypes when heterogeneous on-farm, genomic, and pedigree data were integrated with the spectra. To this end, we used the records of 966 Italian Brown Swiss cows with milk FTIR spectra, on-farm information, medium-density genetic markers, and pedigree data. True and total whey protein, and five casein, and two whey protein traits were analyzed. Multiple kernel learning constructed from spectral and genomic (pedigree) relationship matrices and multilayer BayesB assigning separate priors for FTIR and markers were benchmarked against a baseline partial least squares (PLS) regression. Seven combinations of covariates were considered, and their predictive abilities were evaluated by repeated random sub-sampling and herd cross-validations (CV). Results Addition of the on-farm effects such as herd, days in milk, and parity to spectral data improved predictions as compared to those obtained using the spectra alone. Integrating genomics and/or the top three markers with a large effect further enhanced the predictions. Pedigree data also improved prediction, but to a lesser extent than genomic data. Multiple kernel learning and multilayer BayesB increased predictive performance, whereas PLS did not. Overall, multilayer BayesB provided better predictions than multiple kernel learning, and lower prediction performance was observed in herd CV compared to repeated random sub-sampling CV. Conclusions Integration of genomic information with milk FTIR spectral can enhance milk protein trait predictions by 25% and 7% on average for repeated random sub-sampling and herd CV, respectively. Multiple kernel learning and multilayer BayesB outperformed PLS when used to integrate heterogeneous data for phenotypic predictions.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationGenetics Selection Evolution. 2021 Mar 16;53(1):29en
dc.identifier.doihttps://doi.org/10.1186/s12711-021-00620-7en
dc.identifier.urihttp://hdl.handle.net/10919/102753en
dc.language.isoenen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.holderThe Author(s)en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.titleIntegrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattleen
dc.title.serialGenetics Selection Evolutionen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
dc.type.dcmitypeStillImageen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
12711_2021_Article_620.pdf
Size:
3.26 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: