Comparing Regression and Classification Models to Estimate Leaf Spot Disease in Peanut (Arachis hypogaea L.) for Implementation in Breeding Selection
dc.contributor.author | Chapu, Ivan | en |
dc.contributor.author | Chandel, Abhilash | en |
dc.contributor.author | Sie, Emmanuel Kofi | en |
dc.contributor.author | Okello, David Kalule | en |
dc.contributor.author | Oteng-Frimpong, Richard | en |
dc.contributor.author | Okello, Robert Cyrus Ongom | en |
dc.contributor.author | Hoisington, David | en |
dc.contributor.author | Balota, Maria | en |
dc.date.accessioned | 2024-05-24T13:40:26Z | en |
dc.date.available | 2024-05-24T13:40:26Z | en |
dc.date.issued | 2024-04-30 | en |
dc.date.updated | 2024-05-24T13:04:36Z | en |
dc.description.abstract | Late leaf spot (LLS) is an important disease of peanut, causing global yield losses. Developing resistant varieties through breeding is crucial for yield stability, especially for smallholder farmers. However, traditional phenotyping methods used for resistance selection are laborious and subjective. Remote sensing offers an accurate, objective, and efficient alternative for phenotyping for resistance. The objectives of this study were to compare between regression and classification for breeding, and to identify the best models and indices to be used for selection. We evaluated 223 genotypes in three environments: Serere in 2020, and Nakabango and Nyankpala in 2021. Phenotypic data were collected using visual scores and two handheld sensors: a red–green–blue (RGB) camera and GreenSeeker. RGB indices derived from the images, along with the normalized difference vegetation index (NDVI), were used to model LLS resistance using statistical and machine learning methods. Both regression and classification methods were also evaluated for selection. Random Forest (RF), the artificial neural network (ANN), and k-nearest neighbors (KNNs) were the top-performing algorithms for both regression and classification. The ANN (R<sup>2</sup>: 0.81, RMSE: 22%) was the best regression algorithm, while the RF was the best classification algorithm for both binary (90%) and multiclass (78% and 73% accuracy) classification. The classification accuracy of the models decreased with the increase in classification classes. NDVI, crop senescence index (CSI), hue, and greenness index were strongly associated with LLS and useful for selection. Our study demonstrates that the integration of remote sensing and machine learning can enhance selection for LLS-resistant genotypes, aiding plant breeders in managing large populations effectively. | en |
dc.description.version | Published version | en |
dc.format.mimetype | application/pdf | en |
dc.identifier.citation | Chapu, I.; Chandel, A.; Sie, E.K.; Okello, D.K.; Oteng-Frimpong, R.; Okello, R.C.O.; Hoisington, D.; Balota, M. Comparing Regression and Classification Models to Estimate Leaf Spot Disease in Peanut (Arachis hypogaea L.) for Implementation in Breeding Selection. Agronomy 2024, 14, 947. | en |
dc.identifier.doi | https://doi.org/10.3390/agronomy14050947 | en |
dc.identifier.uri | https://hdl.handle.net/10919/119114 | en |
dc.language.iso | en | en |
dc.publisher | MDPI | en |
dc.rights | Creative Commons Attribution 4.0 International | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
dc.subject | peanut breeding | en |
dc.subject | late leaf spot | en |
dc.subject | machine learning | en |
dc.subject | remote sensing | en |
dc.subject | genotype resistance classification | en |
dc.title | Comparing Regression and Classification Models to Estimate Leaf Spot Disease in Peanut (Arachis hypogaea L.) for Implementation in Breeding Selection | en |
dc.title.serial | Agronomy | en |
dc.type | Article - Refereed | en |
dc.type.dcmitype | Text | en |