Advancing Yield Predictions in Pinus taeda (L.): An Artificial Intelligence (AI) Approach Leveraging LiDAR-derived Individual Tree Crown (ITC) Metrics, Competition Indices (CI), and Satellite Remote Sensing Indices
| dc.contributor.author | Barua, Gunjan | en |
| dc.contributor.committeechair | Thomas, Valerie Anne | en |
| dc.contributor.committeechair | Carter, David Robert James | en |
| dc.contributor.committeemember | Pingel, Thomas | en |
| dc.contributor.committeemember | Sumnall, Matthew James | en |
| dc.contributor.committeemember | Green, Patrick Corey | en |
| dc.contributor.committeemember | Radtke, Philip J. | en |
| dc.contributor.department | Geography | en |
| dc.date.accessioned | 2026-05-09T08:00:23Z | en |
| dc.date.available | 2026-05-09T08:00:23Z | en |
| dc.date.issued | 2026-05-08 | en |
| dc.description.abstract | This study assessed high-resolution UAV-LiDAR and multi-sensor satellite time series (Sentinel-1 and Sentinel-2) utilizing non-parametric machine learning and deep learning architectures across diverse planting densities (618, 1,236, and 1,853 trees ha-1) and thinning regimes to: 1) evaluate the accuracy of individual tree yield predictions over a 4-year interval using LiDAR-derived structural metrics and competition indices (see Chapter 1); 2) assess the temporal transferability of a single-date LiDAR acquisition for forecasting annual yield over a 7-year horizon (see Chapter 2); and 3) evaluate the efficacy of fusing continuous optical and synthetic aperture radar (SAR) time-series data using deep learning sequence models for plot-level yield estimation (see Chapter 3). Chapter 1 results showed that machine learning models significantly outperformed traditional parametric methods for medium-term yield prediction. Support Vector Machine (SVM) achieved the highest individual tree-level accuracy (normalized root mean square error (nRMSE) of 9.59%, R2 of 0.59) and underestimated stand-level volume by only -1.50%, while Random Forest (RF) achieved an nRMSE of 10.86% (R2 of 0.48) and overestimated stand volume by 1.53%. Chapter 2 results demonstrated the temporal transferability of a single age-8 LiDAR acquisition to predict annual growth from age 9 to 15. The RF model maintained high stability across the 7-year horizon (R2 greater than or equal to 0.83). Permutation feature importance revealed a biological shift where early-year predictions relied on individual structural metrics (tree top height increased MSE by 161.84%), while later-year predictions were dominated by distance-dependent competition indices (up to a 200% increase in MSE). Chapter 3 results showed that deep learning sequence models (Gated Recurrent Unit (GRU) and Long Short-Term Memory (LSTM)) and RF successfully scaled predictions to the plot level (R2 of 0.49). GRU yielded the lowest overall error (RMSE of 60.38 m3ha-1, MAE of 34.28 m3ha-1). However, a distinct U-shaped error trend emerged across stand densities; the RF model achieved its lowest error in medium-density stands (618 to 1,236 trees ha-1; RMSE of 36.66 m3ha-1), while error rates sharply increased in high-density stands (greater than 1,237 trees ha-1; RMSE of 88.44 m3ha-1) due to signal saturation above 125 m3ha-1. Three primary conclusions come from this research: 1) non-parametric machine learning models utilizing individual tree crown metrics and competition indices accurately predict tree-level yield without violating the homoscedasticity assumptions that limit traditional linear models (see Chapter 1); 2) a single-date LiDAR acquisition is temporally transferable and captures fundamental ecological shifts from individual size-dependent growth to distance-dependent competition as canopy closure intensifies (see Chapter 2); and 3) deep learning architectures effectively fuse continuous SAR and optical satellite data for landscape-scale estimation, though sensor signal saturation remains a critical bottleneck in dense, mature plantations (see Chapter 3). From these findings, we hypothesize that deploying machine learning and deep learning models to integrate multi-scalar remote sensing data: 1) overcomes the spatial and logistical constraints of traditional plot-based field inventories, which 2) translates to highly accurate, continuous yield forecasting across varying silvicultural regimes, and 3) enables dynamic, data-driven precision forestry management, provided that sensor saturation thresholds in high-biomass stands are properly identified and mitigated. | en |
| dc.description.abstractgeneral | Loblolly pine plantations are vital to the economy and the environment, providing more than half of the timber supply in the United States. To manage these forests sustainably, managers need accurate predictions of how much wood (timber yield) the trees will produce over time. Traditionally, this requires sending crews into the forest to manually measure tree heights and diameters, which is a slow, expensive process that only covers a tiny fraction of the landscape. This dissertation research explored a modern, high-tech solution: combining 3D lasers mounted on drones (LiDAR), continuous satellite imagery, and Artificial Intelligence (AI) to automatically predict forest growth from the sky. First, the study proved that AI and machine learning algorithms are highly accurate at predicting how individual trees will grow over a four-year period, outperforming traditional linear models. Remarkably, the research showed that a single drone flight taking 3D scans of an 8-year-old forest captures enough detail to accurately forecast annual tree growth for the next seven years. The AI was even able to capture the natural biology of the forest, recognizing that young trees grow based on their individual size, but as the forest matures, growth is strictly limited by how fiercely trees must compete with their neighbors for space and sunlight. Next, the research scaled these predictions up to entire landscapes using free, continuously updating satellite sensors and radar. By using advanced neural networks, the computer successfully estimated timber volume across large forest plots. The technology worked exceptionally well in typical, actively managed forests. However, the study identified a specific technological limit: in extremely thick, dense forests, the satellite sensors become saturated. The canopy becomes so thick that the satellites can no longer see the additional wood accumulating underneath, leading the computer to underestimate the total volume of the biggest forests. Ultimately, this research demonstrates that combining drone and satellite technology with AI can revolutionize forest management. It offers a fast, cost-effective, and highly accurate alternative to manual fieldwork. By allowing foresters to continuously monitor and predict timber yield across vast areas, this data-driven approach supports smarter, more sustainable decisions for the future of commercial forestry. | en |
| dc.description.degree | Doctor of Philosophy | en |
| dc.format.medium | ETD | en |
| dc.identifier.other | vt_gsexam:45915 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/143056 | en |
| dc.language.iso | en | en |
| dc.publisher | Virginia Tech | en |
| dc.rights | In Copyright | en |
| dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
| dc.subject | Timber yield prediction | en |
| dc.subject | precision forestry | en |
| dc.subject | temporal transferability | en |
| dc.subject | neural networks | en |
| dc.subject | multispectral and SAR data | en |
| dc.title | Advancing Yield Predictions in Pinus taeda (L.): An Artificial Intelligence (AI) Approach Leveraging LiDAR-derived Individual Tree Crown (ITC) Metrics, Competition Indices (CI), and Satellite Remote Sensing Indices | en |
| dc.type | Dissertation | en |
| thesis.degree.discipline | Geospatial and Environmental Analysis | en |
| thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
| thesis.degree.level | doctoral | en |
| thesis.degree.name | Doctor of Philosophy | en |
Files
Original bundle
1 - 1 of 1