Kannan, RohitBayraksan, GuezinLuedtke, James R.2025-02-182025-02-182023-09-260025-5610https://hdl.handle.net/10919/124626We consider data-driven approaches that integrate a machine learning prediction model within distributionally robust optimization (DRO) given limited joint observations of uncertain parameters and covariates. Our framework is flexible in the sense that it can accommodate a variety of regression setups and DRO ambiguity sets. We investigate asymptotic and finite sample properties of solutions obtained using Wasserstein, sample robust optimization, and phi-divergence-based ambiguity sets within our DRO formulations, and explore cross-validation approaches for sizing these ambiguity sets. Through numerical experiments, we validate our theoretical results, study the effectiveness of our approaches for sizing ambiguity sets, and illustrate the benefits of our DRO formulations in the limited data regime even when the prediction model is misspecified.Pages 369-42557 page(s)application/pdfenIn CopyrightData-driven stochastic programmingDistributionally robust optimizationWasserstein distancePhi-divergencesCovariatesMachine learningConvergence rateLarge deviationsResiduals-based distributionally robust optimization with covariate informationArticle - RefereedMathematical Programminghttps://doi.org/10.1007/s10107-023-02014-72071-2Kannan, Rohit [0000-0002-7963-7682]1436-4646