Responsibly Emboldening Predictions via Boldness-Recalibration

dc.contributor.authorGuthrie, Adeline Pearlen
dc.contributor.committeechairFranck, Christopher Thomasen
dc.contributor.committeememberLattimer, Alan Martinen
dc.contributor.committeememberXing, Xinen
dc.contributor.committeememberVan Mullekom, Jennifer Huffmanen
dc.contributor.departmentStatisticsen
dc.date.accessioned2025-05-20T08:04:24Zen
dc.date.available2025-05-20T08:04:24Zen
dc.date.issued2025-05-19en
dc.description.abstractProbability predictions are essential to inform decision making across many fields. Ideally, probability predictions are (i) well calibrated, (ii) accurate, and (iii) bold, i.e., spread out enough to be informative for decision making. However, there is a fundamental tension between calibration and boldness, since calibration metrics can be high when predictions are overly cautious, i.e., non-bold. The purpose of this work is to develop a Bayesian model selection-based approach to assess calibration, and a strategy for boldness-recalibration that enables practitioners to responsibly embolden predictions subject to their required level of calibration. Specifically, we allow the user to pre-specify their desired posterior probability of calibration, then maximally embolden predictions subject to this constraint. We demonstrate the method with a case study on hockey home team win probabilities and then verify the performance of our procedures via simulation. Additionally, we introduce BRcal, an R package implementing Boldness-Recalibration and supporting methodology. We reformulate boldness-recalibration as a nonlinear optimization of boldness with a nonlinear constraint on calibration, and describe how this is implemented in BRcal. The BRcal package is demonstrated using a case study on foreclosure prediction. Lastly, we extend the methods to account for underlying spatial association in the data and demonstrate via a case study on moose presence.en
dc.description.abstractgeneralPredictions in the form of the probability an event will or will not happen are essential to decision making across many areas of application. When probabilities are well calibrated, they reliably reflect how often that event happens. Consider a sports analyst claiming there is a 50\% chance your favorite team wins their next game against an opponent they should beat. This does not provide much information about the outcome of the game. However, if your team typically wins 50\% of their games, this prediction is well calibrated. A more informative prediction would be 80\%, since it seems likely they will win. This prediction is more bold and more useful for decision making. We propose statistical methodology that allows users to responsibly spread predictions out, closer to extremes of 0\% or 100\%, while maintaining a reasonable level of calibration. We also provide a corresponding software package implementing this work. We demonstrate the methods and software using two real-world case studies, pertaining to hockey and housing foreclosure predictions, and simulated data. We also extend the methods for data with spatial association, which is demonstrated via a case study involving moose presence.en
dc.description.degreeDoctor of Philosophyen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:43561en
dc.identifier.urihttps://hdl.handle.net/10919/133153en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectCalibrationen
dc.subjectBoldnessen
dc.subjectEstimationen
dc.subjectOptimizationen
dc.subjectBayesian Model Selectionen
dc.titleResponsibly Emboldening Predictions via Boldness-Recalibrationen
dc.typeDissertationen
thesis.degree.disciplineStatisticsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.nameDoctor of Philosophyen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Guthrie_AP_D_2025.pdf
Size:
10.64 MB
Format:
Adobe Portable Document Format