FADS-Fusion: A Post-Flood Assessment Using Dempster-Shafer Fusion for Segmentation and Uncertainty Mapping
Files
TR Number
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Machine Learning (ML) modeling for disaster management is a growing field, but existing works focus more on mapping the extent of floods or broad categories of damage and they lack methods for explainability to help users understand model outputs. In this study, we propose Flood Assessment using Dempster–Shafer Fusion (FADS-Fusion), a tool for addressing post-flood damage assessment using Dempster–Shafer fusion to combine outputs from multiple deep learning models. FADS-Fusion is generalized to use any pretrained models, once outputs are post-processed for consistency, making it applicable for other disaster management or change detection applications. The novelty of our work comes from the application of Dempster–Shafer for multi-model fusion and uncertainty quantification on a flood dataset for segmenting both buildings and roads. We trained and evaluated models using the SpaceNet 8 challenge dataset and demonstrated that the fusion of the SpaceNet 8 Baseline (SN8) and Siamese Nested UNet (SNUNet) models has a modest overall improvement +1.93% to mAP, while a +12.3% increase for Precision and a −15.0% decrease in Recall are statistically significant compared to the baseline. FADS-Fusion also quantifies uncertainty by using the conflict of evidence, with a discount factor, with Dempster–Shafer fusion as both a quantitative and qualitative explainability method. While uncertainty correlates with a drop in performance, this relationship depends on values for class-weighted uncertainty and location. Mapping uncertainty back onto the original image allows for a visual inspection on fusion quality and indicates areas where a human will need to reassess. Our work demonstrates that FADS-Fusion improves post-flood segmentation performance and adds the benefit of uncertainty quantification for explainability, an aspect important for reliability and user decision-making but understudied in ML for disaster management in the literature.