Addressing uncertainty in LLM outputs for trust calibration through visualization and user interface design
| dc.contributor.author | Armstrong, Helen | en |
| dc.contributor.author | Anderson, Ashley Lynne | en |
| dc.contributor.author | Planchart, Rebecca | en |
| dc.contributor.author | Baidoo, Kweku | en |
| dc.contributor.author | Peterson, Matthew | en |
| dc.date.accessioned | 2026-01-29T14:52:39Z | en |
| dc.date.available | 2026-01-29T14:52:39Z | en |
| dc.date.issued | 2025-08-15 | en |
| dc.description.abstract | Large language models (LLMs) are becoming ubiquitous in knowledge work. However, the uncertainty inherent to LLM summary generation limits the efficacy of human-machine teaming, especially when users are unable to properly calibrate their trust in automation. Visual conventions for signifying uncertainty and interface design strategies for engaging users are needed to realize the full potential of LLMs. We report on an exploratory interdisciplinary project that resulted in four main contributions to explainable artificial intelligence in and beyond an intelligence analysis context. First, we provide and evaluate eight potential visual conventions for representing uncertainty in LLM summaries. Second, we describe a framework for uncertainty specific to LLM technology. Third, we specify 10 features for a proposed LLM validation system — the Multiple Agent Validation System (MAVS) — that utilizes the visual conventions, the framework, and three virtual agents to aid in language analysis. Fourth, we provide and describe four MAVS prototypes, one as an interactive simulation interface and the others as narrative interface videos. All four utilize a language analysis scenario to educate users on the potential of LLM technology in human-machine teams. To demonstrate applicability of the contributions beyond intelligence analysis, we also consider LLM-derived uncertainty in clinical decision-making in medicine and in climate forecasting. Ultimately, this investigation makes a case for the importance of visual and interface design in shaping the development of LLM technology. | en |
| dc.description.version | Published version | en |
| dc.format.extent | Pages 176-217 | en |
| dc.format.extent | 41 page(s) | en |
| dc.format.mimetype | application/pdf | en |
| dc.identifier.issue | 2 | en |
| dc.identifier.orcid | Anderson, Ashley [0000-0003-2361-7030] | en |
| dc.identifier.uri | https://hdl.handle.net/10919/141043 | en |
| dc.identifier.volume | 59 | en |
| dc.language.iso | en | en |
| dc.publisher | Visible Language Consortium | en |
| dc.rights | Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International | en |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | en |
| dc.subject | explainable AI | en |
| dc.subject | human-machine teaming | en |
| dc.subject | intelligence analysis | en |
| dc.subject | large language models | en |
| dc.subject | trust calibration | en |
| dc.subject | uncertainty | en |
| dc.subject | user interface design | en |
| dc.subject | visual representation | en |
| dc.title | Addressing uncertainty in LLM outputs for trust calibration through visualization and user interface design | en |
| dc.title.serial | Visible Language | en |
| dc.type | Article - Refereed | en |
| dc.type.dcmitype | Text | en |
| dc.type.other | Article | en |
| pubs.organisational-group | Virginia Tech | en |
| pubs.organisational-group | Virginia Tech/Architecture, Arts, and Design | en |
| pubs.organisational-group | Virginia Tech/Architecture, Arts, and Design/School of Visual Arts | en |
| pubs.organisational-group | Virginia Tech/All T&R Faculty | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- addressing-uncertainty-in-llm-outputs-for-trust-calibration-through-visualization-and-user-interface-design (2).pdf
- Size:
- 8.63 MB
- Format:
- Adobe Portable Document Format
- Description:
- Published version
License bundle
1 - 1 of 1