AI-Driven Interpretation of Nonverbal Communication in AR-Enhanced Real-Time Captions: Effects on Cognitive Load, Comprehension, and User Engagement
| dc.contributor.author | Ubur, Sunday | en |
| dc.date.accessioned | 2025-08-12T13:04:14Z | en |
| dc.date.available | 2025-08-12T13:04:14Z | en |
| dc.date.issued | 2025-06-23 | en |
| dc.date.updated | 2025-08-01T07:49:44Z | en |
| dc.description.abstract | Current real-time captioning systems focus on transcribing speech, often overlooking facial expressions, body language, and vocal prosody that convey essential communicative cues. We present an AI-driven augmented reality (AR) captioning system that interprets non-verbal signals in real time and renders them as dynamic visual cues within the user’s view. Grounded in Cognitive Load Theory, cross-modal plasticity, and computational creativity, our approach supports Deaf and Hard of Hearing (DHH) and neurodiverse learners by transforming captions into creative, expressive media. We explore: (RQ1) how non-verbal cues affect comprehension, engagement, and creative interpretation; (RQ2) how cultural differences influence cue perception; and (RQ3) what AI and design strategies enable low-latency, customizable AR captions without increasing cognitive load. A user study shows 45% comprehension gains and 25% reduction in mental demand with emotional indicators in captions. Future work includes building a cross-cultural cue corpus, an open-source AR captioning pipeline, and design guidelines for inclusive STEM education, advancing accessibility and fostering creativity-driven communication. | en |
| dc.description.version | Published version | en |
| dc.format.mimetype | application/pdf | en |
| dc.identifier.doi | https://doi.org/10.1145/3698061.3734421 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/137455 | en |
| dc.language.iso | en | en |
| dc.publisher | ACM | en |
| dc.rights | In Copyright (InC) | en |
| dc.rights.holder | The author(s) | en |
| dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
| dc.title | AI-Driven Interpretation of Nonverbal Communication in AR-Enhanced Real-Time Captions: Effects on Cognitive Load, Comprehension, and User Engagement | en |
| dc.type | Article - Refereed | en |
| dc.type.dcmitype | Text | en |