Interpretive Caption: Real-Time Vocal Emotion Cues for DHH Users

dc.contributor.authorUbur, Sundayen
dc.contributor.authorAdewale, Sikiruen
dc.contributor.authorChandrashekar, Nikithaen
dc.contributor.authorAkli, Enochen
dc.contributor.authorGracanin, Denisen
dc.date.accessioned2025-11-04T13:28:45Zen
dc.date.available2025-11-04T13:28:45Zen
dc.date.issued2025-10-26en
dc.date.updated2025-11-01T07:45:53Zen
dc.description.abstractDeaf and Hard-of-Hearing (DHH) individuals increasingly rely on real-time captioning to access spoken content in educational and professional settings. However, traditional captions omit vocal emotional cues, such as intonation and affect which can hinder comprehension and engagement. This work introduces Interpretive Caption, a machine-learning prototype that augments captions with emotion-aware annotations derived from vocal tone. Using letter-coded tags with hover-based tooltips, the system conveys emotional context on demand, balancing clarity with cognitive accessibility. We conducted a qualitative study with eight DHH participants who interacted with the prototype and shared feedback on usability, emotional clarity, and layout design. Findings highlight the value of hover-based emotional cues, customization features, and segmentation aligned with cognitive load principles. Participants appreciated the non-intrusive emotional insights, while also identifying areas for improvement, including accent-inclusive emotion recognition and better mobile accessibility. Our contributions include a real-time captioning prototype integrating speech emotion recognition, a user-controllable emotion display interface, and design insights for affective accessibility in educational contexts. This work offers a foundation for inclusive, expressive captioning and informs future multimodal caption systems that prioritize interpretability, cultural sensitivity, and user agency.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1145/3663547.3759697en
dc.identifier.urihttps://hdl.handle.net/10919/138850en
dc.language.isoenen
dc.publisherACMen
dc.rightsIn Copyright (InC)en
dc.rights.holderThe author(s)en
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titleInterpretive Caption: Real-Time Vocal Emotion Cues for DHH Usersen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3663547.3759697.pdf
Size:
814.68 KB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: