VTechWorks staff will be away for the winter holidays until January 5, 2026, and will respond to requests at that time.
 

Beyond the Checkbox: Leveraging AI Chatbots for Inclusive Demographic Data Collection

dc.contributor.authorChekili, Amelen
dc.contributor.committeechairHernandez, Jorge Ivanen
dc.contributor.committeememberDiana, Rachel A.en
dc.contributor.committeememberHickman, Louisen
dc.contributor.committeememberHsu, Ningen
dc.contributor.departmentPsychologyen
dc.date.accessioned2025-09-20T08:01:05Zen
dc.date.available2025-09-20T08:01:05Zen
dc.date.issued2025-09-19en
dc.description.abstractTraditional demographic surveys compress rich identities into rigid checkboxes. This dissertation asks whether a conversational chatbot, powered by GPT-4o, can restore that nuance. In a within-subjects experiment, 230 participants completed both a chatbot conversation and the standard Office of Management and Budget (OMB) form. Exploratory analyses showed that participants' open-ended narratives frequently moved beyond the OMB labels. By encoding these responses with the INSTRUCTOR embedding model, and organizing them via hierarchical clustering, the categorization can be "cut" at multiple levels of granularity, producing solutions that can satisfy regulatory reporting and finer leaves that reveal national, regional, and mixed-heritage detail. Hypothesis-driven tests of user experience reinforced these advantages. On the User Experience Questionnaire, the chatbot outscored the demographic checklist on hedonic qualities, novelty, and stimulation, while the checklist retained pragmatic strengths such as dependability. Perceived group inclusivity also rose when data were collected through the chatbot, regardless of how closely respondents' identities aligned with OMB categories. Overall, the findings indicate that a carefully engineered chatbot, paired with advanced natural-language-processing analyses, can enhance race and ethnicity data collection by producing richer information and fostering a more inclusive, engaging respondent experience.en
dc.description.abstractgeneralMost surveys that ask about race or ethnicity limit respondents to a handful of checkboxes. These boxes make record-keeping simple, yet they flatten the richness of personal heritage. This dissertation investigates whether a conversational artificial-intelligence assistant can restore that nuance. A sample of 230 adults first completed the standard Office of Management and Budget race and ethnicity form, and then engaged in a short dialogue with a GPT-4o powered chatbot that encouraged open self-description. The conversation yielded responses that named specific countries, regions, and blended lineages that never appear on the official list. Natural-language software grouped the free-text answers into hierarchies. At the broadest level, the groupings still satisfied regulatory reporting. At the granulated levels, they revealed detailed threads of identity such as national origin and mixed heritage. Participants judged the chatbot to be more engaging, enjoyable, and welcoming than the traditional checklist, though the checklist remained slightly easier to finish. Feelings of inclusion also rose after interacting with the chatbot, regardless of how well respondents' identities aligned with government categories. The results demonstrate that a thoughtfully engineered chatbot can meet formal data requirements while allowing people to express who they truly are. This approach makes demographic information richer, more accurate, and more respectful of individual identity.en
dc.description.degreeDoctor of Philosophyen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:44388en
dc.identifier.urihttps://hdl.handle.net/10919/137810en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/en
dc.subjectChatbotsen
dc.subjectDemographic data collectionen
dc.subjectRaceen
dc.subjectEthnicityen
dc.subjectNatural language processingen
dc.subjectInclusivityen
dc.titleBeyond the Checkbox: Leveraging AI Chatbots for Inclusive Demographic Data Collectionen
dc.typeDissertationen
thesis.degree.disciplinePsychologyen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.nameDoctor of Philosophyen

Files

Original bundle
Now showing 1 - 1 of 1
Name:
Chekili_A_D_2025.pdf
Size:
5.17 MB
Format:
Adobe Portable Document Format