Transforming SDOH Screening: Towards a General Framework for Transformer-based Prediction of Social Determinants of Health
dc.contributor.author | King III, Kenneth Hale | en |
dc.contributor.committeechair | Gracanin, Denis | en |
dc.contributor.committeemember | Luther, Kurt | en |
dc.contributor.committeemember | Azab, Mohamed Mahmoud Mahmoud | en |
dc.contributor.department | Computer Science and#38; Applications | en |
dc.date.accessioned | 2024-09-10T08:00:26Z | en |
dc.date.available | 2024-09-10T08:00:26Z | en |
dc.date.issued | 2024-09-09 | en |
dc.description.abstract | Social Determinants of Health (SDOH) play a crucial role in healthcare outcomes, yet identifying them from unstructured patient data remains a challenge. This research explores the potential of Large Language Models (LLMs) for automated SDOH identification from patient notes. We propose a general framework for SDOH screening that is simple and straightforward. We leverage existing SDOH datasets, adapting and combining them to create a more comprehensive benchmark for this task, addressing the research gap of limited datasets. Using the benchmark and proposed framework, we conclude by conducting several preliminary experiments exploring and comparing promising LLM system implementations. Our findings highlight the potential of LLMs for automated SDOH screening while emphasizing the need for more robust datasets and evaluation frameworks. | en |
dc.description.abstractgeneral | Social Determinants of Health (SDOH) have been shown to significantly impact health outcomes and are seen as a major contributor to global health inequities. However, their use within the healthcare industry is still significantly under emphasized, largely due to the difficulty of manually identifying SDOH factors. While previous works have explored automated approaches for SDOH identification, they lack standardization, data transparency and robustness, and are largely outdated compared to the latest Artificial Intelligence (AI) approaches. Therefore, in this work we propose a holistic framework for automated SDOH identification. We also present a higher quality SDOH benchmark, merging existing publicly available datasets, standardizing them, and cleaning them for errors. With this benchmark, we then conducted experiments to gain greater insights into the best performance across different state-of-the-art AI approaches. Through this work, we contribute a better way to think about automated SDOH screening systems, the first publicly accessible multi-clinic and multi-annotator benchmark, as well as greater insights into the latest AI approaches for state-of-the-art results. | en |
dc.description.degree | Master of Science | en |
dc.format.medium | ETD | en |
dc.identifier.other | vt_gsexam:41355 | en |
dc.identifier.uri | https://hdl.handle.net/10919/121097 | en |
dc.language.iso | en | en |
dc.publisher | Virginia Tech | en |
dc.rights | Creative Commons Attribution 4.0 International | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
dc.subject | Generative AI | en |
dc.subject | Large Language Models | en |
dc.subject | Social Determinants of Health | en |
dc.title | Transforming SDOH Screening: Towards a General Framework for Transformer-based Prediction of Social Determinants of Health | en |
dc.type | Thesis | en |
thesis.degree.discipline | Computer Science & Applications | en |
thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
thesis.degree.level | masters | en |
thesis.degree.name | Master of Science | en |
Files
Original bundle
1 - 1 of 1