Evaluating Human-LLM Alignment in ETD Subject Classification

Klair, Hajra; German, Fausto; Banerjee, Bipasha; Ingram, William A.

Evaluating Human-LLM Alignment in ETD Subject Classification

Files

Accepted version (350.81 KB)

Downloads: 110

Date

2025-09-27

Authors

Publisher

Springer

Abstract

Author-assigned subject labels in Electronic Theses and Dissertations (ETDs) are often inconsistent, overly broad, or misaligned with the research focus. This hampers discovery, aggregation, and analysis, especially for interdisciplinary research. LLMs offer a scalable alternative for automated classification, but their labeling rationale is opaque and introduces systematic biases. This study compares subject labels generated by LLMs with human-assigned labels for over 9,000 ETDs across 21 academic categories to assess the disagreement. We evaluate multiple prompt-based and fine-tuned LLM configurations and analyze areas of agreement and disagreement to identify patterns of misclassification. LLMs achieve competitive performance overall but frequently misclassify theoretical or interdisciplinary texts, often due to overweighting lexical cues and disregarding context. We show such errors are not random but reflect structured semantic divergences from human interpretation. These findings suggest a need for hybrid frameworks that combine LLM scalability with human contextual judgment to improve subject labeling in academic repositories.

Keywords

Classification, Large Language Models

Persistent link

https://hdl.handle.net/10919/141203

Collections

All Faculty Deposits
Scholarly Works, Computer Science
Scholarly Works, University Libraries

Full item page

Evaluating Human-LLM Alignment in ETD Subject Classification

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections