T-LARA: Leveraging Reasoning LLMs to Enhance the Adversarial Robustness of Tabular Security Classifiers
| dc.contributor.author | Detter, Brianna Jennelle | en |
| dc.contributor.committeechair | Viswanath, Bimal | en |
| dc.contributor.committeemember | Gao, Peng | en |
| dc.contributor.committeemember | Hoang, Thang | en |
| dc.contributor.department | Computer Science and Applications | en |
| dc.date.accessioned | 2026-06-04T08:00:46Z | en |
| dc.date.available | 2026-06-04T08:00:46Z | en |
| dc.date.issued | 2026-06-03 | en |
| dc.description.abstract | Security classifiers in many critical network and application tasks rely on tabular data, yet evaluating their adversarial robustness is complicated by the unique constraints of the tabular domain. Unlike homogeneous image or text data, tabular features possess distinct data types, bounds, intricate interdependencies, and require a highly specific perturbation cost model. Traditionally, developing these feature-level cost models requires extensive manual effort and domain expertise, while reliance on gradient-based computation limits existing methods to specific, differentiable classifier families. In this thesis, we examine whether reasoning-based Large Language Models (LLMs) can circumvent these limitations to produce a classifier-agnostic method for creating realistic, low-cost adversarial examples that adhere to domain constraints. We propose T-LARA, an agentic framework designed to evaluate the adversarial robustness of tabular classifiers. T-LARA automatically constructs a feature-level cost model via LLM reasoning augmented with web-based knowledge acquisition. Driven by this cost model, a Generator agent crafts constraint-preserving adversarial samples, while a Critic agent provides iterative, reasoning-based feedback to refine their effectiveness and quality. Ultimately, this framework demonstrates the potential of reasoning LLMs to drastically reduce the human supervision and domain expertise traditionally required to audit the security of tabular classification systems. | en |
| dc.description.abstractgeneral | This thesis presents T-LARA, an automated system for testing how vulnerable machine learning models are to carefully designed inputs. The system focuses on tabular data, which is commonly used in real-world applications such as fraud detection and social media analysis. T-LARA uses large language models (LLMs) to guide the process of finding weaknesses in these systems without requiring extensive manual effort. It does this by generating and refining input changes while ensuring they remain realistic and consistent with real-world constraints. The framework is evaluated on multiple classification models to measure how effectively it can cause misclassifications while still producing valid and plausible data. Overall, this work explores how LLM-based systems can be used to better understand and test the security of machine learning models. | en |
| dc.description.degree | Master of Science | en |
| dc.format.medium | ETD | en |
| dc.identifier.other | vt_gsexam:46766 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/143244 | en |
| dc.language.iso | en | en |
| dc.publisher | Virginia Tech | en |
| dc.rights | In Copyright | en |
| dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
| dc.subject | Agents | en |
| dc.subject | Tabular Data | en |
| dc.subject | AI | en |
| dc.subject | Security | en |
| dc.title | T-LARA: Leveraging Reasoning LLMs to Enhance the Adversarial Robustness of Tabular Security Classifiers | en |
| dc.type | Thesis | en |
| thesis.degree.discipline | Computer Science & Applications | en |
| thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
| thesis.degree.level | masters | en |
| thesis.degree.name | Master of Science | en |
Files
Original bundle
1 - 1 of 1