Transforming Free-Form Sentences into Sequence of Unambiguous Sentences with Large Language Model

dc.contributor.authorYeole, Nikita Kiranen
dc.contributor.committeechairHuang, Lifuen
dc.contributor.committeechairHsiao, Michael S.en
dc.contributor.committeememberViswanath, Bimalen
dc.contributor.departmentComputer Science and#38; Applicationsen
dc.date.accessioned2024-12-18T09:00:11Zen
dc.date.available2024-12-18T09:00:11Zen
dc.date.issued2024-12-17en
dc.description.abstractIn the realm of natural language programming, translating free-form sentences in natural language into a functional, machine-executable program remains difficult due to the following 4 challenges. First, the inherent ambiguity of natural languages. Second, the high-level verbose nature in user descriptions. Third, the complexity in the sentences and Fourth, the invalid or semantically unclear sentences. Our first solution is a Large Language Model (LLM) based Artificial Intelligence driven assistant to process free-form sentences and decompose them into sequences of simplified, unambiguous sentences that abide by a set of rules, thereby stripping away the complexities embedded within the original sentences. These resulting sentences are then used to generate the code. We applied the proposed approach to a set of free-form sentences written by middle-school students for describing the logic behind video games. More than 60% of the free-form sentences containing these problems were sufficiently converted to sequences of simple unambiguous object-oriented sentences by our approach. Next, the thesis also presents "IntentGuide," a neuro-symbolic integration framework to enhance the clarity and executability of human intentions expressed in freeform sentences. IntentGuide effectively integrates the rule-based error detection capabilities of symbolic AI with the powerful adaptive learning abilities of Large Language Model to convert ambiguous or complex sentences into clear, machine-understandable instructions. The empirical evaluation of IntentGuide performed on natural language sentences written by middle school students for designing video games, reveals a significant improvement in error correction and code generation abilities compared to previous approach, attaining an accuracy rate of 90%.en
dc.description.abstractgeneralImagine if you could talk to machines in everyday language and they could understand exactly what you meant, turning your words into programs that do exactly what you describe. That's the goal of the thesis. We've developed a system that helps machines make sense of the kind of free-form language that people, especially students, use when they describe what they want a computer to do. Understanding and converting everyday language into computer code is a complex challenge, primarily because the way we naturally speak can be vague, overly detailed, or just complex. This thesis presents a new tool using artificial intelligence that helps break down and simplify these sentences. By transforming them into clearer, rulefollowing instructions, this tool makes it easier for machines to understand and execute the tasks we describe. The technology was tested using descriptions from middle-school students on how video games should work. Over 60% of these complex or unclear descriptions were sufficiently converted into straightforward instructions that a machine could use. Additionally, a new system called "IntentGuide" was introduced, combining traditional AI methods with advanced language models to improve how effectively machines can interpret and act on human instructions. This improved system showed a 90% accuracy in understanding and correcting errors in the students' game descriptions, marking a significant step forward in helping computers better understand us.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:42030en
dc.identifier.urihttps://hdl.handle.net/10919/123829en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectNatural language programmingen
dc.subjectdecompositionen
dc.subjectNatural language processingen
dc.subjectneuro-symbolicen
dc.titleTransforming Free-Form Sentences into Sequence of Unambiguous Sentences with Large Language Modelen
dc.typeThesisen
thesis.degree.disciplineComputer Science & Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
Yeole_NK_T_2024.pdf
Size:
672.92 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Yeole_NK_T_2024_support_4.pdf
Size:
105.57 KB
Format:
Adobe Portable Document Format
Description:
Supporting documents
Loading...
Thumbnail Image
Name:
Yeole_NK_T_2024_support_5.pdf
Size:
127.79 KB
Format:
Adobe Portable Document Format
Description:
Supporting documents

Collections