Performance Evaluation of Large Language Models for High-Performance Code Generation: A Multi-Agent Approach (MARCO)

dc.contributor.authorRahman, Asifen
dc.contributor.authorCvetkovic, Veljkoen
dc.contributor.authorReece, Kathleenen
dc.contributor.authorWalters, Aidanen
dc.contributor.authorHassan, Yasiren
dc.contributor.authorTummeti, Aneeshen
dc.contributor.authorTorres, Brianen
dc.contributor.authorCooney, Deniseen
dc.contributor.authorEllis, Margareten
dc.contributor.authorNikolopoulos, Dimitriosen
dc.date.accessioned2025-05-07T17:22:43Zen
dc.date.available2025-05-07T17:22:43Zen
dc.date.issued2025-05-07en
dc.description.abstractLarge language models (LLMs) have transformed software development through code generation capabilities, yet their effectiveness for high-performance computing (HPC) remains limited. HPC code requires specialized optimizations for parallelism, memory efficiency, and architecture-specific considerations that general-purpose LLMs often overlook. We present MARCO (Multi-Agent Reactive Code Optimizer), a novel framework that enhances LLM-generated code for HPC through a specialized multi-agent architecture. MARCO employs separate agents for code generation and performance evaluation, connected by a feedback loop that progressively refines optimizations. A key innovation is MARCO's web-search component that retrieves real-time optimization techniques from recent conference proceedings and research publications, bridging the knowledge gap in pre-trained LLMs. Our extensive evaluation on the LeetCode 75 problem set demonstrates that MARCO achieves a 14.6% average runtime reduction compared to Claude 3.5 Sonnet alone, while the integration of the web-search component yields a 30.9% performance improvement over the base MARCO system. These results highlight the potential of multi-agent systems to address the specialized requirements of high-performance code generation, offering a cost-effective alternative to domain-specific model fine-tuning.en
dc.description.versionSubmitted versionen
dc.format.extent9 page(s)en
dc.format.mimetypeapplication/pdfen
dc.identifier.orcidNikolopoulos, Dimitrios [0000-0003-0217-8307]en
dc.identifier.urihttps://hdl.handle.net/10919/129385en
dc.language.isoenen
dc.relation.ispartofPerformance Evaluation of Large Language Models for High-Performance Code Generation: A Multi-Agent Approach (MARCO)en
dc.relation.urihttps://www.cs.vt.edu/~dsnen
dc.relation.urihttp://arxiv.org/en
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titlePerformance Evaluation of Large Language Models for High-Performance Code Generation: A Multi-Agent Approach (MARCO)en
dc.typeArticleen
dc.type.dcmitypeTexten
pubs.organisational-groupVirginia Techen
pubs.organisational-groupVirginia Tech/Engineeringen
pubs.organisational-groupVirginia Tech/Engineering/Computer Scienceen
pubs.organisational-groupVirginia Tech/All T&R Facultyen
pubs.organisational-groupVirginia Tech/Engineering/COE T&R Facultyen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
BURGS_LLM_HPC.pdf
Size:
482.19 KB
Format:
Adobe Portable Document Format
Description:
Submitted version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Plain Text
Description: