Assessing ChatGPT's Code Generation Capabilities with Short vs Long Context Programming Problems

Shuvo, Uddip Acharjee; Dip, Sajib Acharjee; Vaskar, Nirvar Roy; Al Islam, A. B. M. Alim

Assessing ChatGPT's Code Generation Capabilities with Short vs Long Context Programming Problems

dc.contributor.author	Shuvo, Uddip Acharjee	en
dc.contributor.author	Dip, Sajib Acharjee	en
dc.contributor.author	Vaskar, Nirvar Roy	en
dc.contributor.author	Al Islam, A. B. M. Alim	en
dc.date.accessioned	2025-02-06T18:35:39Z	en
dc.date.available	2025-02-06T18:35:39Z	en
dc.date.issued	2024-12-19	en
dc.date.updated	2025-02-01T09:07:07Z	en
dc.description.abstract	This study assesses the code generation capabilities of ChatGPT using competitive programming problems from platforms such as LeetCode, HackerRank, and UVa Online Judge. In a novel approach, we contrast ChatGPT’s performance on concise problems from LeetCode against more complex, narrative-driven problems from Codeforces. Our results reveal significant challenges in addressing the intricate narrative structures of Codeforces, with difficulties in problem recognition and strategic planning in extended contexts. While initial code accuracy for LeetCode problems stands at 72%, it drops to 31% for complex Codeforces problems using Python. Additionally, we explore the impact of targeted instructions aimed at enhancing performance, which increased LeetCode accuracy to 73.53% but saw a decrease in Codeforces performance to 29%. Our analysis further extends across multiple programming languages, examining if iterative prompting and specific feedback can enhance code precision and efficiency. We also delve into ChatGPT’s performance on challenging problems and those released post its training period. This research provides insights into the strengths and weaknesses of AI in code generation and lays groundwork for future developments in AI-driven coding tools.	en
dc.description.version	Published version	en
dc.format.mimetype	application/pdf	en
dc.identifier.doi	https://doi.org/10.1145/3704522.3704535	en
dc.identifier.uri	https://hdl.handle.net/10919/124515	en
dc.language.iso	en	en
dc.publisher	ACM	en
dc.relation.ispartof	NSysS '24: Proceedings of the 11th International Conference on Networking, Systems, and Security	en
dc.rights	Creative Commons Attribution 4.0 International	en
dc.rights.holder	The author(s)	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	en
dc.title	Assessing ChatGPT's Code Generation Capabilities with Short vs Long Context Programming Problems	en
dc.type	Article - Refereed	en
dc.type.dcmitype	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 3704522.3704535.pdf
Size:: 1.36 MB
Format:: Adobe Portable Document Format
Description:: Published version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.5 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Journal Articles, Association for Computing Machinery (ACM)
Scholarly Works, Computer Science