Evaluating Large Language Models for Real World Vulnerability Repair in C/C++ Code

Lan Zhang; Qingtian Zou; Anoop Singhal; Xiaoyan Sun; Peng Liu

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Evaluating Large Language Models for Real World Vulnerability Repair in C/C++ Code

Published

June 19, 2024

Author(s)

Lan Zhang, Qingtian Zou, Anoop Singhal, Xiaoyan Sun, Peng Liu

Abstract

The advent of Large Language Models (LLMs) has enabled advancement in automated code generation, translation, and summarization. Despite their promise, evaluating the use of LLMs in repairing real-world code vulnerabilities remains underexplored. In this study, we address this gap by evaluating the capability of advanced LLMs, such as ChatGPT-4 and Claude, in fixing memory corruption vulnerabilities in real-world C/C++ code. We meticulously curated 223 real-world C/C++ code snippets encompassing a spectrum of memory corruption vulnerabilities, ranging from straightforward memory leaks to intricate buffer errors. Our findings demonstrate the proficiency of LLMs in rectifying simple memory errors like leaks, where fixes are confined to localized code segments. However, their effectiveness diminishes when addressing complicated vulnerabilities necessitating reasoning about cross-cutting concerns and deeper program semantics. Furthermore, we explore techniques for augmenting LLM performance by incorporating additional knowledge. Our results shed light on both the strengths and limitations of LLMs in automated program repair on genuine code, underscoring the need for advancements in reasoning abilities for handling complex code repair tasks.

Proceedings Title

IWSPA 2024: Proceedings of the 10th ACM International Workshop on Security and Privacy Analytics

Conference Dates

June 21, 2024

Conference Location

PORTO, PT

Conference Title

CODASPY 2024: Fourteenth ACM Conference on Data and Application Security and Privacy

Pub Type

Conferences

Download Paper

https://doi.org/10.1145/3643651.3659892

Local Download

Keywords

Large Language Models. Program Repair, Deep Learning

Information technology

Citation

Zhang, L. , Zou, Q. , Singhal, A. , Sun, X. and Liu, P. (2024), Evaluating Large Language Models for Real World Vulnerability Repair in C/C++ Code, IWSPA 2024: Proceedings of the 10th ACM International Workshop on Security and Privacy Analytics, PORTO, PT, [online], https://doi.org/10.1145/3643651.3659892, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=957853 (Accessed April 26, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created June 19, 2024, Updated August 7, 2024

Evaluating Large Language Models for Real World Vulnerability Repair in C/C++ Code

Author(s)

Abstract

Download Paper

Keywords

Citation

Additional citation formats

Issues