Computer Science > Software Engineering

arXiv:2311.04448 (cs)

[Submitted on 8 Nov 2023 (v1), last revised 2 Jul 2024 (this version, v3)]

Title:Inferring Resource-Oriented Intentions using LLMs for Static Resource Leak Detection

Authors:Chong Wang, Jianan Liu, Xin Peng, Yang Liu, Yiling Lou

Abstract:Resource leaks, caused by resources not being released after acquisition, often lead to performance issues and system crashes. Existing static detection techniques rely on mechanical matching of predefined resource acquisition/release APIs and null-checking conditions to find unreleased resources, suffering from both (1) false negatives caused by the incompleteness of predefined resource acquisition/release APIs and (2) false positives caused by the incompleteness of resource reachability validation identification.
To overcome these challenges, we propose InferROI, a novel approach that leverages the exceptional code comprehension capability of large language models (LLMs) to directly infer resource-oriented intentions (acquisition, release, and reachability validation) in code. InferROI first prompts the LLM to infer involved intentions for a given code snippet, and then incorporates a two-stage static analysis approach to check control-flow paths for resource leak detection based on the inferred intentions. We evaluate the effectiveness of InferROI in both resource-oriented intention inference and resource leak detection. Experimental results on the DroidLeaks and JLeaks datasets demonstrate InferROI achieves promising bug detection rate (59.3% and 64.8%) and false alarm rate (18.6% and 24.0%). Compared to three industrial static detectors, InferROI detects 14~45 and 167~503 more bugs in DroidLeaks and JLeaks, respectively. When applied to real-world open-source projects, InferROI identifies 26 unknown resource leak bugs, with 7 of them being confirmed by developers. Finally, manual annotation indicated that InferROI achieved a precision of 74.6% and a recall of 81.8% in intention inference, covering more than 60% resource types involved in the datasets. The results of an ablation study underscores the importance of combining LLM-based inference with static analysis.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2311.04448 [cs.SE]
	(or arXiv:2311.04448v3 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2311.04448

Submission history

From: Chong Wang [view email]
[v1] Wed, 8 Nov 2023 04:19:28 UTC (4,010 KB)
[v2] Fri, 22 Dec 2023 02:33:58 UTC (3,770 KB)
[v3] Tue, 2 Jul 2024 14:52:40 UTC (3,841 KB)

Computer Science > Software Engineering

Title:Inferring Resource-Oriented Intentions using LLMs for Static Resource Leak Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Inferring Resource-Oriented Intentions using LLMs for Static Resource Leak Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators