Computer Science > Cryptography and Security

arXiv:2311.07553 (cs)

[Submitted on 13 Nov 2023 (v1), last revised 23 Nov 2023 (this version, v2)]

Title:An Extensive Study on Adversarial Attack against Pre-trained Models of Code

Authors:Xiaohu Du, Ming Wen, Zichao Wei, Shangwen Wang, Hai Jin

View PDF

Abstract:Transformer-based pre-trained models of code (PTMC) have been widely utilized and have achieved state-of-the-art performance in many mission-critical applications. However, they can be vulnerable to adversarial attacks through identifier substitution or coding style transformation, which can significantly degrade accuracy and may further incur security concerns. Although several approaches have been proposed to generate adversarial examples for PTMC, the effectiveness and efficiency of such approaches, especially on different code intelligence tasks, has not been well understood. To bridge this gap, this study systematically analyzes five state-of-the-art adversarial attack approaches from three perspectives: effectiveness, efficiency, and the quality of generated examples. The results show that none of the five approaches balances all these perspectives. Particularly, approaches with a high attack success rate tend to be time-consuming; the adversarial code they generate often lack naturalness, and vice versa. To address this limitation, we explore the impact of perturbing identifiers under different contexts and find that identifier substitution within for and if statements is the most effective. Based on these findings, we propose a new approach that prioritizes different types of statements for various tasks and further utilizes beam search to generate adversarial examples. Evaluation results show that it outperforms the state-of-the-art ALERT in terms of both effectiveness and efficiency while preserving the naturalness of the generated adversarial examples.

Comments:	Accepted to ESEC/FSE 2023
Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.07553 [cs.CR]
	(or arXiv:2311.07553v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2311.07553

Submission history

From: Xiaohu Du [view email]
[v1] Mon, 13 Nov 2023 18:48:54 UTC (4,415 KB)
[v2] Thu, 23 Nov 2023 11:20:39 UTC (4,415 KB)

Computer Science > Cryptography and Security

Title:An Extensive Study on Adversarial Attack against Pre-trained Models of Code

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:An Extensive Study on Adversarial Attack against Pre-trained Models of Code

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators