Robust Smart Contract Vulnerability Detection via Contrastive Learning-Enhanced Granular-ball Training

arXiv cs.LG / 3/31/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper addresses a key gap in smart contract vulnerability detection research: label noise in training data caused by reliance on open-source labeling tools.

Abstract

Deep neural networks (DNNs) have emerged as a prominent approach for detecting smart contract vulnerabilities, driven by the growing contract datasets and advanced deep learning techniques. However, DNNs typically require large-scale labeled datasets to model the relationships between contract features and vulnerability labels. In practice, the labeling process often depends on existing open-sourced tools, whose accuracy cannot be guaranteed. Consequently, label noise poses a significant challenge for the accuracy and robustness of the smart contract, which is rarely explored in the literature. To this end, we propose Contrastive learning-enhanced Granular-Ball smart Contracts training, CGBC, to enhance the robustness of contract vulnerability detection. Specifically, CGBC first introduces a Granular-ball computing layer between the encoder layer and the classifier layer, to group similar contracts into Granular-Balls (GBs) and generate new coarse-grained representations (i.e., the center and the label of GBs) for them, which can correct noisy labels based on the most correct samples. An inter-GB compactness loss and an intra-GB looseness loss are combined to enhance the effectiveness of clustering. Then, to improve the accuracy of GBs, we pretrain the model through unsupervised contrastive learning supported by our novel semantic-consistent smart contract augmentation method. This procedure can discriminate contracts with different labels by dragging the representation of similar contracts closer, assisting CGBC in clustering. Subsequently, we leverage the symmetric cross-entropy loss function to measure the model quality, which can combat the label noise in gradient computations. Finally, extensive experiments show that the proposed CGBC can significantly improve the robustness and effectiveness of the smart contract vulnerability detection when contrasted with baselines.

Anthropic's Accidental Release of Claude Code's Source Code: Irretrievable and Publicly Accessible

Dev.to

Claude Code's Compaction Engine: What the Source Code Actually Reveals

Dev.to

Part 1 - Why I Picked LangChain4j Over Spring AI

Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

A Vague Rumor Found Real 0-Days in Vim and Emacs. Here's Why It Worked.

Dev.to

Robust Smart Contract Vulnerability Detection via Contrastive Learning-Enhanced Granular-ball Training

Key Points

Abstract

Related Articles

Anthropic's Accidental Release of Claude Code's Source Code: Irretrievable and Publicly Accessible

Claude Code's Compaction Engine: What the Source Code Actually Reveals

Part 1 - Why I Picked LangChain4j Over Spring AI

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

A Vague Rumor Found Real 0-Days in Vim and Emacs. Here's Why It Worked.

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer