AI Navigate

Early Rug Pull Warning for BSC Meme Tokens via Multi-Granularity Wash-Trading Pattern Profiling

arXiv cs.AI / 3/17/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes an end-to-end rug-pull warning framework for BSC meme tokens to address scarce anomalies, incomplete labels, and limited interpretability in DeFi risk detection.
  • It constructs 12 token-level features from three wash-trading patterns (Self, Matched, Circular) by integrating transaction-, address-, and flow-level signals into risk vectors for risk assessment.
  • In experiments on 7 tokens and 33,242 records, Random Forest outperformed Logistic Regression with AUC 0.9098, PR-AUC 0.9185, and F1 0.7429, with trade-level features driving performance and address-level features offering stable gains.
  • The framework demonstrates actionable early-warning potential with a mean lead time of 3.8133 hours and an error profile (FP=1, FN=8), characterizing it as a high-precision screener rather than a high-recall alarm engine, plus executable pipeline and validation contributions.

Abstract

The high-frequency issuance and short-cycle speculation of meme tokens in decentralized finance (DeFi) have significantly amplified rug-pull risk. Existing approaches still struggle to provide stable early warning under scarce anomalies, incomplete labels, and limited interpretability. To address this issue, an end-to-end warning framework is proposed for BSC meme tokens, consisting of four stages: dataset construction and labeling, wash-trading pattern feature modeling, risk prediction, and error analysis. Methodologically, 12 token-level behavioral features are constructed based on three wash-trading patterns (Self, Matched, and Circular), unifying transaction-, address-, and flow-level signals into risk vectors. Supervised models are then employed to output warning scores and alert decisions. Under the current setting (7 tokens, 33,242 records), Random Forest outperforms Logistic Regression on core metrics, achieving AUC=0.9098, PR-AUC=0.9185, and F1=0.7429. Ablation results show that trade-level features are the primary performance driver (Delta PR-AUC=-0.1843 when removed), while address-level features provide stable complementary gain (Delta PR-AUC=-0.0573). The model also demonstrates actionable early-warning potential for a subset of samples, with a mean Lead Time (v1) of 3.8133 hours. The error profile (FP=1, FN=8) indicates that the current system is better positioned as a high-precision screener rather than a high-recall automatic alarm engine. The main contributions are threefold: an executable and reproducible rug-pull warning pipeline, empirical validation of multi-granularity wash-trading features under weak supervision, and deployment-oriented evidence through lead-time and error-bound analysis.