Synthetic Tabular Generators Fail to Preserve Behavioral Fraud Patterns: A Benchmark on Temporal, Velocity, and Multi-Account Signals
arXiv cs.LG / 4/16/2026
📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper proposes “behavioral fidelity” as a new evaluation dimension for synthetic tabular data, focusing on whether generators preserve temporal, sequential, and structural fraud signals used in real detection systems.
- It defines four behavioral fraud pattern types (P1–P4) including inter-event timing, burst structure, multi-account graph motifs, and velocity-rule trigger rates, along with a degradation-ratio metric calibrated to a real-data noise floor.
- The authors prove that row-independent synthetic generators cannot reproduce multi-account graph motifs (P3) and yield non-positive within-entity inter-event-time autocorrelation, implying core burst/fraud fingerprints are unattainable regardless of model architecture or data size.
- Benchmarks on IEEE-CIS Fraud Detection and the Amazon Fraud Dataset show multiple popular generators (CTGAN, TVAE, GaussianCopula, TabularARGN) fail badly, with degradation ratios up to ~39x on IEEE-CIS and 81.6–99.7x for row-independent methods on Amazon, while TabularARGN performs better (17.2x) but still degrades substantially.
- The work releases an open-source evaluation framework and claims the P1–P4 behavioral-pattern framework generalizes to other domains with entity-level sequential tabular data (e.g., healthcare and network security).
Related Articles

Black Hat USA
AI Business

Black Hat Asia
AI Business
The AI Hype Cycle Is Lying to You About What to Learn
Dev.to
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to
OpenAI Codex April 2026 Update Review: Computer Use, Memory & 90+ Plugins — Is the Hype Real?
Dev.to