Synthetic data in cryptocurrencies using generative models

arXiv cs.LG / 4/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes generating synthetic cryptocurrency price time-series data using deep learning to avoid privacy risks and access restrictions associated with real financial data.
  • It uses a Conditional Generative Adversarial Network (CGAN) with an LSTM-style recurrent generator and an MLP discriminator to create synthetic data that is statistically consistent.
  • Experiments across multiple crypto assets show the method can reproduce meaningful temporal patterns, including market trends and underlying dynamics.
  • The authors argue that GAN-based synthetic series can serve as an efficient alternative for simulating financial data, supporting tasks like market behavior analysis and anomaly detection while using less computational cost than more complex generative approaches.

Abstract

Data plays a fundamental role in consolidating markets, services, and products in the digital financial ecosystem. However, the use of real data, especially in the financial context, can lead to privacy risks and access restrictions, affecting institutions, research, and modeling processes. Although not all financial datasets present such limitations, this work proposes the use of deep learning techniques for generating synthetic data applied to cryptocurrency price time series. The approach is based on Conditional Generative Adversarial Networks (CGANs), combining an LSTM-type recurrent generator and an MLP discriminator to produce statistically consistent synthetic data. The experiments consider different crypto-assets and demonstrate that the model is capable of reproducing relevant temporal patterns, preserving market trends and dynamics. The generation of synthetic series through GANs is an efficient alternative for simulating financial data, showing potential for applications such as market behavior analysis and anomaly detection, with lower computational cost compared to more complex generative approaches.

Synthetic data in cryptocurrencies using generative models | AI Navigate