Fun-TSG: A Function-Driven Multivariate Time Series Generator with Variable-Level Anomaly Labeling

arXiv cs.AI / 4/17/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research

Key Points

  • The paper introduces Fun-TSG, a customizable multivariate time series generator aimed at improving reliable evaluation of anomaly detection methods.
  • Existing benchmarks are criticized for lacking fine-grained (variable- and time-level) anomaly annotations and for not exposing explicit inter-variable and temporal dependencies or the data’s generative mechanisms.
  • Fun-TSG enables anomaly dataset creation in two ways: fully automated generation using randomly sampled dependency structures and anomaly types, and manual generation via user-specified equations and anomaly configurations.
  • The generator provides transparent data-generation control and returns ground-truth anomaly labels at both the variable and timestamp levels, supporting interpretable and variable-specific benchmarking.
  • The tool is positioned to produce diverse, interpretable, and reproducible benchmarking scenarios to support fine-grained performance comparisons of both classical and modern anomaly detection models.

Abstract

Reliable evaluation of anomaly detection methods in multivariate time series remains an open challenge, largely due to the limitations of existing benchmark datasets. Current resources often lack fine-grained anomaly annotations, do not provide explicit intervariable and temporal dependencies, and offer little insight into the underlying generative mechanisms. These shortcomings hinder the development and rigorous comparison of detection models, especially those targeting interpretable and variable-specific outputs. To address this gap, we introduce Fun-TSG, a fully customizable time series generator designed to support high-quality evaluation of anomaly detection systems. Our tool enables both fully automated generation, based on randomly sampled dependency structures and anomaly types, and manual generation through user-defined equations and anomaly configurations. In both cases, it provides full transparency over the data generation process, including access to ground-truth anomaly labels at the variable and timestamp levels. Fun-TSG supports the creation of diverse, interpretable, and reproducible benchmarking scenarios, enabling fine-grained performance analysis for both classical and modern anomaly detection models.