AI Navigate

CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data

arXiv cs.AI / 3/13/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • CINDI is an unsupervised probabilistic framework that restores data integrity in noisy, multivariate time-series data, emphasizing critical infrastructure such as electrical power grids.
  • It combines anomaly detection and imputation into one end-to-end system built on conditional normalizing flows, enabling identification of low-probability segments and iterative, statistically consistent replacements.
  • By modeling the exact conditional likelihood, CINDI preserves underlying physical and statistical properties while reusing learned information to avoid discarding useful context.
  • The framework was evaluated on real-world grid loss data from a Norwegian power distribution operator, showing robust performance versus competitive baselines and scalability to noisy environments.
  • Although demonstrated on grid data, CINDI is designed to generalize to any multivariate time series domain.

Abstract

Real-world multivariate time series, particularly in critical infrastructure such as electrical power grids, are often corrupted by noise and anomalies that degrade the performance of downstream tasks. Standard data cleaning approaches often rely on disjoint strategies, which involve detecting errors with one model and imputing them with another. Such approaches can fail to capture the full joint distribution of the data and ignore prediction uncertainty. This work introduces Conditional Imputation and Noisy Data Integrity (CINDI), an unsupervised probabilistic framework designed to restore data integrity in complex time series. Unlike fragmented approaches, CINDI unifies anomaly detection and imputation into a single end-to-end system built on conditional normalizing flows. By modeling the exact conditional likelihood of the data, the framework identifies low-probability segments and iteratively samples statistically consistent replacements. This allows CINDI to efficiently reuse learned information while preserving the underlying physical and statistical properties of the system. We evaluate the framework using real-world grid loss data from a Norwegian power distribution operator, though the methodology is designed to generalize to any multivariate time series domain. The results demonstrate that CINDI yields robust performance compared to competitive baselines, offering a scalable solution for maintaining reliability in noisy environments.