Embedded Variational Neural Stochastic Differential Equations for Learning Heterogeneous Dynamics

arXiv cs.LG / 4/2/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a Variational Neural Stochastic Differential Equation (V-NSDE) framework to model socioeconomic time-series data with both trends and stochastic fluctuations across multiple districts.
  • It combines a VAE-style encoder/decoder with a Neural SDE latent dynamics core, where an encoder produces an initial latent Gaussian distribution conditioned on district embeddings and observations.
  • The Neural SDE learns continuous-time drift and diffusion functions using neural networks that take time, latent state, and district embeddings as inputs, enabling heterogeneous dynamics per district.
  • Observations are reconstructed through a probabilistic decoder that outputs Gaussian likelihood parameters at each time step, trained via ELBO with an added KL-divergence regularization term.
  • Experimental results (on Odisha district data) indicate improved ability to capture complex temporal patterns and generate realistic trajectories showing both clear trends and random variations.

Abstract

This study examines the challenges of modeling complex and noisy data related to socioeconomic factors over time, with a focus on data from various districts in Odisha, India. Traditional time-series models struggle to capture both trends and variations together in this type of data. To tackle this, a Variational Neural Stochastic Differential Equation (V-NSDE) model is designed that combines the expressive dynamics of Neural SDEs with the generative capabilities of Variational Autoencoders (VAEs). This model uses an encoder and a decoder. The encoder takes the initial observations and district embeddings and translates them into a Gaussian distribution, which determines the mean and log-variance of the first latent state. Then the obtained latent state initiates the Neural SDE, which utilize neural networks to determine the drift and diffusion functions that govern continuous-time latent dynamics. These governing functions depend on the time index, latent state, and district embedding, which help the model learn the unique characteristics specific to each district. After that, using a probabilistic decoder, the observations are reconstructed from the latent trajectory. The decoder outputs a mean and log-variance for each time step, which follows the Gaussian likelihood. The Evidence Lower Bound (ELBO) training loss improves by adding a KL-divergence regularization term to the negative log-likelihood (nll). The obtained results demonstrate the effective learning of V-NSDE in recognizing complex patterns over time, yielding realistic outcomes that include clear trends and random fluctuations across different areas.