AI Navigate

DSS-GAN: Directional State Space GAN with Mamba backbone for Class-Conditional Image Synthesis

arXiv cs.LG / 3/19/2026

📰 NewsModels & Research

Key Points

  • The paper introduces DSS-GAN, a GAN that uses a Mamba hierarchical generator backbone and a novel Directional Latent Routing (DLR) mechanism for noise-to-image synthesis.
  • DLR decomposes the latent vector into direction-specific subvectors, each jointly projected with a class embedding to produce feature-wise affine modulation across the Mamba backbone.
  • Unlike conventional global conditioning, DLR couples class identity and latent structure along distinct spatial axes of the feature map, consistently across all generative scales.
  • DSS-GAN reports improved FID, KID, and precision-recall scores versus StyleGAN2-ADA across multiple datasets, with latent-space analysis showing directionally specialized, structured image changes.

Abstract

We present DSS-GAN, the first generative adversarial network to employ Mamba as a hierarchical generator backbone for noise-to-image synthesis. The central contribution is Directional Latent Routing (DLR), a novel conditioning mechanism that decomposes the latent vector into direction-specific subvectors, each jointly projected with a class embedding to produce a feature-wise affine modulation of the corresponding Mamba scan. Unlike conventional class conditioning that injects a global signal, DLR couples class identity and latent structure along distinct spatial axes of the feature map, applied consistently across all generative scales. DSS-GAN achieves improved FID, KID, and precision-recall scores compared to StyleGAN2-ADA across multiple tested datasets. Analysis of the latent space reveals that directional subvectors exhibit measurable specialization: perturbations along individual components produce structured, direction-correlated changes in the synthesized image.