Bi-Lipschitz Autoencoder With Injectivity Guarantee

arXiv cs.LG / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that encoder non-injectivity is a primary bottleneck in regularized autoencoders, causing poor convergence and distorted latent representations.
  • It formalizes “admissible regularization” and provides sufficient conditions to make regularization robust across varying data distributions.
  • The proposed Bi-Lipschitz Autoencoder (BLAE) adds an injective regularization scheme using a separation criterion to avoid pathological local minima.
  • BLAE also uses a bi-Lipschitz relaxation to better preserve manifold geometry and improve robustness under distribution drift.
  • Experiments across multiple datasets show BLAE outperforms prior methods in maintaining manifold structure, including under sampling sparsity and distribution shifts, with accompanying code released on GitHub.

Abstract

Autoencoders are widely used for dimensionality reduction, based on the assumption that high-dimensional data lies on low-dimensional manifolds. Regularized autoencoders aim to preserve manifold geometry during dimensionality reduction, but existing approaches often suffer from non-injective mappings and overly rigid constraints that limit their effectiveness and robustness. In this work, we identify encoder non-injectivity as a core bottleneck that leads to poor convergence and distorted latent representations. To ensure robustness across data distributions, we formalize the concept of admissible regularization and provide sufficient conditions for its satisfaction. In this work, we propose the Bi-Lipschitz Autoencoder (BLAE), which introduces two key innovations: (1) an injective regularization scheme based on a separation criterion to eliminate pathological local minima, and (2) a bi-Lipschitz relaxation that preserves geometry and exhibits robustness to data distribution drift. Empirical results on diverse datasets show that BLAE consistently outperforms existing methods in preserving manifold structure while remaining resilient to sampling sparsity and distribution shifts. Code is available at https://github.com/qipengz/BLAE.