Diffusion Operator Geometry of Feedforward Representations

arXiv cs.LG / 5/5/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper proposes a smooth, operator-theoretic framework to analyze the geometry of feedforward neural network representations, avoiding reliance on discrete neighborhood-graph curvature.
It models each feature cloud as inducing a Gaussian-kernel diffusion Markov operator and derives multiple observables (transport, spectral properties, label-boundary behavior, and local scales) using Bakry–Émery Γ-calculus.
In a balanced Gaussian class-conditional snapshot model with shared covariance, the population operator yields closed-form quantities such as class affinities, leakage, and coarse spectral structure, governed by pairwise regularized Mahalanobis separations.
The authors show these operator-based observables change smoothly under feature perturbations, whereas traditional neighborhood-graph diagnostics can shift discontinuously.
Experiments on synthetic Gaussian data and on learned MNIST demonstrate that the derived operator observables track training dynamics, network width, and perturbation stability.

Abstract

Neural networks transform data through learned representations whose geometry affects separation, contraction, and generalization. Recent work studies this geometry using discrete curvature on neighborhood graphs, suggesting Ricci-flow-like behavior across layers. We develop a smooth operator-theoretic alternative for feedforward representation snapshots. Each feature cloud induces a Gaussian-kernel diffusion Markov operator, and transport, spectral, label-boundary, and local-scale observables are derived from this single object via Bakry-Emery

\Gamma

-calculus. In a balanced Gaussian class-conditional snapshot model with shared covariance, the population operator has closed-form class affinities, leakage, and coarse spectra, all controlled by pairwise regularized Mahalanobis separations

c_\varepsilon^{(a,b)}

. We also prove that the resulting operator observables vary smoothly under feature perturbations, while hard neighborhood-graph diagnostics can change discontinuously. Synthetic experiments validate the closed-form Gaussian bridge, while learned MNIST experiments show that the same operator observables track training, width, and perturbation stability. Together, these results give a stable operator-geometric framework for analyzing feedforward representation geometry.

The 55.6% problem: why frontier LLMs fail at embedded code

Dev.to

Four CVEs in a week, all the same shape: when agents execute LLM-generated code

Dev.to

Healthcare AI Is Absorbing Institutional Knowledge It Can't Actually Hold

Reddit r/artificial

The Transformer: The Architecture Behind Modern AI

Dev.to

Foundational Models Defining a New Era in Vision: A Survey and Outlook

Dev.to

Diffusion Operator Geometry of Feedforward Representations

Key Points

Abstract

Related Articles

The 55.6% problem: why frontier LLMs fail at embedded code

Four CVEs in a week, all the same shape: when agents execute LLM-generated code

Healthcare AI Is Absorbing Institutional Knowledge It Can't Actually Hold

The Transformer: The Architecture Behind Modern AI

Foundational Models Defining a New Era in Vision: A Survey and Outlook

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer