AI Navigate

No More DeLuLu: Physics-Inspired Kernel Networks for Geometrically-Grounded Neural Computation

arXiv cs.LG / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces the yat-product, a physics-inspired kernel operator that combines quadratic alignment with inverse-square proximity and is proven to be a Mercer kernel with a unique RKHS embedding.
  • Neural Matter Networks use the yat-product as the sole non-linearity, shifting normalization into the kernel and replacing conventional activation-normalization blocks with a geometrically grounded operation.
  • Empirically, NMN-based classifiers match linear baselines on MNIST while showing bounded prototype evolution and superposition robustness, and Aether-GPT2 achieves lower validation loss than GPT-2 with a comparable parameter budget using yat-based attention and MLP blocks.
  • The framework is positioned as unifying kernel learning, gradient stability, and information geometry, establishing NMNs as a principled alternative to conventional neural architectures.

Abstract

We introduce the yat-product, a kernel operator combining quadratic alignment with inverse-square proximity. We prove it is a Mercer kernel, analytic, Lipschitz on bounded domains, and self-regularizing, admitting a unique RKHS embedding. Neural Matter Networks (NMNs) use yat-product as the sole non-linearity, replacing conventional linear-activation-normalization blocks with a single geometrically-grounded operation. This architectural simplification preserves universal approximation while shifting normalization into the kernel itself via the denominator, rather than relying on separate normalization layers. Empirically, NMN-based classifiers match linear baselines on MNIST while exhibiting bounded prototype evolution and superposition robustness. In language modeling, Aether-GPT2 achieves lower validation loss than GPT-2 with a comparable parameter budget while using yat-based attention and MLP blocks. Our framework unifies kernel learning, gradient stability, and information geometry, establishing NMNs as a principled alternative to conventional neural architectures.