Photonic AI: A Hybrid Diffractive Holographic Neural System for Passive Optical Real-Time Image Classification

arXiv cs.LG / 4/20/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper argues that optical edge computing can reduce data-movement energy and latency by performing linear transformations through wave propagation, diffraction, and interference rather than clocked arithmetic.
  • It proposes a hybrid diffractive holographic neural architecture that combines a Diffractive Optical Neural Network (DONN) with a Holographic Interference-Based Learning (HIBL) operator to translate digitally optimized phase patterns into fabrication-compatible optical elements.
  • The authors formalize the inference pipeline as a composition of encoding, phase modulation, free-space propagation, and intensity measurement operators, clarifying which components are learned versus fixed and where nonlinearity arises from photodetection.
  • In physics-informed simulations on MNIST, a three-layer system using ~25,000 phase elements reportedly reaches 91.2% test accuracy with propagation-limited nanosecond-scale latency.
  • The main contribution is presented as a rigorous operator-theoretic framework bridging the gap between learning an optical transformation and realizing it in passive, physically fabricated optics.

Abstract

Edge intelligence is constrained by the energy and latency costs of shuttling data through electronic memory hierarchies. Optical systems offer a fundamentally different computational regime: once an input wavefront is launched into a structured medium, propagation, diffraction, and interference jointly enact a linear transformation whose cost is determined by wave physics rather than by clocked arithmetic. This paper develops a rigorous systems-level treatment of that regime and introduces a hybrid diffractive holographic architecture for image classification. The proposed model couples a Diffractive Optical Neural Network (DONN) with a Holographic Interference-Based Learning (HIBL) operator a formal map from digitally optimized phase distributions to physically realizable, fabrication-compatible interference patterns embeddable in passive optical elements. We express the full inference pipeline as a composition of encoding, phase modulation, free-space propagation, and intensity measurement operators, making explicit which quantities are learned, which are fixed by design, and where nonlinearity enters through photodetection. This operator-theoretic view resolves a persistent gap in the optical-ML literature between learning a transformation and physically realizing it. In physics-informed simulation on MNIST, a three-layer system with approximately 25,000 phase elements achieves 91.2% test accuracy with propagation-limited nanosecond-scale latency. The primary contribution is not a performance claim but a precise computational framework: learned representations can be physically embedded into structured optical media so that inference is executed by wavefront transformation through a passive, fabricated object rather than by sequential electronic multiply accumulate operations.