Application of a Mixture of Experts-based Foundation Model to the GlueX DIRC Detector

arXiv cs.LG / 4/29/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper reports a Mixture-of-Experts-based foundation model for the GlueX DIRC Cherenkov detector at Jefferson Lab, used for fast simulation, particle identification, and hit-level noise filtering.
  • It uses one shared transformer backbone across tasks, replacing fragmented task-specific pipelines while achieving competitive and sometimes superior performance versus established approaches.
  • The model generates hit-by-hit outputs autoregressively from low-level detector inputs, using split spatial/temporal vocabularies and continuous kinematic conditioning.
  • The Mixture-of-Experts design enables class-conditional generation of pions and kaons, and the method is benchmarked across the full GlueX DIRC kinematic range without architectural changes.
  • The authors argue this foundation-model framework could serve as a practical, scalable alternative to the current collection of task-specific models proposed for GlueX DIRC analyses.

Abstract

We present a Mixture-of-Experts-based foundation model applied to the GlueX DIRC detector at Jefferson Lab, demonstrating its utility as a unified framework for fast simulation, particle identification, and hit-level noise filtering of Cherenkov photons. By leveraging a single shared transformer backbone across all tasks, the approach eliminates the fragmentation of task-specific pipelines while maintaining competitive-and in several cases superior-performance relative to established methods. The model operates directly on low-level detector inputs, performing hit-by-hit autoregressive generation over split spatial and temporal vocabularies with continuous kinematic conditioning, and supports class-conditional generation of pions and kaons through its Mixture-of-Experts architecture. We benchmark against the standard geometrical reconstruction and prior deep learning methods across the full kinematic phase space of the GlueX DIRC, demonstrating that the foundation model framework transfers effectively to this detector without architectural modification. This work positions the foundation model as a practical and scalable alternative to the suite of task-specific models currently proposed for GlueX DIRC analysis.