Beyond a Single Signal: SPECTREG2, A Unified MultiExpert Anomaly Detector for Unknown Unknowns

arXiv cs.LG / 3/24/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces SPECTRE-G2, a unified multi-signal anomaly detector designed to identify “unknown unknowns” by recognizing when ML systems operate beyond what they know.
  • Instead of relying on a single uncertainty or density metric, SPECTRE-G2 uses a dual-backbone neural network to generate eight complementary signals spanning density, geometry, uncertainty, discriminative, and causal information.
  • The method normalizes each signal with validation statistics and calibrates them using synthetic out-of-distribution data to improve robustness and reliability.
  • An adaptive top-k fusion mechanism selects the most informative subset of signals and averages their anomaly scores for final detection.
  • Experiments across synthetic, Adult, CIFAR-10, and Gridworld benchmarks show strong AUROC/AUPR/FPR95 results and stability across random seeds, especially for detecting new variables and confounders.

Abstract

Epistemic intelligence requires machine learning systems to recognise the limits of their own knowledge and act safely under uncertainty, especially when faced with unknown unknowns. Existing uncertainty quantification methods rely on a single signal such as confidence or density and fail to detect diverse structural anomalies. We introduce SPECTRE-G2, a multi-signal anomaly detector that combines eight complementary signals from a dual-backbone neural network. The architecture includes a spectral normalised Gaussianization encoder, a plain MLP preserving feature geometry, and an ensemble of five models. These produce density, geometry, uncertainty, discriminative, and causal signals. Each signal is normalised using validation statistics and calibrated with synthetic out-of-distribution data. An adaptive top-k fusion selects the most informative signals and averages their scores. Experiments on synthetic, Adult, CIFAR-10, and Gridworld datasets show strong performance across diverse anomaly types, outperforming multiple baselines on AUROC, AUPR, and FPR95. The model is stable across seeds and particularly effective for detecting new variables and confounders. SPECTRE-G2 provides a practical approach for detecting unknown unknowns in open-world settings.