Minimax Optimality and Spectral Routing for Majority-Vote Ensembles under Markov Dependence

arXiv cs.AI / 4/16/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper studies how majority-vote ensemble variance reduction breaks down when training data is not i.i.d. but instead follows discrete Markov dependence.
It provides an information-theoretic minimax lower bound for stationary, reversible, geometrically ergodic Markov chains in fixed dimension, showing excess classification risk cannot improve beyond on the order of \(\Omega(\sqrt{\Tmix/n})\).
For an AR(1) “witness” subclass, it proves dependence-agnostic uniform bagging is provably suboptimal, with excess risk bounded below by \(\Omega(\Tmix/\sqrt{n})\), implying a \(\sqrt{\Tmix}\) algorithmic gap.
The authors introduce adaptive spectral routing, which partitions training data using the empirical Fiedler eigenvector of a dependency graph to achieve the minimax rate \(\mathcal{O}(\sqrt{\Tmix/n})\) on a graph-regular subclass without requiring knowledge of \(\Tmix\).
Experiments across synthetic Markov chains, spatial grids, UCR time-series datasets, and Atari DQN ensembles empirically support the theoretical rates, and the appendix discusses implications for deep RL variance reduction, scalable Nyström approximations, and bounded non-stationarity.

Abstract

Majority-vote ensembles achieve variance reduction by averaging over diverse, approximately independent base learners. When training data exhibits Markov dependence, as in time-series forecasting, reinforcement learning (RL) replay buffers, and spatial grids, this classical guarantee degrades in ways that existing theory does not fully quantify. We provide a minimax characterization of this phenomenon for discrete classification in a fixed-dimensional Markov setting, together with an adaptive algorithm that matches the rate on a graph-regular subclass. We first establish an information-theoretic lower bound for stationary, reversible, geometrically ergodic chains in fixed ambient dimension, showing that no measurable estimator can achieve excess classification risk better than

\Omega(\sqrt{\Tmix/n})

. We then prove that, on the AR(1) witness subclass underlying the lower-bound construction, dependence-agnostic uniform bagging is provably suboptimal with excess risk bounded below by

\Omega(\Tmix/\sqrt{n})

, exhibiting a

\sqrt{\Tmix}

algorithmic gap. Finally, we propose \emph{adaptive spectral routing}, which partitions the training data via the empirical Fiedler eigenvector of a dependency graph and achieves the minimax rate

\mathcal{O}(\sqrt{\Tmix/n})

up to a lower-order geometric cut term on a graph-regular subclass, without knowledge of

\Tmix

. Experiments on synthetic Markov chains, 2D spatial grids, the 128-dataset UCR archive, and Atari DQN ensembles validate the theoretical predictions. Consequences for deep RL target variance, scalability via Nystr\"om approximation, and bounded non-stationarity are developed as supporting material in the appendix.

"The AI Agent's Guide to Sustainable Income: From Zero to Profitability"

Dev.to

"The Hidden Economics of AI Agents: Survival Strategies in Competitive Markets"

Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

"The Hidden Costs of AI Agent Deployment: A CFO's Guide to True ROI in Enterpris

Dev.to

"The Real Cost of AI Compute: Why Token Efficiency Separates Viable Agents from

Dev.to

Minimax Optimality and Spectral Routing for Majority-Vote Ensembles under Markov Dependence

Key Points

Abstract

Related Articles

"The AI Agent's Guide to Sustainable Income: From Zero to Profitability"

"The Hidden Economics of AI Agents: Survival Strategies in Competitive Markets"

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

"The Hidden Costs of AI Agent Deployment: A CFO's Guide to True ROI in Enterpris

"The Real Cost of AI Compute: Why Token Efficiency Separates Viable Agents from

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer