When Alpha Breaks: Two-Level Uncertainty for Safe Deployment of Cross-Sectional Stock Rankers
arXiv cs.AI / 3/17/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisIndustry & Market MovesModels & Research
Key Points
- Under non-stationarity, cross-sectional rankers can fail during regime shifts, motivating deployment decisions beyond mere point predictions.
- The AI Stock Forecaster uses a LightGBM ranker that performs well at a 20-day horizon, but the 2024 holdout coincided with an AI thematic rally and sector rotation that weakened signals at longer horizons.
- The authors adapt Direct Epistemic Uncertainty Prediction (DEUP) to ranking by predicting rank displacement and defining an epistemic uncertainty signal ehat relative to a PIT-safe baseline.
- They find that ehat is structurally coupled with signal strength (median correlation about 0.6 across 1,865 dates), so inverse-uncertainty sizing can dampen strong signals and degrade performance.
- A two-level deployment policy is proposed: a strategy-level regime-trust gate G(t) deciding whether to trade (AUROC around 0.72 overall, ~0.75 in FINAL) and a position-level epistemic tail-risk cap that reduces exposure for the most uncertain predictions; with operational rules (trade only when G(t) ≥ 0.2, apply volatility sizing on active dates, cap the top epistemic tail), this improves risk-adjusted performance and suggests DEUP mainly serves as a tail-risk guard.
Related Articles
I Was Wrong About AI Coding Assistants. Here's What Changed My Mind (and What I Built About It).
Dev.to

Interesting loop
Reddit r/LocalLLaMA
Qwen3.5-122B-A10B Uncensored (Aggressive) — GGUF Release + new K_P Quants
Reddit r/LocalLLaMA
A supervisor or "manager" Al agent is the wrong way to control Al
Reddit r/artificial
FeatherOps: Fast fp8 matmul on RDNA3 without native fp8
Reddit r/LocalLLaMA