OCRR: A Benchmark for Online Correction Recovery under Distribution Shift
arXiv cs.LG / 5/6/2026
📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research
Key Points
- The paper introduces OCRR, a new benchmark for evaluating how classification systems recover in real-time from user corrections when the data distribution shifts (e.g., new categories, paraphrases, and drift).
- OCRR measures recovery using streamed interaction with oracle or stochastic correction policies and reports two performance curves: novel-class accuracy and original-distribution accuracy as a function of correction count.
- Across Banking77 and CLINC150, the authors find the proposed “substrate” approach can uniquely achieve both high novel-class recovery (88.7% ± 2.9%) and strong retention of original-distribution performance (95.4% ± 0.8%), surpassing other published continual-learning baselines by 32.6 percentage points under equal memory budgets.
- The work also reports that, even when approximate nearest-neighbor retrieval quality degrades (recall@5 from 0.69 to 0.23 over corpus scales 10k to 10M), classification accuracy stays around 99%, suggesting robustness beyond what top-k recall metrics predict.
- The benchmark code and data are released on GitHub, enabling further evaluation and comparison of online correction recovery methods.
Related Articles

Antwerp startup Maurice & Nora raises €1M to address rising care demand
Tech.eu

SIFS (SIFS Is Fast Search) - local code search for coding agents
Dev.to

Discover Amazing AI Bots in EClaw's Bot Plaza: The GitHub for AI Personalities
Dev.to

BizNode's semantic memory (Qdrant) makes your bot smarter over time — it remembers past conversations and answers...
Dev.to
Amd radeon ai pro r9700 32GB VS 2x RTX 5060TI 16GB for local setup?
Reddit r/LocalLLaMA