Dual-Stage Invariant Continual Learning under Extreme Visual Sparsity
arXiv cs.LG / 3/30/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper studies continual learning for object detection under extreme visual sparsity (e.g., space-based RSO detection) where foreground signals are overwhelmed by background, leading to backbone destabilization under sequential domain shifts.
- It provides an analytic explanation that background-driven gradients cause progressive representation drift, revealing a structural weakness in continual learning methods that only use output-level distillation.
- To counter this, the authors propose a dual-stage invariant continual learning framework that applies joint distillation to both intermediate backbone representations (structural consistency) and detection predictions (semantic consistency).
- They further introduce sparsity-aware data conditioning—patch-based sampling plus distribution-aware augmentation—to regulate gradient statistics under severe class/visual imbalance.
- Experiments on a high-resolution space-based RSO detection dataset show an absolute +4.0 mAP improvement over established continual object detection methods under sequential domain shifts.




