DINO-Explorer: Active Underwater Discovery via Ego-Motion Compensated Semantic Predictive Coding
arXiv cs.RO / 4/15/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- DINO-Explorer is presented as an underwater active perception framework that prioritizes scientifically relevant, transient events instead of passively logging exhaustive video for offline review.
- The method generates a continuous semantic “surprise” signal inside the latent space of a frozen DINOv3 foundation model, using a lightweight action-conditioned recurrent predictor to forecast short-horizon semantic changes.
- An efference-copy-inspired module compensates for ego-motion by using globally pooled optical flow to suppress self-induced visual variation while preserving true environmental novelty.
- Evaluations on asynchronous event triage under constrained telemetry show bandwidth-efficient attention, retaining 78.8% of consensus events with a 56.8% trigger confirmation rate.
- Ego-motion conditioning reduces false positives by 45.5% versus an uncompensated surprise baseline, and a replay-side study reports 48.2% telemetry bandwidth reduction while maintaining a 62.2% peak F1 score near the selected operating point.




