Interactive Tracking: A Human-in-the-Loop Paradigm with Memory-Augmented Adaptation
arXiv cs.CV / 4/3/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that most visual tracking systems are “fire-and-forget” and proposes Interactive Tracking, where users can steer a tracker at any time using natural-language commands for real human-in-the-loop use cases.
- It introduces InteractTrack, a new large-scale benchmark with 150 videos, densely annotated bounding boxes, and timestamped language instructions to support research on interactive tracking.
- The authors provide a dedicated evaluation protocol and show that 25 representative state-of-the-art trackers perform poorly in interactive scenarios, indicating that gains on conventional benchmarks do not reliably transfer.
- They propose IMAT (Interactive Memory-Augmented Tracking), a baseline that uses dynamic memory to learn from user feedback and update tracking behavior over time.
- The benchmark, evaluation assets, and results are published to serve as a foundation for building more adaptive, collaborative tracking systems.
Related Articles

Black Hat Asia
AI Business

Mistral raises $830M, 9fin hits unicorn status, and new Tech.eu Summit speakers unveiled
Tech.eu

ChatGPT costs $20/month. I built an alternative for $2.99.
Dev.to

OpenAI shifts to usage-based pricing for Codex in ChatGPT business plans
THE DECODER

Why I built an AI assistant that doesn't know who you are
Dev.to