VQ-Jarvis: Retrieval-Augmented Video Restoration Agent with Sharp Vision and Fast Thought
arXiv cs.CV / 3/25/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces VQ-Jarvis, a retrieval-augmented video restoration agent aimed at handling heterogeneous real-world degradations better than fixed pipelines.
- It proposes “sharp vision” via VSR-Compare, a large-scale paired video enhancement dataset (20K comparison pairs) spanning 7 degradation types and 11 enhancement operators.
- VQ-Jarvis uses trained judge and degradation-perception models to distinguish subtle quality differences among candidate restorations and to guide agent decisions.
- For “fast thought,” it combines one-step retrieval for easier videos with hierarchical step-by-step greedy search for more difficult cases to balance efficiency and accuracy.
- Experiments reported in the article indicate that VQ-Jarvis outperforms existing video restoration approaches on complex degraded videos.
Related Articles

Black Hat Asia
AI Business

"The Agent Didn't Decide Wrong. The Instructions Were Conflicting — and Nobody Noticed."
Dev.to
Top 5 LLM Gateway Alternatives After the LiteLLM Supply Chain Attack
Dev.to

Stop Counting Prompts — Start Reflecting on AI Fluency
Dev.to

Reliable Function Calling in Deeply Recursive Union Types: Fixing Qwen Models' Double-Stringify Bug
Dev.to