VSAS-BENCH: Real-Time Evaluation of Visual Streaming Assistant Models
arXiv cs.CV / 4/10/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- VSAS-BENCH is introduced as a new benchmark framework specifically for real-time visual streaming assistant (streaming VLM) evaluation, focusing on metrics beyond offline video understanding.
- The benchmark includes temporally dense annotations (18,000+), diverse domains and task types, and provides standardized synchronous and asynchronous evaluation protocols.
- It introduces metrics to separately measure proactiveness (response timeliness) and consistency (robustness of responses over time), enabling clearer analysis of streaming behavior.
- Large-scale experiments evaluate accuracy–latency trade-offs across factors like memory buffer length, memory access policy, and input resolution, producing practical design insights.
- The study shows conventional VLMs can be adapted to streaming without additional training and that adapted models (e.g., Qwen3-VL-4B) outperform prior streaming VLMs on VSAS-BENCH.
Related Articles

Black Hat Asia
AI Business
v0.20.5
Ollama Releases

Inside Anthropic's Project Glasswing: The AI Model That Found Zero-Days in Every Major OS
Dev.to
Gemma 4 26B fabricated an entire code audit. I have the forensic evidence from the database.
Reddit r/LocalLLaMA
SoloEngine: Low-Code Agentic AI Development Platform with Native Support for Multi-Agent Collaboration, MCP, and Skill System
Dev.to