Swim2Real: VLM-Guided System Identification for Sim-to-Real Transfer
arXiv cs.RO / 3/24/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- Swim2Real is a video-to-simulator calibration pipeline that uses vision-language model (VLM) feedback to tune a 16-parameter robotic fish simulator without hand-designed search stages.
- It addresses hard sim-to-real issues in aquatic robotics—chaotic parameter landscapes, persistent sim model error, and limited reproducible experiments—by comparing simulated and real swim videos and iteratively updating parameters.
- A backtracking line search validates VLM-proposed step sizes, boosting acceptance rate from 14% to 42% by correcting cases where the update direction is right but the magnitude is too large.
- The calibrated simulator closely matches real fish velocities across motor frequencies (MAE 7.4 mm/s, 43% lower than the next-best method) and maintains robustness with zero outlier seeds across five runs.
- With the tuned simulator, motor commands transfer to a physical fish at 50 Hz, and downstream RL policies achieve improved performance versus policies trained on simulators calibrated with BayesOpt or CMA-ES.
Related Articles
The Security Gap in MCP Tool Servers (And What I Built to Fix It)
Dev.to
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to
I made a new programming language to get better coding with less tokens.
Dev.to
RSA Conference 2026: The Week Vibe Coding Security Became Impossible to Ignore
Dev.to

Adversarial AI framework reveals mechanisms behind impaired consciousness and a potential therapy
Reddit r/artificial