CoFL: Continuous Flow Fields for Language-Conditioned Navigation
arXiv cs.RO / 4/30/2026
💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research
Key Points
- The paper introduces CoFL, an end-to-end policy for language-conditioned navigation that outputs a continuous flow field from BEV observations and a language instruction.
- Instead of predicting trajectories from a single start point, CoFL learns local motion vectors at arbitrary BEV locations, using each scene-instruction annotation as dense spatial supervision.
- The approach generates trajectories from any starting position by numerically integrating the predicted flow field, supporting simple real-time rollouts and closed-loop recovery.
- To scale training and evaluation, the authors build a dataset of 500k+ BEV image–instruction pairs with procedurally generated flow fields and trajectories derived from semantic maps from Matterport3D and ScanNet.
- Experiments on strictly unseen scenes show CoFL outperforms modular vision-language planners and trajectory-generation policies in both precision and safety, and it also performs zero-shot in real-world tests with feasible closed-loop control.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles

Chinese firms face pressure on AI investments as US peers’ spending keeps soaring
SCMP Tech
The Prompt Caching Mistake That's Costing You 70% More Than You Need to Pay
Dev.to
We Built a DNS-Based Discovery Protocol for AI Agents — Here's How It Works
Dev.to
Your first business opportunity in 3 commands: /register_directory in @biznode_bot, wait for matches, then /my_pulse to view...
Dev.to
Building AI Evaluation Pipelines: Automating LLM Testing from Dataset to CI/CD
Dev.to