Phone2Act: A Low-Cost, Hardware-Agnostic Teleoperation System for Scalable VLA Data Collection
arXiv cs.RO / 5/5/2026
📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research
Key Points
- Phone2Act is a low-cost, hardware-agnostic teleoperation system designed to collect high-quality Vision-Language-Action (VLA) manipulation data without requiring specialized robot setups.
- It turns a commodity smartphone into a 6-DoF robot controller using Google ARCore, and uses a modular ROS 2 design with interchangeable bridge nodes to support many robot platforms without code changes.
- A Universal Recorder synchronizes multi-camera RGB streams with robot state feedback and exports demonstrations directly in the LeRobot dataset format to reduce post-processing.
- The framework was validated by fine-tuning GR00T-N1.5 on 130 collected episodes and reaching a 90% success rate on a real multi-stage pick-and-place task on a physical Dobot CR5.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles

Black Hat USA
AI Business

Singapore's Fraud Frontier: Why AI Scam Detection Demands Regulatory Precision
Dev.to

First experience with Building Apps with Google AI Studio: Incredibly simple and intuitive.
Dev.to

Meta will use AI to analyze height and bone structure to identify if users are underage
TechCrunch

How AI is Changing the Way We Code in 2026: The Shift from Syntax to Strategy
Dev.to