Real-Time Branch-to-Tool Distance Estimation for Autonomous UAV Pruning: Benchmarking Five DEFOM-Stereo Variants from Simulation to Jetson Deployment
arXiv cs.CV / 3/30/2026
💬 OpinionSignals & Early TrendsModels & Research
Key Points
- The paper targets safety-critical UAV tree pruning by estimating the metric distance from the cutting tool to thin branches in real time using a stereo depth approach.
- It trains five DEFOM-Stereo variants on a task-specific Unreal Engine 5 synthetic dataset (5,520 stereo pairs across 115 tree instances) and deploys the resulting checkpoints to an NVIDIA Jetson Orin Super 16 GB.
- While DEFOM-Stereo ViT-S achieves the best depth accuracy on the synthetic test set, it runs at only ~2.2 FPS on the Jetson, which is too slow for responsive closed-loop tool control.
- The newly introduced DEFOM-PrunePlus (~21M parameters) improves the accuracy-latency trade-off, reaching ~3.3 FPS with deployable performance deemed sufficient for real-time guidance at the 2m operating range.
- Faster lightweight variants (DEFOM-PruneStereo and DEFOM-PruneNano) meet higher frame rates but show substantially worse depth accuracy, and the authors report zero-shot results on real photos to support sim-to-real transfer for the full-capacity models.
Related Articles

Black Hat Asia
AI Business
Mr. Chatterbox is a (weak) Victorian-era ethically trained model you can run on your own computer
Simon Willison's Blog
Beyond the Chatbot: Engineering Multi-Agent Ecosystems in 2026
Dev.to
I missed the "fun" part in software development
Dev.to
Hermes Agent: A Self-Improving AI Agent That Runs Anywhere
Dev.to