Pickalo: Leveraging 6D Pose Estimation for Low-Cost Industrial Bin Picking
arXiv cs.RO / 4/7/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- Pickalo is a modular bin-picking pipeline that targets real industrial environments with heavy clutter and occlusions using only low-cost sensing hardware (a wrist-mounted RGB-D camera and stereo depth processing).
- The system refines raw stereo streams with BridgeDepth, segments objects with a Mask-RCNN model trained solely on photorealistic synthetic data, and estimates 6D pose via the zero-shot SAM-6D approach.
- A pose buffer module fuses multi-view observations over time to reduce pose noise while accounting for object symmetries, improving stability during continuous operation.
- For grasping, Pickalo precomputes large antipodal grasp candidate sets offline and selects grasps online using utility-based ranking with fast collision checking.
- Experiments on a UR5e with a parallel-jaw gripper and Intel RealSense D435i report up to 600 mean picks per hour with 96–99% grasp success, including robust performance over 30-minute runs, with ablations confirming the value of improved depth estimation and the pose buffer.
Related Articles

Black Hat Asia
AI Business

Whats'App Ai Assistant
Dev.to
AI Agents Now Have Credit Cards
Dev.to

I Built a $70K Security Bounty Pipeline with AI — Here's the Exact Workflow
Dev.to

It's insane how lobotomized Opus 4.6 is right now. Even Gemma 4 31B UD IQ3 XXS beat it on the carwash test on my 5070 TI.
Reddit r/LocalLLaMA