Learning Dexterous Grasping from Sparse Taxonomy Guidance

arXiv cs.RO / 4/7/2026

📰 NewsSignals & Early TrendsModels & Research

共有:

Key Points

The paper introduces GRIT, a two-stage framework for dexterous grasping that uses sparse taxonomy guidance rather than dense grasp/contact supervision.
GRIT first predicts a taxonomy-based grasp specification from scene and task context, then generates continuous multi-finger motions conditioned on that sparse grasp structure.
The authors find that different grasp taxonomies work better for different object geometries, and they leverage this relationship to improve generalization.
On benchmark experiments, GRIT reports an overall success rate of 87.9% and improved performance on novel objects versus baseline methods.
Real-world tests indicate the approach is controllable, allowing grasp strategies to be adjusted via high-level taxonomy selection aligned with object geometry and task intent.

Abstract

Dexterous manipulation requires planning a grasp configuration suited to the object and task, which is then executed through coordinated multi-finger control. However, specifying grasp plans with dense pose or contact targets for every object and task is impractical. Meanwhile, end-to-end reinforcement learning from task rewards alone lacks controllability, making it difficult for users to intervene when failures occur. To this end, we present GRIT, a two-stage framework that learns dexterous control from sparse taxonomy guidance. GRIT first predicts a taxonomy-based grasp specification from the scene and task context. Conditioned on this sparse command, a policy generates continuous finger motions that accomplish the task while preserving the intended grasp structure. Our result shows that certain grasp taxonomies are more effective for specific object geometries. By leveraging this relationship, GRIT improves generalization to novel objects over baselines and achieves an overall success rate of 87.9%. Moreover, real-world experiments demonstrate controllability, enabling grasp strategies to be adjusted through high-level taxonomy selection based on object geometry and task intent.

Black Hat Asia

AI Business

v0.20.5

Ollama Releases

Inside Anthropic's Project Glasswing: The AI Model That Found Zero-Days in Every Major OS

Dev.to

Gemma 4 26B fabricated an entire code audit. I have the forensic evidence from the database.

Reddit r/LocalLLaMA

SoloEngine: Low-Code Agentic AI Development Platform with Native Support for Multi-Agent Collaboration, MCP, and Skill System

Dev.to

Learning Dexterous Grasping from Sparse Taxonomy Guidance

Key Points

Abstract

Related Articles

Black Hat Asia

v0.20.5

Inside Anthropic's Project Glasswing: The AI Model That Found Zero-Days in Every Major OS

Gemma 4 26B fabricated an entire code audit. I have the forensic evidence from the database.

SoloEngine: Low-Code Agentic AI Development Platform with Native Support for Multi-Agent Collaboration, MCP, and Skill System

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer