Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks

arXiv cs.RO / 3/25/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper addresses how robot control suffers when inference delay causes a mismatch between the observed state and the state at action execution (tens to hundreds of milliseconds).
It proposes Delay-Aware Diffusion Policy (DA-DP), which trains and runs policies by incorporating measured delay rather than assuming zero delay.
DA-DP corrects zero-delay trajectories into delay-compensated versions and adds delay conditioning so the policy can adapt to different latencies.
Experiments across multiple tasks, robots, and delay settings show DA-DP achieves higher and more robust success rates than delay-unaware baselines.
The approach is architecture-agnostic and also motivates evaluation protocols that report performance versus measured latency, not only task difficulty.

Abstract

As a robot senses and selects actions, the world keeps changing. This inference delay creates a gap of tens to hundreds of milliseconds between the observed state and the state at execution. In this work, we take the natural generalization from zero delay to measured delay during training and inference. We introduce Delay-Aware Diffusion Policy (DA-DP), a framework for explicitly incorporating inference delays into policy learning. DA-DP corrects zero-delay trajectories to their delay-compensated counterparts, and augments the policy with delay conditioning. We empirically validate DA-DP on a variety of tasks, robots, and delays and find its success rate more robust to delay than delay-unaware methods. DA-DP is architecture agnostic and transfers beyond diffusion policies, offering a general pattern for delay-aware imitation learning. More broadly, DA-DP encourages evaluation protocols that report performance as a function of measured latency, not just task difficulty.

AgentDesk vs Hiring Another Consultant: A Cost Comparison

Dev.to

"Why Your AI Agent Needs a System 1"

Dev.to

When should we expect TurboQuant?

Reddit r/LocalLLaMA

AI as Your Customs Co-Pilot: Automating HS Code Chaos in Southeast Asia

Dev.to

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Dev.to

Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks

Key Points

Abstract

Related Articles

AgentDesk vs Hiring Another Consultant: A Cost Comparison

"Why Your AI Agent Needs a System 1"

When should we expect TurboQuant?

AI as Your Customs Co-Pilot: Automating HS Code Chaos in Southeast Asia

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer