QuietWalk: Physics-Informed Reinforcement Learning for Ground Reaction Force-Aware Humanoid Locomotion Under Diverse Footwear

arXiv cs.RO / 4/28/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

共有:

Key Points

QuietWalk is a physics-informed reinforcement learning framework that targets humanoid locomotion by explicitly accounting for ground-reaction forces (GRFs) affected by different footwear.
It uses an inverse-dynamics-constrained PINN to estimate per-foot vertical GRFs from proprioceptive signals and then feeds a frozen predictor into the RL loop to penalize predicted impact forces without needing force sensors at deployment.
On a held-out real-robot dataset, inverse-dynamics consistency enforcement cuts vertical GRF prediction errors by 82%–86% versus a purely supervised predictor and boosts prediction quality (coefficient of determination up to 0.99).
Hardware tests at 1.2 m/s show quieter walking: mean A-weighted noise drops by 7.17 dB and peak noise drops by 4.98 dB under a consistent recording setup.
Cross-footwear experiments (barefoot, skate shoes, sneakers, high heels) across multiple surfaces demonstrate strong policy generalization to footwear-induced contact dynamics changes.

Abstract

Humanoid robots operating in human-centered environments (e.g., homes, hospitals, and offices) must mitigate foot--ground impact transients, as impact-induced vibration and noise degrade user experience and repeated impacts accelerate hardware wear. However, existing low-noise locomotion training often relies on kinematic proxy objectives or fragile force sensors, and footwear-induced changes in contact dynamics introduce distribution shifts that hinder policy generalization.We present QuietWalk, a physics-informed reinforcement learning framework for ground-reaction-force-aware humanoid locomotion under diverse footwear conditions. QuietWalk employs an inverse-dynamics-constrained physics-informed neural network (PINN) to estimate per-foot vertical ground reaction forces (GRFs) from proprioceptive signals, and integrates the frozen predictor into the RL training loop to penalize predicted impact forces without requiring force sensors at deployment.On a held-out real-robot dataset, enforcing inverse-dynamics consistency reduces vertical GRF prediction errors by 82%-86% compared with a purely supervised predictor and improves the coefficient of determination from 0.39/0.67 to 0.99/0.99 for the left/right feet. On hardware at 1.2 m/s (barefoot; averaged over four floor materials), QuietWalk reduces mean A-weighted noise level by 7.17 dB and peak noise level by 4.98 dB under a consistent recording setup. Cross-footwear experiments (barefoot, skate shoes, athletic sneakers, and high heels) across multiple surfaces further demonstrate robust adaptation to footwear-induced contact variations.