RAM: Recover Any 3D Human Motion in-the-Wild

arXiv cs.CV / 3/23/2026

📰 NewsModels & Research

共有:

Key Points

RAM introduces a motion-aware semantic tracker that uses adaptive Kalman filtering to improve identity association under severe occlusions and dynamic interactions.
It adds a memory-augmented Temporal HMR module that injects spatio-temporal priors for more consistent and smooth 3D motion estimation.
A lightweight Predictor module forecasts future poses to maintain reconstruction continuity, complementing the tracker for robust performance.
The approach achieves state-of-the-art results on PoseTrack and 3DPW, demonstrating improved zero-shot tracking stability and 3D accuracy and offering a generalizable, markerless 3D human motion capture paradigm in-the-wild.

Abstract

RAM incorporates a motion-aware semantic tracker with adaptive Kalman filtering to achieve robust identity association under severe occlusions and dynamic interactions. A memory-augmented Temporal HMR module further enhances human motion reconstruction by injecting spatio-temporal priors for consistent and smooth motion estimation. Moreover, a lightweight Predictor module forecasts future poses to maintain reconstruction continuity, while a gated combiner adaptively fuses reconstructed and predicted features to ensure coherence and robustness. Experiments on in-the-wild multi-person benchmarks such as PoseTrack and 3DPW, demonstrate that RAM substantially outperforms previous state-of-the-art in both Zero-shot tracking stability and 3D accuracy, offering a generalizable paradigm for markerless 3D human motion capture in-the-wild.

How political censorship actually works inside Qwen, DeepSeek, GLM, and Yi: Ablation and behavioral results across 9 models

Reddit r/LocalLLaMA

OpenSeeker's open-source approach aims to break up the data monopoly for AI search agents

THE DECODER

How to Choose the Best AI Chat Models of 2026 for Your Business Needs

Dev.to

I built an AI that generates lesson plans in your exact teaching voice (open source)

Dev.to

6-Band Prompt Decomposition: The Complete Technical Guide

Dev.to

RAM: Recover Any 3D Human Motion in-the-Wild

Key Points

Abstract

Related Articles

How political censorship actually works inside Qwen, DeepSeek, GLM, and Yi: Ablation and behavioral results across 9 models

OpenSeeker's open-source approach aims to break up the data monopoly for AI search agents

How to Choose the Best AI Chat Models of 2026 for Your Business Needs

I built an AI that generates lesson plans in your exact teaching voice (open source)

6-Band Prompt Decomposition: The Complete Technical Guide

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer