A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion

arXiv cs.AI / 3/27/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The study proposes a gait foundation model that learns from 3D skeletal motion captured by a depth camera across five motor tasks in 3,414 deeply phenotyped adults, aiming to treat gait as a systemic biomarker rather than a single-disease symptom.
  • Learned gait embeddings outperform engineered features, strongly predicting age (r = 0.69), BMI (r = 0.90), and visceral adipose tissue area (r = 0.82).
  • The model’s embeddings significantly predict 1,980 of 3,210 phenotypic targets, and after adjusting for age, BMI, VAT, and height, gait provides independent predictive gains across nearly all evaluated body systems (18 systems in males, 17 in females).
  • Anatomical ablation suggests body-region specialization: the legs drive metabolic and frailty-related predictions, while the torso carries information associated with sleep and lifestyle phenotypes.
  • The authors position gait as a scalable, passive vital sign and emphasize translation toward consumer-grade video and integration into broader health monitoring workflows.

Abstract

Gait is increasingly recognized as a vital sign, yet current approaches treat it as a symptom of specific pathologies rather than a systemic biomarker. We developed a gait foundation model for 3D skeletal motion from 3,414 deeply phenotyped adults, recorded via a depth camera during five motor tasks. Learned embeddings outperformed engineered features, predicting age (Pearson r = 0.69), BMI (r = 0.90), and visceral adipose tissue area (r = 0.82). Embeddings significantly predicted 1,980 of 3,210 phenotypic targets; after adjustment for age, BMI, VAT, and height, gait provided independent gains in all 18 body systems in males and 17 of 18 in females, and improved prediction of clinical diagnoses and medication use. Anatomical ablation revealed that legs dominated metabolic and frailty predictions while torso encoded sleep and lifestyle phenotypes. These findings establish gait as an independent multi-system biosignal, motivating translation to consumer-grade video and its integration as a scalable, passive vital sign.
広告