A Multi-Modal Dataset for Ground Reaction Force Estimation Using Consumer Wearable Sensors

arXiv cs.AI / 4/1/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper introduces a fully open, multi-modal dataset for estimating vertical ground reaction force (vGRF) using consumer Apple Watch IMU sensors with laboratory force-plate ground truth.
  • The dataset covers five activities (walking, jogging, running, heel drops, step drops) from 10 adults, providing 492 validated, time-aligned trials with IMU recordings (~100 Hz) and force plate measurements (~1000 Hz).
  • It includes both raw and processed time series, trial-level metadata, quality-control flags, and machine-readable data dictionaries, along with trial matching manifests for cross-modality alignment.
  • Quality and reliability are assessed via a multi-phase cross-sensor plausibility/consistency framework, repeatability analysis of peak vGRF (ICC ~0.871–0.990), and robustness testing using Monte Carlo timing perturbations.
  • A subset of 395 trials includes wrist, waist, and force-plate data (triad-complete), enabling sensor-placement studies and reproducible benchmarking for machine-learning vGRF estimation, released under CC BY 4.0 with archived analysis scripts on GitHub.

Abstract

This Data Descriptor presents a fully open, multi-modal dataset for estimating vertical ground reaction force (vGRF) from consumer-grade Apple Watch sensors with laboratory force plate ground truth. Ten healthy adults aged 26--41 years performed five activities: walking, jogging, running, heel drops, and step drops, while wearing two Apple Watches positioned at the left wrist and waist. The dataset contains 492 validated trials with time-aligned inertial measurement unit (IMU) recordings (approximately 100 Hz) and force plate vGRF (Force\_Z, 1000 Hz). The release includes raw and processed time series, trial-level metadata, quality-control flags, and machine-readable data dictionaries. Trial-level matching manifests link recordings across modalities using stable identifiers. Of the 492 validated trials, 395 are triad-complete, containing wrist, waist, and force plate data, enabling cross-sensor analyses and reproducible model evaluation. Dataset quality is characterised through a three-phase cross-sensor plausibility and consistency framework, repeatability analysis of peak vGRF (intraclass correlation coefficient 0.871--0.990), and systematic checks of force ranges and trial completeness. Monte Carlo sensitivity analysis showed that correlation-based validation metrics were robust to single-sample timing perturbations at the IMU sampling resolution. All data are released under CC BY 4.0, with analysis scripts archived alongside the dataset and mirrored on GitHub. This resource supports reproducible research in wearable biomechanics, benchmarking of machine learning models for vGRF estimation, and investigation of sensor placement effects using widely available consumer wearables.

A Multi-Modal Dataset for Ground Reaction Force Estimation Using Consumer Wearable Sensors | AI Navigate