Shared Representation for 3D Pose Estimation, Action Classification, and Progress Prediction from Tactile Signals
arXiv cs.CV / 3/30/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses how to estimate 3D human pose, classify actions, and predict action completion progress using tactile signals to avoid occlusion and privacy issues common in vision-based methods.
- It proposes SCOTTI, a Shared Convolutional Transformer for Tactile Inference, which learns a shared representation to perform all three tasks jointly via multi-task learning.
- The work claims novelty in exploring action progress prediction specifically from foot tactile signals using custom wireless insole sensors.
- Experiments report that SCOTTI outperforms prior approaches on all three tasks compared with separate single-task learning.
- The authors introduce a new tactile dataset collected from 15 participants (7 hours total) performing eight activities, supporting training and evaluation of the proposed approach.




