Patient4D: Temporally Consistent Patient Body Mesh Recovery from Monocular Operating Room Video
arXiv cs.CV / 3/19/2026
📰 NewsModels & Research
Key Points
- Patient4D is a stationarity-constrained reconstruction pipeline that recovers dense 3D body meshes from monocular operating-room video by exploiting a stationarity prior to handle occlusion and changing viewpoints.
- The method combines image-level foundation models for perception with lightweight geometric components, notably Pose Locking and Rigid Fallback, to enforce temporal consistency while remaining compatible with off-the-shelf HMR models.
- Evaluation on 4,680 synthetic surgical sequences and three public HMR benchmarks shows mean IoU of 0.75 under drape occlusion and a reduction of failure frames from 30.5% to 1.3% compared with the best baseline.
- The results indicate that leveraging stationarity priors can substantially improve monocular 3D reconstruction in clinical AR scenarios and may broaden the applicability of HMR in surgical settings.
Related Articles

**Core Allocation Optimization for Energy‑Efficient Multi‑Core Scheduling in ARINC650 Systems**
Dev.to

LongCat-Flash-Prover: A new frontier for Open-Source Formal Reasoning.
Reddit r/LocalLLaMA

composer 2 is just Kimi K2.5 with RL?????
Reddit r/LocalLLaMA

Built a small free iOS app to reduce LLM answer uncertainty with multiple models
Dev.to
![[P] We built a Weights & Biases for Autoresearch - track steps, compare experiments, and share results](/_next/image?url=https%3A%2F%2Fpreview.redd.it%2Flv7w6809f7qg1.png%3Fwidth%3D140%26height%3D75%26auto%3Dwebp%26s%3De77e7b54776d5a33eb092415d26190352ad20577&w=3840&q=75)
[P] We built a Weights & Biases for Autoresearch - track steps, compare experiments, and share results
Reddit r/MachineLearning