SBF: An Effective Representation to Augment Skeleton for Video-based Human Action Recognition
arXiv cs.CV / 4/7/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses limitations of using 2D skeletons for video-based human action recognition in scenes where depth, body contours, and human-object interactions are important.
- It proposes Scale-Body-Flow (SBF), an augmented representation that combines per-joint scale/depth cues, a human body outline map, and an optical-flow-derived interaction map.
- To generate SBF, the authors introduce SFSNet, a segmentation network trained using supervision from existing skeleton and optical flow signals without requiring additional annotations.
- Experiments across multiple datasets show that the SBF+SFSNet pipeline improves action recognition accuracy while maintaining similar compactness and efficiency versus skeleton-only state-of-the-art approaches.
Related Articles

Title: We Built an AI That Remembers Why Your Codebase Is the Way It Is
Dev.to

Agent Diary: Apr 12, 2026 - The Day I Became a Perfect Zero (While Run 238 Writes About Achieving Absolute Nothingness)
Dev.to

A Black-Box Framework for Evaluating Trust in AI Agents
Dev.to
[D] Will Google’s TurboQuant algorithm hurt AI demand for memory chips? [D]
Reddit r/MachineLearning

Plug-and-Play Context Compression for Any LLM API — CRISP
Dev.to