HOI-aware Adaptive Network for Weakly-supervised Action Segmentation
arXiv cs.CV / 4/30/2026
📰 NewsModels & Research
Key Points
- The paper introduces AdaAct, an HOI-aware adaptive network for weakly-supervised action segmentation that uses human-object interactions (HOI) as video-level prior knowledge.
- Instead of relying on a fixed model across all videos, AdaAct dynamically adapts its temporal encoding at test time based on the HOI sequence to reduce ambiguity between similar actions.
- The method first builds a video HOI encoder to extract, select, and integrate the most representative HOI signals over the full video.
- It then uses a two-branch HyperNetwork to generate/adjust temporal encoder parameters on the fly from HOI information, enabling per-video adaptation.
- Experiments on Breakfast and 50Salads show that the approach improves performance across multiple evaluation metrics.
Related Articles
We Built a DNS-Based Discovery Protocol for AI Agents — Here's How It Works
Dev.to
Building AI Evaluation Pipelines: Automating LLM Testing from Dataset to CI/CD
Dev.to

Function Calling Harness 2: CoT Compliance from 9.91% to 100%
Dev.to
What Anthropic's April 23 Postmortem Reveals About Your Agent Harness
Dev.to
Fine-tuning YOLOv11 to detect stamps and signatures on banking documents - a practical walkthrough
Dev.to