AI Navigate

InterventionLens: A Multi-Agent Framework for Detecting ASD Intervention Strategies in Parent-Child Shared Reading

arXiv cs.AI / 3/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • InterventionLens is an end-to-end multi-agent system that automatically detects and temporally segments caregiver intervention strategies from home-based ASD shared reading videos.
  • On the ASD-HI dataset, InterventionLens achieves an overall F1 score of 79.44%, outperforming the baseline by 19.72%.
  • The approach integrates multimodal interaction content via a collaborative multi-agent architecture and does not require task-specific model training or fine-tuning.
  • The project notes that additional resources will be released on the project page, suggesting potential for broader use in analyzing caregiver strategies.

Abstract

Home-based interventions like parent-child shared reading provide a cost-effective approach for supporting children with autism spectrum disorder (ASD). However, analyzing caregiver intervention strategies in naturalistic home interactions typically relies on expert annotation, which is costly, time-intensive, and difficult to scale. To address this challenge, we propose InterventionLens, an end-to-end multi-agent system for automatically detecting and temporally segmenting caregiver intervention strategies from shared reading videos. Without task-specific model training or fine-tuning, InterventionLens uses a collaborative multi-agent architecture to integrate multimodal interaction content and perform fine-grained strategy analysis. Experiments on the ASD-HI dataset show that InterventionLens achieves an overall F1 score of 79.44\%, outperforming the baseline by 19.72\%. These results suggest that InterventionLens is a promising system for analyzing caregiver intervention strategies in home-based ASD shared reading settings. Additional resources will be released on the project page.