Seeing Eye to Eye: Enabling Cognitive Alignment Through Shared First-Person Perspective in Human-AI Collaboration
arXiv cs.AI / 3/16/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- Eye2Eye proposes using first-person perspective as a channel to align human and AI cognition in multimodal collaboration, addressing the communication and understanding gulfs.
- It decomposes the approach into joint attention coordination, revisable memory, and reflective feedback to maintain evolving common ground and clarify AI understanding.
- The authors implement an AR prototype and evaluate it through a user study and a post-hoc pipeline evaluation, finding reduced task completion time and lower interaction load.
- The results indicate increased trust in the AI, demonstrating that shared-perspective interaction can improve collaboration efficiency and user confidence.
- This work outlines a design path for future AI interfaces that leverage first-person perspective to facilitate natural human-AI collaboration.




