ART: Adaptive Relational Transformer for Pedestrian Trajectory Prediction with Temporal-Aware Relations

arXiv cs.CV / 4/7/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces an Adaptive Relational Transformer (ART) aimed at improving real-world pedestrian trajectory prediction for robotics and related applications.
  • ART uses a Temporal-Aware Relation Graph (TARG) to explicitly model how pairwise human interactions change over time.
  • It adds an Adaptive Interaction Pruning (AIP) mechanism to remove redundant interactions, reducing computational overhead compared with prior graph- or transformer-based approaches.
  • Experiments on ETH/UCY and NBA benchmarks report state-of-the-art accuracy while maintaining high computational efficiency.

Abstract

Accurate prediction of real-world pedestrian trajectories is crucial for a wide range of robot-related applications. Recent approaches typically adopt graph-based or transformer-based frameworks to model interactions. Despite their effectiveness, these methods either introduce unnecessary computational overhead or struggle to represent the diverse and time-varying characteristics of human interactions. In this work, we present an Adaptive Relational Transformer (ART), which introduces a Temporal-Aware Relation Graph (TARG) to explicitly capture the evolution of pairwise interactions and an Adaptive Interaction Pruning (AIP) mechanism to reduce redundant computations efficiently. Extensive evaluations on ETH/UCY and NBA benchmarks show that ART delivers state-of-the-art accuracy with high computational efficiency.