Decoding Defensive Coverage Responsibilities in American Football Using Factorized Attention Based Transformer Models

arXiv cs.AI / 3/30/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a factorized attention-based transformer model to predict NFL defensive coverage assignments, receiver–defender matchups, and the targeted defender on each passing play.
  • It improves over team-level post-hoc classification by producing frame-by-frame predictions that track how individual responsibilities evolve from pre-snap to pass arrival.
  • The model uses a factorized attention mechanism that separately models temporal movement patterns and inter-player (agent-to-agent) relationships, improving interpretability of coverage dynamics.
  • Trained on randomly truncated trajectory data, the approach reports roughly 89%+ accuracy across tasks, with the authors noting potential for higher true accuracy due to ambiguous labeling.
  • The outputs support new derivative metrics—such as disguise rate and double coverage rate—intended for both richer TV broadcast storytelling and actionable strategy/player evaluation insights.

Abstract

Defensive coverage schemes in the National Football League (NFL) represent complex tactical patterns requiring coordinated assignments among defenders who must react dynamically to the offense's passing concept. This paper presents a factorized attention-based transformer model applied to NFL multi-agent play tracking data to predict individual coverage assignments, receiver-defender matchups, and the targeted defender on every pass play. Unlike previous approaches that focus on post-hoc coverage classification at the team level, our model enables predictive modeling of individual player assignments and matchup dynamics throughout the play. The factorized attention mechanism separates temporal and agent dimensions, allowing independent modeling of player movement patterns and inter-player relationships. Trained on randomly truncated trajectories, the model generates frame-by-frame predictions that capture how defensive responsibilities evolve from pre-snap through pass arrival. Our models achieve approximately 89\%+ accuracy for all tasks, with true accuracy potentially higher given annotation ambiguity in the ground truth labels. These outputs also enable novel derivative metrics, including disguise rate and double coverage rate, which enable enhanced storytelling in TV broadcasts as well as provide actionable insights for team strategy development and player evaluation.