Modeling Matches as Language: A Generative Transformer Approach for Counterfactual Player Valuation in Football
arXiv cs.AI / 3/17/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- ScoutGPT is a generative Transformer model that treats football match events as sequential tokens within a language-modeling framework to enable counterfactual evaluation of hypothetical lineups.
- It uses a NanoGPT-based architecture trained on next-token prediction to learn the dynamics of match-event sequences and forecast future events.
- The model employs Monte Carlo sampling to perform counterfactual simulations, allowing assessment of unobserved scenarios such as alternative player transfers.
- Experiments on K League data show that simulated transfers lead to measurable changes in offensive progression and goal probabilities, indicating ScoutGPT captures player-specific impact beyond static metrics and can outperform existing baselines.
Related Articles

Interactive Web Visualization of GPT-2
Reddit r/artificial
Stop Treating AI Interview Fraud Like a Proctoring Problem
Dev.to
[R] Causal self-attention as a probabilistic model over embeddings
Reddit r/MachineLearning
The 5 software development trends that actually matter in 2026 (and what they mean for your startup)
Dev.to
InVideo AI Review: Fast Finished
Dev.to