DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining

arXiv cs.CL / 3/13/2026

📰 NewsSignals & Early TrendsModels & Research

共有:

Key Points

DatedGPT introduces twelve 1.3B-parameter language models trained from scratch on temporally partitioned data with strict annual cutoffs from 2013 to 2024 to prevent lookahead bias in financial backtesting.
The models receive instruction fine-tuning on both general-domain and finance-specific datasets aligned to the same temporal cutoffs to constrain knowledge growth by time.
Perplexity-based probing confirms that each model's knowledge is effectively bounded by its cutoff year, reducing leakage of future information.
Evaluation on standard benchmarks shows competitive performance with existing models of similar scale despite the time-aware training.
An interactive web demo allows users to query and compare responses from models across different cutoff years, illustrating practical time-aware forecasting workflows.

Abstract

In financial backtesting, large language models pretrained on internet-scale data risk introducing lookahead bias that undermines their forecasting validity, as they may have already seen the true outcome during training. To address this, we present DatedGPT, a family of twelve 1.3B-parameter language models, each trained from scratch on approximately 100 billion tokens of temporally partitioned data with strict annual cutoffs spanning 2013 to 2024. We further enhance each model with instruction fine-tuning on both general-domain and finance-specific datasets curated to respect the same temporal boundaries. Perplexity-based probing confirms that each model's knowledge is effectively bounded by its data cutoff year, while evaluation on standard benchmarks shows competitive performance with existing models of similar scale. We provide an interactive web demo that allows users to query and compare responses from models across different cutoff years.

ベテランの若手育成負担を減らせ、PLC制御の「ラダー図」をAIで生成

日経XTECH

Jeff Bezos reportedly wants $100 billion to buy and transform old manufacturing firms with AI

TechCrunch

AI Can Write Your Code. Who's Testing Your Thinking?

Dev.to

‘Uncanny Valley’: Nvidia’s ‘Super Bowl of AI,’ Tesla Disappoints, and Meta’s VR Metaverse ‘Shutdown’

Wired

[R] Weekly digest: arXiv AI security papers translated for practitioners -- Cascade (cross-stack CVE+Rowhammer attacks on compound AI), LAMLAD (dual-LLM adversarial ML, 97% evasion), OpenClaw (4 vuln classes in agent frameworks)

Reddit r/MachineLearning

DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining

Key Points

Abstract

Related Articles

ベテランの若手育成負担を減らせ、PLC制御の「ラダー図」をAIで生成

Jeff Bezos reportedly wants $100 billion to buy and transform old manufacturing firms with AI

AI Can Write Your Code. Who's Testing Your Thinking?

‘Uncanny Valley’: Nvidia’s ‘Super Bowl of AI,’ Tesla Disappoints, and Meta’s VR Metaverse ‘Shutdown’

[R] Weekly digest: arXiv AI security papers translated for practitioners -- Cascade (cross-stack CVE+Rowhammer attacks on compound AI), LAMLAD (dual-LLM adversarial ML, 97% evasion), OpenClaw (4 vuln classes in agent frameworks)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer