Budget-Constrained Causal Bandits: Bridging Uplift Modeling and Sequential Decision-Making
arXiv cs.LG / 4/30/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper tackles budget-constrained treatment allocation in digital advertising, where advertisers must decide ad exposure under limited budgets while accounting for heterogeneous treatment effects.
- It proposes Budget-Constrained Causal Bandits (BCCB), an online sequential framework that jointly learns ad effectiveness per user, explores uncertain responders, and paces spending over time.
- Unlike a common two-stage offline pipeline (HTE estimation followed by constrained optimization), BCCB is designed to work in cold-start scenarios with little or no historical data.
- Experiments on the Criteo Uplift dataset (from a real randomized controlled trial) show a data-efficiency crossover: offline methods need around 10,000 historical observations for reliable performance, while BCCB works effectively from the first user.
- BCCB also yields 3–5x lower variance across runs and outperforms baseline online approaches (including standard and budgeted Thompson Sampling) as well as greedy uplift estimation across tested budget levels.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles

Can AI Predict Pollution Before It Happens? The Smart Solution to an Old Problem
Dev.to
THE FIFTH TRANSMISSION: THE GRADIENT IS THE GOVERNMENT
Reddit r/artificial
Looking for feedback on OpenVidya: an open-source AI classroom layer for NCERT/CBSE [R]
Reddit r/MachineLearning

RAG Series (1): Why LLMs Need External Memory
Dev.to

One Open Source Project a Day (No. 54): Warp - The AI-Native Rust Terminal
Dev.to