Cold-Start Forecasting of New Product Life-Cycles via Conditional Diffusion Models

arXiv cs.LG / 4/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper tackles the cold-start problem of forecasting a newly launched product’s life-cycle trajectory when historical demand data is missing or limited in the pre- and early post-launch phases.
  • It introduces the Conditional Diffusion Life-cycle Forecaster (CDLF), a conditional generative model that fuses static product descriptors, reference trajectories from similar products, and newly observed signals as they arrive.
  • CDLF is designed to update forecasts adaptively over time without retraining, producing flexible multi-modal predictive distributions even under extreme data scarcity.
  • The authors report that CDLF achieves more accurate point forecasts and better probabilistic forecasts than classical diffusion models, Bayesian updating methods, and other strong machine-learning baselines.
  • Experiments include Intel microprocessor SKU life cycles and scenarios involving platform-mediated adoption of open large language model repositories, demonstrating the approach across distinct domains.

Abstract

Forecasting the life-cycle trajectory of a newly launched product is important for launch planning, resource allocation, and early risk assessment. This task is especially difficult in the pre-launch and early post-launch phases, when product-specific outcome history is limited or unavailable, creating a cold-start problem. In these phases, firms must make decisions before demand patterns become reliably observable, while early signals are often sparse, noisy, and unstable We propose the Conditional Diffusion Life-cycle Forecaster (CDLF), a conditional generative framework for forecasting new-product life-cycle trajectories under cold start. CDLF combines three sources of information: static descriptors, reference trajectories from similar products, and newly arriving observations when available. Here, static descriptors refer to structured pre-launch characteristics of the product, such as category, price tier, brand or organization identity, scale, and access conditions. This structure allows the model to condition forecasts on relevant product context and to update them adaptively over time without retraining, yielding flexible multi-modal predictive distributions under extreme data scarcity. The method satisfies consistency with a horizon-uniform distributional error bound for recursive generation. Across studies on Intel microprocessor stock keeping unit (SKU) life cycles and the platform-mediated adoption of open large language model repositories, CDLF delivers more accurate point forecasts and higher-quality probabilistic forecasts than classical diffusion models, Bayesian updating approaches, and other state-of-the-art machine-learning baselines.