CountsDiff: A Diffusion Model on the Natural Numbers for Generation and Imputation of Count-Based Data
arXiv cs.LG / 4/7/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces CountsDiff, a diffusion-model framework that natively generates and imputes data defined over the natural numbers (count-based/ordinal discrete distributions).
- CountsDiff builds on the Blackout diffusion framework by reformulating it with a survival-probability schedule and explicit loss weighting, yielding design parameters with clear analogues to existing diffusion approaches.
- The method incorporates modern diffusion features that were previously less explored in count domains, including continuous-time training, classifier-free guidance, and churn/remasking reverse dynamics that permit non-monotone reverse trajectories.
- Experiments show that an initial instantiation of CountsDiff performs competitively on natural image datasets (CIFAR-10, CelebA) and demonstrates strong results on single-cell RNA-seq count imputation tasks (fetal cell and heart cell atlases).
- The authors report that the simple version can match or exceed state-of-the-art discrete generative modeling and leading RNA-seq imputation methods, suggesting further performance gains are likely with better parameter optimization.
Related Articles

Black Hat Asia
AI Business

OpenAI's pricing is about to change — here's why local AI matters more than ever
Dev.to

Google AI Tells Users to Put Glue on Their Pizza!
Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Could it be that this take is not too far fetched?
Reddit r/LocalLLaMA