FRIGID: Scaling Diffusion-Based Molecular Generation from Mass Spectra at Training and Inference Time
arXiv cs.LG / 4/21/2026
📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research
Key Points
- The paper introduces FRIGID, a diffusion-based molecular generation framework that produces molecular structures from mass spectra using intermediate fingerprint representations and known chemical formulas.
- FRIGID is trained on large-scale data comprising hundreds of millions of unlabeled molecular structures, leveraging a novel diffusion language model design.
- The authors propose an inference-time scaling method using forward fragmentation models to detect spectrum-inconsistent fragments and then refine them via targeted remasking and denoising.
- Reported results show FRIGID achieves over 18% Top-1 accuracy on the MassSpecGym benchmark and triples Top-1 accuracy on NPLIB1 compared with leading methods, with performance scaling approximately log-linearly as inference-time compute increases.
- The authors release the FRIGID code publicly, supporting reproducibility and further research into compute-scaled de novo structural elucidation.
Related Articles

Capsule Security Emerges From Stealth With $7 Million in Funding
Dev.to

Rethinking Coding Education for the AI Era
Dev.to

Agent Package Manager (APM): A DevOps Guide to Reproducible AI Agents
Dev.to

3 Things I Learned Benchmarking Claude, GPT-4o, and Gemini on Real Dev Work
Dev.to

Dify Now Supports IRIS as a Vector Store — Setup Guide
Dev.to