Permutation-Symmetrized Diffusion for Unconditional Molecular Generation

arXiv cs.LG / 3/25/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a diffusion model for unconditional 3D molecular generation that is defined directly on the permutation quotient manifold x tilde{X}=R^{dN}/S_N, rather than enforcing permutation symmetry indirectly through permutation-equivariant networks on ordered inputs.
  • It derives an explicit form of the heat kernel on the quotient as a sum of Euclidean heat kernels over all atom permutations, providing insight into how “quotient diffusion” differs from standard ordered-particle diffusion.
  • Training is formulated around a permutation-symmetrized score that involves an intractable sum over S_N, which the authors rewrite as an expectation over a posterior on permutations and approximate via MCMC in permutation space.
  • Experiments on QM9 under the EQGAT-Diff protocol (with a SemlaFlow-style continuous backbone) show that the quotient-based permutation symmetrization is practical and achieves competitive molecule generation quality with improved efficiency.

Abstract

Permutation invariance is fundamental in molecular point-cloud generation, yet most diffusion models enforce it indirectly via permutation-equivariant networks on an ordered space. We propose to model diffusion directly on the quotient manifold \tilde{\calX}=\sR^{d\times N}/S_N, where all atom permutations are identified. We show that the heat kernel on \tilde{\calX} admits an explicit expression as a sum of Euclidean heat kernels over permutations, which clarifies how diffusion on the quotient differs from ordered-particle diffusion. Training requires a permutation-symmetrized score involving an intractable sum over S_N; we derive an expectation form over a posterior on permutations and approximate it using MCMC in permutation space. We evaluate on unconditional 3D molecule generation on QM9 under the EQGAT-Diff protocol, using SemlaFlow-style backbone and treating all variables continuously. The results demonstrate that quotient-based permutation symmetrization is practical and yields competitive generation quality with improved efficiency.