Crystalite: A Lightweight Transformer for Efficient Crystal Modeling

arXiv cs.LG / 4/3/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • Crystalite is introduced as a lightweight diffusion Transformer designed to model crystalline materials more efficiently than costly equivariant graph neural networks.
  • The model’s Subatomic Tokenization uses a compact chemically structured atom representation to better support continuous diffusion than traditional high-dimensional one-hot encodings.
  • Its Geometry Enhancement Module (GEM) injects periodic minimum-image pair geometry directly into attention via additive geometric biases to align the Transformer with crystallographic structure.
  • Experiments report state-of-the-art performance on crystal structure prediction benchmarks and strong de novo generation, including the best S.U.N. discovery score among compared baselines, with substantially faster sampling than geometry-heavy alternatives.

Abstract

Generative models for crystalline materials often rely on equivariant graph neural networks, which capture geometric structure well but are costly to train and slow to sample. We present Crystalite, a lightweight diffusion Transformer for crystal modeling built around two simple inductive biases. The first is Subatomic Tokenization, a compact chemically structured atom representation that replaces high-dimensional one-hot encodings and is better suited to continuous diffusion. The second is the Geometry Enhancement Module (GEM), which injects periodic minimum-image pair geometry directly into attention through additive geometric biases. Together, these components preserve the simplicity and efficiency of a standard Transformer while making it better matched to the structure of crystalline materials. Crystalite achieves state-of-the-art results on crystal structure prediction benchmarks, and de novo generation performance, attaining the best S.U.N. discovery score among the evaluated baselines while sampling substantially faster than geometry-heavy alternatives.