Beyond Structure: Revolutionising Materials Discovery via AI-Driven Synthesis Protocol-Property Relationships

arXiv cs.AI / 5/4/2026

💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that today’s structure-first AI materials discovery is hindered by a “synthesizability gap,” where predicted atomic structures cannot be reliably made in practice.
  • It proposes a synthesis-first paradigm that treats executable synthesis protocols (reaction pathways and recipes) as the primary design variables rather than only configurations.
  • The roadmap includes making synthesis procedures machine-readable, using generative and inverse-design models to generate actionable synthesis routes, and applying closed-loop optimization to align with experimental and sustainability constraints.
  • The work frames the approach via a causal chain (protocol P → structure X → properties y) and discusses methodological building blocks, standards, and strategies for integrating with self-driving laboratories (SDLs) to improve reproducibility.
  • Overall, the article is presented as a research direction aimed at accelerating “data-first” and reproducible materials discovery by incorporating real-world synthesis feasibility into the AI loop.

Abstract

The current structure-centric paradigm in artificial intelligence (AI)-driven materials discovery, despite delivering thousands of candidate structures, is stalling at a critical barrier: the synthesizability gap. We argue that closing this gap demands a pivot to a synthesis-first paradigm in which executable synthesis protocols, not just atomic configurations, are treated as primary design variables. We outline a roadmap built on three pillars: (i) representing synthesis procedures as machine-readable protocols, (ii) deploying generative and inverse-design models to propose actionable reaction pathways and recipes, and (iii) integrating closed-loop optimisation to refine protocols against experimental realities and sustainability constraints. Framed in terms of the causal backbone P->X->y from protocol P to structure X and properties y, this perspective sets out methodological building blocks, standards needs and self-driving laboratory (SDL) integration strategies to accelerate reproducible, data-first materials discovery.