AQVolt26: High-Temperature r$^2$SCAN Halide Dataset for Universal ML Potentials and Solid-State Batteries

arXiv cs.LG / 2026/4/6

💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research

要点

  • The paper introduces AQVolt26, a large dataset containing 322,656 r$^2$SCAN single-point calculations for lithium halides generated via high-temperature configurational sampling over ~5K structures to better study ion-transport conditions in solid-state batteries.
  • It finds that foundational datasets and universal ML interatomic potentials can transfer local forces well for stable halide chemistries, but absolute energy predictions degrade under highly distorted, elevated-temperature regimes relevant to ion transport.
  • Co-training with the new AQVolt26 dataset addresses the identified high-temperature “blind spot,” improving model reliability in dynamic, soft halide environments.
  • Adding Materials Project relaxation data improves near-equilibrium performance but can worsen robustness under extreme strain, and it does not necessarily improve force accuracy at high temperatures.
  • The authors conclude that domain-specific high-temperature configurational sampling is essential for trustworthy dynamic screening, and that near-equilibrium relaxation data is best used as a complementary, task-specific augmentation rather than a universally helpful one.

Abstract

The demand for safe, high-energy-density batteries has spotlighted halide solid-state electrolytes, which offer the potential for enhanced ionic mobility, electrochemical stability, and interfacial deformability. Accelerating their discovery requires extensive molecular dynamics, which has been increasingly enabled by universal machine learning interatomic potentials trained on foundational datasets. However, the dynamic softness of halides poses a stringent test of whether general-purpose models can reliably replace first-principles calculations under the highly distorted, elevated-temperature regimes necessary to probe ion transport. Here, we present AQVolt26, a dataset of 322,656 r^2SCAN single-point calculations for lithium halides, generated via high-temperature configurational sampling across \sim5K structures. We demonstrate that foundational datasets provide a strong baseline for stable halide chemistries and transfer local forces well, however absolute energy predictions degrade in distorted higher-temperature regimes. Co-training with AQVolt26 resolves this blind spot. Furthermore, incorporating Materials Project relaxation data improves near-equilibrium performance but degrades extreme-strain robustness without enhancing high-temperature force accuracy. These results demonstrate that domain-specific configurational sampling is essential for the reliable dynamic screening of halide electrolytes. Furthermore, our findings suggest that while foundational models provide a robust base, they are most effective for dynamically soft solid-state chemistries when augmented with targeted, high-temperature data. Finally, we show that near-equilibrium relaxation data serves as a task-specific complement rather than a universally beneficial addition.