AQCat25: Unlocking spin-aware, high-fidelity machine learning potentials for heterogeneous catalysis

Allam, Omar; Wander, Brook; Singh, Aayush R.

Abstract:Large-scale datasets have enabled highly accurate machine learning interatomic potentials (MLIPs) for general-purpose heterogeneous catalysis modeling. There are, however, some limitations in what can be treated with these potentials because of gaps in the underlying training data. To extend these capabilities, we introduce AQCat25, a complementary dataset of 13.5 million density functional theory (DFT) single point calculations designed to improve the treatment of systems where spin polarization and/or higher fidelity are critical. We also investigate methodologies for integrating new datasets, such as AQCat25, with the broader Open Catalyst 2020 (OC20) dataset to create spin-aware models without sacrificing generalizability. We find that directly tuning a general model on AQCat25 leads to catastrophic forgetting of the original dataset's knowledge. Conversely, joint training strategies prove effective for improving accuracy on the new data without sacrificing general performance. This joint approach introduces a challenge, as the model must learn from a dataset containing both mixed-fidelity calculations and mixed-physics (spin-polarized vs. unpolarized). We show that explicitly conditioning the model on this system-specific metadata, for example by using Feature-wise Linear Modulation (FiLM), successfully addresses this challenge and further enhances model accuracy. Ultimately, our work establishes an effective protocol for bridging DFT fidelity domains to advance the predictive power of foundational models in catalysis.

Comments:	32 pages, 17 figures
Subjects:	Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG)
Cite as:	arXiv:2510.22938 [cond-mat.mtrl-sci]
	(or arXiv:2510.22938v1 [cond-mat.mtrl-sci] for this version)
	https://doi.org/10.48550/arXiv.2510.22938

Condensed Matter > Materials Science

Title:AQCat25: Unlocking spin-aware, high-fidelity machine learning potentials for heterogeneous catalysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators