-
Autonomous Small-Angle Scattering for Accelerated Soft Material Formulation Optimization
Authors:
Tyler B. Martin,
Duncan R. Sutherland,
Austin McDannald,
A. Gilad Kusne,
Peter A. Beaucage
Abstract:
The pace of soft material formulation (re)development and design is rapidly increasing as both consumers and new legislation demand products that do less harm to the environment while maintaining high standards of performance. To meet this need, we have developed the Autonomous Formulation Lab (AFL), a platform that can automatically prepare and measure the microstructure of liquid formulations us…
▽ More
The pace of soft material formulation (re)development and design is rapidly increasing as both consumers and new legislation demand products that do less harm to the environment while maintaining high standards of performance. To meet this need, we have developed the Autonomous Formulation Lab (AFL), a platform that can automatically prepare and measure the microstructure of liquid formulations using small-angle neutron and X-ray scattering and, soon, a variety of other techniques. Here, we describe the design, philosophy, tuning, and validation of our active learning agent that guides the course of AFL experiments. We show how our extensive in silico tuning results in an efficient agent that is robust to both the number of measurements and signal to noise variation. Finally, we experimentally validate our virtually tuned agent by addressing a model formulation problem: replacing a petroleum-derived component with a natural analog. We show that the agent efficiently maps both formulations and how post hoc analysis of the measured data reveals the opportunity for further specialization of the agent. With the tuned and proven active learning agent, our autonomously guided AFL platform will accelerate the pace of discovery of liquid formulations and help speed us towards a greener future.
△ Less
Submitted 14 March, 2025;
originally announced March 2025.
-
Real-time experiment-theory closed-loop interaction for autonomous materials science
Authors:
Haotong Liang,
Chuangye Wang,
Heshan Yu,
Dylan Kirsch,
Rohit Pant,
Austin McDannald,
A. Gilad Kusne,
Ji-Cheng Zhao,
Ichiro Takeuchi
Abstract:
Iterative cycles of theoretical prediction and experimental validation are the cornerstone of the modern scientific method. However, the proverbial "closing of the loop" in experiment-theory cycles in practice are usually ad hoc, often inherently difficult, or impractical to repeat on a systematic basis, beset by the scale or the time constraint of computation or the phenomena under study. Here, w…
▽ More
Iterative cycles of theoretical prediction and experimental validation are the cornerstone of the modern scientific method. However, the proverbial "closing of the loop" in experiment-theory cycles in practice are usually ad hoc, often inherently difficult, or impractical to repeat on a systematic basis, beset by the scale or the time constraint of computation or the phenomena under study. Here, we demonstrate Autonomous MAterials Search Engine (AMASE), where we enlist robot science to perform self-driving continuous cyclical interaction of experiments and computational predictions for materials exploration. In particular, we have applied the AMASE formalism to the rapid mapping of a temperature-composition phase diagram, a fundamental task for the search and discovery of new materials. Thermal processing and experimental determination of compositional phase boundaries in thin films are autonomously interspersed with real-time updating of the phase diagram prediction through the minimization of Gibbs free energies. AMASE was able to accurately determine the eutectic phase diagram of the Sn-Bi binary thin-film system on the fly from a self-guided campaign covering just a small fraction of the entire composition - temperature phase space, translating to a 6-fold reduction in the number of necessary experiments. This study demonstrates for the first time the possibility of real-time, autonomous, and iterative interactions of experiments and theory carried out without any human intervention.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
Learning material synthesis-process-structure-property relationship by data fusion: Bayesian Coregionalization N-Dimensional Piecewise Function Learning
Authors:
A. Gilad Kusne,
Austin McDannald,
Brian DeCost
Abstract:
Autonomous materials research labs require the ability to combine and learn from diverse data streams. This is especially true for learning material synthesis-process-structure-property relationships, key to accelerating materials optimization and discovery as well as accelerating mechanistic understanding. We present the Synthesis-process-structure-property relAtionship coreGionalized lEarner (SA…
▽ More
Autonomous materials research labs require the ability to combine and learn from diverse data streams. This is especially true for learning material synthesis-process-structure-property relationships, key to accelerating materials optimization and discovery as well as accelerating mechanistic understanding. We present the Synthesis-process-structure-property relAtionship coreGionalized lEarner (SAGE) algorithm. A fully Bayesian algorithm that uses multimodal coregionalization to merge knowledge across data sources to learn synthesis-process-structure-property relationships. SAGE outputs a probabilistic posterior for the relationships including the most likely relationships given the data.
△ Less
Submitted 20 August, 2024; v1 submitted 10 November, 2023;
originally announced November 2023.
-
Human-In-the-Loop for Bayesian Autonomous Materials Phase Mapping
Authors:
Felix Adams,
Austin McDannald,
Ichiro Takeuchi,
A. Gilad Kusne
Abstract:
Autonomous experimentation (AE) combines machine learning and research hardware automation in a closed loop, guiding subsequent experiments toward user goals. As applied to materials research, AE can accelerate materials exploration, reducing time and cost compared to traditional Edisonian studies. Additionally, integrating knowledge from diverse sources including theory, simulations, literature,…
▽ More
Autonomous experimentation (AE) combines machine learning and research hardware automation in a closed loop, guiding subsequent experiments toward user goals. As applied to materials research, AE can accelerate materials exploration, reducing time and cost compared to traditional Edisonian studies. Additionally, integrating knowledge from diverse sources including theory, simulations, literature, and domain experts can boost AE performance. Domain experts may provide unique knowledge addressing tasks that are difficult to automate. Here, we present a set of methods for integrating human input into an autonomous materials exploration campaign for composition-structure phase mapping. The methods are demonstrated on x-ray diffraction data collected from a thin film ternary combinatorial library. At any point during the campaign, the user can choose to provide input by indicating regions-of-interest, likely phase regions, and likely phase boundaries based on their prior knowledge (e.g., knowledge of the phase map of a similar material system), along with quantifying their certainty. The human input is integrated by defining a set of probabilistic priors over the phase map. Algorithm output is a probabilistic distribution over potential phase maps, given the data, model, and human input. We demonstrate a significant improvement in phase mapping performance given appropriate human input.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
Scalable Multi-Agent Lab Framework for Lab Optimization
Authors:
A. Gilad Kusne,
Austin McDannald
Abstract:
Autonomous materials research systems allow scientists to fail smarter, learn faster, and spend less resources in their studies. As these systems grow in number, capability, and complexity, a new challenge arises - how will they work together across large facilities? We explore one solution to this question - a multi-agent laboratory control frame-work. We demonstrate this framework with an autono…
▽ More
Autonomous materials research systems allow scientists to fail smarter, learn faster, and spend less resources in their studies. As these systems grow in number, capability, and complexity, a new challenge arises - how will they work together across large facilities? We explore one solution to this question - a multi-agent laboratory control frame-work. We demonstrate this framework with an autonomous material science lab in mind - where information from diverse research campaigns can be combined to ad-dress the scientific question at hand. This framework can 1) account for realistic resource limits such as equipment use, 2) allow for machine learning agents with diverse learning capabilities and goals capable of running re-search campaigns, and 3) facilitate multi-agent collaborations and teams. The framework is dubbed the MULTI-agent auTonomous fAcilities - a Scalable frameworK aka MULTITASK. MULTITASK makes possible facility-wide simulations, including agent-instrument and agent-agent interactions. Through MULTITASK's modularity, real-world facilities can come on-line in phases, with simulated instruments gradually replaced by real-world instruments. We hope MULTITASK opens new areas of study in large-scale autonomous and semi-autonomous research campaigns and facilities.
△ Less
Submitted 20 March, 2023; v1 submitted 18 August, 2022;
originally announced August 2022.
-
aflow++: a C++ framework for autonomous materials design
Authors:
C. Oses,
M. Esters,
D. Hicks,
S. Divilov,
H. Eckert,
R. Friedrich,
M. J. Mehl,
A. Smolyanyuk,
X. Campilongo,
A. van de Walle,
J Schroers,
A. G. Kusne,
I. Takeuchi,
E. Zurek,
M. Buongiorno Nardelli,
M. Fornari,
Y. Lederer,
O. Levy,
C. Toher,
S. Curtarolo
Abstract:
The realization of novel technological opportunities given by computational and autonomous materials design requires efficient and effective frameworks. For more than two decades, aflow++ (Automatic-Flow Framework for Materials Discovery) has provided an interconnected collection of algorithms and workflows to address this challenge. This article contains an overview of the software and some of it…
▽ More
The realization of novel technological opportunities given by computational and autonomous materials design requires efficient and effective frameworks. For more than two decades, aflow++ (Automatic-Flow Framework for Materials Discovery) has provided an interconnected collection of algorithms and workflows to address this challenge. This article contains an overview of the software and some of its most heavily-used functionalities, including algorithmic details, standards, and examples. Key thrusts are highlighted: the calculation of structural, electronic, thermodynamic, and thermomechanical properties in addition to the modeling of complex materials, such as high-entropy ceramics and bulk metallic glasses. The aflow++ software prioritizes interoperability, minimizing the number of independent parameters and tolerances. It ensures consistency of results across property sets - facilitating machine learning studies. The software also features various validation schemes, offering real-time quality assurance for data generated in a high-throughput fashion. Altogether, these considerations contribute to the development of large and reliable materials databases that can ultimately deliver future materials systems
△ Less
Submitted 5 August, 2022;
originally announced August 2022.
-
Reproducible Sorbent Materials Foundry for Carbon Capture at Scale
Authors:
Austin McDannald,
Howie Joress,
Brian DeCost,
Avery E. Baumann,
A. Gilad Kusne,
Kamal Choudhary,
Taner Yildirim,
Daniel W. Siderius,
Winnie Wong-Ng,
Andrew J. Allen,
Christopher M. Stafford,
Diana Ortiz-Montalvo
Abstract:
We envision an autonomous sorbent materials foundry (SMF) for rapidly evaluating materials for direct air capture of carbon dioxide (CO2), specifically targeting novel metal organic framework materials. Our proposed SMF is hierarchical, simultaneously addressing the most critical gaps in the inter-related space of sorbent material synthesis, processing, properties, and performance. The ability to…
▽ More
We envision an autonomous sorbent materials foundry (SMF) for rapidly evaluating materials for direct air capture of carbon dioxide (CO2), specifically targeting novel metal organic framework materials. Our proposed SMF is hierarchical, simultaneously addressing the most critical gaps in the inter-related space of sorbent material synthesis, processing, properties, and performance. The ability to collect these critical data streams in an agile, coordinated, and automated fashion will enable efficient end-to-end sorbent materials design through machine learning driven research framework.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Benchmarking Active Learning Strategies for Materials Optimization and Discovery
Authors:
Alex Wang,
Haotong Liang,
Austin McDannald,
Ichiro Takeuchi,
A. Gilad Kusne
Abstract:
Autonomous physical science is revolutionizing materials science. In these systems, machine learning controls experiment design, execution, and analysis in a closed loop. Active learning, the machine learning field of optimal experiment design, selects each subsequent experiment to maximize knowledge toward the user goal. Autonomous system performance can be further improved with implementation of…
▽ More
Autonomous physical science is revolutionizing materials science. In these systems, machine learning controls experiment design, execution, and analysis in a closed loop. Active learning, the machine learning field of optimal experiment design, selects each subsequent experiment to maximize knowledge toward the user goal. Autonomous system performance can be further improved with implementation of scientific machine learning, also known as inductive bias-engineered artificial intelligence, which folds prior knowledge of physical laws (e.g., Gibbs phase rule) into the algorithm. As the number, diversity, and uses for active learning strategies grow, there is an associated growing necessity for real-world reference datasets to benchmark strategies. We present a reference dataset and demonstrate its use to benchmark active learning strategies in the form of various acquisition functions. Active learning strategies are used to rapidly identify materials with optimal physical properties within a ternary materials system. The data is from an actual Fe-Co-Ni thin-film library and includes previously acquired experimental data for materials compositions, X-ray diffraction patterns, and two functional properties of magnetic coercivity and the Kerr rotation. Popular active learning methods along with a recent scientific active learning method are benchmarked for their materials optimization performance. We discuss the relationship between algorithm performance, materials search space complexity, and the incorporation of prior knowledge.
△ Less
Submitted 12 April, 2022;
originally announced April 2022.
-
A Low-Cost Robot Science Kit for Education with Symbolic Regression for Hypothesis Discovery and Validation
Authors:
Logan Saar,
Haotong Liang,
Alex Wang,
Austin McDannald,
Efrain Rodriguez,
Ichiro Takeuchi,
A. Gilad Kusne
Abstract:
The next generation of physical science involves robot scientists - autonomous physical science systems capable of experimental design, execution, and analysis in a closed loop. Such systems have shown real-world success for scientific exploration and discovery, including the first discovery of a best-in-class material. To build and use these systems, the next generation workforce requires experti…
▽ More
The next generation of physical science involves robot scientists - autonomous physical science systems capable of experimental design, execution, and analysis in a closed loop. Such systems have shown real-world success for scientific exploration and discovery, including the first discovery of a best-in-class material. To build and use these systems, the next generation workforce requires expertise in diverse areas including ML, control systems, measurement science, materials synthesis, decision theory, among others. However, education is lagging. Educators need a low-cost, easy-to-use platform to teach the required skills. Industry can also use such a platform for developing and evaluating autonomous physical science methodologies. We present the next generation in science education, a kit for building a low-cost autonomous scientist. The kit was used during two courses at the University of Maryland to teach undergraduate and graduate students autonomous physical science. We discuss its use in the course and its greater capability to teach the dual tasks of autonomous model exploration, optimization, and determination, with an example of autonomous experimental "discovery" of the Henderson-Hasselbalch equation.
△ Less
Submitted 13 June, 2022; v1 submitted 8 April, 2022;
originally announced April 2022.
-
Graph Neural Network Predictions of Metal Organic Framework CO2 Adsorption Properties
Authors:
Kamal Choudhary,
Taner Yildirim,
Daniel Siderius,
Aaron Gilad Kusne,
Austin McDannald,
Diana Ortiz-Montalvo
Abstract:
The increasing CO2 level is a critical concern and suitable materials are needed to capture such gases from the environment. While experimental and conventional computational methods are useful in finding such materials, they are usually slow and there is a need to expedite such processes. We use Atomistic Line Graph Neural Network (ALIGNN) method to predict CO2 adsorption in metal organic framewo…
▽ More
The increasing CO2 level is a critical concern and suitable materials are needed to capture such gases from the environment. While experimental and conventional computational methods are useful in finding such materials, they are usually slow and there is a need to expedite such processes. We use Atomistic Line Graph Neural Network (ALIGNN) method to predict CO2 adsorption in metal organic frameworks (MOF), which are known for their high functional tunability. We train ALIGNN models for hypothetical MOF (hMOF) database with 137953 MOFs with grand canonical Monte Carlo (GCMC) based CO2 adsorption isotherms. We develop high accuracy and fast models for pre-screening applications. We apply the trained model on CoREMOF database and computationally rank them for experimental synthesis. In addition to the CO2 adsorption isotherm, we also train models for electronic bandgaps, surface area, void fraction, lowest cavity diameter, and pore limiting diameter, and illustrate the strength and limitation of such graph neural network models. For a few candidate MOFs we carry out GCMC calculations to evaluate the deep-learning (DL) predictions.
△ Less
Submitted 19 December, 2021;
originally announced December 2021.
-
Physics in the Machine: Integrating Physical Knowledge in Autonomous Phase-Mapping
Authors:
A. Gilad Kusne,
Austin McDannald,
Brian DeCost,
Corey Oses,
Cormac Toher,
Stefano Curtarolo,
Apurva Mehta,
Ichiro Takeuchi
Abstract:
Application of artificial intelligence (AI), and more specifically machine learning, to the physical sciences has expanded significantly over the past decades. In particular, science-informed AI, also known as scientific AI or inductive bias AI, has grown from a focus on data analysis to now controlling experiment design, simulation, execution and analysis in closed-loop autonomous systems. The CA…
▽ More
Application of artificial intelligence (AI), and more specifically machine learning, to the physical sciences has expanded significantly over the past decades. In particular, science-informed AI, also known as scientific AI or inductive bias AI, has grown from a focus on data analysis to now controlling experiment design, simulation, execution and analysis in closed-loop autonomous systems. The CAMEO (closed-loop autonomous materials exploration and optimization) algorithm employs scientific AI to address two tasks: learning a material system's composition-structure relationship and identifying materials compositions with optimal functional properties. By integrating these, accelerated materials screening across compositional phase diagrams was demonstrated, resulting in the discovery of a best-in-class phase change memory material. Key to this success is the ability to guide subsequent measurements to maximize knowledge of the composition-structure relationship, or phase map. In this work we investigate the benefits of incorporating varying levels of prior physical knowledge into CAMEO's autonomous phase-mapping. This includes the use of ab-initio phase boundary data from the AFLOW repositories, which has been shown to optimize CAMEO's search when used as a prior.
△ Less
Submitted 16 February, 2022; v1 submitted 14 November, 2021;
originally announced November 2021.
-
A Semi-Supervised Approach for Automatic Crystal Structure Classification
Authors:
Satvik Lolla,
Haotong Liang,
A. Gilad Kusne,
Ichiro Takeuchi,
William Ratcliff
Abstract:
The structural solution problem can be a daunting and time consuming task. Especially in the presence of impurity phases, current methods such as indexing become more unstable. In this work, we apply the novel approach of semi-supervised learning towards the problem of identifying the Bravais lattice and the space group of inorganic crystals. Our semi-supervised generative deep learning model can…
▽ More
The structural solution problem can be a daunting and time consuming task. Especially in the presence of impurity phases, current methods such as indexing become more unstable. In this work, we apply the novel approach of semi-supervised learning towards the problem of identifying the Bravais lattice and the space group of inorganic crystals. Our semi-supervised generative deep learning model can train on both labeled data -- diffraction patterns with the associated crystal structure -- and unlabeled data, diffraction patterns that lack this information. This approach allows our models to take advantage of the troves of unlabeled data that current supervised learning approaches cannot, which should result in models that can more accurately generalize to real data. In this work, we classify powder diffraction patterns into all 14 Bravais lattices and 144 space groups (we limit the number due to sparse coverage in crystal structure databases), which covers more crystal classes than other studies. Our models also drastically outperform current deep learning approaches for both space group and Bravais Lattice classification using less training data.
△ Less
Submitted 1 November, 2021;
originally announced November 2021.
-
On-the-fly Autonomous Control of Neutron Diffraction via Physics-Informed Bayesian Active Learning
Authors:
Austin McDannald,
Matthias Frontzek,
Andrei T. Savici,
Mathieu Doucet,
Efrain E. Rodriguez,
Kate Meuse,
Jessica Opsahl-Ong,
Daniel Samarov,
Ichiro Takeuchi,
A. Gilad Kusne,
William Ratcliff
Abstract:
Neutron scattering is a unique and versatile characterization technique for probing the magnetic structure and dynamics of materials. However, instruments at neutron scattering facilities in the world is limited, and instruments at such facilities are perennially oversubscribed. We demonstrate a significant reduction in experimental time required for neutron diffraction experiments by implementati…
▽ More
Neutron scattering is a unique and versatile characterization technique for probing the magnetic structure and dynamics of materials. However, instruments at neutron scattering facilities in the world is limited, and instruments at such facilities are perennially oversubscribed. We demonstrate a significant reduction in experimental time required for neutron diffraction experiments by implementation of autonomous navigation of measurement parameter space through machine learning. Prior scientific knowledge and Bayesian active learning are used to dynamically steer the sequence of measurements. We developed the autonomous neutron diffraction explorer (ANDiE) and used it to determine the magnetic order of MnO and Fe1.09Te. ANDiE can determine the Neel temperature of the materials with 5-fold enhancement in efficiency and correctly identify the transition dynamics via physics-informed Bayesian inference. ANDiE's active learning approach is broadly applicable to a variety of neutron-based experiments and can open the door for neutron scattering as a tool of accelerated materials discovery.
△ Less
Submitted 7 March, 2022; v1 submitted 19 August, 2021;
originally announced August 2021.
-
The Joint Automated Repository for Various Integrated Simulations (JARVIS) for data-driven materials design
Authors:
Kamal Choudhary,
Kevin F. Garrity,
Andrew C. E. Reid,
Brian DeCost,
Adam J. Biacchi,
Angela R. Hight Walker,
Zachary Trautt,
Jason Hattrick-Simpers,
A. Gilad Kusne,
Andrea Centrone,
Albert Davydov,
Jie Jiang,
Ruth Pachter,
Gowoon Cheon,
Evan Reed,
Ankit Agrawal,
Xiaofeng Qian,
Vinit Sharma,
Houlong Zhuang,
Sergei V. Kalinin,
Bobby G. Sumpter,
Ghanshyam Pilania,
Pinar Acar,
Subhasish Mandal,
Kristjan Haule
, et al. (3 additional authors not shown)
Abstract:
The Joint Automated Repository for Various Integrated Simulations (JARVIS) is an integrated infrastructure to accelerate materials discovery and design using density functional theory (DFT), classical force-fields (FF), and machine learning (ML) techniques. JARVIS is motivated by the Materials Genome Initiative (MGI) principles of developing open-access databases and tools to reduce the cost and d…
▽ More
The Joint Automated Repository for Various Integrated Simulations (JARVIS) is an integrated infrastructure to accelerate materials discovery and design using density functional theory (DFT), classical force-fields (FF), and machine learning (ML) techniques. JARVIS is motivated by the Materials Genome Initiative (MGI) principles of developing open-access databases and tools to reduce the cost and development time of materials discovery, optimization, and deployment. The major features of JARVIS are: JARVIS-DFT, JARVIS-FF, JARVIS-ML, and JARVIS-Tools. To date, JARVIS consists of 40,000 materials and 1 million calculated properties in JARVIS-DFT, 1,500 materials and 110 force-fields in JARVIS-FF, and 25 ML models for material-property predictions in JARVIS-ML, all of which are continuously expanding. JARVIS-Tools provides scripts and workflows for running and analyzing various simulations. We compare our computational data to experiments or high-fidelity computational methods wherever applicable to evaluate error/uncertainty in predictions. In addition to the existing workflows, the infrastructure can support a wide variety of other technologically important applications as part of the data-driven materials design paradigm. The JARVIS datasets and tools are publicly available at the website: https://jarvis.nist.gov .
△ Less
Submitted 11 July, 2021; v1 submitted 3 July, 2020;
originally announced July 2020.
-
On-the-fly Closed-loop Autonomous Materials Discovery via Bayesian Active Learning
Authors:
A. Gilad Kusne,
Heshan Yu,
Changming Wu,
Huairuo Zhang,
Jason Hattrick-Simpers,
Brian DeCost,
Suchismita Sarker,
Corey Oses,
Cormac Toher,
Stefano Curtarolo,
Albert V. Davydov,
Ritesh Agarwal,
Leonid A. Bendersky,
Mo Li,
Apurva Mehta,
Ichiro Takeuchi
Abstract:
Active learning - the field of machine learning (ML) dedicated to optimal experiment design, has played a part in science as far back as the 18th century when Laplace used it to guide his discovery of celestial mechanics [1]. In this work we focus a closed-loop, active learning-driven autonomous system on another major challenge, the discovery of advanced materials against the exceedingly complex…
▽ More
Active learning - the field of machine learning (ML) dedicated to optimal experiment design, has played a part in science as far back as the 18th century when Laplace used it to guide his discovery of celestial mechanics [1]. In this work we focus a closed-loop, active learning-driven autonomous system on another major challenge, the discovery of advanced materials against the exceedingly complex synthesis-processes-structure-property landscape. We demonstrate autonomous research methodology (i.e. autonomous hypothesis definition and evaluation) that can place complex, advanced materials in reach, allowing scientists to fail smarter, learn faster, and spend less resources in their studies, while simultaneously improving trust in scientific results and machine learning tools. Additionally, this robot science enables science-over-the-network, reducing the economic impact of scientists being physically separated from their labs. We used the real-time closed-loop, autonomous system for materials exploration and optimization (CAMEO) at the synchrotron beamline to accelerate the fundamentally interconnected tasks of rapid phase mapping and property optimization, with each cycle taking seconds to minutes, resulting in the discovery of a novel epitaxial nanocomposite phase-change memory material.
△ Less
Submitted 10 November, 2020; v1 submitted 10 June, 2020;
originally announced June 2020.
-
CRYSPNet: Crystal Structure Predictions via Neural Network
Authors:
Haotong Liang,
Valentin Stanev,
A. Gilad Kusne,
Ichiro Takeuchi
Abstract:
Structure is the most basic and important property of crystalline solids; it determines directly or indirectly most materials characteristics. However, predicting crystal structure of solids remains a formidable and not fully solved problem. Standard theoretical tools for this task are computationally expensive and at times inaccurate. Here we present an alternative approach utilizing machine lear…
▽ More
Structure is the most basic and important property of crystalline solids; it determines directly or indirectly most materials characteristics. However, predicting crystal structure of solids remains a formidable and not fully solved problem. Standard theoretical tools for this task are computationally expensive and at times inaccurate. Here we present an alternative approach utilizing machine learning for crystal structure prediction. We developed a tool called Crystal Structure Prediction Network (CRYSPNet) that can predict the Bravais lattice, space group, and lattice parameters of an inorganic material based only on its chemical composition. CRYSPNet consists of a series of neural network models, using as inputs predictors aggregating the properties of the elements constituting the compound. It was trained and validated on more than 100,000 entries from the Inorganic Crystal Structure Database. The tool demonstrates robust predictive capability and outperforms alternative strategies by a large margin. Made available to the public (at https://github.com/AuroraLHT/cryspnet), it can be used both as an independent prediction engine or as a method to generate candidate structures for further computational and/or experimental validation.
△ Less
Submitted 31 March, 2020;
originally announced March 2020.
-
Accelerating Photovoltaic Materials Development via High-Throughput Experiments and Machine-Learning-Assisted Diagnosis
Authors:
Shijing Sun,
Noor T. P. Hartono,
Zekun D. Ren,
Felipe Oviedo,
Antonio M. Buscemi,
Mariya Layurova,
De Xin Chen,
Tofunmi Ogunfunmi,
Janak Thapa,
Savitha Ramasamy,
Charles Settens,
Brian L. DeCost,
Aaron Gilad Kusne,
Zhe Liu,
Siyu I. P. Tian,
I. Marius Peters,
Juan-Pablo Correa-Baena,
Tonio Buonassisi
Abstract:
Accelerating the experimental cycle for new materials development is vital for addressing the grand energy challenges of the 21st century. We fabricate and characterize 75 unique halide perovskite-inspired solution-based thin-film materials within a two-month period, with 87% exhibiting band gaps between 1.2 eV and 2.4 eV that are of interest for energy-harvesting applications. This increased thro…
▽ More
Accelerating the experimental cycle for new materials development is vital for addressing the grand energy challenges of the 21st century. We fabricate and characterize 75 unique halide perovskite-inspired solution-based thin-film materials within a two-month period, with 87% exhibiting band gaps between 1.2 eV and 2.4 eV that are of interest for energy-harvesting applications. This increased throughput is enabled by streamlining experimental workflows, developing a set of precursors amenable to high-throughput synthesis, and developing machine-learning assisted diagnosis. We utilize a deep neural network to classify compounds based on experimental X-ray diffraction data into 0D, 2D, and 3D structures more than 10 times faster than human analysis and with 90% accuracy. We validate our methods using lead-halide perovskites and extend the application to novel lead-free compositions. The wider synthesis window and faster cycle of learning enables three noteworthy scientific findings: (1) we realize four inorganic layered perovskites, A3B2Br9 (A = Cs, Rb; B = Bi, Sb) in thin-film form via one-step liquid deposition; (2) we report a multi-site lead-free alloy series that was not previously described in literature, Cs3(Bi1-xSbx)2(I1-xBrx)9; and (3) we reveal the effect on bandgap (reduction to <2 eV) and structure upon simultaneous alloying on the B-site and X-site of Cs3Bi2I9 with Sb and Br. This study demonstrates that combining an accelerated experimental cycle of learning and machine-learning based diagnosis represents an important step toward realizing fully-automated laboratories for materials discovery and development.
△ Less
Submitted 25 November, 2018;
originally announced December 2018.
-
Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks
Authors:
Felipe Oviedo,
Zekun Ren,
Shijing Sun,
Charlie Settens,
Zhe Liu,
Noor Titan Putri Hartono,
Ramasamy Savitha,
Brian L. DeCost,
Siyu I. P. Tian,
Giuseppe Romano,
Aaron Gilad Kusne,
Tonio Buonassisi
Abstract:
X-ray diffraction (XRD) data acquisition and analysis is among the most time-consuming steps in the development cycle of novel thin-film materials. We propose a machine-learning-enabled approach to predict crystallographic dimensionality and space group from a limited number of thin-film XRD patterns. We overcome the scarce-data problem intrinsic to novel materials development by coupling a superv…
▽ More
X-ray diffraction (XRD) data acquisition and analysis is among the most time-consuming steps in the development cycle of novel thin-film materials. We propose a machine-learning-enabled approach to predict crystallographic dimensionality and space group from a limited number of thin-film XRD patterns. We overcome the scarce-data problem intrinsic to novel materials development by coupling a supervised machine learning approach with a model agnostic, physics-informed data augmentation strategy using simulated data from the Inorganic Crystal Structure Database (ICSD) and experimental data. As a test case, 115 thin-film metal halides spanning 3 dimensionalities and 7 space-groups are synthesized and classified. After testing various algorithms, we develop and implement an all convolutional neural network, with cross validated accuracies for dimensionality and space-group classification of 93% and 89%, respectively. We propose average class activation maps, computed from a global average pooling layer, to allow high model interpretability by human experimentalists, elucidating the root causes of misclassification. Finally, we systematically evaluate the maximum XRD pattern step size (data acquisition rate) before loss of predictive accuracy occurs, and determine it to be 0.16°, which enables an XRD pattern to be obtained and classified in 5.5 minutes or less.
△ Less
Submitted 23 April, 2019; v1 submitted 20 November, 2018;
originally announced November 2018.
-
Machine-learning guided discovery of a high-performance spin-driven thermoelectric material
Authors:
Yuma Iwasaki,
Ichiro Takeuchi,
Valentin Stanev,
Aaron Gilad Kusne,
Masahiko Ishida,
Akihiro Kirihara,
Kazuki Ihara,
Ryohto Sawada,
Koichi Terashima,
Hiroko Someya,
Ken-ichi Uchida,
Shinichi Yorozu,
Eiji Saitoh
Abstract:
Thermoelectric conversion using Seebeck effect for generation of electricity is becoming an indispensable technology for energy harvesting and smart thermal management. Recently, the spin-driven thermoelectric effects (STEs), which employ emerging phenomena such as the spin-Seebeck effect (SSE) and the anomalous Nernst effect (ANE), have garnered much attention as a promising path towards low cost…
▽ More
Thermoelectric conversion using Seebeck effect for generation of electricity is becoming an indispensable technology for energy harvesting and smart thermal management. Recently, the spin-driven thermoelectric effects (STEs), which employ emerging phenomena such as the spin-Seebeck effect (SSE) and the anomalous Nernst effect (ANE), have garnered much attention as a promising path towards low cost and versatile thermoelectric technology with easily scalable manufacturing. However, progress in development of STE devices is hindered by the lack of understanding of the mechanism and materials parameters that govern the STEs. To address this problem, we enlist machine learning modeling to establish the key physical parameters controlling SSE. Guided by these models, we have carried out a high-throughput experiment which led to the identification of a novel STE material with a thermopower an order of magnitude larger than that of the current generation STE devices.
△ Less
Submitted 6 May, 2018;
originally announced May 2018.
-
Unsupervised Phase Mapping of X-ray Diffraction Data by Nonnegative Matrix Factorization Integrated with Custom Clustering
Authors:
Valentin Stanev,
Velimir V. Vesselinov,
A. Gilad Kusne,
Graham Antoszewski,
Ichiro Takeuchi,
Boian S. Alexandrov
Abstract:
Analyzing large X-ray diffraction (XRD) datasets is a key step in high-throughput mapping of the compositional phase diagrams of combinatorial materials libraries. Optimizing and automating this task can help accelerate the process of discovery of materials with novel and desirable properties. Here, we report a new method for pattern analysis and phase extraction of XRD datasets. The method expand…
▽ More
Analyzing large X-ray diffraction (XRD) datasets is a key step in high-throughput mapping of the compositional phase diagrams of combinatorial materials libraries. Optimizing and automating this task can help accelerate the process of discovery of materials with novel and desirable properties. Here, we report a new method for pattern analysis and phase extraction of XRD datasets. The method expands the Nonnegative Matrix Factorization method, which has been used previously to analyze such datasets, by combining it with custom clustering and cross-correlation algorithms. This new method is capable of robust determination of the number of basis patterns present in the data which, in turn, enables straightforward identification of any possible peak-shifted patterns. Peak-shifting arises due to continuous change in the lattice constants as a function of composition, and is ubiquitous in XRD datasets from composition spread libraries. Successful identification of the peak-shifted patterns allows proper quantification and classification of the basis XRD patterns, which is necessary in order to decipher the contribution of each unique single-phase structure to the multi-phase regions. The process can be utilized to determine accurately the compositional phase diagram of a system under study. The presented method is applied to one synthetic and one experimental dataset, and demonstrates robust accuracy and identification abilities.
△ Less
Submitted 20 February, 2018;
originally announced February 2018.
-
Machine learning modeling of superconducting critical temperature
Authors:
Valentin Stanev,
Corey Oses,
A. Gilad Kusne,
Efrain Rodriguez,
Johnpierre Paglione,
Stefano Curtarolo,
Ichiro Takeuchi
Abstract:
Superconductivity has been the focus of enormous research effort since its discovery more than a century ago. Yet, some features of this unique phenomenon remain poorly understood; prime among these is the connection between superconductivity and chemical/structural properties of materials. To bridge the gap, several machine learning schemes are developed herein to model the critical temperatures…
▽ More
Superconductivity has been the focus of enormous research effort since its discovery more than a century ago. Yet, some features of this unique phenomenon remain poorly understood; prime among these is the connection between superconductivity and chemical/structural properties of materials. To bridge the gap, several machine learning schemes are developed herein to model the critical temperatures ($T_{\mathrm{c}}$) of the 12,000+ known superconductors available via the SuperCon database. Materials are first divided into two classes based on their $T_{\mathrm{c}}$ values, above and below 10 K, and a classification model predicting this label is trained. The model uses coarse-grained features based only on the chemical compositions. It shows strong predictive power, with out-of-sample accuracy of about 92%. Separate regression models are developed to predict the values of $T_{\mathrm{c}}$ for cuprate, iron-based, and "low-$T_{\mathrm{c}}$" compounds. These models also demonstrate good performance, with learned predictors offering potential insights into the mechanisms behind superconductivity in different families of materials. To improve the accuracy and interpretability of these models, new features are incorporated using materials data from the AFLOW Online Repositories. Finally, the classification and regression models are combined into a single integrated pipeline and employed to search the entire Inorganic Crystallographic Structure Database (ICSD) for potential new superconductors. We identify more than 30 non-cuprate and non-iron-based oxides as candidate materials.
△ Less
Submitted 6 October, 2017; v1 submitted 8 September, 2017;
originally announced September 2017.