-
14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon
Authors:
Kevin Maik Jablonka,
Qianxiang Ai,
Alexander Al-Feghali,
Shruti Badhwar,
Joshua D. Bocarsly,
Andres M Bran,
Stefan Bringuier,
L. Catherine Brinson,
Kamal Choudhary,
Defne Circi,
Sam Cox,
Wibe A. de Jong,
Matthew L. Evans,
Nicolas Gastellu,
Jerome Genzling,
María Victoria Gil,
Ankur K. Gupta,
Zhi Hong,
Alishba Imran,
Sabine Kruschwitz,
Anne Labarre,
Jakub Lála,
Tao Liu,
Steven Ma,
Sauradeep Majumdar
, et al. (28 additional authors not shown)
Abstract:
Large-language models (LLMs) such as GPT-4 caught the interest of many scientists. Recent studies suggested that these models could be useful in chemistry and materials science. To explore these possibilities, we organized a hackathon.
This article chronicles the projects built as part of this hackathon. Participants employed LLMs for various applications, including predicting properties of mole…
▽ More
Large-language models (LLMs) such as GPT-4 caught the interest of many scientists. Recent studies suggested that these models could be useful in chemistry and materials science. To explore these possibilities, we organized a hackathon.
This article chronicles the projects built as part of this hackathon. Participants employed LLMs for various applications, including predicting properties of molecules and materials, designing novel interfaces for tools, extracting knowledge from unstructured data, and developing new educational applications.
The diverse topics and the fact that working prototypes could be generated in less than two days highlight that LLMs will profoundly impact the future of our fields. The rich collection of ideas and projects also indicates that the applications of LLMs are not limited to materials science and chemistry but offer potential benefits to a wide range of scientific disciplines.
△ Less
Submitted 14 July, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
The role of steric effects on hydrogen atom transfer reactions
Authors:
Yi Sun,
Jacob N. Sanders,
K. N. Houk
Abstract:
We explored how steric effects influence the rate of hydrogen atom transfer (HAT) reactions between oxyradicals and alkanes. Quantum chemical computations of transition states show that activation barriers and reaction enthalpies are both influenced by bulky substituents on the radical, but less so by substituents on the alkane. The activation barriers correlate with reaction enthalpies via the Ev…
▽ More
We explored how steric effects influence the rate of hydrogen atom transfer (HAT) reactions between oxyradicals and alkanes. Quantum chemical computations of transition states show that activation barriers and reaction enthalpies are both influenced by bulky substituents on the radical, but less so by substituents on the alkane. The activation barriers correlate with reaction enthalpies via the Evans-Polanyi relationship, even when steric effects are important. Dispersion effects can additionally stabilize transition states in some cases.
△ Less
Submitted 13 December, 2022; v1 submitted 22 November, 2022;
originally announced November 2022.
-
Benchmarking Compressed Sensing, Super-Resolution, and Filter Diagonalization
Authors:
Thomas Markovich,
Samuel M. Blau,
Jacob N. Sanders,
Alan Aspuru-Guzik
Abstract:
Signal processing techniques have been developed that use different strategies to bypass the Nyquist sampling theorem in order to recover more information than a traditional discrete Fourier transform. Here we examine three such methods: filter diagonalization, compressed sensing, and super-resolution. We apply them to a broad range of signal forms commonly found in science and engineering in orde…
▽ More
Signal processing techniques have been developed that use different strategies to bypass the Nyquist sampling theorem in order to recover more information than a traditional discrete Fourier transform. Here we examine three such methods: filter diagonalization, compressed sensing, and super-resolution. We apply them to a broad range of signal forms commonly found in science and engineering in order to discover when and how each method can be used most profitably. We find that filter diagonalization provides the best results for Lorentzian signals, while compressed sensing and super-resolution perform better for arbitrary signals.
△ Less
Submitted 17 February, 2015;
originally announced February 2015.
-
A sparse-sampling approach for the fast computation of matrices: application to molecular vibrations
Authors:
Jacob N. Sanders,
Xavier Andrade,
Alán Aspuru-Guzik
Abstract:
This article presents a new method to compute matrices from numerical simulations based on the ideas of sparse sampling and compressed sensing. The method is useful for problems where the determination of the entries of a matrix constitutes the computational bottleneck. We apply this new method to an important problem in computational chemistry: the determination of molecular vibrations from elect…
▽ More
This article presents a new method to compute matrices from numerical simulations based on the ideas of sparse sampling and compressed sensing. The method is useful for problems where the determination of the entries of a matrix constitutes the computational bottleneck. We apply this new method to an important problem in computational chemistry: the determination of molecular vibrations from electronic structure calculations, where our results show that the overall scaling of the procedure can be improved in some cases. Moreover, our method provides a general framework for bootstrapping cheap low-accuracy calculations in order to reduce the required number of expensive high-accuracy calculations, resulting in a significant 3x speed-up in actual calculations.
△ Less
Submitted 17 October, 2014;
originally announced October 2014.
-
More accurate and efficient bath spectral densities from super-resolution
Authors:
Thomas Markovich,
Samuel M. Blau,
John Parkhill,
Christoph Kreisbeck,
Jacob N. Sanders,
Xavier Andrade,
Alán Aspuru-Guzik
Abstract:
Quantum transport and other phenomena are typically modeled by coupling the system of interest to an environment, or bath, held at thermal equilibrium. Realistic bath models are at least as challenging to construct as models for the quantum systems themselves, since they must incorporate many degrees of freedom that interact with the system on a wide range of timescales. Owing to computational lim…
▽ More
Quantum transport and other phenomena are typically modeled by coupling the system of interest to an environment, or bath, held at thermal equilibrium. Realistic bath models are at least as challenging to construct as models for the quantum systems themselves, since they must incorporate many degrees of freedom that interact with the system on a wide range of timescales. Owing to computational limitations, the environment is often modeled with simple functional forms, with a few parameters fit to experiment to yield semi-quantitative results. Growing computational resources have enabled the construction of more realistic bath models from molecular dynamics (MD) simulations. In this paper, we develop a numerical technique to construct these atomistic bath models with better accuracy and decreased cost. We apply a novel signal processing technique, known as super-resolution, combined with a dictionary of physically-motivated bath modes to derive spectral densities from MD simulations. Our approach reduces the required simulation time and provides a more accurate spectral density than can be obtained via standard Fourier transform methods. Moreover, the spectral density is provided as a convenient closed-form expression which yields an analytic time-dependent bath kernel. Exciton dynamics of the Fenna-Matthews-Olsen light-harvesting complex are simulated with a second order time-convolutionless master equation, and spectral densities constructed via super-resolution are shown to reproduce the dynamics using only a quarter of the amount of MD data.
△ Less
Submitted 16 July, 2013;
originally announced July 2013.
-
Compressed sensing for multidimensional electronic spectroscopy experiments
Authors:
J. N. Sanders,
S. Mostame,
S. K. Saikin,
X. Andrade,
J. R. Widom,
A. H. Marcus,
A. Aspuru-Guzik
Abstract:
Compressed sensing is a processing method that significantly reduces the number of measurements needed to accurately resolve signals in many fields of science and engineering. We develop a two-dimensional (2D) variant of compressed sensing for multidimensional electronic spectroscopy and apply it to experimental data. For the model system of atomic rubidium vapor, we find that compressed sensing p…
▽ More
Compressed sensing is a processing method that significantly reduces the number of measurements needed to accurately resolve signals in many fields of science and engineering. We develop a two-dimensional (2D) variant of compressed sensing for multidimensional electronic spectroscopy and apply it to experimental data. For the model system of atomic rubidium vapor, we find that compressed sensing provides significantly better resolution of 2D spectra than a conventional discrete Fourier transform from the same experimental data. We believe that by combining powerful resolution with ease of use, compressed sensing can be a powerful tool for the analysis and interpretation of ultrafast spectroscopy data.
△ Less
Submitted 16 July, 2012;
originally announced July 2012.
-
Application of compressed sensing to the simulation of atomic systems
Authors:
X. Andrade,
J. N. Sanders,
A. Aspuru-Guzik
Abstract:
Compressed sensing is a method that allows a significant reduction in the number of samples required for accurate measurements in many applications in experimental sciences and engineering. In this work, we show that compressed sensing can also be used to speed up numerical simulations. We apply compressed sensing to extract information from the real-time simulation of atomic and molecular systems…
▽ More
Compressed sensing is a method that allows a significant reduction in the number of samples required for accurate measurements in many applications in experimental sciences and engineering. In this work, we show that compressed sensing can also be used to speed up numerical simulations. We apply compressed sensing to extract information from the real-time simulation of atomic and molecular systems, including electronic and nuclear dynamics. We find that for the calculation of vibrational and optical spectra the total propagation time, and hence the computational cost, can be reduced by approximately a factor of five.
△ Less
Submitted 29 May, 2012;
originally announced May 2012.