-
SELFIES and the future of molecular string representations
Authors:
Mario Krenn,
Qianxiang Ai,
Senja Barthel,
Nessa Carson,
Angelo Frei,
Nathan C. Frey,
Pascal Friederich,
Théophile Gaudin,
Alberto Alexander Gayle,
Kevin Maik Jablonka,
Rafael F. Lameiro,
Dominik Lemm,
Alston Lo,
Seyed Mohamad Moosavi,
José Manuel Nápoles-Duarte,
AkshatKumar Nigam,
Robert Pollice,
Kohulan Rajan,
Ulrich Schatzschneider,
Philippe Schwaller,
Marta Skreta,
Berend Smit,
Felix Strieth-Kalthoff,
Chong Sun,
Gary Tom
, et al. (6 additional authors not shown)
Abstract:
Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language for each of these tasks. Strings are a common tool…
▽ More
Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language for each of these tasks. Strings are a common tool to represent molecular graphs, and the most popular molecular string representation, SMILES, has powered cheminformatics since the late 1980s. However, in the context of AI and ML in chemistry, SMILES has several shortcomings -- most pertinently, most combinations of symbols lead to invalid results with no valid chemical interpretation. To overcome this issue, a new language for molecules was introduced in 2020 that guarantees 100\% robustness: SELFIES (SELF-referencIng Embedded Strings). SELFIES has since simplified and enabled numerous new applications in chemistry. In this manuscript, we look to the future and discuss molecular string representations, along with their respective opportunities and challenges. We propose 16 concrete Future Projects for robust molecular representations. These involve the extension toward new chemical domains, exciting questions at the interface of AI and robust languages and interpretability for both humans and machines. We hope that these proposals will inspire several follow-up works exploiting the full potential of molecular string representations for the future of AI in chemistry and materials science.
△ Less
Submitted 31 March, 2022;
originally announced April 2022.
-
Consistent description of UCN transport properties
Authors:
S. Wlokka,
P. Fierlinger,
A. Frei,
P. Geltenbort,
S. Paul,
T. Pöschl,
F. Schmid,
W. Schreyer,
D. Steffen
Abstract:
We have investigated the diffuse reflection probabilities of Replica guides for ultra-cold neutrons (UCN) using the so-called helium method. For the first time we could establish a consistent description of the diffuse reflection mechanism for different lengths of the guide system. The transmission of the guides is measured depending on the helium pressure inside of the guides. A series of simulat…
▽ More
We have investigated the diffuse reflection probabilities of Replica guides for ultra-cold neutrons (UCN) using the so-called helium method. For the first time we could establish a consistent description of the diffuse reflection mechanism for different lengths of the guide system. The transmission of the guides is measured depending on the helium pressure inside of the guides. A series of simulations was done to reproduce the experimental data. These simulations showed that a diffuse reflection probability of $d = (3.0 \pm 0.5) \cdot 10^{-2}$ sufficiently describes the experimental data.
△ Less
Submitted 25 January, 2017;
originally announced January 2017.
-
A magnetically shielded room with ultra low residual field and gradient
Authors:
I. Altarev,
E. Babcock,
D. Beck,
M. Burghoff,
S. Chesnevskaya,
T. Chupp,
S. Degenkolb,
I. Fan,
P. Fierlinger,
A. Frei,
E. Gutsmiedl,
S. Knappe-Grüneberg,
F. Kuchler,
T. Lauer,
P. Link,
T. Lins,
M. Marino,
J. McAndrew,
B. Niessen,
S. Paul,
G. Petzoldt,
U. Schläpfer,
A. Schnabel,
S. Sharma,
J. Singh
, et al. (7 additional authors not shown)
Abstract:
A versatile and portable magnetically shielded room with a field of (700 \pm 200) pT within a central volume of 1m x 1m x 1m and a field gradient less than 300 pT/m is described. This performance represents more than a hundred-fold improvement of the state of the art for a two-layer magnetic shield and provides an environment suitable for a next generation of precision experiments in fundamental p…
▽ More
A versatile and portable magnetically shielded room with a field of (700 \pm 200) pT within a central volume of 1m x 1m x 1m and a field gradient less than 300 pT/m is described. This performance represents more than a hundred-fold improvement of the state of the art for a two-layer magnetic shield and provides an environment suitable for a next generation of precision experiments in fundamental physics at low energies; in particular, searches for electric dipole moments of fundamental systems and tests of Lorentz-invariance based on spin-precession experiments. Studies of the residual fields and their sources enable improved design of future ultra-low gradient environments and experimental apparatus.
△ Less
Submitted 24 March, 2014;
originally announced March 2014.