Predicting antimicrobial activity of conjugated oligoelectrolyte molecules via machine learning
Authors:
Armi Tiihonen,
Sarah J. Cox-Vazquez,
Qiaohao Liang,
Mohamed Ragab,
Zekun Ren,
Noor Titan Putri Hartono,
Zhe Liu,
Shijing Sun,
Cheng Zhou,
Nathan C. Incandela,
Jakkarin Limwongyut,
Alex S. Moreland,
Senthilnath Jayavelu,
Guillermo C. Bazan,
Tonio Buonassisi
Abstract:
New antibiotics are needed to battle growing antibiotic resistance, but the development process from hit, to lead, and ultimately to a useful drug, takes decades. Although progress in molecular property prediction using machine-learning methods has opened up new pathways for aiding the antibiotics development process, many existing solutions rely on large datasets and finding structural similariti…
▽ More
New antibiotics are needed to battle growing antibiotic resistance, but the development process from hit, to lead, and ultimately to a useful drug, takes decades. Although progress in molecular property prediction using machine-learning methods has opened up new pathways for aiding the antibiotics development process, many existing solutions rely on large datasets and finding structural similarities to existing antibiotics. Challenges remain in modelling of unconventional antibiotics classes that are drawing increasing research attention. In response, we developed an antimicrobial activity prediction model for conjugated oligoelectrolyte molecules, a new class of antibiotics that lacks extensive prior structure-activity relationship studies. Our approach enables us to predict minimum inhibitory concentration for E. coli K12, with 21 molecular descriptors selected by recursive elimination from a set of 5,305 descriptors. This predictive model achieves an R2 of 0.65 with no prior knowledge of the underlying mechanism. We find the molecular representation optimum for the domain is the key to good predictions of antimicrobial activity. In the case of conjugated oligoelectrolytes, a representation reflecting the 3-dimensional shape of the molecules is most critical. Although it is demonstrated with a specific example of conjugated oligoelectrolytes, our proposed approach for creating the predictive model can be readily adapted to other novel antibiotic candidate domains.
△ Less
Submitted 30 November, 2021; v1 submitted 21 May, 2021;
originally announced May 2021.
An invertible crystallographic representation for general inverse design of inorganic crystals with targeted properties
Authors:
Zekun Ren,
Siyu Isaac Parker Tian,
Juhwan Noh,
Felipe Oviedo,
Guangzong Xing,
Jiali Li,
Qiaohao Liang,
Ruiming Zhu,
Armin G. Aberle,
Shijing Sun,
Xiaonan Wang,
Yi Liu,
Qianxiao Li,
Senthilnath Jayavelu,
Kedar Hippalgaonkar,
Yousung Jung,
Tonio Buonassisi
Abstract:
Realizing general inverse design could greatly accelerate the discovery of new materials with user-defined properties. However, state-of-the-art generative models tend to be limited to a specific composition or crystal structure. Herein, we present a framework capable of general inverse design (not limited to a given set of elements or crystal structures), featuring a generalized invertible repres…
▽ More
Realizing general inverse design could greatly accelerate the discovery of new materials with user-defined properties. However, state-of-the-art generative models tend to be limited to a specific composition or crystal structure. Herein, we present a framework capable of general inverse design (not limited to a given set of elements or crystal structures), featuring a generalized invertible representation that encodes crystals in both real and reciprocal space, and a property-structured latent space from a variational autoencoder (VAE). In three design cases, the framework generates 142 new crystals with user-defined formation energies, bandgap, thermoelectric (TE) power factor, and combinations thereof. These generated crystals, absent in the training database, are validated by first-principles calculations. The success rates (number of first-principles-validated target-satisfying crystals/number of designed crystals) ranges between 7.1% and 38.9%. These results represent a significant step toward property-driven general inverse design using generative models, although practical challenges remain when coupled with experimental synthesis.
△ Less
Submitted 15 December, 2021; v1 submitted 15 May, 2020;
originally announced May 2020.