-
Understanding active learning of molecular docking and its applications
Authors:
Jeonghyeon Kim,
Juno Nam,
Seongok Ryu
Abstract:
With the advancing capabilities of computational methodologies and resources, ultra-large-scale virtual screening via molecular docking has emerged as a prominent strategy for in silico hit discovery. Given the exhaustive nature of ultra-large-scale virtual screening, active learning methodologies have garnered attention as a means to mitigate computational cost through iterative small-scale docki…
▽ More
With the advancing capabilities of computational methodologies and resources, ultra-large-scale virtual screening via molecular docking has emerged as a prominent strategy for in silico hit discovery. Given the exhaustive nature of ultra-large-scale virtual screening, active learning methodologies have garnered attention as a means to mitigate computational cost through iterative small-scale docking and machine learning model training. While the efficacy of active learning methodologies has been empirically validated in extant literature, a critical investigation remains in how surrogate models can predict docking score without considering three-dimensional structural features, such as receptor conformation and binding poses. In this paper, we thus investigate how active learning methodologies effectively predict docking scores using only 2D structures and under what circumstances they may work particularly well through benchmark studies encompassing six receptor targets. Our findings suggest that surrogate models tend to memorize structural patterns prevalent in high docking scored compounds obtained during acquisition steps. Despite this tendency, surrogate models demonstrate utility in virtual screening, as exemplified in the identification of actives from DUD-E dataset and high docking-scored compounds from EnamineReal library, a significantly larger set than the initial screening pool. Our comprehensive analysis underscores the reliability and potential applicability of active learning methodologies in virtual screening campaigns.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Accurate, reliable and interpretable solubility prediction of druglike molecules with attention pooling and Bayesian learning
Authors:
Seongok Ryu,
Sumin Lee
Abstract:
In drug discovery, aqueous solubility is an important pharmacokinetic property which affects absorption and assay availability of drug. Thus, in silico prediction of solubility has been studied for its utility in virtual screening and lead optimization. Recently, machine learning (ML) methods using experimental data has been popular because physics-based methods like quantum mechanics and molecula…
▽ More
In drug discovery, aqueous solubility is an important pharmacokinetic property which affects absorption and assay availability of drug. Thus, in silico prediction of solubility has been studied for its utility in virtual screening and lead optimization. Recently, machine learning (ML) methods using experimental data has been popular because physics-based methods like quantum mechanics and molecular dynamics are not suitable for high-throughput tasks due to its computational costs. However, ML method can exhibit over-fitting problem in a data-deficient condition, and this is the case for most chemical property datasets. In addition, ML methods are regarded as a black box function in that it is difficult to interpret contribution of hidden features to outputs, hindering analysis and modification of structure-activity relationship. To deal with mentioned issues, we developed Bayesian graph neural networks (GNNs) with the self-attention readout layer. Unlike most GNNs using self-attention in node updates, self-attention applied at readout layer enabled a model to improve prediction performance as well as to identify atom-wise importance, which can help lead optimization as exemplified for three FDA-approved drugs. Also, Bayesian inference enables us to separate more or less accurate results according to uncertainty in solubility prediction task We expect that our accurate, reliable and interpretable model can be used for more careful decision-making and various applications in the development of drugs.
△ Less
Submitted 29 September, 2022;
originally announced October 2022.
-
Multi-dimensional structure of C. elegans thermal learning
Authors:
Ahmed Roman,
Konstantine Palanski,
Ilya Nemenman,
William S Ryu
Abstract:
Quantitative models of associative learning that explain the behavior of real animals with high precision have turned out very difficult to construct. We do this in the context of the dynamics of the thermal preference of C. elegans. For this, we quantify C. elegans thermotaxis in response to various conditioning parameters, genetic perturbations, and operant behavior using a fast, high-throughput…
▽ More
Quantitative models of associative learning that explain the behavior of real animals with high precision have turned out very difficult to construct. We do this in the context of the dynamics of the thermal preference of C. elegans. For this, we quantify C. elegans thermotaxis in response to various conditioning parameters, genetic perturbations, and operant behavior using a fast, high-throughput microfluidic droplet assay. We then model this data comprehensively, within a new, biologically interpretable, multi-modal framework. We discover that the dynamics of thermal preference are described by two independent contributions and require a model with at least four dynamical variables. One pathway positively associates the experienced temperature independently of food and the other negatively associates to the temperature when food is absent.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation
Authors:
Soojung Yang,
Doyeong Hwang,
Seul Lee,
Seongok Ryu,
Sung Ju Hwang
Abstract:
Recently, utilizing reinforcement learning (RL) to generate molecules with desired properties has been highlighted as a promising strategy for drug design. A molecular docking program - a physical simulation that estimates protein-small molecule binding affinity - can be an ideal reward scoring function for RL, as it is a straightforward proxy of the therapeutic potential. Still, two imminent chal…
▽ More
Recently, utilizing reinforcement learning (RL) to generate molecules with desired properties has been highlighted as a promising strategy for drug design. A molecular docking program - a physical simulation that estimates protein-small molecule binding affinity - can be an ideal reward scoring function for RL, as it is a straightforward proxy of the therapeutic potential. Still, two imminent challenges exist for this task. First, the models often fail to generate chemically realistic and pharmacochemically acceptable molecules. Second, the docking score optimization is a difficult exploration problem that involves many local optima and less smooth surfaces with respect to molecular structure. To tackle these challenges, we propose a novel RL framework that generates pharmacochemically acceptable molecules with large docking scores. Our method - Fragment-based generative RL with Explorative Experience replay for Drug design (FREED) - constrains the generated molecules to a realistic and qualified chemical space and effectively explores the space to find drugs by coupling our fragment-based generation method and a novel error-prioritized experience replay (PER). We also show that our model performs well on both de novo and scaffold-based schemes. Our model produces molecules of higher quality compared to existing methods while achieving state-of-the-art performance on two of three targets in terms of the docking scores of the generated molecules. We further show with ablation studies that our method, predictive error-PER (FREED(PE)), significantly improves the model performance.
△ Less
Submitted 26 October, 2021; v1 submitted 4 October, 2021;
originally announced October 2021.
-
Age dependence of fitness and body mass index in Korean adults
Authors:
Nam Lyong Kang,
Su Chak Ryu
Abstract:
The aim of this study was to investigate the age dependence of the fitness and body mass index (BMI) in Korean adults and to find an effective exercise to restore the degradation of fitness due to aging. The age dependence of the fitness and BMI were calculated using their lump mean values (LMVs) and a linear regression method. The fitness sensitivity percentage to age (FSPA) and fitness sensitivi…
▽ More
The aim of this study was to investigate the age dependence of the fitness and body mass index (BMI) in Korean adults and to find an effective exercise to restore the degradation of fitness due to aging. The age dependence of the fitness and BMI were calculated using their lump mean values (LMVs) and a linear regression method. The fitness sensitivity percentage to age (FSPA) and fitness sensitivity percentage to BMI (FSPB) were introduced as indicators for the effective improvement of the fitness. The results showed that the degradation of fitness due to aging, especially the degradation of cardiorespiratory endurance and muscular endurance, could be improved effectively by controlling the 20-m multi-stage shuttle run and sit-up scores for both males and females. The results also showed that the BMIs could be effectively controlled with enhancing the 10-m shuttle run and standing long jump scores for both males and females. It is expected that the LMV, FSPA, and FSPB could be used to improve fitness effectively and to establish personal exercise aims.
△ Less
Submitted 10 February, 2020;
originally announced February 2020.
-
Automated, predictive, and interpretable inference of C. elegans escape dynamics
Authors:
Bryan C. Daniels,
William S. Ryu,
Ilya Nemenman
Abstract:
The roundworm C. elegans exhibits robust escape behavior in response to rapidly rising temperature. The behavior lasts for a few seconds, shows history dependence, involves both sensory and motor systems, and is too complicated to model mechanistically using currently available knowledge. Instead we model the process phenomenologically, and we use the Sir Isaac dynamical inference platform to infe…
▽ More
The roundworm C. elegans exhibits robust escape behavior in response to rapidly rising temperature. The behavior lasts for a few seconds, shows history dependence, involves both sensory and motor systems, and is too complicated to model mechanistically using currently available knowledge. Instead we model the process phenomenologically, and we use the Sir Isaac dynamical inference platform to infer the model in a fully automated fashion directly from experimental data. The inferred model requires incorporation of an unobserved dynamical variable, and is biologically interpretable. The model makes accurate predictions about the dynamics of the worm behavior, and it can be used to characterize the functional logic of the dynamical system underlying the escape response. This work illustrates the power of modern artificial intelligence to aid in discovery of accurate and interpretable models of complex natural systems.
△ Less
Submitted 25 September, 2018;
originally announced September 2018.
-
Making brain-machine interfaces robust to future neural variability
Authors:
David Sussillo,
Sergey D. Stavisky,
Jonathan C. Kao,
Stephen I. Ryu,
Krishna V. Shenoy
Abstract:
A major hurdle to clinical translation of brain-machine interfaces (BMIs) is that current decoders, which are trained from a small quantity of recent data, become ineffective when neural recording conditions subsequently change. We tested whether a decoder could be made more robust to future neural variability by training it to handle a variety of recording conditions sampled from months of previo…
▽ More
A major hurdle to clinical translation of brain-machine interfaces (BMIs) is that current decoders, which are trained from a small quantity of recent data, become ineffective when neural recording conditions subsequently change. We tested whether a decoder could be made more robust to future neural variability by training it to handle a variety of recording conditions sampled from months of previously collected data as well as synthetic training data perturbations. We developed a new multiplicative recurrent neural network BMI decoder that successfully learned a large variety of neural-to- kinematic mappings and became more robust with larger training datasets. When tested with a non-human primate preclinical BMI model, this decoder was robust under conditions that disabled a state-of-the-art Kalman filter based decoder. These results validate a new BMI strategy in which accumulated data history is effectively harnessed, and may facilitate reliable daily BMI use by reducing decoder retraining downtime.
△ Less
Submitted 19 October, 2016;
originally announced October 2016.
-
Resolving coiled shapes reveals new reorientation behaviors in C. elegans
Authors:
Onno D Broekmans,
Jarlath B Rodgers,
William S Ryu,
Greg J Stephens
Abstract:
We exploit the reduced space of C. elegans postures to develop a novel tracking algorithm which captures both simple shapes and also self-occluding coils, an important, yet unexplored, component of worm behavior. We apply our algorithm to show that visually complex, coiled sequences are a superposition of two simpler patterns: the body wave dynamics and a head-curvature pulse. We demonstrate the p…
▽ More
We exploit the reduced space of C. elegans postures to develop a novel tracking algorithm which captures both simple shapes and also self-occluding coils, an important, yet unexplored, component of worm behavior. We apply our algorithm to show that visually complex, coiled sequences are a superposition of two simpler patterns: the body wave dynamics and a head-curvature pulse. We demonstrate the precise coiled dynamics of an escape response and uncover new behaviors in spontaneous, large amplitude coils; deep reorientations occur through classical Omega-shaped postures and also through larger, new postural excitations which we label here as delta-turns. We find that omega and delta turns occur independently, the serpentine analog of a random left-right step, suggesting a distinct triggering mechanism. We also show that omega and delta turns display approximately equal rates and adapt to food-free conditions on a similar timescale, a simple strategy to avoid navigational bias.
△ Less
Submitted 13 March, 2016;
originally announced March 2016.
-
Stereotypical escape behavior in Caenorhabditis elegans allows quantification of nociceptive stimuli levels
Authors:
Kawai Leung,
Aylia Mohammadi,
William S. Ryu,
Ilya Nemenman
Abstract:
Experiments of pain with human subjects are difficult, subjective, and ethically constrained. Since the molecular mechanisms of pain transduction are reasonably conserved among different species, these problems are partially solved by the use of animal models. However, animals cannot easily communicate to us their own pain levels. Thus progress depends crucially on our ability to quantitatively an…
▽ More
Experiments of pain with human subjects are difficult, subjective, and ethically constrained. Since the molecular mechanisms of pain transduction are reasonably conserved among different species, these problems are partially solved by the use of animal models. However, animals cannot easily communicate to us their own pain levels. Thus progress depends crucially on our ability to quantitatively and objectively infer the perceived level of noxious stimuli from the behavior of animals. Here we develop a quantitative model to infer the perceived level of thermal nociception from the stereotyped nociceptive response of individual nematodes Caenorhabditis elegans stimulated by an IR laser. The model provides a method for quantification of analgesic effects of chemical stimuli or genetic mutations in C. elegans. We test the nociception of ibuprofen-treated worms and a TRPV (transient receptor potential) mutant, and we show that the perception of thermal nociception for the ibuprofen treated worms is lower than the wild-type. At the same time, our model shows that the mutant changes the worm's behavior beyond affecting nociception. Finally, we determine the stimulus level that best distinguishes the analgesic effects and the minimum number of worms that allow for a statistically significant identification of these effects.
△ Less
Submitted 18 January, 2016;
originally announced January 2016.
-
The emergence of stereotyped behaviors in C. elegans
Authors:
Greg J. Stephens,
William S. Ryu,
William Bialek
Abstract:
Animal behaviors are sometimes decomposable into discrete, stereotyped elements. In one model, such behaviors are triggered by specific commands; in the extreme case, the discreteness of behavior is traced to the discreteness of action potentials in the individual command neurons. We use the crawling behavior of the nematode C. elegans to explore the opposite extreme, in which discreteness and s…
▽ More
Animal behaviors are sometimes decomposable into discrete, stereotyped elements. In one model, such behaviors are triggered by specific commands; in the extreme case, the discreteness of behavior is traced to the discreteness of action potentials in the individual command neurons. We use the crawling behavior of the nematode C. elegans to explore the opposite extreme, in which discreteness and stereotypy emerges from the dynamics of the entire behavior. A simple stochastic model for the worm's continuously changing body shape during crawling has attractors corresponding to forward and backward motion; noise-driven transitions between these attractors correspond to abrupt reversals. We show that, with no free parameters, this model generates reversals at a rate within error bars of that observed experimentally, and the relatively stereotyped trajectories in the neighborhood of the reversal also are predicted correctly.
△ Less
Submitted 28 December, 2009;
originally announced December 2009.
-
From modes to movement in the behavior of C. elegans
Authors:
Greg J Stephens,
Bethany Johnson-Kerner,
William Bialek,
William S Ryu
Abstract:
Organisms move through the world by changing their shape, and here we explore the mapping from shape space to movements in the nematode C. elegans as it crawls on a planar agar surface. We characterize the statistics of the trajectories through the correlation functions of the orientation angular velocity, orientation angle and the mean-squared displacement, and we find that the loss of orientat…
▽ More
Organisms move through the world by changing their shape, and here we explore the mapping from shape space to movements in the nematode C. elegans as it crawls on a planar agar surface. We characterize the statistics of the trajectories through the correlation functions of the orientation angular velocity, orientation angle and the mean-squared displacement, and we find that the loss of orientational memory has significant contributions from both abrupt, large amplitude turning events and the continuous dynamics between these events. Further, we demonstrate long-time persistence of orientational memory in the intervals between abrupt turns. Building on recent work demonstrating that C. elegans movements are restricted to a low-dimensional shape space, we construct a map from the dynamics in this shape space to the trajectory of the worm along the agar. We use this connection to illustrate that changes in the continuous dynamics reveal subtle differences in movement strategy that occur among mutants defective in two classes of dopamine receptors.
△ Less
Submitted 23 December, 2009;
originally announced December 2009.
-
Dimensionality and dynamics in the behavior of C. elegans
Authors:
Greg J Stephens,
Bethany Johnson-Kerner,
William Bialek,
William S Ryu
Abstract:
A major challenge in analyzing animal behavior is to discover some underlying simplicity in complex motor actions. Here we show that the space of shapes adopted by the nematode C. elegans is surprisingly low dimensional, with just four dimensions accounting for 95% of the shape variance, and we partially reconstruct "equations of motion" for the dynamics in this space. These dynamics have multip…
▽ More
A major challenge in analyzing animal behavior is to discover some underlying simplicity in complex motor actions. Here we show that the space of shapes adopted by the nematode C. elegans is surprisingly low dimensional, with just four dimensions accounting for 95% of the shape variance, and we partially reconstruct "equations of motion" for the dynamics in this space. These dynamics have multiple attractors, and we find that the worm visits these in a rapid and almost completely deterministic response to weak thermal stimuli. Stimulus-dependent correlations among the different modes suggest that one can generate more reliable behaviors by synchronizing stimuli to the state of the worm in shape space. We confirm this prediction, effectively "steering" the worm in real time.
△ Less
Submitted 16 May, 2007; v1 submitted 11 May, 2007;
originally announced May 2007.