Skip to main content

Showing 1–29 of 29 results for author: Pande, V S

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2303.08993  [pdf

    q-bio.BM physics.chem-ph

    Folding@home: achievements from over twenty years of citizen science herald the exascale era

    Authors: Vincent A. Voelz, Vijay S. Pande, Gregory R. Bowman

    Abstract: Simulations of biomolecules have enormous potential to inform our understanding of biology but require extremely demanding calculations. For over twenty years, the Folding@home distributed computing project has pioneered a massively parallel approach to biomolecular simulation, harnessing the resources of citizen scientists across the globe. Here, we summarize the scientific and technical advances… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 24 pages, 6 figures

  2. arXiv:1907.03041  [pdf

    q-bio.GN

    Predicting Gene Expression Between Species with Neural Networks

    Authors: Peter Eastman, Vijay S. Pande

    Abstract: We train a neural network to predict human gene expression levels based on experimental data for rat cells. The network is trained with paired human/rat samples from the Open TG-GATES database, where paired samples were treated with the same compound at the same dose. When evaluated on a test set of held out compounds, the network successfully predicts human expression levels. On the majority of t… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Comments: 12 pages, 5 figures

  3. arXiv:1902.00060  [pdf

    q-bio.GN

    Predicting Toxicity from Gene Expression with Neural Networks

    Authors: Peter Eastman, Vijay S. Pande

    Abstract: We train a neural network to predict chemical toxicity based on gene expression data. The input to the network is a full expression profile collected either in vitro from cultured cells or in vivo from live animals. The output is a set of fine grained predictions for the presence of a variety of pathological effects in treated animals. When trained on the Open TG-GATEs database it produces good re… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

    Comments: 12 pages, 2 figures, 4 tables

  4. arXiv:1804.08206  [pdf

    q-bio.BM q-bio.QM

    Binding Pathway of Opiates to $μ$ Opioid Receptors Revealed by Unsupervised Machine Learning

    Authors: Amir Barati Farimani, Evan N. Feinberg, Vijay S. Pande

    Abstract: Many important analgesics relieve pain by binding to the $μ$-Opioid Receptor ($μ$OR), which makes the $μ$OR among the most clinically relevant proteins of the G Protein Coupled Receptor (GPCR) family. Despite previous studies on the activation pathways of the GPCRs, the mechanism of opiate binding and the selectivity of $μ$OR are largely unknown. We performed extensive molecular dynamics (MD) simu… ▽ More

    Submitted 22 April, 2018; originally announced April 2018.

    Comments: 25 pages, 8 figures

  5. arXiv:1803.04479  [pdf

    q-bio.BM stat.ML

    Machine Learning Harnesses Molecular Dynamics to Discover New $μ$ Opioid Chemotypes

    Authors: Evan N. Feinberg, Amir Barati Farimani, Rajendra Uprety, Amanda Hunkele, Gavril W. Pasternak, Susruta Majumdar, Vijay S. Pande

    Abstract: Computational chemists typically assay drug candidates by virtually screening compounds against crystal structures of a protein despite the fact that some targets, like the $μ$ Opioid Receptor and other members of the GPCR family, traverse many non-crystallographic states. We discover new conformational states of $μOR$ with molecular dynamics simulation and then machine learn ligand-structure rela… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

    Comments: 28 pages, machine learning, computational biology, GPCRs, molecular dynamics, molecular docking, molecular simulation

  6. arXiv:1803.03146  [pdf

    q-bio.QM cs.AI stat.ML

    SentRNA: Improving computational RNA design by incorporating a prior of human design strategies

    Authors: Jade Shi, Rhiju Das, Vijay S. Pande

    Abstract: Solving the RNA inverse folding problem is a critical prerequisite to RNA design, an emerging field in bioengineering with a broad range of applications from reaction catalysis to cancer therapy. Although significant progress has been made in developing machine-based inverse RNA folding algorithms, current approaches still have difficulty designing sequences for large or complex targets. On the ot… ▽ More

    Submitted 5 March, 2019; v1 submitted 8 March, 2018; originally announced March 2018.

    Comments: 27 pages (not including Supplementary Information), 9 figures, 7 tables

  7. arXiv:1802.10548  [pdf, other

    cs.CV cs.LG q-bio.QM

    Using Deep Learning for Segmentation and Counting within Microscopy Data

    Authors: Carlos X. Hernández, Mohammad M. Sultan, Vijay S. Pande

    Abstract: Cell counting is a ubiquitous, yet tedious task that would greatly benefit from automation. From basic biological questions to clinical trials, cell counts provide key quantitative feedback that drive research. Unfortunately, cell counting is most commonly a manual task and can be time-intensive. The task is made even more difficult due to overlapping cells, existence of multiple focal planes, and… ▽ More

    Submitted 28 February, 2018; originally announced February 2018.

  8. arXiv:1802.10510  [pdf

    stat.ML cs.CE q-bio.BM

    Automated design of collective variables using supervised machine learning

    Authors: Mohammad M. Sultan, Vijay S. Pande

    Abstract: Selection of appropriate collective variables for enhancing sampling of molecular simulations remains an unsolved problem in computational biophysics. In particular, picking initial collective variables (CVs) is particularly challenging in higher dimensions. Which atomic coordinates or transforms there of from a list of thousands should one pick for enhanced sampling runs? How does a modeler even… ▽ More

    Submitted 13 May, 2018; v1 submitted 28 February, 2018; originally announced February 2018.

    Comments: 26 pages, 11 figures

  9. arXiv:1801.00636  [pdf

    stat.ML q-bio.BM

    Transferable neural networks for enhanced sampling of protein dynamics

    Authors: Mohammad M. Sultan, Hannah K. Wayment-Steele, Vijay S. Pande

    Abstract: Variational auto-encoder frameworks have demonstrated success in reducing complex nonlinear dynamics in molecular simulation to a single non-linear embedding. In this work, we illustrate how this non-linear latent embedding can be used as a collective variable for enhanced sampling, and present a simple modification that allows us to rapidly perform sampling in multiple related systems. We first d… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: 20 pages, 10 figures

  10. arXiv:1712.07704  [pdf, other

    physics.bio-ph q-bio.BM q-bio.QM stat.ML

    Unsupervised learning of dynamical and molecular similarity using variance minimization

    Authors: Brooke E. Husic, Vijay S. Pande

    Abstract: In this report, we present an unsupervised machine learning method for determining groups of molecular systems according to similarity in their dynamics or structures using Ward's minimum variance objective function. We first apply the minimum variance clustering to a set of simulated tripeptides using the information theoretic Jensen-Shannon divergence between Markovian transition matrices in ord… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: NIPS 2017 Workshop on Machine Learning for Molecules and Materials

  11. arXiv:1711.08576  [pdf, other

    stat.ML physics.bio-ph physics.chem-ph physics.comp-ph q-bio.BM

    Variational Encoding of Complex Dynamics

    Authors: Carlos X. Hernández, Hannah K. Wayment-Steele, Mohammad M. Sultan, Brooke E. Husic, Vijay S. Pande

    Abstract: Often the analysis of time-dependent chemical and biophysical systems produces high-dimensional time-series data for which it can be difficult to interpret which individual features are most salient. While recent work from our group and others has demonstrated the utility of time-lagged co-variate models to study such systems, linearity assumptions can limit the compression of inherently nonlinear… ▽ More

    Submitted 1 December, 2017; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: Fixed typos and added references

    Journal ref: Phys. Rev. E 97, 062412 (2018)

  12. arXiv:1708.08120  [pdf, other

    q-bio.BM physics.bio-ph physics.chem-ph

    MSM lag time cannot be used for variational model selection

    Authors: Brooke E. Husic, Vijay S. Pande

    Abstract: The variational principle for conformational dynamics has enabled the systematic construction of Markov state models through the optimization of hyperparameters by approximating the transfer operator. In this note we discuss why lag time of the operator being approximated must be held constant in the variational approach.

    Submitted 27 August, 2017; originally announced August 2017.

    Journal ref: J. Chem. Phys. 2017, 147, 176101

  13. arXiv:1708.03011  [pdf

    q-bio.BM physics.bio-ph physics.comp-ph

    Theoretical restrictions on longest implicit timescales in Markov state models of biomolecular dynamics

    Authors: Anton V. Sinitskiy, Vijay S. Pande

    Abstract: Markov state models (MSMs) have been widely used to analyze computer simulations of various biomolecular systems. They can capture conformational transitions much slower than an average or maximal length of a single molecular dynamics (MD) trajectory from the set of trajectories used to build the MSM. A rule of thumb claiming that the slowest implicit timescale captured by an MSM should be compara… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

  14. arXiv:1612.06319  [pdf

    q-bio.BM

    Computationally Discovered Potentiating Role of Glycans on NMDA Receptors

    Authors: Anton V. Sinitskiy, Nathaniel H. Stanley, David H. Hackos, Jesse E. Hanson, Benjamin D. Sellers, Vijay S. Pande

    Abstract: N-methyl-D-aspartate receptors (NMDARs) are glycoproteins in the brain central to learning and memory. The effects of glycosylation on the structure and dynamics of NMDARs are largely unknown. In this work, we use extensive molecular dynamics simulations of GluN1 and GluN2B ligand binding domains (LBDs) of NMDARs to investigate these effects. Our simulations predict that intra-domain interactions… ▽ More

    Submitted 19 December, 2016; originally announced December 2016.

  15. arXiv:1602.08776  [pdf, other

    cond-mat.stat-mech q-bio.BM q-bio.QM

    Identification of simple reaction coordinates from complex dynamics

    Authors: Robert T. McGibbon, Brooke E. Husic, Vijay S. Pande

    Abstract: Reaction coordinates are widely used throughout chemical physics to model and understand complex chemical transformations. We introduce a definition of the natural reaction coordinate, suitable for condensed phase and biomolecular systems, as a maximally predictive one-dimensional projection. We then show this criterion is uniquely satisfied by a dominant eigenfunction of an integral operator asso… ▽ More

    Submitted 6 January, 2017; v1 submitted 28 February, 2016; originally announced February 2016.

    Comments: 18 pages, 10 figures

  16. arXiv:1504.01804  [pdf, other

    physics.data-an physics.chem-ph q-bio.BM

    Efficient maximum likelihood parameterization of continuous-time Markov processes

    Authors: Robert T. McGibbon, Vijay S. Pande

    Abstract: Continuous-time Markov processes over finite state-spaces are widely used to model dynamical processes in many fields of natural and social science. Here, we introduce an maximum likelihood estimator for constructing such models from data observed at a finite time interval. This estimator is dramatically more efficient than prior approaches, enables the calculation of deterministic confidence inte… ▽ More

    Submitted 30 June, 2015; v1 submitted 7 April, 2015; originally announced April 2015.

  17. arXiv:1408.5446  [pdf, ps, other

    physics.chem-ph physics.bio-ph q-bio.BM

    Perspective: Markov Models for Long-Timescale Biomolecular Dynamics

    Authors: Christian R. Schwantes, Robert T. McGibbon, Vijay S. Pande

    Abstract: Molecular dynamics simulations have the potential to provide atomic-level detail and insight to important questions in chemical physics that cannot be observed in typical experiments. However, simply generating a long trajectory is insufficient, as researchers must be able to transform the data in a simulation trajectory into specific scientific insights. Although this analysis step has often been… ▽ More

    Submitted 22 August, 2014; originally announced August 2014.

    Comments: 7 pages

  18. arXiv:1408.0255  [pdf, ps, other

    physics.bio-ph q-bio.BM

    Efficient inference of protein structural ensembles

    Authors: Thomas J. Lane, Christian R. Schwantes, Kyle A. Beauchamp, Vijay S. Pande

    Abstract: It is becoming clear that traditional, single-structure models of proteins are insufficient for understanding their biological function. Here, we outline one method for inferring, from experiments, not only the most common structure a protein adopts (native state), but the entire ensemble of conformations the system can adopt. Such ensemble mod- els are necessary to understand intrinsically disord… ▽ More

    Submitted 1 August, 2014; originally announced August 2014.

  19. arXiv:1407.8083  [pdf, other

    q-bio.BM math.ST physics.bio-ph physics.chem-ph

    Variational cross-validation of slow dynamical modes in molecular kinetics

    Authors: Robert T. McGibbon, Vijay S. Pande

    Abstract: Markov state models (MSMs) are a widely used method for approximating the eigenspectrum of the molecular dynamics propagator, yielding insight into the long-timescale statistical kinetics and slow dynamical modes of biomolecular systems. However, the lack of a unified theoretical framework for choosing between alternative models has hampered progress, especially for non-experts applying these meth… ▽ More

    Submitted 27 March, 2015; v1 submitted 30 July, 2014; originally announced July 2014.

    Journal ref: J. Chem. Phys. 142, 124105 (2015)

  20. arXiv:1405.1444  [pdf, other

    q-bio.BM stat.AP stat.ML

    Understanding Protein Dynamics with L1-Regularized Reversible Hidden Markov Models

    Authors: Robert T. McGibbon, Bharath Ramsundar, Mohammad M. Sultan, Gert Kiss, Vijay S. Pande

    Abstract: We present a machine learning framework for modeling protein dynamics. Our approach uses L1-regularized, reversible hidden Markov models to understand large protein datasets generated via molecular dynamics simulations. Our model is motivated by three design principles: (1) the requirement of massive scalability; (2) the need to adhere to relevant physical law; and (3) the necessity of providing a… ▽ More

    Submitted 6 May, 2014; originally announced May 2014.

    Journal ref: Proceedings of the 31st International Conference on Machine Learning, Beijing, China, 2014

  21. arXiv:1305.0963  [pdf, other

    physics.bio-ph q-bio.BM

    Probing the Origins of Two-State Folding

    Authors: Thomas J. Lane, Christian R. Schwantes, Kyle A. Beauchamp, Vijay S. Pande

    Abstract: Many protein systems fold in a two-state manner. Random models, however, rarely display two-state kinetics and thus such behavior should not be accepted as a default. To date, many theories for the prevalence of two-state kinetics have been presented, but none sufficiently explain the breadth of experimental observations. A model, making a minimum of assumptions, is introduced that suggests two-st… ▽ More

    Submitted 4 May, 2013; originally announced May 2013.

  22. arXiv:1108.2304  [pdf, other

    cond-mat.stat-mech q-bio.BM

    A robust approach to estimating rates from time-correlation functions

    Authors: John D. Chodera, Phillip J. Elms, William C. Swope, Jan-Hendrik Prinz, Susan Marqusee, Carlos Bustamante, Frank Noé, Vijay S. Pande

    Abstract: While seemingly straightforward in principle, the reliable estimation of rate constants is seldom easy in practice. Numerous issues, such as the complication of poor reaction coordinates, cause obvious approaches to yield unreliable estimates. When a reliable order parameter is available, the reactive flux theory of Chandler allows the rate constant to be extracted from the plateau region of an ap… ▽ More

    Submitted 10 August, 2011; originally announced August 2011.

  23. arXiv:1105.0710  [pdf, other

    cond-mat.stat-mech q-bio.BM

    Splitting probabilities as a test of reaction coordinate choice in single-molecule experiments

    Authors: John D. Chodera, Vijay S. Pande

    Abstract: To explain the observed dynamics in equilibrium single-molecule measurements of biomolecules, the experimental observable is often chosen as a putative reaction coordinate along which kinetic behavior is presumed to be governed by diffusive dynamics. Here, we invoke the splitting probability as a test of the suitability of such a proposed reaction coordinate. Comparison of the observed splitting p… ▽ More

    Submitted 13 July, 2011; v1 submitted 3 May, 2011; originally announced May 2011.

    Journal ref: Phys. Rev. Lett., 107:098102 (2011)

  24. arXiv:1007.0315  [pdf, ps, other

    physics.bio-ph physics.chem-ph q-bio.BM

    A simple theory of protein folding kinetics

    Authors: Vijay S. Pande

    Abstract: We present a simple model of protein folding dynamics that captures key qualitative elements recently seen in all-atom simulations. The goals of this theory are to serve as a simple formalism for gaining deeper insight into the physical properties seen in detailed simulations as well as to serve as a model to easily compare why these simulations suggest a different kinetic mechanism than previous… ▽ More

    Submitted 2 July, 2010; originally announced July 2010.

  25. arXiv:0901.0866  [pdf

    physics.bio-ph physics.comp-ph q-bio.QM

    Folding@Home and Genome@Home: Using distributed computing to tackle previously intractable problems in computational biology

    Authors: Stefan M. Larson, Christopher D. Snow, Michael Shirts, Vijay S. Pande

    Abstract: For decades, researchers have been applying computer simulation to address problems in biology. However, many of these "grand challenges" in computational biology, such as simulating how proteins fold, remained unsolved due to their great complexity. Indeed, even to simulate the fastest folding protein would require decades on the fastest modern CPUs. Here, we review novel methods to fundamental… ▽ More

    Submitted 7 January, 2009; originally announced January 2009.

  26. Potential for modulation of the hydrophobic effect inside chaperonins

    Authors: Jeremy L. England, Vijay S. Pande

    Abstract: Despite the spontaneity of some in vitro protein folding reactions, native folding in vivo often requires the participation of barrel-shaped multimeric complexes known as chaperonins. Although it has long been known that chaperonin substrates fold upon sequestration inside the chaperonin barrel, the precise mechanism by which confinement within this space facilitates folding remains unknown. In… ▽ More

    Submitted 4 February, 2008; originally announced February 2008.

  27. Freezing Transition of Compact Polyampholytes

    Authors: Vijay S. Pande, Alexander Yu. Grosberg, Chris Joerg, Mehran Kardar, Toyoichi Tanaka

    Abstract: Polyampholytes (PAs) are heteropolymers with long range Coulomb interactions. Unlike polymers with short range forces, PA energy levels have non-vanishing correlations and are thus very different from the Random Energy Model (REM). Nevertheless, if charges in the PA globule are screened as in a regular plasma, PAs freeze in REM fashion. Our results shed light on the potential role of Coulomb int… ▽ More

    Submitted 5 September, 1996; originally announced September 1996.

    Comments: 4 pages, 3 eps figures

  28. Is Heteropolymer Freezing Well Described by the Random Energy Model?

    Authors: Vijay S. Pande, Alexander Yu. Grosberg, Chris Joerg, Toyoichi Tanaka

    Abstract: It is widely held that the Random Energy Model (REM) describes the freezing transition of a variety of types of heteropolymers. We demonstrate that the hallmark property of REM, statistical independence of the energies of states over disorder, is violated in different ways for models commonly employed in heteropolymer freezing studies. The implications for proteins are also discussed.

    Submitted 23 April, 1996; originally announced April 1996.

    Comments: 4 pages, 3 eps figures To appear in Physical Review Letters, May 1996

  29. arXiv:cond-mat/9510123  [pdf, ps, other

    cond-mat physics.chem-ph q-bio

    How Accurate Must Potentials Be for Successful Modeling of Protein Folding?

    Authors: Vijay S. Pande, Alexander Yu. Grosberg, Toyoichi Tanaka

    Abstract: Protein sequences are believed to have been selected to provide the stability of, and reliable renaturation to, an encoded unique spatial fold. In recently proposed theoretical schemes, this selection is modeled as ``minimal frustration,'' or ``optimal energy'' of the desirable target conformation over all possible sequences, such that the ``design'' of the sequence is governed by the interactio… ▽ More

    Submitted 20 October, 1995; originally announced October 1995.

    Comments: 28 pages, 3 postscript figures; tared, compressed, uuencoded