Search | arXiv e-print repository

Implicit Delta Learning of High Fidelity Neural Network Potentials

Authors: Stephan Thaler, Cristian Gabellini, Nikhil Shenoy, Prudencio Tossou

Abstract: Neural network potentials (NNPs) offer a fast and accurate alternative to ab-initio methods for molecular dynamics (MD) simulations but are hindered by the high cost of training data from high-fidelity Quantum Mechanics (QM) methods. Our work introduces the Implicit Delta Learning (IDLe) method, which reduces the need for high-fidelity QM data by leveraging cheaper semi-empirical QM computations w… ▽ More Neural network potentials (NNPs) offer a fast and accurate alternative to ab-initio methods for molecular dynamics (MD) simulations but are hindered by the high cost of training data from high-fidelity Quantum Mechanics (QM) methods. Our work introduces the Implicit Delta Learning (IDLe) method, which reduces the need for high-fidelity QM data by leveraging cheaper semi-empirical QM computations without compromising NNP accuracy or inference cost. IDLe employs an end-to-end multi-task architecture with fidelity-specific heads that decode energies based on a shared latent representation of the input atomistic system. In various settings, IDLe achieves the same accuracy as single high-fidelity baselines while using up to 50x less high-fidelity data. This result could significantly reduce data generation cost and consequently enhance accuracy and generalization, and expand chemical coverage for NNPs, advancing MD simulations for material science and drug discovery. Additionally, we provide a novel set of 11 million semi-empirical QM calculations to support future multi-fidelity NNP modeling. △ Less

Submitted 8 December, 2024; originally announced December 2024.

arXiv:2411.19629 [pdf, other]

OpenQDC: Open Quantum Data Commons

Authors: Cristian Gabellini, Nikhil Shenoy, Stephan Thaler, Semih Canturk, Daniel McNeela, Dominique Beaini, Michael Bronstein, Prudencio Tossou

Abstract: Machine Learning Interatomic Potentials (MLIPs) are a highly promising alternative to force-fields for molecular dynamics (MD) simulations, offering precise and rapid energy and force calculations. However, Quantum-Mechanical (QM) datasets, crucial for MLIPs, are fragmented across various repositories, hindering accessibility and model development. We introduce the openQDC package, consolidating 3… ▽ More Machine Learning Interatomic Potentials (MLIPs) are a highly promising alternative to force-fields for molecular dynamics (MD) simulations, offering precise and rapid energy and force calculations. However, Quantum-Mechanical (QM) datasets, crucial for MLIPs, are fragmented across various repositories, hindering accessibility and model development. We introduce the openQDC package, consolidating 37 QM datasets from over 250 quantum methods and 400 million geometries into a single, accessible resource. These datasets are meticulously preprocessed, and standardized for MLIP training, covering a wide range of chemical elements and interactions relevant in organic chemistry. OpenQDC includes tools for normalization and integration, easily accessible via Python. Experiments with well-known architectures like SchNet, TorchMD-Net, and DimeNet reveal challenges for those architectures and constitute a leaderboard to accelerate benchmarking and guide novel algorithms development. Continuously adding datasets to OpenQDC will democratize QM dataset access, foster more collaboration and innovation, enhance MLIP development, and support their adoption in the MD field. △ Less

Submitted 29 November, 2024; originally announced November 2024.

arXiv:2410.22388 [pdf, other]

ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation

Authors: Majdi Hassan, Nikhil Shenoy, Jungyoon Lee, Hannes Stark, Stephan Thaler, Dominique Beaini

Abstract: Predicting low-energy molecular conformations given a molecular graph is an important but challenging task in computational drug discovery. Existing state-of-the-art approaches either resort to large scale transformer-based models that diffuse over conformer fields, or use computationally expensive methods to generate initial structures and diffuse over torsion angles. In this work, we introduce E… ▽ More Predicting low-energy molecular conformations given a molecular graph is an important but challenging task in computational drug discovery. Existing state-of-the-art approaches either resort to large scale transformer-based models that diffuse over conformer fields, or use computationally expensive methods to generate initial structures and diffuse over torsion angles. In this work, we introduce Equivariant Transformer Flow (ET-Flow). We showcase that a well-designed flow matching approach with equivariance and harmonic prior alleviates the need for complex internal geometry calculations and large architectures, contrary to the prevailing methods in the field. Our approach results in a straightforward and scalable method that directly operates on all-atom coordinates with minimal assumptions. With the advantages of equivariance and flow matching, ET-Flow significantly increases the precision and physical validity of the generated conformers, while being a lighter model and faster at inference. Code is available https://github.com/shenoynikhil/ETFlow. △ Less

Submitted 29 October, 2024; originally announced October 2024.

Comments: NeurIPS 2024

arXiv:2311.00862 [pdf, other]

Role of Structural and Conformational Diversity for Machine Learning Potentials

Authors: Nikhil Shenoy, Prudencio Tossou, Emmanuel Noutahi, Hadrien Mary, Dominique Beaini, Jiarui Ding

Abstract: In the field of Machine Learning Interatomic Potentials (MLIPs), understanding the intricate relationship between data biases, specifically conformational and structural diversity, and model generalization is critical in improving the quality of Quantum Mechanics (QM) data generation efforts. We investigate these dynamics through two distinct experiments: a fixed budget one, where the dataset size… ▽ More In the field of Machine Learning Interatomic Potentials (MLIPs), understanding the intricate relationship between data biases, specifically conformational and structural diversity, and model generalization is critical in improving the quality of Quantum Mechanics (QM) data generation efforts. We investigate these dynamics through two distinct experiments: a fixed budget one, where the dataset size remains constant, and a fixed molecular set one, which focuses on fixed structural diversity while varying conformational diversity. Our results reveal nuanced patterns in generalization metrics. Notably, for optimal structural and conformational generalization, a careful balance between structural and conformational diversity is required, but existing QM datasets do not meet that trade-off. Additionally, our results highlight the limitation of the MLIP models at generalizing beyond their training distribution, emphasizing the importance of defining applicability domain during model deployment. These findings provide valuable insights and guidelines for QM data generation efforts. △ Less

Submitted 30 October, 2023; originally announced November 2023.

Comments: Accepted at NeurIPS 2023 AI4D3 and AI4S workshops

arXiv:2208.06359 [pdf]

A Case for Rejection in Low Resource ML Deployment

Authors: Jerome White, Pulkit Madaan, Nikhil Shenoy, Apoorv Agnihotri, Makkunda Sharma, Jigar Doshi

Abstract: Building reliable AI decision support systems requires a robust set of data on which to train models; both with respect to quantity and diversity. Obtaining such datasets can be difficult in resource limited settings, or for applications in early stages of deployment. Sample rejection is one way to work around this challenge, however much of the existing work in this area is ill-suited for such sc… ▽ More Building reliable AI decision support systems requires a robust set of data on which to train models; both with respect to quantity and diversity. Obtaining such datasets can be difficult in resource limited settings, or for applications in early stages of deployment. Sample rejection is one way to work around this challenge, however much of the existing work in this area is ill-suited for such scenarios. This paper substantiates that position and proposes a simple solution as a proof of concept baseline. △ Less

Submitted 15 August, 2022; v1 submitted 12 August, 2022; originally announced August 2022.

Journal ref: NeurIPS 2022 workshop on Challenges In Deploying And Monitoring Machine Learning Systems

arXiv:2106.03851 [pdf, other]

Impact of data-splits on generalization: Identifying COVID-19 from cough and context

Authors: Makkunda Sharma, Nikhil Shenoy, Jigar Doshi, Piyush Bagad, Aman Dalmia, Parag Bhamare, Amrita Mahale, Saurabh Rane, Neeraj Agrawal, Rahul Panicker

Abstract: Rapidly scaling screening, testing and quarantine has shown to be an effective strategy to combat the COVID-19 pandemic. We consider the application of deep learning techniques to distinguish individuals with COVID from non-COVID by using data acquirable from a phone. Using cough and context (symptoms and meta-data) represent such a promising approach. Several independent works in this direction h… ▽ More Rapidly scaling screening, testing and quarantine has shown to be an effective strategy to combat the COVID-19 pandemic. We consider the application of deep learning techniques to distinguish individuals with COVID from non-COVID by using data acquirable from a phone. Using cough and context (symptoms and meta-data) represent such a promising approach. Several independent works in this direction have shown promising results. However, none of them report performance across clinically relevant data splits. Specifically, the performance where the development and test sets are split in time (retrospective validation) and across sites (broad validation). Although there is meaningful generalization across these splits the performance significantly varies (up to 0.1 AUC score). In addition, we study the performance of symptomatic and asymptomatic individuals across these three splits. Finally, we show that our model focuses on meaningful features of the input, cough bouts for cough and relevant symptoms for context. The code and checkpoints are available at https://github.com/WadhwaniAI/cough-against-covid △ Less

Submitted 5 June, 2021; originally announced June 2021.

Comments: Published as a workshop paper at ICLR 2021 AI for Public Health Workshop and ICLR 20201 Machine Learning for Preventing and Combating Pandemics Workshop

arXiv:1912.05934 [pdf]

Lion Algorithm- Optimized Long Short-Term Memory Network for Groundwater Level Forecasting in Udupi District, India

Authors: Supreetha B. S, Narayan Shenoy, Prabhakar Nayak

Abstract: Groundwater is a precious natural resource. Groundwater level (GWL) forecasting is crucial in the field of water resource management. Measurement of GWL from observation-wells is the principle source of information about the aquifer and is critical to its evaluation. Most part of the Udupi district of Karnataka State in India consists of geological formations: lateritic terrain and gneissic comple… ▽ More Groundwater is a precious natural resource. Groundwater level (GWL) forecasting is crucial in the field of water resource management. Measurement of GWL from observation-wells is the principle source of information about the aquifer and is critical to its evaluation. Most part of the Udupi district of Karnataka State in India consists of geological formations: lateritic terrain and gneissic complex. Due to the topographical ruggedness and inconsistency in rainfall, the GWL in Udupi region is declining continually and most of the open wells are drying-up during the summer. Hence, the current research aimed at developing a groundwater level forecasting model by using hybrid Long Short-term Memory-Lion Algorithm (LSTM-LA). The historical GWL and rainfall data from an observation well from Udupi district, located in Karnataka state, India, were used to develop the model. The prediction accuracy of the hybrid LSTM-LA model was better than that of the Feedforward Neural network (FFNN) and the isolated LSTM models. The hybrid LSTM-LA based forecasting model is promising for a larger dataset. △ Less

Submitted 5 December, 2019; originally announced December 2019.

arXiv:1807.04888 [pdf]

Utilizing Smartphone-Based Machine Learning in Medical Monitor Data Collection: Seven Segment Digit Recognition

Authors: Varun N. Shenoy, Oliver O. Aalami

Abstract: Biometric measurements captured from medical devices, such as blood pressure gauges, glucose monitors, and weighing scales, are essential to tracking a patient's health. Trends in these measurements can accurately track diabetes, cardiovascular issues, and assist medication management for patients. Currently, patients record their results and data of measurement in a physical notebook. It may be w… ▽ More Biometric measurements captured from medical devices, such as blood pressure gauges, glucose monitors, and weighing scales, are essential to tracking a patient's health. Trends in these measurements can accurately track diabetes, cardiovascular issues, and assist medication management for patients. Currently, patients record their results and data of measurement in a physical notebook. It may be weeks before a doctor sees a patient's records and can assess the health of the patient. With a predicted 6.8 billion smartphones in the world by 2022, health monitoring platforms, such as Apple's HealthKit, can be leveraged to provide the right care at the right time. This research presents a mobile application that enables users to capture medical monitor data and send it to their doctor swiftly. A key contribution of this paper is a robust engine that can recognize digits from medical monitors with an accuracy of 98.2%. △ Less

Submitted 12 July, 2018; originally announced July 2018.

Comments: Accepted for publication in AMIA 2017 Annual Symposium

Journal ref: Shenoy VN, Aalami OO. Utilizing Smartphone-Based Machine Learning in Medical Monitor Data Collection: Seven Segment Digit Recognition. AMIA Annual Symposium Proceedings. 2017;2017:1564-1570

arXiv:1111.4886 [pdf, other]

Prediction Of Arrival Of Nodes In A Scale Free Network

Authors: S. M. Vijay Mahantesh, Sudarshan Iyengar, M. Vijesh, Shruthi Nayak, Nikitha Shenoy

Abstract: Most of the networks observed in real life obey power-law degree distribution. It is hypothesized that the emergence of such a degree distribution is due to preferential attachment of the nodes. Barabasi-Albert model is a generative procedure that uses preferential attachment based on degree and one can use this model to generate networks with power-law degree distribution. In this model, the netw… ▽ More Most of the networks observed in real life obey power-law degree distribution. It is hypothesized that the emergence of such a degree distribution is due to preferential attachment of the nodes. Barabasi-Albert model is a generative procedure that uses preferential attachment based on degree and one can use this model to generate networks with power-law degree distribution. In this model, the network is assumed to grow one node every time step. After the evolution of such a network, it is impossible for one to predict the exact order of node arrivals. We present in this article, a novel strategy to partially predict the order of node arrivals in such an evolved network. We show that our proposed method outperforms other centrality measure based approaches. We bin the nodes and predict the order of node arrivals between the bins with an accuracy of above 80%. △ Less

Submitted 24 November, 2011; v1 submitted 21 November, 2011; originally announced November 2011.

Showing 1–9 of 9 results for author: Shenoy, N