Skip to main content

Showing 1–10 of 10 results for author: Jantre, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17773  [pdf, ps, other

    cs.LG

    C-LoRA: Contextual Low-Rank Adaptation for Uncertainty Estimation in Large Language Models

    Authors: Amir Hossein Rahmati, Sanket Jantre, Weifeng Zhang, Yucheng Wang, Byung-Jun Yoon, Nathan M. Urban, Xiaoning Qian

    Abstract: Low-Rank Adaptation (LoRA) offers a cost-effective solution for fine-tuning large language models (LLMs), but it often produces overconfident predictions in data-scarce few-shot settings. To address this issue, several classical statistical learning approaches have been repurposed for scalable uncertainty-aware LoRA fine-tuning. However, these approaches neglect how input characteristics affect th… ▽ More

    Submitted 28 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2502.06173  [pdf, other

    cs.LG cs.AI cs.CL stat.AP stat.ML

    Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis

    Authors: Sanket Jantre, Tianle Wang, Gilchan Park, Kriti Chopra, Nicholas Jeon, Xiaoning Qian, Nathan M. Urban, Byung-Jun Yoon

    Abstract: Identification of protein-protein interactions (PPIs) helps derive cellular mechanistic understanding, particularly in the context of complex conditions such as neurodegenerative disorders, metabolic syndromes, and cancer. Large Language Models (LLMs) have demonstrated remarkable potential in predicting protein structures and interactions via automated mining of vast biomedical literature; yet the… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  3. arXiv:2405.20573  [pdf, other

    cs.LG q-bio.BM q-bio.QM stat.ML

    Enhancing Generative Molecular Design via Uncertainty-guided Fine-tuning of Variational Autoencoders

    Authors: A N M Nafiz Abeer, Sanket Jantre, Nathan M Urban, Byung-Jun Yoon

    Abstract: In recent years, deep generative models have been successfully adopted for various molecular design tasks, particularly in the life and material sciences. A critical challenge for pre-trained generative molecular design (GMD) models is to fine-tune them to be better suited for downstream design tasks aimed at optimizing specific molecular properties. However, redesigning and training an existing e… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  4. arXiv:2405.00202  [pdf, other

    cs.LG q-bio.QM stat.ML

    Leveraging Active Subspaces to Capture Epistemic Model Uncertainty in Deep Generative Models for Molecular Design

    Authors: A N M Nafiz Abeer, Sanket Jantre, Nathan M Urban, Byung-Jun Yoon

    Abstract: Deep generative models have been accelerating the inverse design process in material and drug design. Unlike their counterpart property predictors in typical molecular design frameworks, generative molecular design models have seen fewer efforts on uncertainty quantification (UQ) due to computational challenges in Bayesian inference posed by their large number of parameters. In this work, we focus… ▽ More

    Submitted 15 August, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

  5. arXiv:2309.03061  [pdf, other

    stat.ML cs.LG stat.ME

    Learning Active Subspaces for Effective and Scalable Uncertainty Quantification in Deep Neural Networks

    Authors: Sanket Jantre, Nathan M. Urban, Xiaoning Qian, Byung-Jun Yoon

    Abstract: Bayesian inference for neural networks, or Bayesian deep learning, has the potential to provide well-calibrated predictions with quantified uncertainty and robustness. However, the main hurdle for Bayesian deep learning is its computational complexity due to the high dimensionality of the parameter space. In this work, we propose a novel scheme that addresses this limitation by constructing a low-… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  6. arXiv:2308.09104  [pdf, other

    stat.ML cs.LG stat.ME

    Spike-and-slab shrinkage priors for structurally sparse Bayesian neural networks

    Authors: Sanket Jantre, Shrijita Bhattacharya, Tapabrata Maiti

    Abstract: Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily over-parameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g. node sparsity) provide low… ▽ More

    Submitted 21 August, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

  7. arXiv:2306.08754  [pdf, other

    cs.LG physics.ao-ph

    ClimSim-Online: A Large Multi-scale Dataset and Framework for Hybrid ML-physics Climate Emulation

    Authors: Sungduk Yu, Zeyuan Hu, Akshay Subramaniam, Walter Hannah, Liran Peng, Jerry Lin, Mohamed Aziz Bhouri, Ritwik Gupta, Björn Lütjens, Justus C. Will, Gunnar Behrens, Julius J. M. Busecke, Nora Loose, Charles I. Stern, Tom Beucler, Bryce Harrop, Helge Heuer, Benjamin R. Hillman, Andrea Jenney, Nana Liu, Alistair White, Tian Zheng, Zhiming Kuang, Fiaz Ahmed, Elizabeth Barnes , et al. (22 additional authors not shown)

    Abstract: Modern climate projections lack adequate spatial and temporal resolution due to computational constraints, leading to inaccuracies in representing critical processes like thunderstorms that occur on the sub-resolution scale. Hybrid methods combining physics with machine learning (ML) offer faster, higher fidelity climate simulations by outsourcing compute-hungry, high-resolution simulations to ML… ▽ More

    Submitted 8 July, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: This manuscript is an expanded version of our paper that received the Outstanding Paper Award at the NeurIPS 2023 conference

  8. arXiv:2210.04083  [pdf, other

    cs.LG cs.AI

    Unified Probabilistic Neural Architecture and Weight Ensembling Improves Model Robustness

    Authors: Sumegha Premchandar, Sandeep Madireddy, Sanket Jantre, Prasanna Balaprakash

    Abstract: Robust machine learning models with accurately calibrated uncertainties are crucial for safety-critical applications. Probabilistic machine learning and especially the Bayesian formalism provide a systematic framework to incorporate robustness through the distributional estimates and reason about uncertainty. Recent works have shown that approximate inference approaches that take the weight space… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  9. arXiv:2206.00794  [pdf, other

    stat.ML cs.LG math.ST

    Sequential Bayesian Neural Subnetwork Ensembles

    Authors: Sanket Jantre, Shrijita Bhattacharya, Nathan M. Urban, Byung-Jun Yoon, Tapabrata Maiti, Prasanna Balaprakash, Sandeep Madireddy

    Abstract: Deep ensembles have emerged as a powerful technique for improving predictive performance and enhancing model robustness across various applications by leveraging model diversity. However, traditional deep ensemble methods are often computationally expensive and rely on deterministic models, which may limit their flexibility. Additionally, while sparse subnetworks of dense models have shown promise… ▽ More

    Submitted 19 August, 2024; v1 submitted 1 June, 2022; originally announced June 2022.

  10. arXiv:2108.11000  [pdf, other

    stat.ML cs.LG

    Layer Adaptive Node Selection in Bayesian Neural Networks: Statistical Guarantees and Implementation Details

    Authors: Sanket Jantre, Shrijita Bhattacharya, Tapabrata Maiti

    Abstract: Sparse deep neural networks have proven to be efficient for predictive model building in large-scale studies. Although several works have studied theoretical and numerical properties of sparse neural architectures, they have primarily focused on the edge selection. Sparsity through edge selection might be intuitively appealing; however, it does not necessarily reduce the structural complexity of a… ▽ More

    Submitted 8 July, 2022; v1 submitted 24 August, 2021; originally announced August 2021.