Skip to main content

Showing 1–2 of 2 results for author: Clark, J D

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2503.21017  [pdf, other

    q-bio.BM q-bio.QM

    Two for the Price of One: Integrating Large Language Models to Learn Biophysical Interactions

    Authors: Joseph D. Clark, Tanner J. Dean, Diwakar Shukla

    Abstract: Deep learning models have become fundamental tools in drug design. In particular, large language models trained on biochemical sequences learn feature vectors that guide drug discovery through virtual screening. However, such models do not capture the molecular interactions important for binding affinity and specificity. Therefore, there is a need to 'compose' representations from distinct biologi… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 27 pages, 5 Figures

  2. arXiv:2402.15181  [pdf, other

    q-bio.QM q-bio.BM

    Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning

    Authors: Joseph D. Clark, Xuenan Mi, Douglas A. Mitchell, Diwakar Shukla

    Abstract: Ribosomally synthesized and post-translationally modified peptide (RiPP) biosynthetic enzymes often exhibit promiscuous substrate preferences that cannot be reduced to simple rules. Large language models are promising tools for predicting such peptide fitness landscapes. However, state-of-the-art protein language models are trained on relatively few peptide sequences. A previous study comprehensiv… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.