Skip to main content

Showing 1–6 of 6 results for author: Burke, M D

.
  1. arXiv:2505.12565  [pdf, ps, other

    cs.AI cs.CL cs.LG q-bio.QM

    mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model

    Authors: Carl Edwards, Chi Han, Gawon Lee, Thao Nguyen, Bowen Jin, Chetan Kumar Prasad, Sara Szymkuć, Bartosz A. Grzybowski, Ying Diao, Jiawei Han, Ge Liu, Hao Peng, Martin D. Burke, Heng Ji

    Abstract: Despite their ability to understand chemical knowledge and accurately generate sequential representations, large language models (LLMs) remain limited in their capacity to propose novel molecules with drug-like properties. In addition, the molecules that LLMs propose can often be challenging to make in the lab. To more effectively enable the discovery of functional small molecules, LLMs need to le… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  2. arXiv:2410.02082  [pdf, other

    cs.LG q-bio.QM

    FARM: Functional Group-Aware Representations for Small Molecules

    Authors: Thao Nguyen, Kuan-Hao Huang, Ge Liu, Martin D. Burke, Ying Diao, Heng Ji

    Abstract: We introduce Functional Group-Aware Representations for Small Molecules (FARM), a novel foundation model designed to bridge the gap between SMILES, natural language, and molecular graphs. The key innovation of FARM lies in its functional group-aware tokenization, which directly incorporates functional group information into the representations. This strategic reduction in tokenization granularity… ▽ More

    Submitted 6 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Preprint

  3. arXiv:2311.17189  [pdf, other

    hep-ph cond-mat.other physics.comp-ph

    TorchAmi: Generalized CPU/GPU Implementation of Algorithmic Matsubara Integration

    Authors: M. D. Burke, J. P. F. LeBlanc

    Abstract: We present torchami, an advanced implementation of algorithmic Matsubara integration (AMI) that utilizes pytorch as a backend to provide easy parallelization and GPU support. AMI is a tool for analytically resolving the sequence of nested Matsubara integrals that arise in virtually all Feynman perturbative expansions. In this implementation we present a new AMI algorithm that creates a more natura… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 23pg, 5 figs. Code reference included

  4. arXiv:2305.13650  [pdf, other

    cs.LG cs.AI

    Robust Model-Based Optimization for Challenging Fitness Landscapes

    Authors: Saba Ghaffari, Ehsan Saleh, Alexander G. Schwing, Yu-Xiong Wang, Martin D. Burke, Saurabh Sinha

    Abstract: Protein design, a grand challenge of the day, involves optimization on a fitness landscape, and leading methods adopt a model-based approach where a model is trained on a training set (protein sequences and fitness) and proposes candidates to explore next. These methods are challenged by sparsity of high-fitness samples in the training set, a problem that has been in the literature. A less recogni… ▽ More

    Submitted 27 June, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

  5. Renormalized Perturbation Theory for Fast Evaluation of Feynman Diagrams on the Real Frequency Axis

    Authors: M. D. Burke, Maxence Grandadam, J. P. F. LeBlanc

    Abstract: We present a method to accelerate the numerical evaluation of spatial integrals of Feynman diagrams when expressed on the real frequency axis. This can be realized through use of a renormalized perturbation expansion with a constant but complex renormalization shift. The complex shift acts as a regularization parameter for the numerical integration of otherwise sharp functions. This results in an… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  6. arXiv:2109.09888  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    Chemical-Reaction-Aware Molecule Representation Learning

    Authors: Hongwei Wang, Weijiang Li, Xiaomeng Jin, Kyunghyun Cho, Heng Ji, Jiawei Han, Martin D. Burke

    Abstract: Molecule representation learning (MRL) methods aim to embed molecules into a real vector space. However, existing SMILES-based (Simplified Molecular-Input Line-Entry System) or GNN-based (Graph Neural Networks) MRL methods either take SMILES strings as input that have difficulty in encoding molecule structure information, or over-emphasize the importance of GNN architectures but neglect their gene… ▽ More

    Submitted 22 September, 2021; v1 submitted 20 September, 2021; originally announced September 2021.