Skip to main content

Showing 1–6 of 6 results for author: Burke, M

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2505.12565  [pdf, ps, other

    cs.AI cs.CL cs.LG q-bio.QM

    mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model

    Authors: Carl Edwards, Chi Han, Gawon Lee, Thao Nguyen, Bowen Jin, Chetan Kumar Prasad, Sara Szymkuć, Bartosz A. Grzybowski, Ying Diao, Jiawei Han, Ge Liu, Hao Peng, Martin D. Burke, Heng Ji

    Abstract: Despite their ability to understand chemical knowledge and accurately generate sequential representations, large language models (LLMs) remain limited in their capacity to propose novel molecules with drug-like properties. In addition, the molecules that LLMs propose can often be challenging to make in the lab. To more effectively enable the discovery of functional small molecules, LLMs need to le… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  2. arXiv:2410.02082  [pdf, other

    cs.LG q-bio.QM

    FARM: Functional Group-Aware Representations for Small Molecules

    Authors: Thao Nguyen, Kuan-Hao Huang, Ge Liu, Martin D. Burke, Ying Diao, Heng Ji

    Abstract: We introduce Functional Group-Aware Representations for Small Molecules (FARM), a novel foundation model designed to bridge the gap between SMILES, natural language, and molecular graphs. The key innovation of FARM lies in its functional group-aware tokenization, which directly incorporates functional group information into the representations. This strategic reduction in tokenization granularity… ▽ More

    Submitted 6 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Preprint

  3. arXiv:2307.11694  [pdf, other

    cs.AI cs.LG q-bio.BM q-bio.MN

    SynerGPT: In-Context Learning for Personalized Drug Synergy Prediction and Drug Design

    Authors: Carl Edwards, Aakanksha Naik, Tushar Khot, Martin Burke, Heng Ji, Tom Hope

    Abstract: Predicting synergistic drug combinations can help accelerate discovery of cancer treatments, particularly therapies personalized to a patient's specific tumor via biopsied cells. In this paper, we propose a novel setting and models for in-context drug synergy learning. We are given a small "personalized dataset" of 10-20 drug synergy relationships in the context of specific cancer cell targets. Ou… ▽ More

    Submitted 24 October, 2023; v1 submitted 19 June, 2023; originally announced July 2023.

  4. FAIR Data Pipeline: provenance-driven data management for traceable scientific workflows

    Authors: Sonia Natalie Mitchell, Andrew Lahiff, Nathan Cummings, Jonathan Hollocombe, Bram Boskamp, Ryan Field, Dennis Reddyhoff, Kristian Zarebski, Antony Wilson, Bruno Viola, Martin Burke, Blair Archibald, Paul Bessell, Richard Blackwell, Lisa A Boden, Alys Brett, Sam Brett, Ruth Dundas, Jessica Enright, Alejandra N. Gonzalez-Beltran, Claire Harris, Ian Hinder, Christopher David Hughes, Martin Knight, Vino Mano , et al. (13 additional authors not shown)

    Abstract: Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily da… ▽ More

    Submitted 4 May, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

  5. arXiv:2109.09888  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    Chemical-Reaction-Aware Molecule Representation Learning

    Authors: Hongwei Wang, Weijiang Li, Xiaomeng Jin, Kyunghyun Cho, Heng Ji, Jiawei Han, Martin D. Burke

    Abstract: Molecule representation learning (MRL) methods aim to embed molecules into a real vector space. However, existing SMILES-based (Simplified Molecular-Input Line-Entry System) or GNN-based (Graph Neural Networks) MRL methods either take SMILES strings as input that have difficulty in encoding molecule structure information, or over-emphasize the importance of GNN architectures but neglect their gene… ▽ More

    Submitted 22 September, 2021; v1 submitted 20 September, 2021; originally announced September 2021.

  6. Differential hydrophobicity drives self-assembly in Huntington's disease

    Authors: Martin G. Burke, Rudiger Woscholski, Sophia N. Yaliraki

    Abstract: Identifying the driving forces and the mechanism of association of huntingtin-exon1, a close marker for the progress of Huntington's disease, is an important prerequisite towards finding potential drug targets, and ultimately a cure. We introduce here a modelling framework based on a key analogy of the physico-chemical properties of the exon1 fragment to block copolymers. We use a systematic mes… ▽ More

    Submitted 26 September, 2003; originally announced September 2003.