Skip to main content

Showing 1–9 of 9 results for author: Deane, C M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.07735  [pdf, ps, other

    cs.LG

    Assessing the Chemical Intelligence of Large Language Models

    Authors: Nicholas T. Runcie, Charlotte M. Deane, Fergus Imrie

    Abstract: Large Language Models are versatile, general-purpose tools with a wide range of applications. Recently, the advent of "reasoning models" has led to substantial improvements in their abilities in advanced problem-solving domains such as mathematics and software engineering. In this work, we assessed the ability of reasoning models to perform chemistry tasks directly, without any assistance from ext… ▽ More

    Submitted 10 July, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

  2. arXiv:2502.01533  [pdf, other

    cs.LG cs.AI q-bio.BM

    Transformers trained on proteins can learn to attend to Euclidean distance

    Authors: Isaac Ellmen, Constantin Schneider, Matthew I. J. Raybould, Charlotte M. Deane

    Abstract: While conventional Transformers generally operate on sequence data, they can be used in conjunction with structure models, typically SE(3)-invariant or equivariant graph neural networks (GNNs), for 3D applications such as protein structure modelling. These hybrids typically involve either (1) preprocessing/tokenizing structural features as input for Transformers or (2) taking Transformer embedding… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  3. arXiv:2407.11942  [pdf, other

    q-bio.BM cs.LG stat.ML

    Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design

    Authors: Leo Klarner, Tim G. J. Rudner, Garrett M. Morris, Charlotte M. Deane, Yee Whye Teh

    Abstract: Generative models have the potential to accelerate key steps in the discovery of novel molecular therapeutics and materials. Diffusion models have recently emerged as a powerful approach, excelling at unconditional sample generation and, with data-driven guidance, conditional generation within their training domain. Reliably sampling from high-value regions beyond the training data, however, remai… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Published in the Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  4. arXiv:2405.20863  [pdf, other

    q-bio.BM cs.AI

    ABodyBuilder3: Improved and scalable antibody structure predictions

    Authors: Henry Kenlay, Frédéric A. Dreyer, Daniel Cutting, Daniel Nissley, Charlotte M. Deane

    Abstract: Accurate prediction of antibody structure is a central task in the design and development of monoclonal antibodies, notably to understand both their developability and their binding properties. In this article, we introduce ABodyBuilder3, an improved and scalable antibody structure prediction model based on ImmuneBuilder. We achieve a new state-of-the-art accuracy in the modelling of CDR loops by… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 8 pages, 3 figures, 3 tables, code available at https://github.com/Exscientia/ABodyBuilder3, weights and data available at https://zenodo.org/records/11354577

  5. arXiv:2405.07622  [pdf, other

    q-bio.BM cs.LG

    De novo antibody design with SE(3) diffusion

    Authors: Daniel Cutting, Frédéric A. Dreyer, David Errington, Constantin Schneider, Charlotte M. Deane

    Abstract: We introduce IgDiff, an antibody variable domain diffusion model based on a general protein backbone diffusion framework which was extended to handle multiple chains. Assessing the designability and novelty of the structures generated with our model, we find that IgDiff produces highly designable antibodies that can contain novel binding regions. The backbone dihedral angles of sampled structures… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 20 pages, 11 figures, 4 tables, model weights and samples available at https://zenodo.org/records/11184374

  6. arXiv:2403.17889  [pdf, other

    q-bio.BM cs.LG

    Large scale paired antibody language models

    Authors: Henry Kenlay, Frédéric A. Dreyer, Aleksandr Kovaltsuk, Dom Miketa, Douglas Pires, Charlotte M. Deane

    Abstract: Antibodies are proteins produced by the immune system that can identify and neutralise a wide variety of antigens with high specificity and affinity, and constitute the most successful class of biotherapeutics. With the advent of next-generation sequencing, billions of antibody sequences have been collected in recent years, though their application in the design of better therapeutics has been con… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 14 pages, 2 figures, 6 tables, model weights available at https://zenodo.org/doi/10.5281/zenodo.10876908

  7. arXiv:2310.19513  [pdf, other

    q-bio.BM cs.AI

    Inverse folding for antibody sequence design using deep learning

    Authors: Frédéric A. Dreyer, Daniel Cutting, Constantin Schneider, Henry Kenlay, Charlotte M. Deane

    Abstract: We consider the problem of antibody sequence design given 3D structural information. Building on previous work, we propose a fine-tuned inverse folding model that is specifically optimised for antibody structures and outperforms generic protein models on sequence recovery and structure robustness when applied on antibodies, with notable improvement on the hypervariable CDR-H3 loop. We study the ca… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 2023 ICML Workshop on Computational Biology, model weights available at https://zenodo.org/record/8164693

  8. arXiv:2203.09281  [pdf, other

    q-bio.NC cs.LG cs.SI stat.AP stat.ML

    Ranking of Communities in Multiplex Spatiotemporal Models of Brain Dynamics

    Authors: James Wilsenach, Katie Warnaby, Charlotte M. Deane, Gesine Reinert

    Abstract: As a relatively new field, network neuroscience has tended to focus on aggregate behaviours of the brain averaged over many successive experiments or over long recordings in order to construct robust brain models. These models are limited in their ability to explain dynamic state changes in the brain which occurs spontaneously as a result of normal brain function. Hidden Markov Models (HMMs) train… ▽ More

    Submitted 17 May, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: Part of the Special Issue on Community Structure in Networks 2021 (35 Pages, first 22 for main text)

    ACM Class: I.2.1; I.6.3; I.6.4; I.6.5; I.6.6

    Journal ref: Applied Network Science (2022) 7-15

  9. arXiv:1704.00387  [pdf, other

    stat.ML cs.SI physics.soc-ph

    Identifying networks with common organizational principles

    Authors: Anatol E. Wegner, Luis Ospina-Forero, Robert E. Gaunt, Charlotte M. Deane, Gesine Reinert

    Abstract: Many complex systems can be represented as networks, and the problem of network comparison is becoming increasingly relevant. There are many techniques for network comparison, from simply comparing network summary statistics to sophisticated but computationally costly alignment-based approaches. Yet it remains challenging to accurately cluster networks that are of a different size and density, but… ▽ More

    Submitted 2 April, 2017; originally announced April 2017.

    Comments: 26 pages, 7 figures