Skip to main content

Showing 1–20 of 20 results for author: Hammerla, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.01388  [pdf, other

    cs.LG

    Harnessing Preference Optimisation in Protein LMs for Hit Maturation in Cell Therapy

    Authors: Katarzyna Janocha, Annabel Ling, Alice Godson, Yulia Lampi, Simon Bornschein, Nils Y. Hammerla

    Abstract: Cell and immunotherapy offer transformative potential for treating diseases like cancer and autoimmune disorders by modulating the immune system. The development of these therapies is resource-intensive, with the majority of drug candidates failing to progress beyond laboratory testing. While recent advances in machine learning have revolutionised areas such as protein engineering, applications in… ▽ More

    Submitted 3 December, 2024; v1 submitted 2 December, 2024; originally announced December 2024.

  2. arXiv:2210.10946  [pdf, other

    cs.LG

    Causally-guided Regularization of Graph Attention Improves Generalizability

    Authors: Alexander P. Wu, Thomas Markovich, Bonnie Berger, Nils Hammerla, Rohit Singh

    Abstract: Graph attention networks estimate the relational importance of node neighbors to aggregate relevant information over local neighborhoods for a prediction task. However, the inferred attentions are vulnerable to spurious correlations and connectivity in the training data, hampering the generalizability of the model. We introduce CAR, a general-purpose regularization framework for graph attention ne… ▽ More

    Submitted 28 February, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  3. arXiv:2209.15486  [pdf, other

    cs.LG cs.IR

    Graph Neural Networks for Link Prediction with Subgraph Sketching

    Authors: Benjamin Paul Chamberlain, Sergey Shirobokov, Emanuele Rossi, Fabrizio Frasca, Thomas Markovich, Nils Hammerla, Michael M. Bronstein, Max Hansmire

    Abstract: Many Graph Neural Networks (GNNs) perform poorly compared to simple heuristics on Link Prediction (LP) tasks. This is due to limitations in expressive power such as the inability to count triangles (the backbone of most LP heuristics) and because they can not distinguish automorphic nodes (those having identical structural roles). Both expressiveness issues can be alleviated by learning link (rath… ▽ More

    Submitted 2 May, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 29 pages, 19 figures, 6 appendices

    Journal ref: The Eleventh International Conference on Learning Representations 2023 (oral - top 5%)

  4. arXiv:2112.12672  [pdf, other

    cs.CL

    Towards more patient friendly clinical notes through language models and ontologies

    Authors: Francesco Moramarco, Damir Juric, Aleksandar Savkov, Jack Flann, Maria Lehl, Kristian Boda, Tessa Grafen, Vitalii Zhelezniak, Sunir Gohil, Alex Papadopoulos Korfiatis, Nils Hammerla

    Abstract: Clinical notes are an efficient way to record patient information but are notoriously hard to decipher for non-experts. Automatically simplifying medical text can empower patients with valuable information about their health, while saving clinicians time. We present a novel approach to automated simplification of medical text based on word frequencies and language modelling, grounded on medical on… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Report number: 35308976

  5. arXiv:2010.16218  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Biomedical Concept Relatedness -- A large EHR-based benchmark

    Authors: Claudia Schulz, Josh Levy-Kramer, Camille Van Assel, Miklos Kepes, Nils Hammerla

    Abstract: A promising application of AI to healthcare is the retrieval of information from electronic health records (EHRs), e.g. to aid clinicians in finding relevant information for a consultation or to recruit suitable patients for a study. This requires search capabilities far beyond simple string matching, including the retrieval of concepts (diagnoses, symptoms, medications, etc.) related to the one i… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

    Comments: Accepted for publication at the 28th International Conference on Computational Linguistics (COLING 2020)

  6. arXiv:2007.13794  [pdf, other

    cs.LG stat.ML

    Neural Temporal Point Processes For Modelling Electronic Health Records

    Authors: Joseph Enguehard, Dan Busbridge, Adam Bozson, Claire Woodcock, Nils Y. Hammerla

    Abstract: The modelling of Electronic Health Records (EHRs) has the potential to drive more efficient allocation of healthcare resources, enabling early intervention strategies and advancing personalised healthcare. However, EHRs are challenging to model due to their realisation as noisy, multi-modal data occurring at irregular time intervals. To address their temporal nature, we treat EHRs as samples gener… ▽ More

    Submitted 7 December, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: Version accepted to Machine Learning for Health (ML4H) workshop at NeurIPS 2020. 10 pages, 5 figures, 3 tables. Code, pre-trained models and datasets available at https://github.com/babylonhealth/neuralTPPs

  7. arXiv:1910.03492  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Neural Language Priors

    Authors: Joseph Enguehard, Dan Busbridge, Vitalii Zhelezniak, Nils Hammerla

    Abstract: The choice of sentence encoder architecture reflects assumptions about how a sentence's meaning is composed from its constituent words. We examine the contribution of these architectures by holding them randomly initialised and fixed, effectively treating them as as hand-crafted language priors, and evaluating the resulting sentence encoders on downstream language tasks. We find that even when enc… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: 4 pages, 1 figure, 1 table

  8. arXiv:1910.02902  [pdf, other

    cs.CL cs.LG

    Correlations between Word Vector Sets

    Authors: Vitalii Zhelezniak, April Shen, Daniel Busbridge, Aleksandar Savkov, Nils Hammerla

    Abstract: Similarity measures based purely on word embeddings are comfortably competing with much more sophisticated deep learning and expert-engineered systems on unsupervised semantic textual similarity (STS) tasks. In contrast to commonly used geometric approaches, we treat a single word embedding as e.g. 300 observations from a scalar random variable. Using this paradigm, we first illustrate that simila… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: Accepted as a long paper at EMNLP-IJCNLP 2019

  9. arXiv:1906.10671  [pdf, other

    cs.LG stat.ML

    Explaining Deep Learning Models with Constrained Adversarial Examples

    Authors: Jonathan Moore, Nils Hammerla, Chris Watkins

    Abstract: Machine learning algorithms generally suffer from a problem of explainability. Given a classification result from a model, it is typically hard to determine what caused the decision to be made, and to give an informative explanation. We explore a new method of generating counterfactual explanations, which instead of explaining why a particular classification was made explain how a different outcom… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

  10. arXiv:1905.07790  [pdf, other

    cs.CL cs.LG stat.ML

    Correlation Coefficients and Semantic Textual Similarity

    Authors: Vitalii Zhelezniak, Aleksandar Savkov, April Shen, Nils Y. Hammerla

    Abstract: A large body of research into semantic textual similarity has focused on constructing state-of-the-art embeddings using sophisticated modelling, careful choice of learning signals and many clever tricks. By contrast, little attention has been devoted to similarity measures between these embeddings, with cosine similarity being used unquestionably in the majority of cases. In this work, we illustra… ▽ More

    Submitted 19 May, 2019; originally announced May 2019.

    Comments: Accepted as a long paper at NAACL-HLT 2019

  11. arXiv:1905.05547  [pdf, other

    cs.LG cs.CL stat.ML

    Multilingual Factor Analysis

    Authors: Francisco Vargas, Kamen Brestnichki, Alex Papadopoulos-Korfiatis, Nils Hammerla

    Abstract: In this work we approach the task of learning multilingual word representations in an offline manner by fitting a generative latent variable model to a multilingual dictionary. We model equivalent words in different languages as different views of the same word generated by a common latent variable representing their latent lexical meaning. We explore the task of alignment by querying the fitted m… ▽ More

    Submitted 23 October, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

  12. arXiv:1904.13323  [pdf, other

    cs.LG cs.CL stat.ML

    Model Comparison for Semantic Grouping

    Authors: Francisco Vargas, Kamen Brestnichki, Nils Hammerla

    Abstract: We introduce a probabilistic framework for quantifying the semantic similarity between two groups of embeddings. We formulate the task of semantic similarity as a model comparison task in which we contrast a generative model which jointly models two sentences versus one that does not. We illustrate how this framework can be used for the Semantic Textual Similarity tasks using clear assumptions abo… ▽ More

    Submitted 1 May, 2019; v1 submitted 30 April, 2019; originally announced April 2019.

    Comments: Proceedings of the 36th International Conference on Machine Learning

  13. arXiv:1904.13264  [pdf, other

    cs.CL cs.LG

    Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors

    Authors: Vitalii Zhelezniak, Aleksandar Savkov, April Shen, Francesco Moramarco, Jack Flann, Nils Y. Hammerla

    Abstract: Recent literature suggests that averaged word vectors followed by simple post-processing outperform many deep learning methods on semantic textual similarity tasks. Furthermore, when averaged word vectors are trained supervised on large corpora of paraphrases, they achieve state-of-the-art results on standard STS benchmarks. Inspired by these insights, we push the limits of word embeddings even fu… ▽ More

    Submitted 30 April, 2019; originally announced April 2019.

    Comments: Published as a conference paper at ICLR 2019

  14. arXiv:1904.05811  [pdf, other

    cs.LG cs.AI stat.ML

    Relational Graph Attention Networks

    Authors: Dan Busbridge, Dane Sherburn, Pietro Cavallo, Nils Y. Hammerla

    Abstract: We investigate Relational Graph Attention Networks, a class of models that extends non-relational graph attention mechanisms to incorporate relational information, opening up these methods to a wider variety of problems. A thorough evaluation of these models is performed, and comparisons are made against established benchmarks. To provide a meaningful comparison, we retrain Relational Graph Convol… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: 10 pages + 8 pages of appendices. Layer implementation available at https://github.com/Babylonpartners/rgat/

  15. arXiv:1805.03435  [pdf, other

    cs.AI cs.CL cs.LG

    Decoding Decoders: Finding Optimal Representation Spaces for Unsupervised Similarity Tasks

    Authors: Vitalii Zhelezniak, Dan Busbridge, April Shen, Samuel L. Smith, Nils Y. Hammerla

    Abstract: Experimental evidence indicates that simple models outperform complex deep networks on many unsupervised similarity tasks. We provide a simple yet rigorous explanation for this behaviour by introducing the concept of an optimal representation space, in which semantically close symbols are mapped to representations that are close under a similarity measure induced by the model's objective function.… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

    Comments: ICLR 2018 Workshop Track, 15 pages, 3 figures, 6 tables

  16. arXiv:1804.03999  [pdf, other

    cs.CV

    Attention U-Net: Learning Where to Look for the Pancreas

    Authors: Ozan Oktay, Jo Schlemper, Loic Le Folgoc, Matthew Lee, Mattias Heinrich, Kazunari Misawa, Kensaku Mori, Steven McDonagh, Nils Y Hammerla, Bernhard Kainz, Ben Glocker, Daniel Rueckert

    Abstract: We propose a novel attention gate (AG) model for medical imaging that automatically learns to focus on target structures of varying shapes and sizes. Models trained with AGs implicitly learn to suppress irrelevant regions in an input image while highlighting salient features useful for a specific task. This enables us to eliminate the necessity of using explicit external tissue/organ localisation… ▽ More

    Submitted 20 May, 2018; v1 submitted 11 April, 2018; originally announced April 2018.

    Comments: Accepted to published in MIDL'18 (Revised Version) / OpenReview link: https://openreview.net/forum?id=Skft7cijM

  17. arXiv:1702.03859  [pdf, other

    cs.CL cs.AI cs.IR

    Offline bilingual word vectors, orthogonal transformations and the inverted softmax

    Authors: Samuel L. Smith, David H. P. Turban, Steven Hamblin, Nils Y. Hammerla

    Abstract: Usually bilingual word vectors are trained "online". Mikolov et al. showed they can also be found "offline", whereby two pre-trained embeddings are aligned with a linear transformation, using dictionaries compiled from expert knowledge. In this work, we prove that the linear transformation between two spaces should be orthogonal. This transformation can be obtained using the singular value decompo… ▽ More

    Submitted 13 February, 2017; originally announced February 2017.

    Comments: Accepted to conference track at ICLR 2017

  18. arXiv:1606.02041  [pdf

    cs.AI cs.CY cs.HC

    Sorting out symptoms: design and evaluation of the 'babylon check' automated triage system

    Authors: Katherine Middleton, Mobasher Butt, Nils Hammerla, Steven Hamblin, Karan Mehta, Ali Parsa

    Abstract: Prior to seeking professional medical care it is increasingly common for patients to use online resources such as automated symptom checkers. Many such systems attempt to provide a differential diagnosis based on the symptoms elucidated from the user, which may lead to anxiety if life or limb-threatening conditions are part of the list, a phenomenon termed 'cyberchondria' [1]. Systems that provide… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

  19. arXiv:1604.08880  [pdf, other

    cs.LG cs.AI cs.HC stat.ML

    Deep, Convolutional, and Recurrent Models for Human Activity Recognition using Wearables

    Authors: Nils Y. Hammerla, Shane Halloran, Thomas Ploetz

    Abstract: Human activity recognition (HAR) in ubiquitous computing is beginning to adopt deep learning to substitute for well-established analysis techniques that rely on hand-crafted feature extraction and classification techniques. From these isolated applications of custom deep architectures it is, however, difficult to gain an overview of their suitability for problems ranging from the recognition of ma… ▽ More

    Submitted 29 April, 2016; originally announced April 2016.

    Comments: Extended version has been accepted for publication at International Joint Conference on Artificial Intelligence (IJCAI)

  20. arXiv:1312.6995  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Using Unlabeled Data in a Sparse-coding Framework for Human Activity Recognition

    Authors: Sourav Bhattacharya, Petteri Nurmi, Nils Hammerla, Thomas Plötz

    Abstract: We propose a sparse-coding framework for activity recognition in ubiquitous and mobile computing that alleviates two fundamental problems of current supervised learning approaches. (i) It automatically derives a compact, sparse and meaningful feature representation of sensor data that does not rely on prior expert knowledge and generalizes extremely well across domain boundaries. (ii) It exploits… ▽ More

    Submitted 23 July, 2014; v1 submitted 25 December, 2013; originally announced December 2013.

    Comments: 18 pages, 12 figures, Pervasive and Mobile Computing, 2014