Skip to main content

Showing 1–8 of 8 results for author: Foote, A

.
  1. arXiv:2506.06278  [pdf, ps, other

    cs.LG cs.AI

    Distillation Robustifies Unlearning

    Authors: Bruce W. Lee, Addie Foote, Alex Infanger, Leni Shor, Harish Kamath, Jacob Goldman-Wetzler, Bryce Woodworth, Alex Cloud, Alexander Matt Turner

    Abstract: Current LLM unlearning methods are not robust: they can be reverted easily with a few steps of finetuning. This is true even for the idealized unlearning method of training to imitate an oracle model that was never exposed to unwanted information, suggesting that output-based finetuning is insufficient to achieve robust unlearning. In a similar vein, we find that training a randomly initialized st… ▽ More

    Submitted 9 June, 2025; v1 submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2411.08166  [pdf, other

    cs.LG

    Tackling Polysemanticity with Neuron Embeddings

    Authors: Alex Foote

    Abstract: We present neuron embeddings, a representation that can be used to tackle polysemanticity by identifying the distinct semantic behaviours in a neuron's characteristic dataset examples, making downstream manual or automatic interpretation much easier. We apply our method to GPT2-small, and provide a UI for exploring the results. Neuron embeddings are computed using a model's internal representation… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  3. arXiv:2405.10020  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    Natural Language Can Help Bridge the Sim2Real Gap

    Authors: Albert Yu, Adeline Foote, Raymond Mooney, Roberto Martín-Martín

    Abstract: The main challenge in learning image-conditioned robotic policies is acquiring a visual representation conducive to low-level control. Due to the high dimensionality of the image space, learning a good visual representation requires a considerable amount of visual data. However, when learning in the real world, data is expensive. Sim2Real is a promising paradigm for overcoming data scarcity in the… ▽ More

    Submitted 2 July, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: To appear in RSS 2024. Project website at https://robin-lab.cs.utexas.edu/lang4sim2real/

    ACM Class: I.2.9; I.2.7; I.2.6

  4. arXiv:2305.19911  [pdf, other

    cs.LG cs.CL

    Neuron to Graph: Interpreting Language Model Neurons at Scale

    Authors: Alex Foote, Neel Nanda, Esben Kran, Ioannis Konstas, Shay Cohen, Fazl Barez

    Abstract: Advances in Large Language Models (LLMs) have led to remarkable capabilities, yet their inner mechanisms remain largely unknown. To understand these models, we need to unravel the functions of individual neurons and their contribution to the network. This paper introduces a novel automated approach designed to scale interpretability techniques across a vast array of neurons within LLMs, to make th… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  5. arXiv:2304.12918  [pdf, other

    cs.LG

    N2G: A Scalable Approach for Quantifying Interpretable Neuron Representations in Large Language Models

    Authors: Alex Foote, Neel Nanda, Esben Kran, Ionnis Konstas, Fazl Barez

    Abstract: Understanding the function of individual neurons within language models is essential for mechanistic interpretability research. We propose $\textbf{Neuron to Graph (N2G)}$, a tool which takes a neuron and its dataset examples, and automatically distills the neuron's behaviour on those examples to an interpretable graph. This presents a less labour intensive approach to interpreting neurons than cu… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: To be published at ICLR 2023 Workshop on Trustworthy and Reliable Large-Scale Machine Learning Models

  6. arXiv:2201.12311  [pdf

    cs.LG cs.CV

    REET: Robustness Evaluation and Enhancement Toolbox for Computational Pathology

    Authors: Alex Foote, Amina Asif, Nasir Rajpoot, Fayyaz Minhas

    Abstract: Motivation: Digitization of pathology laboratories through digital slide scanners and advances in deep learning approaches for objective histological assessment have resulted in rapid progress in the field of computational pathology (CPath) with wide-ranging applications in medical and pharmaceutical research as well as clinical workflows. However, the estimation of robustness of CPath models to v… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  7. arXiv:2106.08153  [pdf

    eess.IV cs.LG

    Now You See It, Now You Dont: Adversarial Vulnerabilities in Computational Pathology

    Authors: Alex Foote, Amina Asif, Ayesha Azam, Tim Marshall-Cox, Nasir Rajpoot, Fayyaz Minhas

    Abstract: Deep learning models are routinely employed in computational pathology (CPath) for solving problems of diagnostic and prognostic significance. Typically, the generalization performance of CPath models is analyzed using evaluation protocols such as cross-validation and testing on multi-centric cohorts. However, to ensure that such CPath solutions are robust and safe for use in a clinical setting, a… ▽ More

    Submitted 16 June, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 10 pages

  8. The $2$-Selmer group of a number field and heuristics for narrow class groups and signature ranks of units

    Authors: David S. Dummit, John Voight, appendix with Richard Foote

    Abstract: We investigate in detail a homomorphism which we call the 2-Selmer signature map from the $2$-Selmer group of a number field $K$ to a nondegenerate symmetric space, in particular proving the image is a maximal totally isotropic subspace. Applications include precise predictions on the density of fields $K$ with given narrow class group 2-rank and with given unit group signature rank. In addition t… ▽ More

    Submitted 29 March, 2018; v1 submitted 31 January, 2017; originally announced February 2017.

    Comments: 48 pages, small corrections made