Skip to main content

Showing 1–6 of 6 results for author: Bussmann, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.17547  [pdf, other

    cs.LG cs.AI

    Learning Multi-Level Features with Matryoshka Sparse Autoencoders

    Authors: Bart Bussmann, Noa Nabeshima, Adam Karvonen, Neel Nanda

    Abstract: Sparse autoencoders (SAEs) have emerged as a powerful tool for interpreting neural networks by extracting the concepts represented in their activations. However, choosing the size of the SAE dictionary (i.e. number of learned concepts) creates a tension: as dictionary size increases to capture more relevant concepts, sparsity incentivizes features to be split or absorbed into more specific feature… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2502.04878  [pdf, other

    cs.LG cs.AI

    Sparse Autoencoders Do Not Find Canonical Units of Analysis

    Authors: Patrick Leask, Bart Bussmann, Michael Pearce, Joseph Bloom, Curt Tigges, Noura Al Moubayed, Lee Sharkey, Neel Nanda

    Abstract: A common goal of mechanistic interpretability is to decompose the activations of neural networks into features: interpretable properties of the input computed by the model. Sparse autoencoders (SAEs) are a popular method for finding these features in LLMs, and it has been postulated that they can be used to find a \textit{canonical} set of units: a unique and complete list of atomic features. We c… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted to ICLR 2025

  3. arXiv:2412.06410  [pdf, other

    cs.LG cs.AI stat.ML

    BatchTopK Sparse Autoencoders

    Authors: Bart Bussmann, Patrick Leask, Neel Nanda

    Abstract: Sparse autoencoders (SAEs) have emerged as a powerful tool for interpreting language model activations by decomposing them into sparse, interpretable features. A popular approach is the TopK SAE, that uses a fixed number of the most active latents per sample to reconstruct the model activations. We introduce BatchTopK SAEs, a training method that improves upon TopK SAEs by relaxing the top-k const… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  4. Inferring the relationship between soil temperature and the normalized difference vegetation index with machine learning

    Authors: Steven Mortier, Amir Hamedpour, Bart Bussmann, Ruth Phoebe Tchana Wandji, Steven Latré, Bjarni D. Sigurdsson, Tom De Schepper, Tim Verdonck

    Abstract: Changes in climate can greatly affect the phenology of plants, which can have important feedback effects, such as altering the carbon cycle. These phenological feedback effects are often induced by a shift in the start or end dates of the growing season of plants. The normalized difference vegetation index (NDVI) serves as a straightforward indicator for assessing the presence of green vegetation… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 31 pages, 7 figures, 5 tables

  5. Neural Additive Vector Autoregression Models for Causal Discovery in Time Series

    Authors: Bart Bussmann, Jannes Nys, Steven Latré

    Abstract: Causal structure discovery in complex dynamical systems is an important challenge for many scientific domains. Although data from (interventional) experiments is usually limited, large amounts of observational time series data sets are usually available. Current methods that learn causal structure from time series often assume linear relationships. Hence, they may fail in realistic settings that c… ▽ More

    Submitted 18 October, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 11 pages, 5 figures

    Journal ref: Discovery Science. DS 2021. Lecture Notes in Computer Science, vol 12986. Springer, Cham

  6. arXiv:1906.10918  [pdf, other

    cs.LG cs.AI cs.NE

    Towards Empathic Deep Q-Learning

    Authors: Bart Bussmann, Jacqueline Heinerman, Joel Lehman

    Abstract: As reinforcement learning (RL) scales to solve increasingly complex tasks, interest continues to grow in the fields of AI safety and machine ethics. As a contribution to these fields, this paper introduces an extension to Deep Q-Networks (DQNs), called Empathic DQN, that is loosely inspired both by empathy and the golden rule ("Do unto others as you would have them do unto you"). Empathic DQN aims… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: To be presented as a poster at the IJCAI-19 AI Safety Workshop