Skip to main content

Showing 1–15 of 15 results for author: Butler, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17495  [pdf, ps, other

    cs.LG cs.AI cs.CL

    ProxySPEX: Inference-Efficient Interpretability via Sparse Feature Interactions in LLMs

    Authors: Landon Butler, Abhineet Agarwal, Justin Singh Kang, Yigit Efe Erginbas, Bin Yu, Kannan Ramchandran

    Abstract: Large Language Models (LLMs) have achieved remarkable performance by capturing complex interactions between input features. To identify these interactions, most existing approaches require enumerating all possible combinations of features up to a given order, causing them to scale poorly with the number of inputs $n$. Recently, Kang et al. (2025) proposed SPEX, an information-theoretic approach th… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2502.13870  [pdf, other

    cs.LG cs.AI cs.CL cs.IT

    SPEX: Scaling Feature Interaction Explanations for LLMs

    Authors: Justin Singh Kang, Landon Butler, Abhineet Agarwal, Yigit Efe Erginbas, Ramtin Pedarsani, Kannan Ramchandran, Bin Yu

    Abstract: Large language models (LLMs) have revolutionized machine learning due to their ability to capture complex interactions between input features. Popular post-hoc explanation methods like SHAP provide marginal feature attributions, while their extensions to interaction importances only scale to small input lengths ($\approx 20$). We propose Spectral Explainer (SPEX), a model-agnostic interaction attr… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  3. arXiv:2407.16551  [pdf

    cs.DL

    Estimating global article processing charges paid to six publishers for open access between 2019 and 2023

    Authors: Stefanie Haustein, Eric Schares, Juan Pablo Alperin, Madelaine Hare, Leigh-Ann Butler, Nina Schönfelder

    Abstract: This study presents estimates of the global expenditure on article processing charges (APCs) paid to six publishers for open access between 2019 and 2023. APCs are fees charged for publishing in some fully open access journals (gold) and in subscription journals to make individual articles open access (hybrid). There is currently no way to systematically track institutional, national or global exp… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 21 pages, 6 figures, 4 tables

  4. arXiv:2406.08356  [pdf

    cs.DL

    An open dataset of article processing charges from six large scholarly publishers (2019-2023)

    Authors: Leigh-Ann Butler, Madelaine Hare, Nina Schönfelder, Eric Schares, Juan Pablo Alperin, Stefanie Haustein

    Abstract: This paper introduces a dataset of article processing charges (APCs) produced from the price lists of six large scholarly publishers - Elsevier, Frontiers, PLOS, MDPI, Springer Nature and Wiley - between 2019 and 2023. APC price lists were downloaded from publisher websites each year as well as via Wayback Machine snapshots to retrieve fees per journal per year. The dataset includes journal metada… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 13 pages, 3 figures, 4 tables

  5. arXiv:2402.02631  [pdf, other

    cs.LG

    Learning to Understand: Identifying Interactions via the Möbius Transform

    Authors: Justin S. Kang, Yigit E. Erginbas, Landon Butler, Ramtin Pedarsani, Kannan Ramchandran

    Abstract: One of the key challenges in machine learning is to find interpretable representations of learned functions. The Möbius transform is essential for this purpose, as its coefficients correspond to unique importance scores for sets of input variables. This transform is closely related to widely used game-theoretic notions of importance like the Shapley and Bhanzaf value, but it also captures crucial… ▽ More

    Submitted 15 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: 34 pages, 16 figures

  6. arXiv:2310.03879  [pdf, other

    cs.LG

    Non Commutative Convolutional Signal Models in Neural Networks: Stability to Small Deformations

    Authors: Alejandro Parada-Mayorga, Landon Butler, Alejandro Ribeiro

    Abstract: In this paper we discuss the results recently published in~[1] about algebraic signal models (ASMs) based on non commutative algebras and their use in convolutional neural networks. Relying on the general tools from algebraic signal processing (ASP), we study the filtering and stability properties of non commutative convolutional filters. We show how non commutative filters can be stable to small… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  7. arXiv:2305.14117  [pdf, other

    eess.AS cs.LG

    Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings

    Authors: Anfeng Xu, Rajat Hebbar, Rimita Lahiri, Tiantian Feng, Lindsay Butler, Lue Shen, Helen Tager-Flusberg, Shrikanth Narayanan

    Abstract: Speech processing techniques are useful for analyzing speech and language development in children with Autism Spectrum Disorder (ASD), who are often varied and delayed in acquiring these skills. Early identification and intervention are crucial, but traditional assessment methodologies such as caregiver reports are not adequate for the requisite behavioral phenotyping. Natural Language Sample (NLS… ▽ More

    Submitted 31 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to Interspeech 2023, 5 pages

  8. arXiv:2212.08571  [pdf, other

    cs.SD cs.LG eess.AS stat.AP

    Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19

    Authors: Davide Pigoli, Kieran Baker, Jobie Budd, Lorraine Butler, Harry Coppock, Sabrina Egglestone, Steven G. Gilmour, Chris Holmes, David Hurley, Radka Jersakova, Ivan Kiskin, Vasiliki Koutra, Jonathon Mellor, George Nicholson, Joe Packham, Selina Patel, Richard Payne, Stephen J. Roberts, Björn W. Schuller, Ana Tendero-Cañadas, Tracey Thornley, Alexander Titcomb

    Abstract: Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has been interest in using artificial intelligence methods to predict COVID-19 infection status based on vocal audio signals, for example cough recordings. However, existing studies have limitations in terms of data collection and of the assessment of the performances of the proposed predictive models. This paper rigorously ass… ▽ More

    Submitted 27 February, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  9. arXiv:2212.08570  [pdf, other

    cs.SD cs.LG eess.AS

    Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

    Authors: Harry Coppock, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Kieran Baker, Jobie Budd, Richard Payne, Emma Karoune, David Hurley, Alexander Titcomb, Sabrina Egglestone, Ana Tendero Cañadas, Lorraine Butler, Radka Jersakova, Jonathon Mellor, Selina Patel, Tracey Thornley, Peter Diggle, Sylvia Richardson, Josef Packham, Björn W. Schuller, Davide Pigoli, Steven Gilmour, Stephen Roberts, Chris Holmes

    Abstract: Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata… ▽ More

    Submitted 2 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  10. arXiv:2212.07738  [pdf

    cs.SD cs.LG eess.AS

    A large-scale and PCR-referenced vocal audio dataset for COVID-19

    Authors: Jobie Budd, Kieran Baker, Emma Karoune, Harry Coppock, Selina Patel, Ana Tendero Cañadas, Alexander Titcomb, Richard Payne, David Hurley, Sabrina Egglestone, Lorraine Butler, Jonathon Mellor, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Radka Jersakova, Rachel A. McKendry, Peter Diggle, Sylvia Richardson, Björn W. Schuller, Steven Gilmour, Davide Pigoli, Stephen Roberts, Josef Packham, Tracey Thornley , et al. (1 additional authors not shown)

    Abstract: The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmi… ▽ More

    Submitted 3 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 39 pages, 4 figures

  11. arXiv:2210.16272  [pdf, other

    eess.SP cs.LG

    Learning with Multigraph Convolutional Filters

    Authors: Landon Butler, Alejandro Parada-Mayorga, Alejandro Ribeiro

    Abstract: In this paper, we introduce a convolutional architecture to perform learning when information is supported on multigraphs. Exploiting algebraic signal processing (ASP), we propose a convolutional signal processing model on multigraphs (MSP). Then, we introduce multigraph convolutional neural networks (MGNNs) as stacked and layered structures where information is processed according to an MSP model… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2209.11354

  12. Convolutional Learning on Multigraphs

    Authors: Landon Butler, Alejandro Parada-Mayorga, Alejandro Ribeiro

    Abstract: Graph convolutional learning has led to many exciting discoveries in diverse areas. However, in some applications, traditional graphs are insufficient to capture the structure and intricacies of the data. In such scenarios, multigraphs arise naturally as discrete structures in which complex dynamics can be embedded. In this paper, we develop convolutional information processing on multigraphs and… ▽ More

    Submitted 8 February, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

  13. arXiv:2108.09923  [pdf, other

    cs.LG

    Convolutional Filtering and Neural Networks with Non Commutative Algebras

    Authors: Alejandro Parada-Mayorga, Landon Butler, Alejandro Ribeiro

    Abstract: In this paper we introduce and study the algebraic generalization of non commutative convolutional neural networks. We leverage the theory of algebraic signal processing to model convolutional non commutative architectures, and we derive concrete stability bounds that extend those obtained in the literature for commutative convolutional neural networks. We show that non commutative convolutional a… ▽ More

    Submitted 6 July, 2023; v1 submitted 23 August, 2021; originally announced August 2021.

  14. arXiv:2103.05091  [pdf, ps, other

    cs.RO cs.LG cs.MA cs.NI

    Learning Connectivity for Data Distribution in Robot Teams

    Authors: Ekaterina Tolstaya, Landon Butler, Daniel Mox, James Paulos, Vijay Kumar, Alejandro Ribeiro

    Abstract: Many algorithms for control of multi-robot teams operate under the assumption that low-latency, global state information necessary to coordinate agent actions can readily be disseminated among the team. However, in harsh environments with no existing communication infrastructure, robots must form ad-hoc networks, forcing the team to operate in a distributed fashion. To overcome this challenge, we… ▽ More

    Submitted 30 July, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

  15. arXiv:2008.11147  [pdf, other

    cs.SE cs.CY cs.HC

    A Tale of Two Cities: Software Developers Working from Home During the COVID-19 Pandemic

    Authors: Denae Ford, Margaret-Anne Storey, Thomas Zimmermann, Christian Bird, Sonia Jaffe, Chandra Maddila, Jenna L. Butler, Brian Houck, Nachiappan Nagappan

    Abstract: The COVID-19 pandemic has shaken the world to its core and has provoked an overnight exodus of developers that normally worked in an office setting to working from home. The magnitude of this shift and the factors that have accompanied this new unplanned work setting go beyond what the software engineering community has previously understood to be remote work. To find out how developers and their… ▽ More

    Submitted 10 September, 2021; v1 submitted 25 August, 2020; originally announced August 2020.

    Comments: 36 pages, 1 figure, 6 tables

    Journal ref: ACM Transactions on Software Engineering and Methodology, Volume 31, Issue 2 (April 2022)