Skip to main content

Showing 1–50 of 206 results for author: Wilson, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.08316  [pdf, ps, other

    cs.LG stat.ML

    Why Masking Diffusion Works: Condition on the Jump Schedule for Improved Discrete Diffusion

    Authors: Alan N. Amin, Nate Gruver, Andrew Gordon Wilson

    Abstract: Discrete diffusion models, like continuous diffusion models, generate high-quality samples by gradually undoing noise applied to datapoints with a Markov process. Gradual generation in theory comes with many conceptual benefits; for example, inductive biases can be incorporated into the noising Markov process, and access to improved sampling algorithms. In practice, however, the consistently best… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Code available at: https://github.com/AlanNawzadAmin/SCUD

  2. arXiv:2506.08137  [pdf, ps, other

    cs.CV cs.AI

    IGraSS: Learning to Identify Infrastructure Networks from Satellite Imagery by Iterative Graph-constrained Semantic Segmentation

    Authors: Oishee Bintey Hoque, Abhijin Adiga, Aniruddha Adiga, Siddharth Chaudhary, Madhav V. Marathe, S. S. Ravi, Kirti Rajagopalan, Amanda Wilson, Samarth Swarup

    Abstract: Accurate canal network mapping is essential for water management, including irrigation planning and infrastructure maintenance. State-of-the-art semantic segmentation models for infrastructure mapping, such as roads, rely on large, well-annotated remote sensing datasets. However, incomplete or inadequate ground truth can hinder these learning approaches. Many infrastructure networks have graph-lev… ▽ More

    Submitted 10 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  3. arXiv:2505.24603  [pdf, ps, other

    cs.LG

    The Gaussian Mixing Mechanism: Renyi Differential Privacy via Gaussian Sketches

    Authors: Omri Lev, Vishwak Srinivasan, Moshe Shenfeld, Katrina Ligett, Ayush Sekhari, Ashia C. Wilson

    Abstract: Gaussian sketching, which consists of pre-multiplying the data with a random Gaussian matrix, is a widely used technique for multiple problems in data science and machine learning, with applications spanning computationally efficient optimization, coded computing, and federated learning. This operation also provides differential privacy guarantees due to its inherent randomness. In this work, we r… ▽ More

    Submitted 4 June, 2025; v1 submitted 30 May, 2025; originally announced May 2025.

  4. arXiv:2505.24257  [pdf, ps, other

    cs.CV

    Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames

    Authors: Sahithya Ravi, Gabriel Sarch, Vibhav Vineet, Andrew D. Wilson, Balasaravanan Thoravi Kumaravel

    Abstract: An embodied AI assistant operating on egocentric video must integrate spatial cues across time - for instance, determining where an object A, glimpsed a few moments ago lies relative to an object B encountered later. We introduce Disjoint-3DQA , a generative QA benchmark that evaluates this ability of VLMs by posing questions about object pairs that are not co-visible in the same frame. We evaluat… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  5. arXiv:2505.22673  [pdf, ps, other

    q-bio.TO cs.AI cs.CV

    Physiology-Informed Generative Multi-Task Network for Contrast-Free CT Perfusion

    Authors: Wasif Khan, Kyle B. See, Simon Kato, Ziqian Huang, Amy Lazarte, Kyle Douglas, Xiangyang Lou, Teng J. Peng, Dhanashree Rajderkar, John Rees, Pina Sanelli, Amita Singh, Ibrahim Tuna, Christina A. Wilson, Ruogu Fang

    Abstract: Perfusion imaging is extensively utilized to assess hemodynamic status and tissue perfusion in various organs. Computed tomography perfusion (CTP) imaging plays a key role in the early assessment and planning of stroke treatment. While CTP provides essential perfusion parameters to identify abnormal blood flow in the brain, the use of contrast agents in CTP can lead to allergic reactions and adver… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: Under Review

  6. arXiv:2505.09500  [pdf, ps, other

    cs.LG

    Layered Unlearning for Adversarial Relearning

    Authors: Timothy Qian, Vinith Suriyakumar, Ashia Wilson, Dylan Hadfield-Menell

    Abstract: Our goal is to understand how post-training methods, such as fine-tuning, alignment, and unlearning, modify language model behavior and representations. We are particularly interested in the brittle nature of these modifications that makes them easy to bypass through prompt engineering or relearning. Recent results suggest that post-training induces shallow context-dependent ``circuits'' that supp… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: 37 pages, 8 figures

  7. arXiv:2505.08302  [pdf, ps, other

    cs.CV

    Knowledge-Informed Deep Learning for Irrigation Type Mapping from Remote Sensing

    Authors: Oishee Bintey Hoque, Nibir Chandra Mandal, Abhijin Adiga, Samarth Swarup, Sayjro Kossi Nouwakpo, Amanda Wilson, Madhav Marathe

    Abstract: Accurate mapping of irrigation methods is crucial for sustainable agricultural practices and food systems. However, existing models that rely solely on spectral features from satellite imagery are ineffective due to the complexity of agricultural landscapes and limited training data, making this a challenging problem. We present Knowledge-Informed Irrigation Mapping (KIIM), a novel Swin-Transforme… ▽ More

    Submitted 5 June, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: Full version of the paper will be appearing at the Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI-25), Special Track on AI for Good

  8. arXiv:2505.01578  [pdf, other

    cs.CV

    Grounding Task Assistance with Multimodal Cues from a Single Demonstration

    Authors: Gabriel Sarch, Balasaravanan Thoravi Kumaravel, Sahithya Ravi, Vibhav Vineet, Andrew D. Wilson

    Abstract: A person's demonstration often serves as a key reference for others learning the same task. However, RGB video, the dominant medium for representing these demonstrations, often fails to capture fine-grained contextual cues such as intent, safety-critical environmental factors, and subtle preferences embedded in human behavior. This sensory gap fundamentally limits the ability of Vision Language Mo… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  9. arXiv:2504.15208  [pdf, other

    cs.LG cs.AI

    Compute-Optimal LLMs Provably Generalize Better With Scale

    Authors: Marc Finzi, Sanyam Kapoor, Diego Granziol, Anming Gu, Christopher De Sa, J. Zico Kolter, Andrew Gordon Wilson

    Abstract: Why do larger language models generalize better? To investigate this question, we develop generalization bounds on the pretraining objective of large language models (LLMs) in the compute-optimal regime, as described by the Chinchilla scaling laws. We introduce a novel, fully empirical Freedman-type martingale concentration inequality that tightens existing bounds by accounting for the variance of… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: ICLR 2025

  10. arXiv:2504.04528  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    A Consequentialist Critique of Binary Classification Evaluation Practices

    Authors: Gerardo Flores, Abigail Schiff, Alyssa H. Smith, Julia A Fukuyama, Ashia C. Wilson

    Abstract: ML-supported decisions, such as ordering tests or determining preventive custody, often involve binary classification based on probabilistic forecasts. Evaluation frameworks for such forecasts typically consider whether to prioritize independent-decision metrics (e.g., Accuracy) or top-K metrics (e.g., Precision@K), and whether to focus on fixed thresholds or threshold-agnostic measures like AUC-R… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

  11. arXiv:2503.18928  [pdf

    cs.SD eess.AS

    A Reliable and Efficient Detection Pipeline for Rodent Ultrasonic Vocalizations

    Authors: Sabah Shahnoor Anis, Devin M. Kellis, Kris Ford Kaigler, Marlene A. Wilson, Christian O'Reilly

    Abstract: Analyzing ultrasonic vocalizations (USVs) is crucial for understanding rodents' affective states and social behaviors, but the manual analysis is time-consuming and prone to errors. Automated USV detection systems have been developed to address these challenges. Yet, these systems often rely on machine learning and fail to generalize effectively to new datasets. To tackle these shortcomings, we in… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: Accepted for publication in the proceeding of the 7th International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI' 2025), 8-10 April 2025, Innsbruck, Austria

  12. Allocation Multiplicity: Evaluating the Promises of the Rashomon Set

    Authors: Shomik Jain, Margaret Wang, Kathleen Creel, Ashia Wilson

    Abstract: The Rashomon set of equally-good models promises less discriminatory algorithms, reduced outcome homogenization, and fairer decisions through model ensembles or reconciliation. However, we argue from the perspective of allocation multiplicity that these promises may remain unfulfilled. When there are more qualified candidates than resources available, many different allocations of scarce resources… ▽ More

    Submitted 25 May, 2025; v1 submitted 20 March, 2025; originally announced March 2025.

    Comments: To appear in the proceedings of the ACM Conference on Fairness, Accountability, and Transparency (FAccT 2025)

    ACM Class: K.4.0

  13. arXiv:2503.15634  [pdf, other

    cs.GT cs.CY

    Homogeneous Algorithms Can Reduce Competition in Personalized Pricing

    Authors: Nathanael Jo, Kathleen Creel, Ashia Wilson, Manish Raghavan

    Abstract: Firms' algorithm development practices are often homogeneous. Whether firms train algorithms on similar data, aim at similar benchmarks, or rely on similar pre-trained models, the result is correlated predictions. We model the impact of correlated algorithms on competition in the context of personalized pricing. Our analysis reveals that (1) higher correlation diminishes consumer welfare and (2) a… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  14. arXiv:2503.13577  [pdf, other

    cs.MA cs.CY cs.LG

    When Should We Orchestrate Multiple Agents?

    Authors: Umang Bhatt, Sanyam Kapoor, Mihir Upadhyay, Ilia Sucholutsky, Francesco Quinzan, Katherine M. Collins, Adrian Weller, Andrew Gordon Wilson, Muhammad Bilal Zafar

    Abstract: Strategies for orchestrating the interactions between multiple agents, both human and artificial, can wildly overestimate performance and underestimate the cost of orchestration. We design a framework to orchestrate agents under realistic conditions, such as inference costs or availability constraints. We show theoretically that orchestration is only effective if there are performance or cost diff… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  15. arXiv:2503.02113  [pdf, other

    cs.LG stat.ML

    Deep Learning is Not So Mysterious or Different

    Authors: Andrew Gordon Wilson

    Abstract: Deep neural networks are often seen as different from other model classes by defying conventional notions of generalization. Popular examples of anomalous generalization behaviour include benign overfitting, double descent, and the success of overparametrization. We argue that these phenomena are not distinct to neural networks, or particularly mysterious. Moreover, this generalization behaviour c… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  16. arXiv:2502.17495  [pdf, other

    cs.LG physics.ao-ph stat.AP stat.ML

    Spatiotemporal Forecasting in Climate Data Using EOFs and Machine Learning Models: A Case Study in Chile

    Authors: Mauricio Herrera, Francisca Kleisinger, Andrés Wilsón

    Abstract: Effective resource management and environmental planning in regions with high climatic variability, such as Chile, demand advanced predictive tools. This study addresses this challenge by employing an innovative and computationally efficient hybrid methodology that integrates machine learning (ML) methods for time series forecasting with established statistical techniques. The spatiotemporal data… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: 25 pages, 6 figures

  17. arXiv:2502.02672  [pdf, other

    cs.CL cs.LG

    Transformers Boost the Performance of Decision Trees on Tabular Data across Sample Sizes

    Authors: Mayuka Jayawardhana, Renbo, Samuel Dooley, Valeriia Cherepanova, Andrew Gordon Wilson, Frank Hutter, Colin White, Tom Goldstein, Micah Goldblum

    Abstract: Large language models (LLMs) perform remarkably well on tabular datasets in zero- and few-shot settings, since they can extract meaning from natural language column headers that describe features and labels. Similarly, TabPFN, a recent non-LLM transformer pretrained on numerous tables for in-context learning, has demonstrated excellent performance for dataset sizes up to a thousand samples. In con… ▽ More

    Submitted 5 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: 12 pages, 6 figures

    ACM Class: I.2.m; I.2.6; I.2.7

  18. arXiv:2412.19284  [pdf, other

    cs.LG cs.AI

    PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing

    Authors: Michael Bezick, Blake A. Wilson, Vaishnavi Iyer, Yuheng Chen, Vladimir M. Shalaev, Sabre Kais, Alexander V. Kildishev, Alexandra Boltasseva, Brad Lackey

    Abstract: PearSAN is a machine learning-assisted optimization algorithm applicable to inverse design problems with large design spaces, where traditional optimizers struggle. The algorithm leverages the latent space of a generative model for rapid sampling and employs a Pearson correlated surrogate model to predict the figure of merit of the true design metric. As a showcase example, PearSAN is applied to t… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

  19. arXiv:2412.16098  [pdf, other

    cs.LG cs.AI

    Explainable AI for Multivariate Time Series Pattern Exploration: Latent Space Visual Analytics with Temporal Fusion Transformer and Variational Autoencoders in Power Grid Event Diagnosis

    Authors: Haowen Xu, Ali Boyaci, Jianming Lian, Aaron Wilson

    Abstract: Detecting and analyzing complex patterns in multivariate time-series data is crucial for decision-making in urban and environmental system operations. However, challenges arise from the high dimensionality, intricate complexity, and interconnected nature of complex patterns, which hinder the understanding of their underlying physical processes. Existing AI methods often face limitations in interpr… ▽ More

    Submitted 24 December, 2024; v1 submitted 20 December, 2024; originally announced December 2024.

  20. arXiv:2412.07763  [pdf, other

    stat.ML cs.LG q-bio.BM

    Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences

    Authors: Alan Nawzad Amin, Nate Gruver, Yilun Kuang, Lily Li, Hunter Elliott, Calvin McCarter, Aniruddh Raghu, Peyton Greenside, Andrew Gordon Wilson

    Abstract: To build effective therapeutics, biologists iteratively mutate antibody sequences to improve binding and stability. Proposed mutations can be informed by previous measurements or by learning from large antibody databases to predict only typical antibodies. Unfortunately, the space of typical antibodies is enormous to search, and experiments often fail to find suitable antibodies on a budget. We in… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Code available at https://github.com/AlanNawzadAmin/CloneBO

  21. arXiv:2412.05244  [pdf, other

    cs.LG cs.AI

    Enhancing Foundation Models for Time Series Forecasting via Wavelet-based Tokenization

    Authors: Luca Masserano, Abdul Fatir Ansari, Boran Han, Xiyuan Zhang, Christos Faloutsos, Michael W. Mahoney, Andrew Gordon Wilson, Youngsuk Park, Syama Rangapuram, Danielle C. Maddix, Yuyang Wang

    Abstract: How to best develop foundational models for time series forecasting remains an important open question. Tokenization is a crucial consideration in this effort: what is an effective discrete vocabulary for a real-valued sequential input? To address this question, we develop WaveToken, a wavelet-based tokenizer that allows models to learn complex representations directly in the space of time-localiz… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

    Comments: 25 pages, 15 figures

  22. arXiv:2412.02525  [pdf, other

    cs.LG cs.CL

    LLMForecaster: Improving Seasonal Event Forecasts with Unstructured Textual Data

    Authors: Hanyu Zhang, Chuck Arvin, Dmitry Efimov, Michael W. Mahoney, Dominique Perrault-Joncas, Shankar Ramasubramanian, Andrew Gordon Wilson, Malcolm Wolff

    Abstract: Modern time-series forecasting models often fail to make full use of rich unstructured information about the time series themselves. This lack of proper conditioning can lead to obvious model failures; for example, models may be unaware of the details of a particular product, and hence fail to anticipate seasonal surges in customer demand in the lead up to major exogenous events like holidays for… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: Presented at NeurIPS Time Series in the Age of Large Models (2024)

  23. arXiv:2411.05359  [pdf, other

    cs.CV cs.AI cs.CY

    Agricultural Landscape Understanding At Country-Scale

    Authors: Radhika Dua, Nikita Saxena, Aditi Agarwal, Alex Wilson, Gaurav Singh, Hoang Tran, Ishan Deshpande, Amandeep Kaur, Gaurav Aggarwal, Chandan Nath, Arnab Basu, Vishal Batchu, Sharath Holla, Bindiya Kurle, Olana Missura, Rahul Aggarwal, Shubhika Garg, Nishi Shah, Avneet Singh, Dinesh Tewari, Agata Dondzik, Bharat Adsul, Milind Sohoni, Asim Rama Praveen, Aaryan Dangi , et al. (10 additional authors not shown)

    Abstract: Agricultural landscapes are quite complex, especially in the Global South where fields are smaller, and agricultural practices are more varied. In this paper we report on our progress in digitizing the agricultural landscape (natural and man-made) in our study region of India. We use high resolution imagery and a UNet style segmentation model to generate the first of its kind national-scale multi-… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: 34 pages, 7 tables, 15 figs

  24. arXiv:2410.20468  [pdf, other

    cs.HC

    Understanding Communication Preferences of Information Workers in Engagement with Text-Based Conversational Agents

    Authors: Ananya Bhattacharjee, Jina Suh, Mahsa Ershadi, Shamsi T. Iqbal, Andrew D. Wilson, Javier Hernandez

    Abstract: Communication traits in text-based human-AI conversations play pivotal roles in shaping user experiences and perceptions of systems. With the advancement of large language models (LLMs), it is now feasible to analyze these traits at a more granular level. In this study, we explore the preferences of information workers regarding chatbot communication traits across seven applications. Participants… ▽ More

    Submitted 1 November, 2024; v1 submitted 27 October, 2024; originally announced October 2024.

  25. arXiv:2410.13043  [pdf, other

    eess.IV cs.CV

    UniCoN: Universal Conditional Networks for Multi-Age Embryonic Cartilage Segmentation with Sparsely Annotated Data

    Authors: Nishchal Sapkota, Yejia Zhang, Zihao Zhao, Maria Gomez, Yuhan Hsi, Jordan A. Wilson, Kazuhiko Kawasaki, Greg Holmes, Meng Wu, Ethylin Wang Jabs, Joan T. Richtsmeier, Susan M. Motch Perrine, Danny Z. Chen

    Abstract: Osteochondrodysplasia, affecting 2-3% of newborns globally, is a group of bone and cartilage disorders that often result in head malformations, contributing to childhood morbidity and reduced quality of life. Current research on this disease using mouse models faces challenges since it involves accurately segmenting the developing cartilage in 3D micro-CT images of embryonic mice. Tackling this se… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  26. arXiv:2410.08074  [pdf, other

    cs.LG cs.CR cs.CV

    Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models

    Authors: Vinith M. Suriyakumar, Rohan Alur, Ayush Sekhari, Manish Raghavan, Ashia C. Wilson

    Abstract: Text-to-image diffusion models rely on massive, web-scale datasets. Training them from scratch is computationally expensive, and as a result, developers often prefer to make incremental updates to existing models. These updates often compose fine-tuning steps (to learn new concepts or improve model performance) with "unlearning" steps (to "forget" existing concepts, such as copyrighted works or ex… ▽ More

    Submitted 10 February, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: 20 pages, 13 figures

  27. arXiv:2410.02117  [pdf, other

    cs.LG stat.ML

    Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices

    Authors: Andres Potapczynski, Shikai Qiu, Marc Finzi, Christopher Ferri, Zixi Chen, Micah Goldblum, Bayan Bruss, Christopher De Sa, Andrew Gordon Wilson

    Abstract: Dense linear layers are the dominant computational bottleneck in large neural networks, presenting a critical need for more efficient alternatives. Previous efforts focused on a small number of hand-crafted structured matrices and neglected to investigate whether these structures can surpass dense layers in terms of compute-optimal scaling laws when both the model size and training examples are op… ▽ More

    Submitted 4 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024. Code available at https://github.com/AndPotap/einsum-search

  28. SpaceBlender: Creating Context-Rich Collaborative Spaces Through Generative 3D Scene Blending

    Authors: Nels Numan, Shwetha Rajaram, Balasaravanan Thoravi Kumaravel, Nicolai Marquardt, Andrew D. Wilson

    Abstract: There is increased interest in using generative AI to create 3D spaces for Virtual Reality (VR) applications. However, today's models produce artificial environments, falling short of supporting collaborative tasks that benefit from incorporating the user's physical context. To generate environments that support VR telepresence, we introduce SpaceBlender, a novel pipeline that utilizes generative… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  29. arXiv:2409.10563  [pdf, other

    cs.CR cs.LG

    Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning

    Authors: Alec Wilson, William Holmes, Ryan Menzies, Kez Smithson Whitehead

    Abstract: In previous work, the IPMSRL environment (Integrated Platform Management System Reinforcement Learning environment) was developed with the aim of training defensive RL agents in a simulator representing a subset of an IPMS on a maritime vessel under a cyber-attack. This paper extends the use of IPMSRL to enhance realism including the additional dynamics of false positive alerts and alert delay. Ap… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 14 pages, 9 figures, CAMLIS'24: Conference on Applied Machine Learning for Information Security, October 24--25, 2024, Arlington, VA

  30. arXiv:2408.14400  [pdf, other

    cs.CV cs.LG

    Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping

    Authors: Vishal Batchu, Alex Wilson, Betty Peng, Carl Elkin, Umangi Jain, Christopher Van Arsdale, Ross Goroshin, Varun Gulshan

    Abstract: The transition to renewable energy, particularly solar, is key to mitigating climate change. Google's Solar API aids this transition by estimating solar potential from aerial imagery, but its impact is constrained by geographical coverage. This paper proposes expanding the API's reach using satellite imagery, enabling global solar potential assessment. We tackle challenges involved in building a D… ▽ More

    Submitted 29 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: 14 pages

  31. arXiv:2408.13150  [pdf, other

    math.OC cs.LG

    Adaptive Backtracking Line Search

    Authors: Joao V. Cavalcanti, Laurent Lessard, Ashia C. Wilson

    Abstract: Backtracking line search is foundational in numerical optimization. The basic idea is to adjust the step-size of an algorithm by a constant factor until some chosen criterion (e.g. Armijo, Descent Lemma) is satisfied. We propose a novel way to adjust step-sizes, replacing the constant factor used in regular backtracking with one that takes into account the degree to which the chosen criterion is v… ▽ More

    Submitted 26 May, 2025; v1 submitted 23 August, 2024; originally announced August 2024.

  32. arXiv:2408.08477  [pdf, other

    cs.CY

    Automating Transparency Mechanisms in the Judicial System Using LLMs: Opportunities and Challenges

    Authors: Ishana Shastri, Shomik Jain, Barbara Engelhardt, Ashia Wilson

    Abstract: Bringing more transparency to the judicial system for the purposes of increasing accountability often demands extensive effort from auditors who must meticulously sift through numerous disorganized legal case files to detect patterns of bias and errors. For example, the high-profile investigation into the Curtis Flowers case took seven reporters a full year to assemble evidence about the prosecuto… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: Accepted at the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES 2024)

  33. arXiv:2407.18158  [pdf, other

    stat.ML cs.LG

    Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models

    Authors: Sanae Lotfi, Yilun Kuang, Brandon Amos, Micah Goldblum, Marc Finzi, Andrew Gordon Wilson

    Abstract: Large language models (LLMs) with billions of parameters excel at predicting the next token in a sequence. Recent work computes non-vacuous compression-based generalization bounds for LLMs, but these bounds are vacuous for large models at the billion-parameter scale. Moreover, these bounds are obtained through restrictive compression techniques, bounding compressed models that generate low-quality… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  34. arXiv:2407.13830  [pdf, other

    quant-ph cs.LG

    Non-native Quantum Generative Optimization with Adversarial Autoencoders

    Authors: Blake A. Wilson, Jonathan Wurtz, Vahagn Mkhitaryan, Michael Bezick, Sheng-Tao Wang, Sabre Kais, Vladimir M. Shalaev, Alexandra Boltasseva

    Abstract: Large-scale optimization problems are prevalent in several fields, including engineering, finance, and logistics. However, most optimization problems cannot be efficiently encoded onto a physical system because the existing quantum samplers have too few qubits. Another typical limiting factor is that the optimization constraints are not compatible with the native cost Hamiltonian. This work presen… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 7 + 3 pages, 7 figures

  35. arXiv:2407.08169  [pdf, other

    cs.LG cs.AI

    The Approximate Fisher Influence Function: Faster Estimation of Data Influence in Statistical Models

    Authors: Omri Lev, Ashia C. Wilson

    Abstract: Quantifying the influence of infinitesimal changes in training data on model performance is crucial for understanding and improving machine learning models. In this work, we reformulate this problem as a weighted empirical risk minimization and enhance existing influence function-based methods by using information geometry to derive a new algorithm to estimate influence. Our formulation proves ver… ▽ More

    Submitted 9 April, 2025; v1 submitted 11 July, 2024; originally announced July 2024.

  36. arXiv:2406.11463  [pdf, other

    cs.LG stat.ML

    Just How Flexible are Neural Networks in Practice?

    Authors: Ravid Shwartz-Ziv, Micah Goldblum, Arpit Bansal, C. Bayan Bruss, Yann LeCun, Andrew Gordon Wilson

    Abstract: It is widely believed that a neural network can fit a training set containing at least as many samples as it has parameters, underpinning notions of overparameterized and underparameterized models. In practice, however, we only find solutions accessible via our training procedure, including the optimizer and regularizers, limiting flexibility. Moreover, the exact parameterization of the function c… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  37. arXiv:2406.09177  [pdf, other

    stat.ML cs.LG

    Scalable and Flexible Causal Discovery with an Efficient Test for Adjacency

    Authors: Alan Nawzad Amin, Andrew Gordon Wilson

    Abstract: To make accurate predictions, understand mechanisms, and design interventions in systems of many variables, we wish to learn causal graphs from large scale data. Unfortunately the space of all possible causal graphs is enormous so scalably and accurately searching for the best fit to the data is a challenge. In principle we could substantially decrease the search space, or learn the graph entirely… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: ICML 2024; Code at https://github.com/AlanNawzadAmin/DAT-graph

  38. arXiv:2406.08391  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Large Language Models Must Be Taught to Know What They Don't Know

    Authors: Sanyam Kapoor, Nate Gruver, Manley Roberts, Katherine Collins, Arka Pal, Umang Bhatt, Adrian Weller, Samuel Dooley, Micah Goldblum, Andrew Gordon Wilson

    Abstract: When using large language models (LLMs) in high-stakes applications, we need to know when we can trust their predictions. Some works argue that prompting high-performance LLMs is sufficient to produce calibrated uncertainties, while others introduce sampling methods that can be prohibitively expensive. In this work, we first argue that prompting on its own is insufficient to achieve good calibrati… ▽ More

    Submitted 5 December, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2024 Camera Ready

  39. arXiv:2406.07337  [pdf, other

    cs.LG

    Transferring Knowledge from Large Foundation Models to Small Downstream Models

    Authors: Shikai Qiu, Boran Han, Danielle C. Maddix, Shuai Zhang, Yuyang Wang, Andrew Gordon Wilson

    Abstract: How do we transfer the relevant knowledge from ever larger foundation models into small, task-specific downstream models that can run at much lower costs? Standard transfer learning using pre-trained weights as the initialization transfers limited information and commits us to often massive pre-trained architectures. This procedure also precludes combining multiple pre-trained models that learn co… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ICML 2024. Code available at https://github.com/amazon-science/adaptive-feature-transfer

  40. arXiv:2406.06248  [pdf, other

    cs.LG

    Compute Better Spent: Replacing Dense Layers with Structured Matrices

    Authors: Shikai Qiu, Andres Potapczynski, Marc Finzi, Micah Goldblum, Andrew Gordon Wilson

    Abstract: Dense linear layers are the dominant computational bottleneck in foundation models. Identifying more efficient alternatives to dense matrices has enormous potential for building more compute-efficient models, as exemplified by the success of convolutional networks in the image domain. In this work, we systematically explore structured matrices as replacements for dense matrices. We show that diffe… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: ICML 24. Code available at https://github.com/shikaiqiu/compute-better-spent

  41. arXiv:2405.14812  [pdf, other

    cs.CY

    As an AI Language Model, "Yes I Would Recommend Calling the Police": Norm Inconsistency in LLM Decision-Making

    Authors: Shomik Jain, D Calacci, Ashia Wilson

    Abstract: We investigate the phenomenon of norm inconsistency: where LLMs apply different norms in similar situations. Specifically, we focus on the high-risk application of deciding whether to call the police in Amazon Ring home surveillance videos. We evaluate the decisions of three state-of-the-art LLMs -- GPT-4, Gemini 1.0, and Claude 3 Sonnet -- in relation to the activities portrayed in the videos, th… ▽ More

    Submitted 17 August, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: To appear in the proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES 2024)

  42. arXiv:2405.00740  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Modeling Caption Diversity in Contrastive Vision-Language Pretraining

    Authors: Samuel Lavoie, Polina Kirichenko, Mark Ibrahim, Mahmoud Assran, Andrew Gordon Wilson, Aaron Courville, Nicolas Ballas

    Abstract: There are a thousand ways to caption an image. Contrastive Language Pretraining (CLIP) on the other hand, works by mapping an image and its caption to a single vector -- limiting how well CLIP-like models can represent the diverse ways to describe an image. In this work, we introduce Llip, Latent Language Image Pretraining, which models the diversity of captions that could match an image. Llip's v… ▽ More

    Submitted 29 March, 2025; v1 submitted 29 April, 2024; originally announced May 2024.

    Comments: 14 pages, 8 figures, 7 tables, to be published at ICML2024

  43. arXiv:2404.14952  [pdf, other

    cs.CV cs.AI

    Leveraging Speech for Gesture Detection in Multimodal Communication

    Authors: Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim Pouw, Ivan Toni, Peter Uhrig, Anna Wilson, Judith Holler, Aslı Özyürek, Raquel Fernández

    Abstract: Gestures are inherent to human interaction and often complement speech in face-to-face communication, forming a multimodal communication system. An important task in gesture analysis is detecting a gesture's beginning and end. Research on automatic gesture detection has primarily focused on visual and kinematic information to detect a limited set of isolated or silent gestures with low variability… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  44. arXiv:2404.08592  [pdf, other

    cs.CY

    Scarce Resource Allocations That Rely On Machine Learning Should Be Randomized

    Authors: Shomik Jain, Kathleen Creel, Ashia Wilson

    Abstract: Contrary to traditional deterministic notions of algorithmic fairness, this paper argues that fairly allocating scarce resources using machine learning often requires randomness. We address why, when, and how to randomize by proposing stochastic procedures that more adequately account for all of the claims that individuals have to allocations of social goods or opportunities.

    Submitted 19 June, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: To appear in the proceedings of the International Conference on Machine Learning (ICML 2024)

    ACM Class: K.4.0

  45. arXiv:2403.16365  [pdf, other

    cs.LG cs.CR cs.CV

    Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion

    Authors: Hossein Souri, Arpit Bansal, Hamid Kazemi, Liam Fowl, Aniruddha Saha, Jonas Geiping, Andrew Gordon Wilson, Rama Chellappa, Tom Goldstein, Micah Goldblum

    Abstract: Modern neural networks are often trained on massive datasets that are web scraped with minimal human inspection. As a result of this insecure curation pipeline, an adversary can poison or backdoor the resulting model by uploading malicious data to the internet and waiting for a victim to scrape and train on it. Existing approaches for creating poisons and backdoors start with randomly sampled clea… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  46. arXiv:2403.14029  [pdf, other

    cs.RO

    Quadcopter Team Configurable Motion Guided by a Quadruped

    Authors: Mohammad Ghufran, Sourish Tetakayala, Jack Hughes, Aron Wilson, Hossein Rastgoftar

    Abstract: The paper focuses on modeling and experimental evaluation of a quadcopter team configurable coordination guided by a single quadruped robot. We consider the quadcopter team as particles of a two-dimensional deformable body and propose a two-dimensional affine transformation model for safe and collision-free configurable coordination of this heterogeneous robotic system. The proposed affine transfo… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  47. BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI

    Authors: Shwetha Rajaram, Nels Numan, Balasaravanan Thoravi Kumaravel, Nicolai Marquardt, Andrew D. Wilson

    Abstract: Today's video-conferencing tools support a rich range of professional and social activities, but their generic meeting environments cannot be dynamically adapted to align with distributed collaborators' needs. To enable end-user customization, we developed BlendScape, a rendering and composition system for video-conferencing participants to tailor environments to their meeting context by leveragin… ▽ More

    Submitted 1 October, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: ACM UIST 2024

  48. arXiv:2403.09869  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Mind the GAP: Improving Robustness to Subpopulation Shifts with Group-Aware Priors

    Authors: Tim G. J. Rudner, Ya Shi Zhang, Andrew Gordon Wilson, Julia Kempe

    Abstract: Machine learning models often perform poorly under subpopulation shifts in the data distribution. Developing methods that allow machine learning models to better generalize to such shifts is crucial for safe deployment in real-world settings. In this paper, we develop a family of group-aware prior (GAP) distributions over neural network parameters that explicitly favor models that generalize well… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Published in Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  49. arXiv:2403.07815  [pdf, other

    cs.LG cs.AI

    Chronos: Learning the Language of Time Series

    Authors: Abdul Fatir Ansari, Lorenzo Stella, Caner Turkmen, Xiyuan Zhang, Pedro Mercado, Huibin Shen, Oleksandr Shchur, Syama Sundar Rangapuram, Sebastian Pineda Arango, Shubham Kapoor, Jasper Zschiegner, Danielle C. Maddix, Hao Wang, Michael W. Mahoney, Kari Torkkola, Andrew Gordon Wilson, Michael Bohlke-Schneider, Yuyang Wang

    Abstract: We introduce Chronos, a simple yet effective framework for pretrained probabilistic time series models. Chronos tokenizes time series values using scaling and quantization into a fixed vocabulary and trains existing transformer-based language model architectures on these tokenized time series via the cross-entropy loss. We pretrained Chronos models based on the T5 family (ranging from 20M to 710M… ▽ More

    Submitted 4 November, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Code and model checkpoints available at https://github.com/amazon-science/chronos-forecasting

  50. arXiv:2403.02695  [pdf, other

    cs.LG

    Controllable Prompt Tuning For Balancing Group Distributional Robustness

    Authors: Hoang Phan, Andrew Gordon Wilson, Qi Lei

    Abstract: Models trained on data composed of different groups or domains can suffer from severe performance degradation under distribution shifts. While recent methods have largely focused on optimizing the worst-group objective, this often comes at the expense of good performance on other groups. To address this problem, we introduce an optimization scheme to achieve good performance across groups and find… ▽ More

    Submitted 4 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning