Skip to main content

Showing 1–50 of 517 results for author: Jordan, M

.
  1. arXiv:2506.13488  [pdf, ps, other

    cs.LG physics.optics quant-ph

    Imaging at the quantum limit with convolutional neural networks

    Authors: Andrew H. Proppe, Aaron Z. Goldberg, Guillaume Thekkadath, Noah Lupu-Gladstein, Kyle M. Jordan, Philip J. Bustard, Frédéric Bouchard, Duncan England, Khabat Heshami, Jeff S. Lundeen, Benjamin J. Sussman

    Abstract: Deep neural networks have been shown to achieve exceptional performance for computer vision tasks like image recognition, segmentation, and reconstruction or denoising. Here, we evaluate the ultimate performance limits of deep convolutional neural network models for image reconstruction, by comparing them against the standard quantum limit set by shot-noise and the Heisenberg limit on precision. W… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  2. arXiv:2506.10887  [pdf, ps, other

    cs.CL cs.LG

    Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

    Authors: Yixiao Huang, Hanlin Zhu, Tianyu Guo, Jiantao Jiao, Somayeh Sojoudi, Michael I. Jordan, Stuart Russell, Song Mei

    Abstract: Large language models (LLMs) can acquire new knowledge through fine-tuning, but this process exhibits a puzzling duality: models can generalize remarkably from new facts, yet are also prone to hallucinating incorrect information. However, the reasons for this phenomenon remain poorly understood. In this work, we argue that both behaviors stem from a single mechanism known as out-of-context reasoni… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  3. arXiv:2506.10354  [pdf, ps, other

    math.ST cs.IT

    Revisiting mean estimation over $\ell_p$ balls: Is the MLE optimal?

    Authors: Liviu Aolaritei, Michael I. Jordan, Reese Pathak, Annie Ulichney

    Abstract: We revisit the problem of mean estimation on $\ell_p$ balls under additive Gaussian noise. When $p$ is strictly less than $2$, it is well understood that rate-optimal estimators must be nonlinear in the observations. In this work, we study the maximum likelihood estimator (MLE), which may be viewed as a nonlinear shrinkage procedure for mean estimation over $\ell_p$ balls. We demonstrate two pheno… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 37 pages, 3 figures

  4. arXiv:2506.05295  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Sample Complexity and Representation Ability of Test-time Scaling Paradigms

    Authors: Baihe Huang, Shanda Li, Tianhao Wu, Yiming Yang, Ameet Talwalkar, Kannan Ramchandran, Michael I. Jordan, Jiantao Jiao

    Abstract: Test-time scaling paradigms have significantly advanced the capabilities of large language models (LLMs) on complex tasks. Despite their empirical success, theoretical understanding of the sample efficiency of various test-time strategies -- such as self-consistency, best-of-$n$, and self-correction -- remains limited. In this work, we first establish a separation result between two repeated sampl… ▽ More

    Submitted 12 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

  5. arXiv:2505.18223  [pdf, ps, other

    cs.CL cs.AI

    IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis

    Authors: Hanyu Li, Haoyu Liu, Tingyu Zhu, Tianyu Guo, Zeyu Zheng, Xiaotie Deng, Michael I. Jordan

    Abstract: Large Language Models (LLMs) show promise as data analysis agents, but existing benchmarks overlook the iterative nature of the field, where experts' decisions evolve with deeper insights of the dataset. To address this, we introduce IDA-Bench, a novel benchmark evaluating LLM agents in multi-round interactive scenarios. Derived from complex Kaggle notebooks, tasks are presented as sequential natu… ▽ More

    Submitted 6 June, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  6. arXiv:2505.13732  [pdf, ps, other

    stat.ML cs.LG

    Backward Conformal Prediction

    Authors: Etienne Gauthier, Francis Bach, Michael I. Jordan

    Abstract: We introduce $\textit{Backward Conformal Prediction}$, a method that guarantees conformal coverage while providing flexible control over the size of prediction sets. Unlike standard conformal prediction, which fixes the coverage level and allows the conformal set size to vary, our approach defines a rule that constrains how prediction set sizes behave based on the observed data, and adapts the cov… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Code available at: https://github.com/GauthierE/backward-cp

  7. arXiv:2505.13564  [pdf, ps, other

    cs.LG stat.ML

    Online Decision-Focused Learning

    Authors: Aymeric Capitaine, Maxime Haddouche, Eric Moulines, Michael I. Jordan, Etienne Boursier, Alain Durmus

    Abstract: Decision-focused learning (DFL) is an increasingly popular paradigm for training predictive models whose outputs are used in decision-making tasks. Instead of merely optimizing for predictive accuracy, DFL trains models to directly minimize the loss associated with downstream decisions. This end-to-end strategy holds promise for tackling complex combinatorial problems; however, existing studies fo… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  8. arXiv:2505.05145  [pdf, other

    cs.LG cs.AI cs.CL

    Understanding In-context Learning of Addition via Activation Subspaces

    Authors: Xinyan Hu, Kayo Yin, Michael I. Jordan, Jacob Steinhardt, Lijie Chen

    Abstract: To perform in-context learning, language models must extract signals from individual few-shot examples, aggregate these into a learned prediction rule, and then apply this rule to new examples. How is this implemented in the forward pass of modern transformer models? To study this, we consider a structured family of few-shot learning tasks for which the true prediction rule is to add an integer… ▽ More

    Submitted 15 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: 20 pages

  9. arXiv:2505.04607  [pdf, other

    quant-ph

    Experimental demonstration of a multi-particle collective measurement for optimal quantum state estimation

    Authors: Arman Mansouri, Kyle M. Jordan, Raphael A. Abrahao, Jeff S. Lundeen

    Abstract: We experimentally demonstrate a two-particle collective measurement proposed as the optimal solution to a quantum state estimation game. Our results suggest that, in practice, the collective measurement strategy is at least as good as the best local approach, and it achieves a higher average fidelity when accounting for systematic errors. This photonic implementation uses a recently developed univ… ▽ More

    Submitted 13 May, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

  10. arXiv:2504.03560  [pdf, other

    math.OC cs.LG math.ST stat.ML

    Stochastic Optimization with Optimal Importance Sampling

    Authors: Liviu Aolaritei, Bart P. G. Van Parys, Henry Lam, Michael I. Jordan

    Abstract: Importance Sampling (IS) is a widely used variance reduction technique for enhancing the efficiency of Monte Carlo methods, particularly in rare-event simulation and related applications. Despite its power, the performance of IS is often highly sensitive to the choice of the proposal distribution and frequently requires stochastic calibration techniques. While the design and analysis of IS have be… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  11. arXiv:2504.02818  [pdf, other

    math.ST stat.ME

    Universal Log-Optimality for General Classes of e-processes and Sequential Hypothesis Tests

    Authors: Ian Waudby-Smith, Ricardo Sandoval, Michael I. Jordan

    Abstract: We consider the problem of sequential hypothesis testing by betting. For a general class of composite testing problems -- which include bounded mean testing, equal mean testing for bounded random tuples, and some key ingredients of two-sample and independence testing as special cases -- we show that any $e$-process satisfying a certain sublinear regret bound is adaptively, asymptotically, and almo… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  12. arXiv:2503.19068  [pdf, other

    stat.ML cs.AI cs.LG stat.ME stat.OT

    Minimum Volume Conformal Sets for Multivariate Regression

    Authors: Sacha Braun, Liviu Aolaritei, Michael I. Jordan, Francis Bach

    Abstract: Conformal prediction provides a principled framework for constructing predictive sets with finite-sample validity. While much of the focus has been on univariate response variables, existing multivariate methods either impose rigid geometric assumptions or rely on flexible but computationally expensive approaches that do not explicitly optimize prediction set volume. We propose an optimization-dri… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  13. arXiv:2503.13050  [pdf, other

    stat.ML cs.LG

    E-Values Expand the Scope of Conformal Prediction

    Authors: Etienne Gauthier, Francis Bach, Michael I. Jordan

    Abstract: Conformal prediction is a powerful framework for distribution-free uncertainty quantification. The standard approach to conformal prediction relies on comparing the ranks of prediction scores: under exchangeability, the rank of a future test point cannot be too extreme relative to a calibration set. This rank-based method can be reformulated in terms of p-values. In this paper, we explore an alter… ▽ More

    Submitted 6 May, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: Code available at: https://github.com/GauthierE/evalues-expand-cp

  14. arXiv:2503.11895  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Resolving UnderEdit & OverEdit with Iterative & Neighbor-Assisted Model Editing

    Authors: Bhiman Kumar Baghel, Scott M. Jordan, Zheyuan Ryan Shi, Xiang Lorraine Li

    Abstract: Large Language Models (LLMs) are widely deployed in downstream tasks, but keeping their knowledge up-to-date via retraining or fine-tuning is often computationally expensive. Model editing provides a more efficient alternative by updating a targeted subset of parameters, which often follows the locate-and-edit paradigm. Despite this efficiency, existing methods are limited: edits may fail to injec… ▽ More

    Submitted 17 June, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: Under Review

  15. arXiv:2503.07879  [pdf, other

    cs.CL cs.LG

    Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality

    Authors: Alex Fang, Hadi Pouransari, Matt Jordan, Alexander Toshev, Vaishaal Shankar, Ludwig Schmidt, Tom Gunter

    Abstract: Data filtering has become a powerful tool for improving model performance while reducing computational cost. However, as large language model compute budgets continue to grow, the limited data volume provided by heavily filtered and deduplicated datasets will become a practical constraint. In efforts to better understand how to proceed, we study model performance at various compute budgets and acr… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  16. arXiv:2503.06582  [pdf, other

    econ.TH cs.GT

    The Role of the Marketplace Operator in Inducing Competition

    Authors: Tiffany Ding, Dominique Perrault-Joncas, Orit Ronen, Michael I. Jordan, Dirk Bergemann, Dean Foster, Omer Gottesman

    Abstract: The steady rise of e-commerce marketplaces underscores the need to study a market structure that captures the key features of this setting. To this end, we consider a price-quantity Stackelberg duopoly in which the leader is the marketplace operator and the follower is an independent seller. The objective of the marketplace operator is to maximize a weighted sum of profit and a term capturing posi… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  17. arXiv:2502.17814  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    An Overview of Large Language Models for Statisticians

    Authors: Wenlong Ji, Weizhe Yuan, Emily Getzen, Kyunghyun Cho, Michael I. Jordan, Song Mei, Jason E Weston, Weijie J. Su, Jing Xu, Linjun Zhang

    Abstract: Large Language Models (LLMs) have emerged as transformative tools in artificial intelligence (AI), exhibiting remarkable capabilities across diverse tasks such as text generation, reasoning, and decision-making. While their success has primarily been driven by advances in computational power and deep learning architectures, emerging problems -- in areas such as uncertainty quantification, decision… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  18. arXiv:2502.14105  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Conformal Prediction under Levy-Prokhorov Distribution Shifts: Robustness to Local and Global Perturbations

    Authors: Liviu Aolaritei, Zheyu Oliver Wang, Julie Zhu, Michael I. Jordan, Youssef Marzouk

    Abstract: Conformal prediction provides a powerful framework for constructing prediction intervals with finite-sample guarantees, yet its robustness under distribution shifts remains a significant challenge. This paper addresses this limitation by modeling distribution shifts using Levy-Prokhorov (LP) ambiguity sets, which capture both local and global perturbations. We provide a self-contained overview of… ▽ More

    Submitted 18 May, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

  19. arXiv:2502.13913  [pdf, other

    cs.CL cs.AI

    How Do LLMs Perform Two-Hop Reasoning in Context?

    Authors: Tianyu Guo, Hanlin Zhu, Ruiqi Zhang, Jiantao Jiao, Song Mei, Michael I. Jordan, Stuart Russell

    Abstract: ``Socrates is human. All humans are mortal. Therefore, Socrates is mortal.'' This form of argument illustrates a typical pattern of two-hop reasoning. Formally, two-hop reasoning refers to the process of inferring a conclusion by making two logical steps, each connecting adjacent concepts, such that the final conclusion depends on the integration of both steps. It is one of the most fundamental co… ▽ More

    Submitted 28 May, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

  20. arXiv:2502.04879  [pdf, other

    stat.ML cs.LG

    Statistical Collusion by Collectives on Learning Platforms

    Authors: Etienne Gauthier, Francis Bach, Michael I. Jordan

    Abstract: As platforms increasingly rely on learning algorithms, collectives may form and seek ways to influence these platforms to align with their own interests. This can be achieved by coordinated submission of altered data. To evaluate the potential impact of such behavior, it is essential to understand the computations that collectives must perform to impact platforms in this way. In particular, collec… ▽ More

    Submitted 25 May, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: Code available at: https://github.com/GauthierE/statistical-collusion

  21. arXiv:2501.19388  [pdf, other

    cs.GT

    Online Decision-Making in Tree-Like Multi-Agent Games with Transfers

    Authors: Antoine Scheid, Etienne Boursier, Alain Durmus, Eric Moulines, Michael Jordan

    Abstract: The widespread deployment of Machine Learning systems everywhere raises challenges, such as dealing with interactions or competition between multiple learners. In that goal, we study multi-agent sequential decision-making by considering principal-agent interactions in a tree structure. In this problem, the reward of a player is influenced by the actions of her children, who are all self-interested… ▽ More

    Submitted 27 May, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

  22. arXiv:2501.19195  [pdf, other

    cs.LG cs.AI

    Rethinking Early Stopping: Refine, Then Calibrate

    Authors: Eugène Berta, David Holzmüller, Michael I. Jordan, Francis Bach

    Abstract: Machine learning classifiers often produce probabilistic predictions that are critical for accurate and interpretable decision-making in various domains. The quality of these predictions is generally evaluated with proper losses like cross-entropy, which decompose into two components: calibration error assesses general under/overconfidence, while refinement error measures the ability to distinguis… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

  23. arXiv:2501.19144  [pdf, other

    cs.GT

    Prediction-Aware Learning in Multi-Agent Systems

    Authors: Aymeric Capitaine, Etienne Boursier, Eric Moulines, Michael I. Jordan, Alain Durmus

    Abstract: The framework of uncoupled online learning in multiplayer games has made significant progress in recent years. In particular, the development of time-varying games has considerably expanded its modeling capabilities. However, current regret bounds quickly become vacuous when the game undergoes significant variations over time, even when these variations are easy to predict. Intuitively, the abilit… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

  24. arXiv:2501.15910  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    The Sample Complexity of Online Reinforcement Learning: A Multi-model Perspective

    Authors: Michael Muehlebach, Zhiyu He, Michael I. Jordan

    Abstract: We study the sample complexity of online reinforcement learning in the general setting of nonlinear dynamical systems with continuous state and action spaces. Our analysis accommodates a large class of dynamical systems ranging from a finite set of nonlinear candidate models to models with bounded and Lipschitz continuous dynamics, to systems that are parametrized by a compact and real-valued set… ▽ More

    Submitted 20 May, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

    Comments: 29 pages, 3 figures

  25. arXiv:2501.10139  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Conformal Prediction Sets with Improved Conditional Coverage using Trust Scores

    Authors: Jivat Neet Kaur, Michael I. Jordan, Ahmed Alaa

    Abstract: Standard conformal prediction offers a marginal guarantee on coverage, but for prediction sets to be truly useful, they should ideally ensure coverage conditional on each test point. Unfortunately, it is impossible to achieve exact, distribution-free conditional coverage in finite samples. In this work, we propose an alternative conformal prediction algorithm that targets coverage where it matters… ▽ More

    Submitted 9 February, 2025; v1 submitted 17 January, 2025; originally announced January 2025.

  26. arXiv:2501.08330  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Gradient Equilibrium in Online Learning: Theory and Applications

    Authors: Anastasios N. Angelopoulos, Michael I. Jordan, Ryan J. Tibshirani

    Abstract: We present a new perspective on online learning that we refer to as gradient equilibrium: a sequence of iterates achieves gradient equilibrium if the average of gradients of losses along the sequence converges to zero. In general, this condition is not implied by, nor implies, sublinear regret. It turns out that gradient equilibrium is achievable by standard online learning methods such as gradien… ▽ More

    Submitted 18 February, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: Code available at https://github.com/aangelopoulos/gradient-equilibrium/

  27. arXiv:2501.00656  [pdf, other

    cs.CL cs.LG

    2 OLMo 2 Furious

    Authors: Team OLMo, Pete Walsh, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Shane Arora, Akshita Bhagia, Yuling Gu, Shengyi Huang, Matt Jordan, Nathan Lambert, Dustin Schwenk, Oyvind Tafjord, Taira Anderson, David Atkinson, Faeze Brahman, Christopher Clark, Pradeep Dasigi, Nouha Dziri, Michal Guerquin, Hamish Ivison, Pang Wei Koh, Jiacheng Liu, Saumya Malik, William Merrill , et al. (15 additional authors not shown)

    Abstract: We present OLMo 2, the next generation of our fully open language models. OLMo 2 includes dense autoregressive models with improved architecture and training recipe, pretraining data mixtures, and instruction tuning recipes. Our modified model architecture and training recipe achieve both better training stability and improved per-token efficiency. Our updated pretraining data mixture introduces a… ▽ More

    Submitted 14 January, 2025; v1 submitted 31 December, 2024; originally announced January 2025.

    Comments: Model demo available at playground.allenai.org

  28. arXiv:2412.08060  [pdf, ps, other

    stat.ML cs.LG math.OC

    An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints

    Authors: Jordan Lekeufack, Michael I. Jordan

    Abstract: We study Online Convex Optimization (OCO) with adversarial constraints, where an online algorithm must make sequential decisions to minimize both convex loss functions and cumulative constraint violations. We focus on a setting where the algorithm has access to predictions of the loss and constraint functions. Our results show that we can improve the current best bounds of $ O(\sqrt{T}) $ regret a… ▽ More

    Submitted 12 March, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: 18 pages

  29. arXiv:2412.02556  [pdf, other

    physics.plasm-ph

    Quadrupolar Density Structures in Driven Magnetic Reconnection Experiments with a Guide Field

    Authors: T. W. O. Varnish, J. Chen, S. Chowdhry, R. Datta, G. V. Dowhan, L. S. Horan IV, N. M. Jordan, E. R. Neill, A. P. Shah, B. J. Sporer, R. Shapovalov, R. D. McBride, J. D. Hare

    Abstract: Magnetic reconnection is a ubiquitous process in plasma physics, driving rapid and energetic events such as coronal mass ejections. Reconnection between magnetic fields with arbitrary shear can be decomposed into an anti-parallel, reconnecting component, and a non-reconnecting guide-field component which is parallel to the reconnecting electric field. This guide field modifies the structure of the… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 12 pages, 9 figures. Submitted to Physics of Plasmas for review

  30. arXiv:2411.19073  [pdf, other

    astro-ph.EP astro-ph.IM

    Hydrodynamical simulations with strong indirect terms in Fargo-like codes: Numerical aspects of non-inertial frame and artificial viscosity

    Authors: Lucas M. Jordan, Thomas Rometsch

    Abstract: Context. Binary star systems allow us to study the planet formation process under extreme conditions. In the early stages, these systems contain a circumbinary disk and a disk around each star. To model the interactions between these disks in the frame of one of the stars, strong fictitious forces must be included in the simulations. The original Fargo and the Fargo3D codes fail to correctly simul… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

    Comments: 13 pages, 13 figures, accepted by A&A

    Journal ref: A&A 693, A177 (2025)

  31. arXiv:2411.16066  [pdf

    physics.plasm-ph

    Stability of Crossed-Field Amplifiers

    Authors: Christopher Swenson, Ryan Revolinsky, Adam Brusstar, Emma Guerin, Nicholas M. Jordan, Y. Y. Lau, Ronald Gilgenbach

    Abstract: This research examines the stability of crossed-field amplifiers (CFAs) and characterizes their different modes of operation: amplification, driven oscillation, and self-excited oscillation. The CFA used in this paper is the Recirculating Planar Crossed-Field Amplifier (RPCFA), which is a high power (MW) pulsed (300 ns) amplifier that operates around 3 GHz. Initially, the RPCFA is shown to be a st… ▽ More

    Submitted 4 December, 2024; v1 submitted 24 November, 2024; originally announced November 2024.

  32. arXiv:2411.00775  [pdf, ps, other

    cs.LG stat.ML

    Dimension-free Private Mean Estimation for Anisotropic Distributions

    Authors: Yuval Dagan, Michael I. Jordan, Xuelin Yang, Lydia Zakynthinou, Nikita Zhivotovskiy

    Abstract: We present differentially private algorithms for high-dimensional mean estimation. Previous private estimators on distributions over $\mathbb{R}^d$ suffer from a curse of dimensionality, as they require $Ω(d^{1/2})$ samples to achieve non-trivial error, even in cases where $O(1)$ samples suffice without privacy. This rate is unavoidable when the distribution is isotropic, namely, when the covarian… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  33. arXiv:2410.20649  [pdf, ps, other

    cs.LG math.OC stat.ML

    Learning Variational Inequalities from Data: Fast Generalization Rates under Strong Monotonicity

    Authors: Eric Zhao, Tatjana Chavdarova, Michael Jordan

    Abstract: Variational inequalities (VIs) are a broad class of optimization problems encompassing machine learning problems ranging from standard convex minimization to more complex scenarios like min-max optimization and computing the equilibria of multi-player games. In convex optimization, strong convexity allows for fast statistical learning rates requiring only $Θ(1/ε)$ stochastic first-order oracle cal… ▽ More

    Submitted 18 February, 2025; v1 submitted 27 October, 2024; originally announced October 2024.

  34. arXiv:2410.18404  [pdf, other

    cs.LG cs.CR stat.ML

    Enhancing Feature-Specific Data Protection via Bayesian Coordinate Differential Privacy

    Authors: Maryam Aliakbarpour, Syomantak Chaudhuri, Thomas A. Courtade, Alireza Fallah, Michael I. Jordan

    Abstract: Local Differential Privacy (LDP) offers strong privacy guarantees without requiring users to trust external parties. However, LDP applies uniform protection to all data features, including less sensitive ones, which degrades performance of downstream tasks. To overcome this limitation, we propose a Bayesian framework, Bayesian Coordinate Differential Privacy (BCDP), that enables feature-specific p… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  35. arXiv:2410.17055  [pdf, other

    cs.LG stat.ML

    Optimal Design for Reward Modeling in RLHF

    Authors: Antoine Scheid, Etienne Boursier, Alain Durmus, Michael I. Jordan, Pierre Ménard, Eric Moulines, Michal Valko

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a popular approach to align language models (LMs) with human preferences. This method involves collecting a large dataset of human pairwise preferences across various text generations and using it to infer (implicitly or explicitly) a reward model. Numerous methods have been proposed to learn the reward model and align a LM with it. Howe… ▽ More

    Submitted 23 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

  36. arXiv:2410.13835  [pdf, other

    cs.LG

    Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs

    Authors: Tianyu Guo, Druv Pai, Yu Bai, Jiantao Jiao, Michael I. Jordan, Song Mei

    Abstract: Practitioners have consistently observed three puzzling phenomena in transformer-based large language models (LLMs): attention sinks, value-state drains, and residual-state peaks, collectively referred to as extreme-token phenomena. These phenomena are characterized by certain so-called "sink tokens" receiving disproportionately high attention weights, exhibiting significantly smaller value states… ▽ More

    Submitted 7 November, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

  37. arXiv:2409.16528  [pdf, other

    physics.app-ph physics.ins-det quant-ph

    Wide-field microwave magnetic field imaging with nitrogen-vacancy centers in diamond

    Authors: Luca Basso, Pauli Kehayias, Jacob Henshaw, Gajadhar Joshi, Michael P. Lilly, Matthew B. Jordan, Andrew M. Mounce

    Abstract: Non-invasive imaging of microwave (MW) magnetic fields with microscale lateral resolution is pivotal for various applications, such as MW technologies and integrated circuit failure analysis. Diamond nitrogen-vacancy (NV) center magnetometry has emerged as an ideal tool, offering $μ$m-scale resolution, millimeter-scale field of view, high sensitivity, and non-invasive imaging compatible with diver… ▽ More

    Submitted 18 October, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

  38. arXiv:2409.03734  [pdf, other

    cs.LG cs.CY econ.GN stat.ML

    Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry

    Authors: Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt

    Abstract: Emerging marketplaces for large language models and other large-scale machine learning (ML) models appear to exhibit market concentration, which has raised concerns about whether there are insurmountable barriers to entry in such markets. In this work, we study this issue from both an economic and an algorithmic point of view, focusing on a phenomenon that reduces barriers to entry. Specifically,… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  39. arXiv:2409.01855  [pdf, other

    cs.CR

    Graph-based Modeling and Simulation of Emergency Services Communication Systems

    Authors: Jardi Martinez Jordan, Michael Stiber

    Abstract: Emergency Services Communication Systems (ESCS) are evolving into Internet Protocol based communication networks, promising enhancements to their function, availability, and resilience. This increase in complexity and cyber-attack surface demands better understanding of these systems' breakdown dynamics under extreme circumstances. Existing ESCS research largely overlooks simulation and the little… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 8 pages, 3 figures

  40. arXiv:2408.11974  [pdf, other

    cs.LG math.OC

    Two-Timescale Gradient Descent Ascent Algorithms for Nonconvex Minimax Optimization

    Authors: Tianyi Lin, Chi Jin, Michael. I. Jordan

    Abstract: We provide a unified analysis of two-timescale gradient descent ascent (TTGDA) for solving structured nonconvex minimax optimization problems in the form of $\min_\textbf{x} \max_{\textbf{y} \in Y} f(\textbf{x}, \textbf{y})$, where the objective function $f(\textbf{x}, \textbf{y})$ is nonconvex in $\textbf{x}$ and concave in $\textbf{y}$, and the constraint set $Y \subseteq \mathbb{R}^n$ is convex… ▽ More

    Submitted 27 January, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: Accepted by Journal of Machine Learning Research; A preliminary version [arXiv:1906.00331] of this paper, with a subset of the results that are presented here, was presented at ICML 2020; 44 Pages, 10 Figures

  41. arXiv:2408.10389  [pdf, other

    physics.app-ph

    Enhancement of Photoresponse for InGaAs Infrared Photodetectors Using Plasmonic WO3-x/CsyWO3-x Nanocrystals

    Authors: Zach D. Merino, Gyorgy Jaics, Andrew W. M. Jordan, Arjun Shetty, Penghui Yin, Man C. Tam, Xinning Wang, Zbig. R. Wasilewski, Pavle V. Radovanovic, Jonathan Baugh

    Abstract: Fast and accurate detection of light in the near-infrared (NIR) spectral range plays a crucial role in modern society, from alleviating speed and capacity bottlenecks in optical communications to enhancing the control and safety of autonomous vehicles through NIR imaging systems. Several technological platforms are currently under investigation to improve NIR photodetection, aiming to surpass the… ▽ More

    Submitted 26 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  42. Two-dimensional simulations of disks in close binaries: Simulating outburst cycles in cataclysmic variables

    Authors: Lucas M. Jordan, Dennis Wehner, Rolf Kuiper

    Abstract: Previous simulations of cataclysmic variables studied either the quiescence, or the outburst state in multiple dimensions or they simulated complete outburst cycles in one dimension using simplified models for the gravitational torques. We self-consistently simulate complete outburst cycles of normal and superoutbursts in cataclysmic variable systems in two dimensions. We study the effect of diffe… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted in A&A. 21 pages, 27 figures

    Journal ref: A&A 689, A354 (2024)

  43. arXiv:2407.14332  [pdf, ps, other

    cs.GT

    Unravelling in Collaborative Learning

    Authors: Aymeric Capitaine, Etienne Boursier, Antoine Scheid, Eric Moulines, Michael I. Jordan, El-Mahdi El-Mhamdi, Alain Durmus

    Abstract: Collaborative learning offers a promising avenue for leveraging decentralized data. However, collaboration in groups of strategic learners is not a given. In this work, we consider strategic agents who wish to train a model together but have sampling distributions of different quality. The collaboration is organized by a benevolent aggregator who gathers samples so as to maximize total welfare, bu… ▽ More

    Submitted 10 December, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

  44. arXiv:2406.19824  [pdf, other

    cs.GT stat.ML

    Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality

    Authors: Antoine Scheid, Aymeric Capitaine, Etienne Boursier, Eric Moulines, Michael I Jordan, Alain Durmus

    Abstract: In economic theory, the concept of externality refers to any indirect effect resulting from an interaction between players that affects the social welfare. Most of the models within which externality has been studied assume that agents have perfect knowledge of their environment and preferences. This is a major hindrance to the practical implementation of many proposed solutions. To address this i… ▽ More

    Submitted 28 January, 2025; v1 submitted 28 June, 2024; originally announced June 2024.

  45. arXiv:2406.17819  [pdf, other

    cs.LG cs.AI

    Automatically Adaptive Conformal Risk Control

    Authors: Vincent Blot, Anastasios N Angelopoulos, Michael I Jordan, Nicolas J-B Brunel

    Abstract: Science and technology have a growing need for effective mechanisms that ensure reliable, controlled performance from black-box machine learning algorithms. These performance guarantees should ideally hold conditionally on the input-that is the performance guarantees should hold, at least approximately, no matter what the input. However, beyond stylized discrete groupings such as ethnicity and gen… ▽ More

    Submitted 27 March, 2025; v1 submitted 25 June, 2024; originally announced June 2024.

  46. arXiv:2406.16241  [pdf, other

    cs.LG stat.ME

    Position: Benchmarking is Limited in Reinforcement Learning Research

    Authors: Scott M. Jordan, Adam White, Bruno Castro da Silva, Martha White, Philip S. Thomas

    Abstract: Novel reinforcement learning algorithms, or improvements on existing ones, are commonly justified by evaluating their performance on benchmark environments and are compared to an ever-changing set of standard algorithms. However, despite numerous calls for improvements, experimental practices continue to produce misleading or unsupported claims. One reason for the ongoing substandard practices is… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 19 pages, 13 figures, The Forty-first International Conference on Machine Learning (ICML 2024)

  47. arXiv:2406.15898  [pdf, other

    cs.GT cs.LG

    Defection-Free Collaboration between Competitors in a Learning System

    Authors: Mariel Werner, Sai Praneeth Karimireddy, Michael I. Jordan

    Abstract: We study collaborative learning systems in which the participants are competitors who will defect from the system if they lose revenue by collaborating. As such, we frame the system as a duopoly of competitive firms who are each engaged in training machine-learning models and selling their predictions to a market of consumers. We first examine a fully collaborative scheme in which both firms share… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  48. arXiv:2406.11794  [pdf, other

    cs.LG cs.CL

    DataComp-LM: In search of the next generation of training sets for language models

    Authors: Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Gadre, Hritik Bansal, Etash Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner , et al. (34 additional authors not shown)

    Abstract: We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset experiments with the goal of improving language models. As part of DCLM, we provide a standardized corpus of 240T tokens extracted from Common Crawl, effective pretraining recipes based on the OpenLM framework, and a broad suite of 53 downstream evaluations. Participants in the DCLM benchmark can experiment with dat… ▽ More

    Submitted 21 April, 2025; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Project page: https://www.datacomp.ai/dclm/

  49. arXiv:2406.11271  [pdf, other

    cs.CV cs.LG

    MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

    Authors: Anas Awadalla, Le Xue, Oscar Lo, Manli Shu, Hannah Lee, Etash Kumar Guha, Matt Jordan, Sheng Shen, Mohamed Awadalla, Silvio Savarese, Caiming Xiong, Ran Xu, Yejin Choi, Ludwig Schmidt

    Abstract: Multimodal interleaved datasets featuring free-form interleaved sequences of images and text are crucial for training frontier large multimodal models (LMMs). Despite the rapid progression of open-source LMMs, there remains a pronounced scarcity of large-scale, diverse open-source multimodal interleaved datasets. In response, we introduce MINT-1T, the most extensive and diverse open-source Multimo… ▽ More

    Submitted 30 October, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  50. arXiv:2406.07029  [pdf, other

    cs.LG

    Fairness-Aware Meta-Learning via Nash Bargaining

    Authors: Yi Zeng, Xuelin Yang, Li Chen, Cristian Canton Ferrer, Ming Jin, Michael I. Jordan, Ruoxi Jia

    Abstract: To address issues of group-level fairness in machine learning, it is natural to adjust model parameters based on specific fairness objectives over a sensitive-attributed validation set. Such an adjustment procedure can be cast within a meta-learning framework. However, naive integration of fairness goals via meta-learning can cause hypergradient conflicts for subgroups, resulting in unstable conve… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.