Skip to main content

Showing 1–50 of 294 results for author: Jordan, I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.20173  [pdf, ps, other

    stat.ML cs.AI cs.LG stat.ME stat.OT

    Valid Selection among Conformal Sets

    Authors: Mahmoud Hegazy, Liviu Aolaritei, Michael I. Jordan, Aymeric Dieuleveut

    Abstract: Conformal prediction offers a distribution-free framework for constructing prediction sets with coverage guarantees. In practice, multiple valid conformal prediction sets may be available, arising from different models or methodologies. However, selecting the most desirable set, such as the smallest, can invalidate the coverage guarantees. To address this challenge, we propose a stability-based ap… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  2. arXiv:2506.05295  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Sample Complexity and Representation Ability of Test-time Scaling Paradigms

    Authors: Baihe Huang, Shanda Li, Tianhao Wu, Yiming Yang, Ameet Talwalkar, Kannan Ramchandran, Michael I. Jordan, Jiantao Jiao

    Abstract: Test-time scaling paradigms have significantly advanced the capabilities of large language models (LLMs) on complex tasks. Despite their empirical success, theoretical understanding of the sample efficiency of various test-time strategies -- such as self-consistency, best-of-$n$, and self-correction -- remains limited. In this work, we first establish a separation result between two repeated sampl… ▽ More

    Submitted 12 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

  3. arXiv:2506.03744  [pdf, ps, other

    stat.AP physics.ao-ph stat.ML

    Probabilistic measures afford fair comparisons of AIWP and NWP model output

    Authors: Tilmann Gneiting, Tobias Biegert, Kristof Kraus, Eva-Maria Walz, Alexander I. Jordan, Sebastian Lerch

    Abstract: We introduce a new measure for fair and meaningful comparisons of single-valued output from artificial intelligence based weather prediction (AIWP) and numerical weather prediction (NWP) models, called potential continuous ranked probability score (PC). In a nutshell, we subject the deterministic backbone of physics-based and data-driven models post hoc to the same statistical postprocessing techn… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  4. arXiv:2505.13732  [pdf, ps, other

    stat.ML cs.LG

    Backward Conformal Prediction

    Authors: Etienne Gauthier, Francis Bach, Michael I. Jordan

    Abstract: We introduce $\textit{Backward Conformal Prediction}$, a method that guarantees conformal coverage while providing flexible control over the size of prediction sets. Unlike standard conformal prediction, which fixes the coverage level and allows the conformal set size to vary, our approach defines a rule that constrains how prediction set sizes behave based on the observed data, and adapts the cov… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Code available at: https://github.com/GauthierE/backward-cp

  5. arXiv:2505.13564  [pdf, ps, other

    cs.LG stat.ML

    Online Decision-Focused Learning

    Authors: Aymeric Capitaine, Maxime Haddouche, Eric Moulines, Michael I. Jordan, Etienne Boursier, Alain Durmus

    Abstract: Decision-focused learning (DFL) is an increasingly popular paradigm for training predictive models whose outputs are used in decision-making tasks. Instead of merely optimizing for predictive accuracy, DFL trains models to directly minimize the loss associated with downstream decisions. This end-to-end strategy holds promise for tackling complex combinatorial problems; however, existing studies fo… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  6. arXiv:2504.03560  [pdf, other

    math.OC cs.LG math.ST stat.ML

    Stochastic Optimization with Optimal Importance Sampling

    Authors: Liviu Aolaritei, Bart P. G. Van Parys, Henry Lam, Michael I. Jordan

    Abstract: Importance Sampling (IS) is a widely used variance reduction technique for enhancing the efficiency of Monte Carlo methods, particularly in rare-event simulation and related applications. Despite its power, the performance of IS is often highly sensitive to the choice of the proposal distribution and frequently requires stochastic calibration techniques. While the design and analysis of IS have be… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  7. arXiv:2504.02818  [pdf, other

    math.ST stat.ME

    Universal Log-Optimality for General Classes of e-processes and Sequential Hypothesis Tests

    Authors: Ian Waudby-Smith, Ricardo Sandoval, Michael I. Jordan

    Abstract: We consider the problem of sequential hypothesis testing by betting. For a general class of composite testing problems -- which include bounded mean testing, equal mean testing for bounded random tuples, and some key ingredients of two-sample and independence testing as special cases -- we show that any $e$-process satisfying a certain sublinear regret bound is adaptively, asymptotically, and almo… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  8. arXiv:2503.19068  [pdf, other

    stat.ML cs.AI cs.LG stat.ME stat.OT

    Minimum Volume Conformal Sets for Multivariate Regression

    Authors: Sacha Braun, Liviu Aolaritei, Michael I. Jordan, Francis Bach

    Abstract: Conformal prediction provides a principled framework for constructing predictive sets with finite-sample validity. While much of the focus has been on univariate response variables, existing multivariate methods either impose rigid geometric assumptions or rely on flexible but computationally expensive approaches that do not explicitly optimize prediction set volume. We propose an optimization-dri… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  9. arXiv:2503.13050  [pdf, other

    stat.ML cs.LG

    E-Values Expand the Scope of Conformal Prediction

    Authors: Etienne Gauthier, Francis Bach, Michael I. Jordan

    Abstract: Conformal prediction is a powerful framework for distribution-free uncertainty quantification. The standard approach to conformal prediction relies on comparing the ranks of prediction scores: under exchangeability, the rank of a future test point cannot be too extreme relative to a calibration set. This rank-based method can be reformulated in terms of p-values. In this paper, we explore an alter… ▽ More

    Submitted 6 May, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: Code available at: https://github.com/GauthierE/evalues-expand-cp

  10. arXiv:2502.17814  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    An Overview of Large Language Models for Statisticians

    Authors: Wenlong Ji, Weizhe Yuan, Emily Getzen, Kyunghyun Cho, Michael I. Jordan, Song Mei, Jason E Weston, Weijie J. Su, Jing Xu, Linjun Zhang

    Abstract: Large Language Models (LLMs) have emerged as transformative tools in artificial intelligence (AI), exhibiting remarkable capabilities across diverse tasks such as text generation, reasoning, and decision-making. While their success has primarily been driven by advances in computational power and deep learning architectures, emerging problems -- in areas such as uncertainty quantification, decision… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  11. arXiv:2502.14105  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Conformal Prediction under Levy-Prokhorov Distribution Shifts: Robustness to Local and Global Perturbations

    Authors: Liviu Aolaritei, Zheyu Oliver Wang, Julie Zhu, Michael I. Jordan, Youssef Marzouk

    Abstract: Conformal prediction provides a powerful framework for constructing prediction intervals with finite-sample guarantees, yet its robustness under distribution shifts remains a significant challenge. This paper addresses this limitation by modeling distribution shifts using Levy-Prokhorov (LP) ambiguity sets, which capture both local and global perturbations. We provide a self-contained overview of… ▽ More

    Submitted 18 May, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

  12. arXiv:2502.04879  [pdf, other

    stat.ML cs.LG

    Statistical Collusion by Collectives on Learning Platforms

    Authors: Etienne Gauthier, Francis Bach, Michael I. Jordan

    Abstract: As platforms increasingly rely on learning algorithms, collectives may form and seek ways to influence these platforms to align with their own interests. This can be achieved by coordinated submission of altered data. To evaluate the potential impact of such behavior, it is essential to understand the computations that collectives must perform to impact platforms in this way. In particular, collec… ▽ More

    Submitted 25 May, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: Code available at: https://github.com/GauthierE/statistical-collusion

  13. arXiv:2501.15910  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    The Sample Complexity of Online Reinforcement Learning: A Multi-model Perspective

    Authors: Michael Muehlebach, Zhiyu He, Michael I. Jordan

    Abstract: We study the sample complexity of online reinforcement learning in the general setting of nonlinear dynamical systems with continuous state and action spaces. Our analysis accommodates a large class of dynamical systems ranging from a finite set of nonlinear candidate models to models with bounded and Lipschitz continuous dynamics, to systems that are parametrized by a compact and real-valued set… ▽ More

    Submitted 20 May, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

    Comments: 29 pages, 3 figures

  14. arXiv:2501.10139  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Conformal Prediction Sets with Improved Conditional Coverage using Trust Scores

    Authors: Jivat Neet Kaur, Michael I. Jordan, Ahmed Alaa

    Abstract: Standard conformal prediction offers a marginal guarantee on coverage, but for prediction sets to be truly useful, they should ideally ensure coverage conditional on each test point. Unfortunately, it is impossible to achieve exact, distribution-free conditional coverage in finite samples. In this work, we propose an alternative conformal prediction algorithm that targets coverage where it matters… ▽ More

    Submitted 9 February, 2025; v1 submitted 17 January, 2025; originally announced January 2025.

  15. arXiv:2501.08330  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Gradient Equilibrium in Online Learning: Theory and Applications

    Authors: Anastasios N. Angelopoulos, Michael I. Jordan, Ryan J. Tibshirani

    Abstract: We present a new perspective on online learning that we refer to as gradient equilibrium: a sequence of iterates achieves gradient equilibrium if the average of gradients of losses along the sequence converges to zero. In general, this condition is not implied by, nor implies, sublinear regret. It turns out that gradient equilibrium is achievable by standard online learning methods such as gradien… ▽ More

    Submitted 18 February, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: Code available at https://github.com/aangelopoulos/gradient-equilibrium/

  16. arXiv:2412.08060  [pdf, ps, other

    stat.ML cs.LG math.OC

    An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints

    Authors: Jordan Lekeufack, Michael I. Jordan

    Abstract: We study Online Convex Optimization (OCO) with adversarial constraints, where an online algorithm must make sequential decisions to minimize both convex loss functions and cumulative constraint violations. We focus on a setting where the algorithm has access to predictions of the loss and constraint functions. Our results show that we can improve the current best bounds of $ O(\sqrt{T}) $ regret a… ▽ More

    Submitted 12 March, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: 18 pages

  17. arXiv:2411.00775  [pdf, ps, other

    cs.LG stat.ML

    Dimension-free Private Mean Estimation for Anisotropic Distributions

    Authors: Yuval Dagan, Michael I. Jordan, Xuelin Yang, Lydia Zakynthinou, Nikita Zhivotovskiy

    Abstract: We present differentially private algorithms for high-dimensional mean estimation. Previous private estimators on distributions over $\mathbb{R}^d$ suffer from a curse of dimensionality, as they require $Ω(d^{1/2})$ samples to achieve non-trivial error, even in cases where $O(1)$ samples suffice without privacy. This rate is unavoidable when the distribution is isotropic, namely, when the covarian… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  18. arXiv:2410.18404  [pdf, other

    cs.LG cs.CR stat.ML

    Enhancing Feature-Specific Data Protection via Bayesian Coordinate Differential Privacy

    Authors: Maryam Aliakbarpour, Syomantak Chaudhuri, Thomas A. Courtade, Alireza Fallah, Michael I. Jordan

    Abstract: Local Differential Privacy (LDP) offers strong privacy guarantees without requiring users to trust external parties. However, LDP applies uniform protection to all data features, including less sensitive ones, which degrades performance of downstream tasks. To overcome this limitation, we propose a Bayesian framework, Bayesian Coordinate Differential Privacy (BCDP), that enables feature-specific p… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  19. arXiv:2410.17055  [pdf, other

    cs.LG stat.ML

    Optimal Design for Reward Modeling in RLHF

    Authors: Antoine Scheid, Etienne Boursier, Alain Durmus, Michael I. Jordan, Pierre Ménard, Eric Moulines, Michal Valko

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a popular approach to align language models (LMs) with human preferences. This method involves collecting a large dataset of human pairwise preferences across various text generations and using it to infer (implicitly or explicitly) a reward model. Numerous methods have been proposed to learn the reward model and align a LM with it. Howe… ▽ More

    Submitted 23 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

  20. arXiv:2409.03734  [pdf, other

    cs.LG cs.CY econ.GN stat.ML

    Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry

    Authors: Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt

    Abstract: Emerging marketplaces for large language models and other large-scale machine learning (ML) models appear to exhibit market concentration, which has raised concerns about whether there are insurmountable barriers to entry in such markets. In this work, we study this issue from both an economic and an algorithmic point of view, focusing on a phenomenon that reduces barriers to entry. Specifically,… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  21. arXiv:2406.19824  [pdf, other

    cs.GT stat.ML

    Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality

    Authors: Antoine Scheid, Aymeric Capitaine, Etienne Boursier, Eric Moulines, Michael I Jordan, Alain Durmus

    Abstract: In economic theory, the concept of externality refers to any indirect effect resulting from an interaction between players that affects the social welfare. Most of the models within which externality has been studied assume that agents have perfect knowledge of their environment and preferences. This is a major hindrance to the practical implementation of many proposed solutions. To address this i… ▽ More

    Submitted 28 January, 2025; v1 submitted 28 June, 2024; originally announced June 2024.

  22. arXiv:2404.18490  [pdf, other

    cs.LG stat.ML

    Reduced-Rank Multi-objective Policy Learning and Optimization

    Authors: Ezinne Nwankwo, Michael I. Jordan, Angela Zhou

    Abstract: Evaluating the causal impacts of possible interventions is crucial for informing decision-making, especially towards improving access to opportunity. However, if causal effects are heterogeneous and predictable from covariates, personalized treatment decisions can improve individual outcomes and contribute to both efficiency and equity. In practice, however, causal researchers do not have a single… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  23. arXiv:2404.15746  [pdf, other

    stat.ML cs.CR cs.LG

    Collaborative Heterogeneous Causal Inference Beyond Meta-analysis

    Authors: Tianyu Guo, Sai Praneeth Karimireddy, Michael I. Jordan

    Abstract: Collaboration between different data centers is often challenged by heterogeneity across sites. To account for the heterogeneity, the state-of-the-art method is to re-weight the covariate distributions in each site to match the distribution of the target population. Nevertheless, this method could easily fail when a certain site couldn't cover the entire population. Moreover, it still relies on th… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: submitted to ICML

  24. arXiv:2403.19605  [pdf, other

    stat.ME cs.LG

    Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction

    Authors: Drew T. Nguyen, Reese Pathak, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan

    Abstract: Decision-making pipelines are generally characterized by tradeoffs among various risk functions. It is often desirable to manage such tradeoffs in a data-adaptive manner. As we demonstrate, if this is done naively, state-of-the art uncertainty quantification methods can lead to significant violations of putative risk guarantees. To address this issue, we develop methods that permit valid control… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 27 pages, 10 figures

  25. arXiv:2403.07008  [pdf, other

    cs.LG cs.AI cs.CL stat.ME

    AutoEval Done Right: Using Synthetic Data for Model Evaluation

    Authors: Pierre Boyeau, Anastasios N. Angelopoulos, Nir Yosef, Jitendra Malik, Michael I. Jordan

    Abstract: The evaluation of machine learning models using human-labeled validation data can be expensive and time-consuming. AI-labeled synthetic data can be used to decrease the number of human annotations required for this purpose in a process called autoevaluation. We suggest efficient and statistically principled algorithms for this purpose that improve sample efficiency while remaining unbiased. These… ▽ More

    Submitted 28 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: New experiments, fix fig 1

  26. arXiv:2403.03811  [pdf, other

    stat.ML cs.GT cs.LG

    Incentivized Learning in Principal-Agent Bandit Games

    Authors: Antoine Scheid, Daniil Tiapkin, Etienne Boursier, Aymeric Capitaine, El Mahdi El Mhamdi, Eric Moulines, Michael I. Jordan, Alain Durmus

    Abstract: This work considers a repeated principal-agent bandit game, where the principal can only interact with her environment through the agent. The principal and the agent have misaligned objectives and the choice of action is only left to the agent. However, the principal can influence the agent's decisions by offering incentives which add up to his rewards. The principal aims to iteratively learn an i… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  27. arXiv:2401.16335  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

    Authors: Banghua Zhu, Michael I. Jordan, Jiantao Jiao

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is a pivotal technique that aligns language models closely with human-centric values. The initial phase of RLHF involves learning human values using a reward model from ranking data. It is observed that the performance of the reward model degrades after one epoch of training, and optimizing too much against the learned reward model eventually hinde… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  28. arXiv:2312.07930  [pdf, other

    cs.LG cs.CL cs.CR cs.IT stat.ML

    Towards Optimal Statistical Watermarking

    Authors: Baihe Huang, Hanlin Zhu, Banghua Zhu, Kannan Ramchandran, Michael I. Jordan, Jason D. Lee, Jiantao Jiao

    Abstract: We study statistical watermarking by formulating it as a hypothesis testing problem, a general framework which subsumes all previous statistical watermarking methods. Key to our formulation is a coupling of the output tokens and the rejection region, realized by pseudo-random generators in practice, that allows non-trivial trade-offs between the Type I error and Type II error. We characterize the… ▽ More

    Submitted 6 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  29. arXiv:2310.05921  [pdf, other

    stat.ML cs.LG cs.RO stat.ME

    Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions

    Authors: Jordan Lekeufack, Anastasios N. Angelopoulos, Andrea Bajcsy, Michael I. Jordan, Jitendra Malik

    Abstract: We introduce Conformal Decision Theory, a framework for producing safe autonomous decisions despite imperfect machine learning predictions. Examples of such decisions are ubiquitous, from robot planning algorithms that rely on pedestrian predictions, to calibrating autonomous manufacturing to exhibit high throughput and low error, to the choice of trusting a nominal policy versus switching to a sa… ▽ More

    Submitted 2 May, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures

  30. arXiv:2309.04877  [pdf, other

    cs.LG stat.ML

    A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning

    Authors: Neha S. Wadia, Yatin Dandi, Michael I. Jordan

    Abstract: The rapid progress in machine learning in recent years has been based on a highly productive connection to gradient-based optimization. Further progress hinges in part on a shift in focus from pattern recognition to decision-making and multi-agent problems. In these broader settings, new mathematical challenges emerge that involve equilibria and game theory instead of optima. Gradient-based method… ▽ More

    Submitted 26 February, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: 36 pages, 7 figures; minor corrections

  31. arXiv:2309.01837  [pdf, other

    cs.LG stat.ML

    Delegating Data Collection in Decentralized Machine Learning

    Authors: Nivasini Ananthakrishnan, Stephen Bates, Michael I. Jordan, Nika Haghtalab

    Abstract: Motivated by the emergence of decentralized machine learning (ML) ecosystems, we study the delegation of data collection. Taking the field of contract theory as our starting point, we design optimal and near-optimal contracts that deal with two fundamental information asymmetries that arise in decentralized ML: uncertainty in the assessment of model quality and uncertainty regarding the optimal pe… ▽ More

    Submitted 20 November, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

  32. arXiv:2307.13381  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Scaff-PD: Communication Efficient Fair and Robust Federated Learning

    Authors: Yaodong Yu, Sai Praneeth Karimireddy, Yi Ma, Michael I. Jordan

    Abstract: We present Scaff-PD, a fast and communication-efficient algorithm for distributionally robust federated learning. Our approach improves fairness by optimizing a family of distributionally robust objectives tailored to heterogeneous clients. We leverage the special structure of these objectives, and design an accelerated primal dual (APD) algorithm which uses bias corrected local steps (as in Scaff… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    MSC Class: 68W40; 68W15; 90C25; 90C06 ACM Class: G.1.6; F.2.1; E.4

  33. arXiv:2307.03748  [pdf, other

    stat.ME cs.GT cs.LG stat.ML

    Incentive-Theoretic Bayesian Inference for Collaborative Science

    Authors: Stephen Bates, Michael I. Jordan, Michael Sklar, Jake A. Soloff

    Abstract: Contemporary scientific research is a distributed, collaborative endeavor, carried out by teams of researchers, regulatory institutions, funding agencies, commercial partners, and scientific bodies, all interacting with each other and facing different incentives. To maintain scientific rigor, statistical methods should acknowledge this state of affairs. To this end, we study hypothesis testing whe… ▽ More

    Submitted 8 February, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  34. arXiv:2307.00126  [pdf, other

    math.OC cs.LG stat.ML

    Accelerating Inexact HyperGradient Descent for Bilevel Optimization

    Authors: Haikuo Yang, Luo Luo, Chris Junchi Li, Michael I. Jordan

    Abstract: We present a method for solving general nonconvex-strongly-convex bilevel optimization problems. Our method -- the \emph{Restarted Accelerated HyperGradient Descent} (\texttt{RAHGD}) method -- finds an $ε$-first-order stationary point of the objective with $\tilde{\mathcal{O}}(κ^{3.25}ε^{-1.75})$ oracle complexity, where $κ$ is the condition number of the lower-level objective and $ε$ is the desir… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  35. arXiv:2306.14670  [pdf, other

    cs.GT cs.CY cs.LG stat.ML

    Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition

    Authors: Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt, Nika Haghtalab

    Abstract: As the scale of machine learning models increases, trends such as scaling laws anticipate consistent downstream improvements in predictive accuracy. However, these trends take the perspective of a single model-provider in isolation, while in reality providers often compete with each other for users. In this work, we demonstrate that competition can fundamentally alter the behavior of these scaling… ▽ More

    Submitted 6 February, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Appeared at NeurIPS 2023; this is the full version

  36. arXiv:2306.09335  [pdf, other

    stat.ML cs.CV cs.LG stat.ME

    Class-Conditional Conformal Prediction with Many Classes

    Authors: Tiffany Ding, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan, Ryan J. Tibshirani

    Abstract: Standard conformal prediction methods provide a marginal coverage guarantee, which means that for a random test point, the conformal prediction set contains the true label with a user-specified probability. In many classification problems, we would like to obtain a stronger guarantee--that for test points of a specific class, the prediction set contains the true label with the same user-chosen pro… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  37. arXiv:2306.07479  [pdf, ps, other

    cs.GT cs.IR cs.LG stat.ML

    Incentivizing High-Quality Content in Online Recommender Systems

    Authors: Xinyan Hu, Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt

    Abstract: In content recommender systems such as TikTok and YouTube, the platform's recommendation algorithm shapes content producer incentives. Many platforms employ online learning, which generates intertemporal incentives, since content produced today affects recommendations of future content. We study the game between producers and analyze the content created at equilibrium. We show that standard online… ▽ More

    Submitted 21 June, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: Updated version with revised and expanded content

  38. arXiv:2306.02003  [pdf, other

    cs.LG cs.AI cs.PF eess.SY stat.ML

    On Optimal Caching and Model Multiplexing for Large Model Inference

    Authors: Banghua Zhu, Ying Sheng, Lianmin Zheng, Clark Barrett, Michael I. Jordan, Jiantao Jiao

    Abstract: Large Language Models (LLMs) and other large foundation models have achieved noteworthy success, but their size exacerbates existing resource consumption and latency challenges. In particular, the large-scale deployment of these models is hindered by the significant resource requirements during inference. In this paper, we study two approaches for mitigating these challenges: employing a cache to… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  39. arXiv:2303.06317  [pdf, ps, other

    stat.ME

    Evaluating Sensitivity to the Stick-Breaking Prior in Bayesian Nonparametrics (Rejoinder)

    Authors: Ryan Giordano, Runjing Liu, Michael I. Jordan, Tamara Broderick

    Abstract: One can typically form a local robustness metric for a particular problem quite directly, for Markov chain Monte Carlo applications as well as optimization problems such as variational Bayes. However, we argue that simply forming a local robustness metric is not enough: the hard work is showing that it is useful. Computability, interpretability, and the ability of a local robustness metric to extr… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: Rejoinder for the discussion article "Evaluating Sensitivity to the Stick-Breaking Prior in Bayesian Nonparametrics'' in Bayesian Analysis

  40. arXiv:2302.00316  [pdf, other

    math.OC cs.LG eess.SP stat.ML

    Accelerated First-Order Optimization under Nonlinear Constraints

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We exploit analogies between first-order algorithms for constrained optimization and non-smooth dynamical systems to design a new class of accelerated first-order algorithms for constrained optimization. Unlike Frank-Wolfe or projected gradients, these algorithms avoid optimization over the entire feasible set at each iteration. We prove convergence to stationary points even in a nonconvex setting… ▽ More

    Submitted 1 May, 2025; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 44 pages, 6 figures

  41. arXiv:2301.11270  [pdf, other

    cs.LG cs.AI cs.HC math.ST stat.ML

    Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons

    Authors: Banghua Zhu, Jiantao Jiao, Michael I. Jordan

    Abstract: We provide a theoretical framework for Reinforcement Learning with Human Feedback (RLHF). Our analysis shows that when the true reward function is linear, the widely used maximum likelihood estimator (MLE) converges under both the Bradley-Terry-Luce (BTL) model and the Plackett-Luce (PL) model. However, we show that when training a policy based on the learned reward model, MLE fails while a pessim… ▽ More

    Submitted 7 February, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

  42. arXiv:2301.10803  [pdf, other

    stat.ME cs.LG stat.ML

    Evaluating Probabilistic Classifiers: The Triptych

    Authors: Timo Dimitriadis, Tilmann Gneiting, Alexander I. Jordan, Peter Vogel

    Abstract: Probability forecasts for binary outcomes, often referred to as probabilistic classifiers or confidence scores, are ubiquitous in science and society, and methods for evaluating and comparing them are in great demand. We propose and study a triptych of diagnostic graphics that focus on distinct and complementary aspects of forecast performance: The reliability diagram addresses calibration, the re… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

  43. arXiv:2301.09633  [pdf, other

    stat.ML cs.AI cs.LG q-bio.QM stat.ME

    Prediction-Powered Inference

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Clara Fannjiang, Michael I. Jordan, Tijana Zrnic

    Abstract: Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system. The framework yields simple algorithms for computing provably valid confidence intervals for quantities such as means, quantiles, and linear and logistic regression coefficients, without making any assumptions on the ma… ▽ More

    Submitted 9 November, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Code is available at https://github.com/aangelopoulos/ppi_py

  44. arXiv:2211.15381  [pdf, other

    cs.IR cs.LG stat.ML

    Incentive-Aware Recommender Systems in Two-Sided Markets

    Authors: Xiaowu Dai, Wenlu Xu, Yuan Qi, Michael I. Jordan

    Abstract: Online platforms in the Internet Economy commonly incorporate recommender systems that recommend products (or "arms") to users (or "agents"). A key challenge in this domain arises from myopic agents who are naturally incentivized to exploit by choosing the optimal arm based on current information, rather than exploring various alternatives to gather information that benefits the collective. We pro… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 November, 2022; originally announced November 2022.

  45. arXiv:2210.17550  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization

    Authors: Chris Junchi Li, Angela Yuan, Gauthier Gidel, Quanquan Gu, Michael I. Jordan

    Abstract: We propose a new first-order optimization algorithm -- AcceleratedGradient-OptimisticGradient (AG-OG) Descent Ascent -- for separable convex-concave minimax optimization. The main idea of our algorithm is to carefully leverage the structure of the minimax problem, performing Nesterov acceleration on the individual component and optimistic gradient on the coupling component. Equipped with proper re… ▽ More

    Submitted 14 August, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 44 pages. This version matches the camera-ready that appeared at ICML 2023 under the same title

  46. arXiv:2210.15659  [pdf, other

    stat.ML cs.LG

    A Primal-Dual Approach to Solving Variational Inequalities with General Constraints

    Authors: Tatjana Chavdarova, Tong Yang, Matteo Pagliardini, Michael I. Jordan

    Abstract: Yang et al. (2023) recently showed how to use first-order gradient methods to solve general variational inequalities (VIs) under a limiting assumption that analytic solutions of specific subproblems are available. In this paper, we circumvent this assumption via a warm-starting technique where we solve subproblems approximately and initialize variables with the approximate solution found at the pr… ▽ More

    Submitted 3 August, 2024; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: Source code at https://github.com/Chavdarova/I-ACVI

    Journal ref: ICLR 2024

  47. arXiv:2210.10278  [pdf, other

    cs.LG cs.GT stat.ML

    A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

    Authors: Rui Ai, Boxiang Lyu, Zhaoran Wang, Zhuoran Yang, Michael I. Jordan

    Abstract: We study reserve price optimization in multi-phase second price auctions, where seller's prior actions affect the bidders' later valuations through a Markov Decision Process (MDP). Compared to the bandit setting in existing works, the setting in ours involves three challenges. First, from the seller's perspective, we need to efficiently explore the environment in the presence of potentially nontru… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  48. arXiv:2210.04334  [pdf, ps, other

    stat.ME cs.LG eess.SP

    QuTE: decentralized multiple testing on sensor networks with false discovery rate control

    Authors: Aaditya Ramdas, Jianbo Chen, Martin J. Wainwright, Michael I. Jordan

    Abstract: This paper designs methods for decentralized multiple hypothesis testing on graphs that are equipped with provable guarantees on the false discovery rate (FDR). We consider the setting where distinct agents reside on the nodes of an undirected graph, and each agent possesses p-values corresponding to one or more hypotheses local to its node. Each agent must individually decide whether to reject on… ▽ More

    Submitted 7 July, 2025; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: This paper appeared in the IEEE CDC'17 conference proceedings. The last two sections were then developed in 2018, and it is now being put on arXiv simply for easier access. The latest version fixed some figures

  49. arXiv:2209.15634  [pdf, other

    cs.LG cs.AI stat.ML

    A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

    Authors: Zixiang Chen, Chris Junchi Li, Angela Yuan, Quanquan Gu, Michael I. Jordan

    Abstract: With the increasing need for handling large state and action spaces, general function approximation has become a key technique in reinforcement learning (RL). In this paper, we propose a general framework that unifies model-based and model-free RL, and an Admissible Bellman Characterization (ABC) class that subsumes nearly all Markov Decision Process (MDP) models in the literature for tractable RL… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  50. arXiv:2208.13701  [pdf, other

    stat.ME cs.LG math.OC stat.ML

    Data-Driven Influence Functions for Optimization-Based Causal Inference

    Authors: Michael I. Jordan, Yixin Wang, Angela Zhou

    Abstract: We study a constructive algorithm that approximates Gateaux derivatives for statistical functionals by finite differencing, with a focus on functionals that arise in causal inference. We study the case where probability distributions are not known a priori but need to be estimated from data. These estimated distributions lead to empirical Gateaux derivatives, and we study the relationships betwe… ▽ More

    Submitted 15 June, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

    Comments: Extended version of conference version "Empirical Gateaux Derivatives for Causal Inference" accepted at Neurips 2022; new results on optimization and sensitivity analysis