Skip to main content

Showing 1–19 of 19 results for author: Angelopoulos, A N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.08330  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Gradient Equilibrium in Online Learning: Theory and Applications

    Authors: Anastasios N. Angelopoulos, Michael I. Jordan, Ryan J. Tibshirani

    Abstract: We present a new perspective on online learning that we refer to as gradient equilibrium: a sequence of iterates achieves gradient equilibrium if the average of gradients of losses along the sequence converges to zero. In general, this condition is not implied by, nor implies, sublinear regret. It turns out that gradient equilibrium is achievable by standard online learning methods such as gradien… ▽ More

    Submitted 18 February, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: Code available at https://github.com/aangelopoulos/gradient-equilibrium/

  2. arXiv:2411.11824  [pdf, ps, other

    math.ST stat.ME stat.ML

    Theoretical Foundations of Conformal Prediction

    Authors: Anastasios N. Angelopoulos, Rina Foygel Barber, Stephen Bates

    Abstract: This book is about conformal prediction and related inferential techniques that build on permutation tests and exchangeability. These techniques are useful in a diverse array of tasks, including hypothesis testing and providing uncertainty quantification guarantees for machine learning systems. Much of the current interest in conformal prediction is due to its ability to integrate into complex mac… ▽ More

    Submitted 3 June, 2025; v1 submitted 18 November, 2024; originally announced November 2024.

    Comments: This material will be published by Cambridge University Press as Theoretical Foundations of Conformal Prediction by Anastasios N. Angelopoulos, Rina Foygel Barber, and Stephen Bates. This prepublication version is free to view/download for personal use only. Not for redistribution/resale/use in derivative works. Copyright Anastasios N. Angelopoulos, Rina Foygel Barber, and Stephen Bates, 2025

  3. arXiv:2403.19605  [pdf, other

    stat.ME cs.LG

    Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction

    Authors: Drew T. Nguyen, Reese Pathak, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan

    Abstract: Decision-making pipelines are generally characterized by tradeoffs among various risk functions. It is often desirable to manage such tradeoffs in a data-adaptive manner. As we demonstrate, if this is done naively, state-of-the art uncertainty quantification methods can lead to significant violations of putative risk guarantees. To address this issue, we develop methods that permit valid control… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 27 pages, 10 figures

  4. arXiv:2403.07008  [pdf, other

    cs.LG cs.AI cs.CL stat.ME

    AutoEval Done Right: Using Synthetic Data for Model Evaluation

    Authors: Pierre Boyeau, Anastasios N. Angelopoulos, Nir Yosef, Jitendra Malik, Michael I. Jordan

    Abstract: The evaluation of machine learning models using human-labeled validation data can be expensive and time-consuming. AI-labeled synthetic data can be used to decrease the number of human annotations required for this purpose in a process called autoevaluation. We suggest efficient and statistically principled algorithms for this purpose that improve sample efficiency while remaining unbiased. These… ▽ More

    Submitted 28 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: New experiments, fix fig 1

  5. arXiv:2402.01139  [pdf, other

    stat.ML cs.LG stat.ME

    Online conformal prediction with decaying step sizes

    Authors: Anastasios N. Angelopoulos, Rina Foygel Barber, Stephen Bates

    Abstract: We introduce a method for online conformal prediction with decaying step sizes. Like previous methods, ours possesses a retrospective guarantee of coverage for arbitrary sequences. However, unlike previous methods, we can simultaneously estimate a population quantile when it exists. Our theory and experiments indicate substantially improved practical properties: in particular, when the distributio… ▽ More

    Submitted 28 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  6. arXiv:2311.01453  [pdf, other

    stat.ML cs.LG stat.ME

    PPI++: Efficient Prediction-Powered Inference

    Authors: Anastasios N. Angelopoulos, John C. Duchi, Tijana Zrnic

    Abstract: We present PPI++: a computationally lightweight methodology for estimation and inference based on a small labeled dataset and a typically much larger dataset of machine-learning predictions. The methods automatically adapt to the quality of available predictions, yielding easy-to-compute confidence sets -- for parameters of any dimensionality -- that always improve on classical intervals using onl… ▽ More

    Submitted 25 March, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Code available at https://github.com/aangelopoulos/ppi_py

  7. arXiv:2310.05921  [pdf, other

    stat.ML cs.LG cs.RO stat.ME

    Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions

    Authors: Jordan Lekeufack, Anastasios N. Angelopoulos, Andrea Bajcsy, Michael I. Jordan, Jitendra Malik

    Abstract: We introduce Conformal Decision Theory, a framework for producing safe autonomous decisions despite imperfect machine learning predictions. Examples of such decisions are ubiquitous, from robot planning algorithms that rely on pedestrian predictions, to calibrating autonomous manufacturing to exhibit high throughput and low error, to the choice of trusting a nominal policy versus switching to a sa… ▽ More

    Submitted 2 May, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures

  8. arXiv:2307.16895  [pdf, other

    cs.LG eess.SY stat.ME stat.ML

    Conformal PID Control for Time Series Prediction

    Authors: Anastasios N. Angelopoulos, Emmanuel J. Candes, Ryan J. Tibshirani

    Abstract: We study the problem of uncertainty quantification for time series prediction, with the goal of providing easy-to-use algorithms with formal guarantees. The algorithms we present build upon ideas from conformal prediction and control theory, are able to prospectively model conformal scores in an online setting, and adapt to the presence of systematic errors due to seasonality, trends, and general… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Code available at https://github.com/aangelopoulos/conformal-time-series

  9. arXiv:2306.09335  [pdf, other

    stat.ML cs.CV cs.LG stat.ME

    Class-Conditional Conformal Prediction with Many Classes

    Authors: Tiffany Ding, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan, Ryan J. Tibshirani

    Abstract: Standard conformal prediction methods provide a marginal coverage guarantee, which means that for a random test point, the conformal prediction set contains the true label with a user-specified probability. In many classification problems, we would like to obtain a stronger guarantee--that for test points of a specific class, the prediction set contains the true label with the same user-chosen pro… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  10. arXiv:2301.09633  [pdf, other

    stat.ML cs.AI cs.LG q-bio.QM stat.ME

    Prediction-Powered Inference

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Clara Fannjiang, Michael I. Jordan, Tijana Zrnic

    Abstract: Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system. The framework yields simple algorithms for computing provably valid confidence intervals for quantities such as means, quantiles, and linear and logistic regression coefficients, without making any assumptions on the ma… ▽ More

    Submitted 9 November, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Code is available at https://github.com/aangelopoulos/ppi_py

  11. arXiv:2209.14295  [pdf, other

    cs.LG cs.AI math.ST stat.ME stat.ML

    Label Noise Robustness of Conformal Prediction

    Authors: Bat-Sheva Einbinder, Shai Feldman, Stephen Bates, Anastasios N. Angelopoulos, Asaf Gendler, Yaniv Romano

    Abstract: We study the robustness of conformal prediction, a powerful tool for uncertainty quantification, to label noise. Our analysis tackles both regression and classification problems, characterizing when and how it is possible to construct uncertainty sets that correctly cover the unobserved noiseless ground truth labels. We further extend our theory and formulate the requirements for correctly control… ▽ More

    Submitted 26 November, 2024; v1 submitted 28 September, 2022; originally announced September 2022.

  12. arXiv:2208.02814  [pdf, ps, other

    stat.ME cs.AI cs.LG math.ST stat.ML

    Conformal Risk Control

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Adam Fisch, Lihua Lei, Tal Schuster

    Abstract: We extend conformal prediction to control the expected value of any monotone loss function. The algorithm generalizes split conformal prediction together with its coverage guarantee. Like conformal prediction, the conformal risk control procedure is tight up to an $\mathcal{O}(1/n)$ factor. We also introduce extensions of the idea to distribution shift, quantile risk control, multiple and adversar… ▽ More

    Submitted 13 June, 2025; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: Code available at https://github.com/aangelopoulos/conformal-risk

  13. arXiv:2207.10074  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Semantic uncertainty intervals for disentangled latent spaces

    Authors: Swami Sankaranarayanan, Anastasios N. Angelopoulos, Stephen Bates, Yaniv Romano, Phillip Isola

    Abstract: Meaningful uncertainty quantification in computer vision requires reasoning about semantic information -- say, the hair color of the person in a photo or the location of a car on the street. To this end, recent breakthroughs in generative modeling allow us to represent semantic information in disentangled latent spaces, but providing uncertainties on the semantic latent variables has remained chal… ▽ More

    Submitted 30 November, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to NeurIPS 2022. Project page: https://swamiviv.github.io/semantic_uncertainty_intervals/

  14. arXiv:2207.01609  [pdf, other

    cs.IR cs.LG stat.ML

    Recommendation Systems with Distribution-Free Reliability Guarantees

    Authors: Anastasios N. Angelopoulos, Karl Krauth, Stephen Bates, Yixin Wang, Michael I. Jordan

    Abstract: When building recommendation systems, we seek to output a helpful set of items to the user. Under the hood, a ranking model predicts which of two candidate items is better, and we must distill these pairwise comparisons into the user-facing output. However, a learned ranking model is never perfect, so taking its predictions at face value gives no guarantee that the user-facing output is reliable.… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  15. arXiv:2202.05265  [pdf, other

    cs.LG cs.CV eess.IV q-bio.QM stat.ML

    Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging

    Authors: Anastasios N Angelopoulos, Amit P Kohli, Stephen Bates, Michael I Jordan, Jitendra Malik, Thayer Alshaabi, Srigokul Upadhyayula, Yaniv Romano

    Abstract: Image-to-image regression is an important learning task, used frequently in biological imaging. Current algorithms, however, do not generally offer statistical guarantees that protect against a model's mistakes and hallucinations. To address this, we develop uncertainty quantification techniques with rigorous statistical guarantees for image-to-image regression problems. In particular, we show how… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Code available at https://github.com/aangelopoulos/im2im-uq

  16. arXiv:2202.03613  [pdf, other

    cs.LG q-bio.QM stat.ME

    Conformal Prediction Under Feedback Covariate Shift for Biomolecular Design

    Authors: Clara Fannjiang, Stephen Bates, Anastasios N. Angelopoulos, Jennifer Listgarten, Michael I. Jordan

    Abstract: Many applications of machine learning methods involve an iterative protocol in which data are collected, a model is trained, and then outputs of that model are used to choose what data to consider next. For example, one data-driven approach for designing proteins is to train a regression model to predict the fitness of protein sequences, then use it to propose new sequences believed to exhibit gre… ▽ More

    Submitted 3 April, 2025; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Code at https://github.com/clarafy/conformal-for-design. Updated title to match published version

    Journal ref: Proc. Natl. Acad. Sci. 119 (43) e2204569119 (2022)

  17. arXiv:2110.01052  [pdf, other

    cs.LG cs.AI cs.CV stat.ME stat.ML

    Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Emmanuel J. Candès, Michael I. Jordan, Lihua Lei

    Abstract: We introduce a framework for calibrating machine learning models so that their predictions satisfy explicit, finite-sample statistical guarantees. Our calibration algorithms work with any underlying model and (unknown) data-generating distribution and do not require model refitting. The framework addresses, among other examples, false discovery rate control in multi-label classification, intersect… ▽ More

    Submitted 29 September, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

    Comments: Code available at https://github.com/aangelopoulos/ltt

  18. arXiv:2107.07511  [pdf, other

    cs.LG cs.AI math.ST stat.ME stat.ML

    A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification

    Authors: Anastasios N. Angelopoulos, Stephen Bates

    Abstract: Black-box machine learning models are now routinely used in high-risk settings, like medical diagnostics, which demand uncertainty quantification to avoid consequential model failures. Conformal prediction is a user-friendly paradigm for creating statistically rigorous uncertainty sets/intervals for the predictions of such models. Critically, the sets are valid in a distribution-free sense: they p… ▽ More

    Submitted 7 December, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Blog and tutorial video at http://angelopoulos.ai/blog/posts/gentle-intro/ ; Code is available at https://github.com/aangelopoulos/conformal-prediction

  19. arXiv:2102.06202  [pdf, other

    cs.LG cs.AI cs.CR stat.ME stat.ML

    Private Prediction Sets

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Tijana Zrnic, Michael I. Jordan

    Abstract: In real-world settings involving consequential decision-making, the deployment of machine learning systems generally requires both reliable uncertainty quantification and protection of individuals' privacy. We present a framework that treats these two desiderata jointly. Our framework is based on conformal prediction, a methodology that augments predictive models to return prediction sets that pro… ▽ More

    Submitted 3 March, 2024; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Code available at https://github.com/aangelopoulos/private_prediction_sets

    Journal ref: Harvard Data Science Review, 4(2). 2022