Skip to main content

Showing 1–4 of 4 results for author: Chidambaram, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2409.13074  [pdf, other

    cs.LG cs.CV stat.ML

    What does guidance do? A fine-grained analysis in a simple setting

    Authors: Muthu Chidambaram, Khashayar Gatmiry, Sitan Chen, Holden Lee, Jianfeng Lu

    Abstract: The use of guidance in diffusion models was originally motivated by the premise that the guidance-modified score is that of the data distribution tilted by a conditional likelihood raised to some power. In this work we clarify this misconception by rigorously proving that guidance fails to sample from the intended tilted distribution. Our main result is to give a fine-grained characterization of… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  2. arXiv:2406.04068  [pdf, other

    cs.LG math.ST stat.ML

    Reassessing How to Compare and Improve the Calibration of Machine Learning Models

    Authors: Muthu Chidambaram, Rong Ge

    Abstract: A machine learning model is calibrated if its predicted probability for an outcome matches the observed frequency for that outcome conditional on the model prediction. This property has become increasingly important as the impact of machine learning models has continued to spread to various domains. As a result, there are now a dizzying number of recent papers on measuring and improving the calibr… ▽ More

    Submitted 23 February, 2025; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: ICLR 2025, 29 pages, 14 figures

  3. arXiv:2306.00740  [pdf, other

    cs.LG stat.ML

    On the Limitations of Temperature Scaling for Distributions with Overlaps

    Authors: Muthu Chidambaram, Rong Ge

    Abstract: Despite the impressive generalization capabilities of deep neural networks, they have been repeatedly shown to be overconfident when they are wrong. Fixing this issue is known as model calibration, and has consequently received much attention in the form of modified training schemes and post-training calibration procedures such as temperature scaling. While temperature scaling is frequently used b… ▽ More

    Submitted 13 February, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 27 pages, 9 Figures, published in ICLR 2024

  4. arXiv:2210.13512  [pdf, other

    cs.LG cs.AI cs.CV math.OC stat.ML

    Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup

    Authors: Muthu Chidambaram, Xiang Wang, Chenwei Wu, Rong Ge

    Abstract: Mixup is a data augmentation technique that relies on training using random convex combinations of data points and their labels. In recent years, Mixup has become a standard primitive used in the training of state-of-the-art image classification models due to its demonstrated benefits over empirical risk minimization with regards to generalization and robustness. In this work, we try to explain so… ▽ More

    Submitted 4 November, 2024; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: 37 pages, 2 figures, ICML 2023, minor corrections in latest version