Skip to main content

Showing 1–6 of 6 results for author: Roelofs, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2012.08668  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Mitigating Bias in Calibration Error Estimation

    Authors: Rebecca Roelofs, Nicholas Cain, Jonathon Shlens, Michael C. Mozer

    Abstract: For an AI system to be reliable, the confidence it expresses in its decisions must match its accuracy. To assess the degree of match, examples are typically binned by confidence and the per-bin mean confidence and accuracy are compared. Most research in calibration focuses on techniques to reduce this empirical measure of calibration error, ECE_bin. We instead focus on assessing statistical bias i… ▽ More

    Submitted 10 February, 2022; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: To be published in AISTATS 2022. Code is available https://github.com/google-research/google-research/tree/master/caltrain

  2. arXiv:1906.02168  [pdf, other

    cs.LG cs.CV stat.ML

    Do Image Classifiers Generalize Across Time?

    Authors: Vaishaal Shankar, Achal Dave, Rebecca Roelofs, Deva Ramanan, Benjamin Recht, Ludwig Schmidt

    Abstract: We study the robustness of image classifiers to temporal perturbations derived from videos. As part of this study, we construct two datasets, ImageNet-Vid-Robust and YTBB-Robust , containing a total 57,897 images grouped into 3,139 sets of perceptually similar images. Our datasets were derived from ImageNet-Vid and Youtube-BB respectively and thoroughly re-annotated by human experts for image simi… ▽ More

    Submitted 9 December, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: 23 pages, 11 tables, 11 figures. Paper Website: https://modestyachts.github.io/natural-perturbations-website/

  3. arXiv:1902.10811  [pdf, other

    cs.CV cs.LG stat.ML

    Do ImageNet Classifiers Generalize to ImageNet?

    Authors: Benjamin Recht, Rebecca Roelofs, Ludwig Schmidt, Vaishaal Shankar

    Abstract: We build new test sets for the CIFAR-10 and ImageNet datasets. Both benchmarks have been the focus of intense research for almost a decade, raising the danger of overfitting to excessively re-used test sets. By closely following the original dataset creation processes, we test to what extent current classification models generalize to new data. We evaluate a broad range of models and find accuracy… ▽ More

    Submitted 12 June, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

  4. arXiv:1806.00451  [pdf, other

    cs.LG stat.ML

    Do CIFAR-10 Classifiers Generalize to CIFAR-10?

    Authors: Benjamin Recht, Rebecca Roelofs, Ludwig Schmidt, Vaishaal Shankar

    Abstract: Machine learning is currently dominated by largely experimental work focused on improvements in a few key tasks. However, the impressive accuracy numbers of the best performing models are questionable because the same test sets have been used to select these models for multiple years now. To understand the danger of overfitting, we measure the accuracy of CIFAR-10 classifiers by creating a new tes… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  5. arXiv:1705.08292  [pdf, other

    stat.ML cs.LG

    The Marginal Value of Adaptive Gradient Methods in Machine Learning

    Authors: Ashia C. Wilson, Rebecca Roelofs, Mitchell Stern, Nathan Srebro, Benjamin Recht

    Abstract: Adaptive optimization methods, which perform local optimization with a metric constructed from the history of iterates, are becoming increasingly popular for training deep neural networks. Examples include AdaGrad, RMSProp, and Adam. We show that for simple overparameterized problems, adaptive methods often find drastically different solutions than gradient descent (GD) or stochastic gradient desc… ▽ More

    Submitted 21 May, 2018; v1 submitted 23 May, 2017; originally announced May 2017.

  6. arXiv:1602.05310  [pdf, other

    cs.LG math.OC stat.ML

    Large Scale Kernel Learning using Block Coordinate Descent

    Authors: Stephen Tu, Rebecca Roelofs, Shivaram Venkataraman, Benjamin Recht

    Abstract: We demonstrate that distributed block coordinate descent can quickly solve kernel regression and classification problems with millions of data points. Armed with this capability, we conduct a thorough comparison between the full kernel, the Nyström method, and random features on three large classification tasks from various domains. Our results suggest that the Nyström method generally achieves be… ▽ More

    Submitted 17 February, 2016; originally announced February 2016.