Skip to main content

Showing 1–9 of 9 results for author: Scott, C D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.07321  [pdf, ps, other

    stat.ML cs.LG

    On Classification-Calibration of Gamma-Phi Losses

    Authors: Yutong Wang, Clayton D. Scott

    Abstract: Gamma-Phi losses constitute a family of multiclass classification loss functions that generalize the logistic and other common losses, and have found application in the boosting literature. We establish the first general sufficient condition for the classification-calibration (CC) of such losses. To our knowledge, this sufficient condition gives the first family of nonconvex multiclass surrogate l… ▽ More

    Submitted 12 December, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: Appeared in COLT 2023

  2. arXiv:2205.09342  [pdf, other

    stat.ML cs.LG

    Consistent Interpolating Ensembles via the Manifold-Hilbert Kernel

    Authors: Yutong Wang, Clayton D. Scott

    Abstract: Recent research in the theory of overparametrized learning has sought to establish generalization guarantees in the interpolating regime. Such results have been established for a few common classes of methods, but so far not for ensemble methods. We devise an ensemble classification method that simultaneously interpolates the training data, and is consistent for a broad class of data distributions… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  3. arXiv:2110.02456  [pdf, ps, other

    stat.ML cs.LG

    VC dimension of partially quantized neural networks in the overparametrized regime

    Authors: Yutong Wang, Clayton D. Scott

    Abstract: Vapnik-Chervonenkis (VC) theory has so far been unable to explain the small generalization error of overparametrized neural networks. Indeed, existing applications of VC theory to large networks obtain upper bounds on VC dimension that are proportional to the number of weights, and for a large class of networks, these upper bound are known to be tight. In this work, we focus on a class of partiall… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  4. arXiv:2102.05640  [pdf, other

    stat.ML cs.LG

    An Exact Solver for the Weston-Watkins SVM Subproblem

    Authors: Yutong Wang, Clayton D. Scott

    Abstract: Recent empirical evidence suggests that the Weston-Watkins support vector machine is among the best performing multiclass extensions of the binary SVM. Current state-of-the-art solvers repeatedly solve a particular subproblem approximately using an iterative strategy. In this work, we propose an algorithm that solves the subproblem exactly using a novel reparametrization of the Weston-Watkins dual… ▽ More

    Submitted 7 June, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: ICML 2021

  5. arXiv:2006.07346  [pdf, ps, other

    stat.ML cs.LG math.OC

    Weston-Watkins Hinge Loss and Ordered Partitions

    Authors: Yutong Wang, Clayton D. Scott

    Abstract: Multiclass extensions of the support vector machine (SVM) have been formulated in a variety of ways. A recent empirical comparison of nine such formulations [Doǧan et al. 2016] recommends the variant proposed by Weston and Watkins (WW), despite the fact that the WW-hinge loss is not calibrated with respect to the 0-1 loss. In this work we introduce a novel discrete loss function for multiclass cla… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: 38 pages, 3 figures

  6. arXiv:1607.00071  [pdf, ps, other

    stat.ML math.ST

    An Operator Theoretic Approach to Nonparametric Mixture Models

    Authors: Robert A. Vandermeulen, Clayton D. Scott

    Abstract: When estimating finite mixture models, it is common to make assumptions on the mixture components, such as parametric assumptions. In this work, we make no distributional assumptions on the mixture components and instead assume that observations from the mixture model are grouped, such that observations in the same group are known to be drawn from the same mixture component. We precisely character… ▽ More

    Submitted 12 October, 2016; v1 submitted 30 June, 2016; originally announced July 2016.

    Comments: Contains and greatly extends the results from our previous work, arXiv:1502.06644, and thus contains some overlap with that work. This version contains some small grammatical and technical corrections as well as some changes for improved clarity

  7. arXiv:1502.06644  [pdf, ps, other

    stat.ML cs.LG math.ST

    On The Identifiability of Mixture Models from Grouped Samples

    Authors: Robert A. Vandermeulen, Clayton D. Scott

    Abstract: Finite mixture models are statistical models which appear in many problems in statistics and machine learning. In such models it is assumed that data are drawn from random probability measures, called mixture components, which are themselves drawn from a probability measure P over probability measures. When estimating mixture models, it is common to make assumptions on the mixture components, such… ▽ More

    Submitted 2 April, 2022; v1 submitted 23 February, 2015; originally announced February 2015.

    Comments: The work was subsumed and expanded upon in our Annals of Statistics publication "An Operator Theoretic Approach to Nonparametric Mixture Models."

  8. arXiv:1411.4378  [pdf, other

    stat.ML

    Robust Kernel Density Estimation by Scaling and Projection in Hilbert Space

    Authors: Robert A. Vandermeulen, Clayton D. Scott

    Abstract: While robust parameter estimation has been well studied in parametric density estimation, there has been little investigation into robust density estimation in the nonparametric setting. We present a robust version of the popular kernel density estimator (KDE). As with other estimators, a robust version of the KDE is useful since sample contamination is a common issue with datasets. What "robustne… ▽ More

    Submitted 17 November, 2014; originally announced November 2014.

    Comments: Extended version of NIPS 2014 paper

  9. arXiv:1107.3133  [pdf, other

    stat.ML cs.LG stat.ME

    Robust Kernel Density Estimation

    Authors: JooSeuk Kim, Clayton D. Scott

    Abstract: We propose a method for nonparametric density estimation that exhibits robustness to contamination of the training sample. This method achieves robustness by combining a traditional kernel density estimator (KDE) with ideas from classical $M$-estimation. We interpret the KDE based on a radial, positive semi-definite kernel as a sample mean in the associated reproducing kernel Hilbert space. Since… ▽ More

    Submitted 5 September, 2011; v1 submitted 15 July, 2011; originally announced July 2011.