Skip to main content

Showing 1–8 of 8 results for author: Mixon, D G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2211.15744  [pdf, other

    cs.LG cs.DS cs.IT math.OC math.ST stat.ML

    Sketch-and-solve approaches to k-means clustering by semidefinite programming

    Authors: Charles Clum, Dustin G. Mixon, Soledad Villar, Kaiying Xie

    Abstract: We introduce a sketch-and-solve approach to speed up the Peng-Wei semidefinite relaxation of k-means clustering. When the data is appropriately separated we identify the k-means optimal clustering. Otherwise, our approach provides a high-confidence lower bound on the optimal k-means value. This lower bound is data-driven; it does not make any assumption on the data nor how it is generated. We prov… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  2. arXiv:2008.04278  [pdf, other

    cs.LG math.OC stat.ML

    Lie PCA: Density estimation for symmetric manifolds

    Authors: Jameson Cahill, Dustin G. Mixon, Hans Parshall

    Abstract: We introduce an extension to local principal component analysis for learning symmetric manifolds. In particular, we use a spectral method to approximate the Lie algebra corresponding to the symmetry group of the underlying manifold. We derive the sample complexity of our method for a variety of manifolds before applying it to various data sets for improved density estimation.

    Submitted 13 September, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

  3. arXiv:1812.02768  [pdf, other

    stat.ML cs.LG math.OC

    SqueezeFit: Label-aware dimensionality reduction by semidefinite programming

    Authors: Culver McWhirter, Dustin G. Mixon, Soledad Villar

    Abstract: Given labeled points in a high-dimensional vector space, we seek a low-dimensional subspace such that projecting onto this subspace maintains some prescribed distance between points of differing labels. Intended applications include compressive classification. Taking inspiration from large margin nearest neighbor classification, this paper introduces a semidefinite relaxation of this problem. Unli… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

  4. arXiv:1803.09319  [pdf, other

    cs.LG stat.ML

    SUNLayer: Stable denoising with generative networks

    Authors: Dustin G. Mixon, Soledad Villar

    Abstract: It has been experimentally established that deep neural networks can be used to produce good generative models for real world data. It has also been established that such generative models can be exploited to solve classical inverse problems like compressed sensing and super resolution. In this work we focus on the classical signal processing problem of image denoising. We propose a theoretical se… ▽ More

    Submitted 25 March, 2018; originally announced March 2018.

  5. arXiv:1710.00956  [pdf, other

    stat.ML math.OC

    Monte Carlo approximation certificates for k-means clustering

    Authors: Dustin G. Mixon, Soledad Villar

    Abstract: Efficient algorithms for $k$-means clustering frequently converge to suboptimal partitions, and given a partition, it is difficult to detect $k$-means optimality. In this paper, we develop an a posteriori certifier of approximate optimality for $k$-means clustering. The certifier is a sub-linear Monte Carlo algorithm based on Peng and Wei's semidefinite relaxation of $k$-means. In particular, solv… ▽ More

    Submitted 2 October, 2017; originally announced October 2017.

    Comments: 8 pages

  6. arXiv:1602.06612  [pdf, other

    stat.ML cs.DS cs.IT cs.LG math.ST

    Clustering subgaussian mixtures by semidefinite programming

    Authors: Dustin G. Mixon, Soledad Villar, Rachel Ward

    Abstract: We introduce a model-free relax-and-round algorithm for k-means clustering based on a semidefinite relaxation due to Peng and Wei. The algorithm interprets the SDP output as a denoised version of the original data and then rounds this output to a hard clustering. We provide a generic method for proving performance guarantees for this algorithm, and we analyze the algorithm in the context of subgau… ▽ More

    Submitted 10 May, 2016; v1 submitted 21 February, 2016; originally announced February 2016.

  7. arXiv:1505.04778  [pdf, other

    cs.IT cs.DS cs.LG math.ST stat.ML

    On the tightness of an SDP relaxation of k-means

    Authors: Takayuki Iguchi, Dustin G. Mixon, Jesse Peterson, Soledad Villar

    Abstract: Recently, Awasthi et al. introduced an SDP relaxation of the $k$-means problem in $\mathbb R^m$. In this work, we consider a random model for the data points in which $k$ balls of unit radius are deterministically distributed throughout $\mathbb R^m$, and then in each ball, $n$ points are drawn according to a common rotationally invariant probability distribution. For any fixed ball configuration… ▽ More

    Submitted 18 May, 2015; originally announced May 2015.

  8. arXiv:1210.2440  [pdf, ps, other

    math.ST cs.IT stat.ML

    Group Model Selection Using Marginal Correlations: The Good, the Bad and the Ugly

    Authors: Waheed U. Bajwa, Dustin G. Mixon

    Abstract: Group model selection is the problem of determining a small subset of groups of predictors (e.g., the expression data of genes) that are responsible for majority of the variation in a response variable (e.g., the malignancy of a tumor). This paper focuses on group model selection in high-dimensional linear models, in which the number of predictors far exceeds the number of samples of the response… ▽ More

    Submitted 8 October, 2012; originally announced October 2012.

    Comments: Accepted for publication in Proc. 50th Annu. Allerton Conf. Communication, Control, and Computing, Monticello, IL, Oct. 1-5, 2012; 8 pages and 4 figures

    Journal ref: Proc. 50th Annu. Allerton Conf. Communication, Control, and Computing, Monticello, IL, Oct. 1-5, 2012, pp. 494-501