Skip to main content

Showing 1–15 of 15 results for author: Kulis, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2111.01348  [pdf, other

    stat.ML cs.LG

    Faster Algorithms for Learning Convex Functions

    Authors: Ali Siahkamari, Durmus Alp Emre Acar, Christopher Liao, Kelly Geyer, Venkatesh Saligrama, Brian Kulis

    Abstract: The task of approximating an arbitrary convex function arises in several learning problems such as convex regression, learning with a difference of convex (DC) functions, and learning Bregman or $f$-divergences. In this paper, we develop and analyze an approach for solving a broad range of convex function learning problems that is faster than state-of-the-art approaches. Our approach is based on a… ▽ More

    Submitted 19 June, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 21 pages, 3 figures. Proceedings of the 39 th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022. Copy- right 2022 by the author(s)

  2. arXiv:2007.02422  [pdf, ps, other

    stat.ML cs.LG

    Piecewise Linear Regression via a Difference of Convex Functions

    Authors: Ali Siahkamari, Aditya Gangrade, Brian Kulis, Venkatesh Saligrama

    Abstract: We present a new piecewise linear regression methodology that utilizes fitting a difference of convex functions (DC functions) to the data. These are functions $f$ that may be represented as the difference $φ_1 - φ_2$ for a choice of convex functions $φ_1, φ_2$. The method proceeds by estimating piecewise-liner convex functions, in a manner similar to max-affine regression, whose difference approx… ▽ More

    Submitted 13 November, 2020; v1 submitted 5 July, 2020; originally announced July 2020.

    Comments: Published in International Conference on Machine Learning (ICML2020) Proceedings

  3. arXiv:2005.02612  [pdf, other

    cs.LG stat.ML

    Deep Divergence Learning

    Authors: Kubra Cilingir, Rachel Manzelli, Brian Kulis

    Abstract: Classical linear metric learning methods have recently been extended along two distinct lines: deep metric learning methods for learning embeddings of the data using neural networks, and Bregman divergence learning approaches for extending learning Euclidean distances to more general divergence measures such as divergences over distributions. In this paper, we introduce deep Bregman divergences, w… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: Under review

  4. arXiv:1908.07116  [pdf, other

    cs.LG cs.CV stat.ML

    Protecting Neural Networks with Hierarchical Random Switching: Towards Better Robustness-Accuracy Trade-off for Stochastic Defenses

    Authors: Xiao Wang, Siyue Wang, Pin-Yu Chen, Yanzhi Wang, Brian Kulis, Xue Lin, Peter Chin

    Abstract: Despite achieving remarkable success in various domains, recent studies have uncovered the vulnerability of deep neural networks to adversarial perturbations, creating concerns on model generalizability and new threats such as prediction-evasive misclassification or stealthy reprogramming. Among different defense proposals, stochastic network defenses such as random neuron activation pruning or ra… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: Published as Conference Paper @ IJCAI 2019

  5. arXiv:1905.11545  [pdf, other

    stat.ML cs.LG

    Learning to Approximate a Bregman Divergence

    Authors: Ali Siahkamari, Xide Xia, Venkatesh Saligrama, David Castanon, Brian Kulis

    Abstract: Bregman divergences generalize measures such as the squared Euclidean distance and the KL divergence, and arise throughout many areas of machine learning. In this paper, we focus on the problem of approximating an arbitrary Bregman divergence from supervision, and we provide a well-principled approach to analyzing such approximations. We develop a formulation and algorithm for learning arbitrary B… ▽ More

    Submitted 2 November, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: 19 pages, 4 figures

    Journal ref: Proceedings of the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  6. arXiv:1806.09905  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Conditioning Deep Generative Raw Audio Models for Structured Automatic Music

    Authors: Rachel Manzelli, Vijay Thakkar, Ali Siahkamari, Brian Kulis

    Abstract: Existing automatic music generation approaches that feature deep learning can be broadly classified into two types: raw audio models and symbolic models. Symbolic models, which train and generate at the note level, are currently the more prevalent approach; these models can capture long-range dependencies of melodic structure, but fail to grasp the nuances and richness of raw audio generations. Ra… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: Presented at the ISMIR 2018 Conference

  7. arXiv:1707.08493  [pdf, other

    stat.ML

    Dynamic Clustering Algorithms via Small-Variance Analysis of Markov Chain Mixture Models

    Authors: Trevor Campbell, Brian Kulis, Jonathan How

    Abstract: Bayesian nonparametrics are a class of probabilistic models in which the model size is inferred from data. A recently developed methodology in this field is small-variance asymptotic analysis, a mathematical technique for deriving learning algorithms that capture much of the flexibility of Bayesian nonparametric inference algorithms, but are simpler to implement and less computationally expensive.… ▽ More

    Submitted 26 July, 2017; originally announced July 2017.

    Comments: 27 pages

  8. arXiv:1604.02027  [pdf, other

    cs.LG cs.CL stat.ML

    Combinatorial Topic Models using Small-Variance Asymptotics

    Authors: Ke Jiang, Suvrit Sra, Brian Kulis

    Abstract: Topic models have emerged as fundamental tools in unsupervised machine learning. Most modern topic modeling algorithms take a probabilistic view and derive inference algorithms based on Latent Dirichlet Allocation (LDA) or its variants. In contrast, we study topic modeling as a combinatorial optimization problem, and propose a new objective function derived from LDA by passing to the small-varianc… ▽ More

    Submitted 26 May, 2016; v1 submitted 7 April, 2016; originally announced April 2016.

    Comments: 19 pages

  9. arXiv:1601.02257  [pdf, other

    cs.LG stat.ML

    A Sufficient Statistics Construction of Bayesian Nonparametric Exponential Family Conjugate Models

    Authors: Robert Finn, Brian Kulis

    Abstract: Conjugate pairs of distributions over infinite dimensional spaces are prominent in statistical learning theory, particularly due to the widespread adoption of Bayesian nonparametric methodologies for a host of models and applications. Much of the existing literature in the learning community focuses on processes possessing some form of computationally tractable conjugacy as is the case for the bet… ▽ More

    Submitted 10 January, 2016; originally announced January 2016.

  10. arXiv:1411.4199  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Revisiting Kernelized Locality-Sensitive Hashing for Improved Large-Scale Image Retrieval

    Authors: Ke Jiang, Qichao Que, Brian Kulis

    Abstract: We present a simple but powerful reinterpretation of kernelized locality-sensitive hashing (KLSH), a general and popular method developed in the vision community for performing approximate nearest-neighbor searches in an arbitrary reproducing kernel Hilbert space (RKHS). Our new perspective is based on viewing the steps of the KLSH algorithm in an appropriately projected space, and has several key… ▽ More

    Submitted 15 November, 2014; originally announced November 2014.

    Comments: 15 pages

  11. arXiv:1411.1971  [pdf, other

    cs.CV cs.LG stat.ML

    Power-Law Graph Cuts

    Authors: Xiangyang Zhou, Jiaxin Zhang, Brian Kulis

    Abstract: Algorithms based on spectral graph cut objectives such as normalized cuts, ratio cuts and ratio association have become popular in recent years because they are widely applicable and simple to implement via standard eigenvector computations. Despite strong performance for a number of clustering tasks, spectral graph cut algorithms still suffer from several limitations: first, they require the numb… ▽ More

    Submitted 25 November, 2014; v1 submitted 29 October, 2014; originally announced November 2014.

  12. arXiv:1410.1068  [pdf, other

    stat.ML cs.AI cs.LG

    Gamma Processes, Stick-Breaking, and Variational Inference

    Authors: Anirban Roychowdhury, Brian Kulis

    Abstract: While most Bayesian nonparametric models in machine learning have focused on the Dirichlet process, the beta process, or their variants, the gamma process has recently emerged as a useful nonparametric prior in its own right. Current inference schemes for models involving the gamma process are restricted to MCMC-based methods, which limits their scalability. In this paper, we present a variational… ▽ More

    Submitted 4 October, 2014; originally announced October 2014.

  13. arXiv:1305.6659  [pdf, other

    cs.LG stat.ML

    Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture

    Authors: Trevor Campbell, Miao Liu, Brian Kulis, Jonathan P. How, Lawrence Carin

    Abstract: This paper presents a novel algorithm, based upon the dependent Dirichlet process mixture model (DDPMM), for clustering batch-sequential data containing an unknown number of evolving clusters. The algorithm is derived via a low-variance asymptotic analysis of the Gibbs sampling algorithm for the DDPMM, and provides a hard clustering with convergence guarantees similar to those of the k-means algor… ▽ More

    Submitted 1 November, 2013; v1 submitted 28 May, 2013; originally announced May 2013.

    Comments: This paper is from NIPS 2013. Please use the following BibTeX citation: @inproceedings{Campbell13_NIPS, Author = {Trevor Campbell and Miao Liu and Brian Kulis and Jonathan P. How and Lawrence Carin}, Title = {Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process}, Booktitle = {Advances in Neural Information Processing Systems (NIPS)}, Year = {2013}}

  14. arXiv:1212.2126  [pdf, other

    stat.ML

    MAD-Bayes: MAP-based Asymptotic Derivations from Bayes

    Authors: Tamara Broderick, Brian Kulis, Michael I. Jordan

    Abstract: The classical mixture of Gaussians model is related to K-means via small-variance asymptotics: as the covariances of the Gaussians tend to zero, the negative log-likelihood of the mixture of Gaussians model approaches the K-means objective, and the EM algorithm approaches the K-means algorithm. Kulis & Jordan (2012) used this observation to obtain a novel K-means-like algorithm from a Gibbs sample… ▽ More

    Submitted 15 February, 2013; v1 submitted 10 December, 2012; originally announced December 2012.

    Comments: 13 pages, 3 figures

  15. arXiv:1111.0352  [pdf, other

    cs.LG stat.ML

    Revisiting k-means: New Algorithms via Bayesian Nonparametrics

    Authors: Brian Kulis, Michael I. Jordan

    Abstract: Bayesian models offer great flexibility for clustering applications---Bayesian nonparametrics can be used for modeling infinite mixtures, and hierarchical Bayesian models can be utilized for sharing clusters across multiple data sets. For the most part, such flexibility is lacking in classical clustering methods such as k-means. In this paper, we revisit the k-means clustering algorithm from a Bay… ▽ More

    Submitted 14 June, 2012; v1 submitted 1 November, 2011; originally announced November 2011.

    Comments: 14 pages. Updated based on the corresponding ICML paper