Skip to main content

Showing 1–17 of 17 results for author: Jebara, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:1906.06419  [pdf, other

    cs.LG stat.ML

    Learning Correlated Latent Representations with Adaptive Priors

    Authors: Da Tang, Dawen Liang, Nicholas Ruozzi, Tony Jebara

    Abstract: Variational Auto-Encoders (VAEs) have been widely applied for learning compact, low-dimensional latent representations of high-dimensional data. When the correlation structure among data points is available, previous work proposed Correlated Variational Auto-Encoders (CVAEs), which employ a structured mixture model as prior and a structured variational posterior for each mixture component to enfor… ▽ More

    Submitted 18 December, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: 16 pages, 1 figure, 5 tables

  2. arXiv:1905.12052  [pdf, other

    cs.LG stat.ML

    A New Distribution on the Simplex with Auto-Encoding Applications

    Authors: Andrew Stirn, Tony Jebara, David A Knowles

    Abstract: We construct a new distribution for the simplex using the Kumaraswamy distribution and an ordered stick-breaking process. We explore and develop the theoretical properties of this new distribution and prove that it exhibits symmetry under the same conditions as the well-known Dirichlet. Like the Dirichlet, the new distribution is adept at capturing sparsity but, unlike the Dirichlet, has an exact… ▽ More

    Submitted 14 December, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: 15 pages, 6 figures, 1 tables

  3. arXiv:1905.05335  [pdf, other

    cs.LG stat.ML

    Correlated Variational Auto-Encoders

    Authors: Da Tang, Dawen Liang, Tony Jebara, Nicholas Ruozzi

    Abstract: Variational Auto-Encoders (VAEs) are capable of learning latent representations for high dimensional data. However, due to the i.i.d. assumption, VAEs only optimize the singleton variational distributions and fail to account for the correlations between data points, which might be crucial for learning latent representations from dataset where a priori we know correlations exist. We propose Correla… ▽ More

    Submitted 17 April, 2020; v1 submitted 13 May, 2019; originally announced May 2019.

    Comments: International Conference on Machine Learning (ICML), 2019

  4. arXiv:1905.03818  [pdf, other

    cs.LG stat.ML

    Beta Survival Models

    Authors: David Hubbard, Benoit Rostykus, Yves Raimond, Tony Jebara

    Abstract: This article analyzes the problem of estimating the time until an event occurs, also known as survival modeling. We observe through substantial experiments on large real-world datasets and use-cases that populations are largely heterogeneous. Sub-populations have different mean and variance in their survival rates requiring flexible models that capture heterogeneity. We leverage a classical extens… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: 11 pages, 9 figures

  5. arXiv:1812.00856  [pdf, other

    cs.LG stat.ML

    Thompson Sampling for Noncompliant Bandits

    Authors: Andrew Stirn, Tony Jebara

    Abstract: Thompson sampling, a Bayesian method for balancing exploration and exploitation in bandit problems, has theoretical guarantees and exhibits strong empirical performance in many domains. Traditional Thompson sampling, however, assumes perfect compliance, where an agent's chosen action is treated as the implemented action. This article introduces a stochastic noncompliance model that relaxes this as… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: 21 pages, 5 figures

  6. arXiv:1807.06651  [pdf, other

    stat.ML cs.IR cs.LG

    Item Recommendation with Variational Autoencoders and Heterogenous Priors

    Authors: Giannis Karamanolakis, Kevin Raji Cherian, Ananth Ravi Narayan, Jie Yuan, Da Tang, Tony Jebara

    Abstract: In recent years, Variational Autoencoders (VAEs) have been shown to be highly effective in both standard collaborative filtering applications and extensions such as incorporation of implicit feedback. We extend VAEs to collaborative filtering with side information, for instance when ratings are combined with explicit text feedback from the user. Instead of using a user-agnostic standard Gaussian p… ▽ More

    Submitted 6 October, 2018; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: Accepted for the 3rd Workshop on Deep Learning for Recommender Systems (DLRS 2018), held in conjunction with the 12th ACM Conference on Recommender Systems (RecSys 2018) in Vancouver, Canada

  7. arXiv:1802.05814  [pdf, other

    stat.ML cs.IR cs.LG

    Variational Autoencoders for Collaborative Filtering

    Authors: Dawen Liang, Rahul G. Krishnan, Matthew D. Hoffman, Tony Jebara

    Abstract: We extend variational autoencoders (VAEs) to collaborative filtering for implicit feedback. This non-linear probabilistic model enables us to go beyond the limited modeling capacity of linear factor models which still largely dominate collaborative filtering research.We introduce a generative model with multinomial likelihood and use Bayesian inference for parameter estimation. Despite widespread… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Comments: 10 pages, 3 figures. WWW 2018

  8. arXiv:1611.00838  [pdf, other

    stat.ML cs.CV cs.LG

    Initialization and Coordinate Optimization for Multi-way Matching

    Authors: Da Tang, Tony Jebara

    Abstract: We consider the problem of consistently matching multiple sets of elements to each other, which is a common task in fields such as computer vision. To solve the underlying NP-hard objective, existing methods often relax or approximate it, but end up with unsatisfying empirical performance due to a misaligned objective. We propose a coordinate update algorithm that directly optimizes the target obj… ▽ More

    Submitted 18 July, 2019; v1 submitted 2 November, 2016; originally announced November 2016.

    Comments: Artificial Intelligence and Statistics (AISTATS), 2017

  9. arXiv:1610.07797  [pdf, other

    math.OC cs.LG stat.ML

    Frank-Wolfe Algorithms for Saddle Point Problems

    Authors: Gauthier Gidel, Tony Jebara, Simon Lacoste-Julien

    Abstract: We extend the Frank-Wolfe (FW) optimization algorithm to solve constrained smooth convex-concave saddle point (SP) problems. Remarkably, the method only requires access to linear minimization oracles. Leveraging recent advances in FW optimization, we provide the first proof of convergence of a FW-type saddle point solver over polytopes, thereby partially answering a 30 year-old conjecture. We also… ▽ More

    Submitted 3 March, 2017; v1 submitted 25 October, 2016; originally announced October 2016.

    Comments: Appears in: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS 2017). 39 pages

    MSC Class: 90C52; 90C90; 68T05 ACM Class: G.1.6; I.2.6

  10. arXiv:1503.01228  [pdf, other

    cs.LG cs.CV stat.ML

    Bethe Learning of Conditional Random Fields via MAP Decoding

    Authors: Kui Tang, Nicholas Ruozzi, David Belanger, Tony Jebara

    Abstract: Many machine learning tasks can be formulated in terms of predicting structured outputs. In frameworks such as the structured support vector machine (SVM-Struct) and the structured perceptron, discriminative functions are learned by iteratively applying efficient maximum a posteriori (MAP) decoding. However, maximum likelihood estimation (MLE) of probabilistic models over these same structured spa… ▽ More

    Submitted 4 March, 2015; originally announced March 2015.

    Comments: 19 pages (9 supplementary), 10 figures (3 supplementary)

  11. arXiv:1402.5902  [pdf, ps, other

    stat.ML cs.LG

    On Learning from Label Proportions

    Authors: Felix X. Yu, Krzysztof Choromanski, Sanjiv Kumar, Tony Jebara, Shih-Fu Chang

    Abstract: Learning from Label Proportions (LLP) is a learning setting, where the training data is provided in groups, or "bags", and only the proportion of each class in each bag is known. The task is to learn a model to predict the class labels of the individual instances. LLP has broad applications in political science, marketing, healthcare, and computer vision. This work answers the fundamental question… ▽ More

    Submitted 11 February, 2015; v1 submitted 24 February, 2014; originally announced February 2014.

  12. arXiv:1309.1369  [pdf, other

    stat.ML cs.LG math.NA stat.CO

    Semistochastic Quadratic Bound Methods

    Authors: Aleksandr Y. Aravkin, Anna Choromanska, Tony Jebara, Dimitri Kanevsky

    Abstract: Partition functions arise in a variety of settings, including conditional random fields, logistic regression, and latent gaussian models. In this paper, we consider semistochastic quadratic bound (SQB) methods for maximum likelihood inference based on partition function optimization. Batch methods based on the quadratic bound were recently proposed for this class of problems, and performed favorab… ▽ More

    Submitted 17 February, 2014; v1 submitted 5 September, 2013; originally announced September 2013.

    Comments: 11 pages, 1 figure

    MSC Class: 90C55; 90C15; 62H30

  13. arXiv:1306.0886  [pdf, other

    cs.LG stat.ML

    $\propto$SVM for learning with label proportions

    Authors: Felix X. Yu, Dong Liu, Sanjiv Kumar, Tony Jebara, Shih-Fu Chang

    Abstract: We study the problem of learning with label proportions in which the training data is provided in groups and only the proportion of each class in each group is known. We propose a new method called proportion-SVM, or $\propto$SVM, which explicitly models the latent unknown instance labels together with the known group label proportions in a large-margin framework. Unlike the existing works, our ap… ▽ More

    Submitted 4 June, 2013; originally announced June 2013.

    Comments: Appears in Proceedings of the 30th International Conference on Machine Learning (ICML 2013)

  14. arXiv:1301.3865  [pdf

    cs.LG stat.ML

    Feature Selection and Dualities in Maximum Entropy Discrimination

    Authors: Tony S. Jebara, Tommi S. Jaakkola

    Abstract: Incorporating feature selection into a classification or regression method often carries a number of advantages. In this paper we formalize feature selection specifically from a discriminative perspective of improving classification/regression accuracy. The feature selection method is developed as an extension to the recently proposed maximum entropy discrimination (MED) framework. We describe MED… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-291-300

  15. arXiv:1301.0015  [pdf, ps, other

    cs.LG stat.ML

    Bethe Bounds and Approximating the Global Optimum

    Authors: Adrian Weller, Tony Jebara

    Abstract: Inference in general Markov random fields (MRFs) is NP-hard, though identifying the maximum a posteriori (MAP) configuration of pairwise MRFs with submodular cost functions is efficiently solvable using graph cuts. Marginal inference, however, even for this restricted class, is in #P. We prove new formulations of derivatives of the Bethe free energy, provide bounds on the derivatives and bracket t… ▽ More

    Submitted 31 December, 2012; originally announced January 2013.

  16. arXiv:1207.4148  [pdf

    cs.LG stat.ML

    Dynamical Systems Trees

    Authors: Andrew Howard, Tony S. Jebara

    Abstract: We propose dynamical systems trees (DSTs) as a flexible class of models for describing multiple processes that interact via a hierarchy of aggregating parent chains. DSTs extend Kalman filters, hidden Markov models and nonlinear dynamical systems to an interactive group scenario. Various individual processes interact as communities and sub-communities in a tree structure that is unrolled in time.… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-260-267

  17. arXiv:1206.3269  [pdf

    cs.LG stat.ML

    Bayesian Out-Trees

    Authors: Tony S. Jebara

    Abstract: A Bayesian treatment of latent directed graph structure for non-iid data is provided where each child datum is sampled with a directed conditional dependence on a single unknown parent datum. The latent graph structure is assumed to lie in the family of directed out-tree graphs which leads to efficient Bayesian inference. The latent likelihood of the data and its gradients are computable in closed… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-315-324