Skip to main content

Showing 1–8 of 8 results for author: Khardon, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.02420  [pdf, other

    cs.LG stat.ML

    Variational Inference on the Final-Layer Output of Neural Networks

    Authors: Yadi Wei, Roni Khardon

    Abstract: Traditional neural networks are simple to train but they typically produce overconfident predictions. In contrast, Bayesian neural networks provide good uncertainty quantification but optimizing them is time consuming due to the large parameter space. This paper proposes to combine the advantages of both approaches by performing Variational Inference in the Final layer Output space (VIFO), because… ▽ More

    Submitted 6 November, 2024; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Published to TMLR

  2. arXiv:2211.08393  [pdf, other

    cs.LG stat.ML

    On the Performance of Direct Loss Minimization for Bayesian Neural Networks

    Authors: Yadi Wei, Roni Khardon

    Abstract: Direct Loss Minimization (DLM) has been proposed as a pseudo-Bayesian method motivated as regularized loss minimization. Compared to variational inference, it replaces the loss term in the evidence lower bound (ELBO) with the predictive log loss, which is the same loss function used in evaluation. A number of theoretical and empirical results in prior work suggest that DLM can significantly improv… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: I Cant Believe It is Not Better Workshop at NeurIPS 2022

  3. arXiv:2203.12139  [pdf, other

    cs.AI stat.CO

    Approximate Inference for Stochastic Planning in Factored Spaces

    Authors: Zhennan Wu, Roni Khardon

    Abstract: Stochastic planning can be reduced to probabilistic inference in large discrete graphical models, but hardness of inference requires approximation schemes to be used. In this paper we argue that such applications can be disentangled along two dimensions. The first is the direction of information flow in the idealized exact optimization objective, i.e., forward vs backward inference. The second is… ▽ More

    Submitted 1 September, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

  4. arXiv:2004.03083  [pdf, other

    cs.LG stat.ML

    Direct loss minimization algorithms for sparse Gaussian processes

    Authors: Yadi Wei, Rishit Sheth, Roni Khardon

    Abstract: The paper provides a thorough investigation of Direct loss minimization (DLM), which optimizes the posterior to minimize predictive loss, in sparse Gaussian processes. For the conjugate case, we consider DLM for log-loss and DLM for square loss showing a significant performance improvement in both cases. The application of DLM in non-conjugate cases is more complex because the logarithm of expecta… ▽ More

    Submitted 27 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: 31 pages, 16 figures

  5. arXiv:1612.03957  [pdf, other

    stat.ML

    Monte Carlo Structured SVI for Two-Level Non-Conjugate Models

    Authors: Rishit Sheth, Roni Khardon

    Abstract: The stochastic variational inference (SVI) paradigm, which combines variational inference, natural gradients, and stochastic updates, was recently proposed for large-scale data analysis in conjugate Bayesian models and demonstrated to be effective in several problems. This paper studies a family of Bayesian latent variable models with two levels of hidden variables but without any conjugacy requir… ▽ More

    Submitted 2 February, 2018; v1 submitted 12 December, 2016; originally announced December 2016.

    Comments: Updated w/ mixed effects model

  6. arXiv:1301.5332  [pdf, ps, other

    stat.ML cs.LG

    Online Learning with Pairwise Loss Functions

    Authors: Yuyang Wang, Roni Khardon, Dmitry Pechyony, Rosie Jones

    Abstract: Efficient online learning with pairwise loss functions is a crucial component in building large-scale learning system that maximizes the area under the Receiver Operator Characteristic (ROC) curve. In this paper we investigate the generalization performance of online learning algorithms with pairwise loss functions. We show that the existing proof techniques for generalization bounds of online alg… ▽ More

    Submitted 22 January, 2013; originally announced January 2013.

    Comments: This is an extension of our COLT paper

  7. Nonparametric Bayesian Mixed-effect Model: a Sparse Gaussian Process Approach

    Authors: Yuyang Wang, Roni Khardon

    Abstract: Multi-task learning models using Gaussian processes (GP) have been developed and successfully applied in various applications. The main difficulty with this approach is the computational cost of inference using the union of examples from all tasks. Therefore sparse solutions, that avoid using the entire data directly and instead use a set of informative "representatives" are desirable. The paper i… ▽ More

    Submitted 28 November, 2012; originally announced November 2012.

    Comments: Preliminary version appeared in ECML2012

  8. arXiv:1203.0970  [pdf, other

    cs.LG astro-ph.IM stat.ML

    Infinite Shift-invariant Grouped Multi-task Learning for Gaussian Processes

    Authors: Yuyang Wang, Roni Khardon, Pavlos Protopapas

    Abstract: Multi-task learning leverages shared information among data sets to improve the learning performance of individual tasks. The paper applies this framework for data where each task is a phase-shifted periodic time series. In particular, we develop a novel Bayesian nonparametric model capturing a mixture of Gaussian processes where each task is a sum of a group-specific function and a component capt… ▽ More

    Submitted 20 May, 2013; v1 submitted 5 March, 2012; originally announced March 2012.

    Comments: This is an extended version of our ECML 2010 paper entitled "Shift-invariant Grouped Multi-task Learning for Gaussian Processes"; ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III