Skip to main content

Showing 1–15 of 15 results for author: Bilmes, J A

.
  1. arXiv:2410.22656  [pdf, ps, other

    cs.LG

    Tilted Sharpness-Aware Minimization

    Authors: Tian Li, Tianyi Zhou, Jeffrey A. Bilmes

    Abstract: Sharpness-Aware Minimization (SAM) has been demonstrated to improve the generalization performance of overparameterized models by seeking flat minima on the loss landscape through optimizing model parameters that incur the largest loss within a neighborhood. Nevertheless, such min-max formulations are computationally challenging especially when the problem is highly non-convex. Additionally, focus… ▽ More

    Submitted 8 June, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: Accepted by ICML 2025

  2. arXiv:1801.07413  [pdf, other

    cs.DM

    Greed is Still Good: Maximizing Monotone Submodular+Supermodular Functions

    Authors: Wenruo Bai, Jeffrey A. Bilmes

    Abstract: We analyze the performance of the greedy algorithm, and also a discrete semi-gradient based algorithm, for maximizing the sum of a suBmodular and suPermodular (BP) function (both of which are non-negative monotone non-decreasing) under two types of constraints, either a cardinality constraint or $p\geq 1$ matroid independence constraints. These problems occur naturally in several real-world applic… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

  3. arXiv:1410.7875  [pdf, other

    q-bio.MN stat.ML

    Faster graphical model identification of tandem mass spectra using peptide word lattices

    Authors: Shengjie Wang, John T. Halloran, Jeff A. Bilmes, William S. Noble

    Abstract: Liquid chromatography coupled with tandem mass spectrometry, also known as shotgun proteomics, is a widely-used high-throughput technology for identifying proteins in complex biological samples. Analysis of the tens of thousands of fragmentation spectra produced by a typical shotgun proteomics experiment begins by assigning to each observed spectrum the peptide hypothesized to be responsible for g… ▽ More

    Submitted 29 October, 2014; originally announced October 2014.

  4. arXiv:1408.2062  [pdf

    cs.LG stat.ML

    The Lovasz-Bregman Divergence and connections to rank aggregation, clustering, and web ranking

    Authors: Rishabh Iyer, Jeff A. Bilmes

    Abstract: We extend the recently introduced theory of Lovasz-Bregman (LB) divergences (Iyer & Bilmes 2012) in several ways. We show that they represent a distortion between a "score" and an "ordering", thus providing a new view of rank aggregation and order based clustering with interesting connections to web ranking. We show how the LB divergences have a number of properties akin to many permutation based… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-321-330

  5. arXiv:1408.2051  [pdf

    cs.LG stat.ML

    Algorithms for Approximate Minimization of the Difference Between Submodular Functions, with Applications

    Authors: Rishabh Iyer, Jeff A. Bilmes

    Abstract: We extend the work of Narasimhan and Bilmes [30] for minimizing set functions representable as a dierence between submodular functions. Similar to [30], our new algorithms are guaranteed to monotonically reduce the objective function at every step. We empirically and theoretically show that the per-iteration cost of our algorithms is much less than [30], and our algorithms can be used to efficient… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-407-417

  6. arXiv:1301.3837  [pdf

    cs.LG cs.AI stat.ML

    Dynamic Bayesian Multinets

    Authors: Jeff A. Bilmes

    Abstract: In this work, dynamic Bayesian multinets are introduced where a Markov chain state at time t determines conditional independence patterns between random variables lying within a local time window surrounding t. It is shown how information-theoretic criterion functions can be used to induce sparse, discriminative, and class-conditional network structures that yield an optimal approximation to the… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-38-45

  7. arXiv:1212.2448  [pdf

    cs.AI

    On Triangulating Dynamic Graphical Models

    Authors: Jeff A. Bilmes, Chris Bartels

    Abstract: This paper introduces new methodology to triangulate dynamic Bayesian networks (DBNs) and dynamic graphical models (DGMs). While most methods to triangulate such networks use some form of constrained elimination scheme based on properties of the underlying directed graph, we find it useful to view triangulation and elimination using properties only of the resulting undirected g… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-47-56

  8. arXiv:1210.4904  [pdf

    cs.CE q-bio.QM

    Spectrum Identification using a Dynamic Bayesian Network Model of Tandem Mass Spectra

    Authors: Ajit P. Singh, John Halloran, Jeff A. Bilmes, Katrin Kirchoff, William S. Noble

    Abstract: Shotgun proteomics is a high-throughput technology used to identify unknown proteins in a complex mixture. At the heart of this process is a prediction task, the spectrum identification problem, in which each fragmentation spectrum produced by a shotgun proteomics experiment must be mapped to the peptide (protein subsequence) which generated the spectrum. We propose a new algorithm for spectrum id… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-775-785

  9. arXiv:1210.4871  [pdf

    cs.LG cs.CL cs.IR stat.ML

    Learning Mixtures of Submodular Shells with Application to Document Summarization

    Authors: Hui Lin, Jeff A. Bilmes

    Abstract: We introduce a method to learn a mixture of submodular "shells" in a large-margin setting. A submodular shell is an abstract submodular function that can be instantiated with a ground set and a set of parameters to produce a submodular function. A mixture of such shells can then also be so instantiated to produce a more complex submodular function. What our algorithm learns are the mixture weights… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-479-490

  10. arXiv:1207.4151  [pdf

    cs.LG cs.DS stat.ML

    PAC-learning bounded tree-width Graphical Models

    Authors: Mukund Narasimhan, Jeff A. Bilmes

    Abstract: We show that the class of strongly connected graphical models with treewidth at most k can be properly efficiently PAC-learnt with respect to the Kullback-Leibler Divergence. Previous approaches to this problem, such as those of Chow ([1]), and Ho gen ([7]) have shown that this class is PAC-learnable by reducing it to a combinatorial optimization problem. However, for k > 1, this problem is NP-com… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-410-417

  11. arXiv:1207.1404  [pdf

    cs.LG cs.DS stat.ML

    A submodular-supermodular procedure with applications to discriminative structure learning

    Authors: Mukund Narasimhan, Jeff A. Bilmes

    Abstract: In this paper, we present an algorithm for minimizing the difference between two submodular functions using a variational framework which is based on (an extension of) the concave-convex procedure [17]. Because several commonly used metrics in machine learning, like mutual information and conditional mutual information, are submodular, the problem of minimizing the difference of two submodular pro… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-404-412

  12. arXiv:1206.6869  [pdf

    cs.AI

    Recognizing Activities and Spatial Context Using Wearable Sensors

    Authors: Amarnag Subramanya, Alvin Raj, Jeff A. Bilmes, Dieter Fox

    Abstract: We introduce a new dynamic model with the capability of recognizing both activities that an individual is performing as well as where that ndividual is located. Our model is novel in that it utilizes a dynamic graphical model to jointly estimate both activity and spatial context over time based on the simultaneous use of asynchronous observations consisting of GPS measurements, and measurements fr… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-494-502

  13. arXiv:1206.6825  [pdf

    cs.AI cs.DS

    Non-Minimal Triangulations for Mixed Stochastic/Deterministic Graphical Models

    Authors: Chris Bartels, Jeff A. Bilmes

    Abstract: We observe that certain large-clique graph triangulations can be useful to reduce both computational and space requirements when making queries on mixed stochastic/deterministic graphical models. We demonstrate that many of these large-clique triangulations are non-minimal and are thus unattainable via the variable elimination algorithm. We introduce ancestral pairs as the basis for novel triangul… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-15-22

  14. arXiv:1206.5265  [pdf

    cs.LG cs.AI stat.ML

    Consensus ranking under the exponential model

    Authors: Marina Meila, Kapil Phadnis, Arthur Patterson, Jeff A. Bilmes

    Abstract: We analyze the generalized Mallows model, a popular exponential model over rankings. Estimating the central (or consensus) ranking from data is NP-hard. We obtain the following new results: (1) We show that search methods can estimate both the central ranking pi0 and the model parameters theta exactly. The search is n! in the worst case, but is tractable when the true distribution is concentrated… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-285-294

  15. arXiv:1202.3726  [pdf

    cs.LG stat.ML

    Active Semi-Supervised Learning using Submodular Functions

    Authors: Andrew Guillory, Jeff A. Bilmes

    Abstract: We consider active, semi-supervised learning in an offline transductive setting. We show that a previously proposed error bound for active learning on undirected weighted graphs can be generalized by replacing graph cut with an arbitrary symmetric submodular function. Arbitrary non-symmetric submodular functions can be used via symmetrization. Different choices of submodular functions give differe… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-274-282