Skip to main content

Showing 1–10 of 10 results for author: Akata, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.07829  [pdf, other

    cs.LG cs.CV stat.ML

    Disentangled Representation Learning with the Gromov-Monge Gap

    Authors: Théo Uscidda, Luca Eyring, Karsten Roth, Fabian Theis, Zeynep Akata, Marco Cuturi

    Abstract: Learning disentangled representations from unlabelled data is a fundamental challenge in machine learning. Solving it may unlock other problems, such as generalization, interpretability, or fairness. Although remarkably challenging to solve in theory, disentanglement is often achieved in practice through prior matching. Furthermore, recent works have shown that prior matching approaches can be enh… ▽ More

    Submitted 24 October, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2307.10865  [pdf, other

    cs.LG stat.ML

    Addressing caveats of neural persistence with deep graph persistence

    Authors: Leander Girrbach, Anders Christensen, Ole Winther, Zeynep Akata, A. Sophia Koepke

    Abstract: Neural Persistence is a prominent measure for quantifying neural network complexity, proposed in the emerging field of topological data analysis in deep learning. In this work, however, we find both theoretically and empirically that the variance of network weights and spatial concentration of large weights are the main factors that impact neural persistence. Whilst this captures useful informatio… ▽ More

    Submitted 20 November, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Transactions on Machine Learning Research (TMLR), 2023

  3. arXiv:2210.07347  [pdf, other

    cs.LG stat.ML

    Disentanglement of Correlated Factors via Hausdorff Factorized Support

    Authors: Karsten Roth, Mark Ibrahim, Zeynep Akata, Pascal Vincent, Diane Bouchacourt

    Abstract: A grand goal in deep learning research is to learn representations capable of generalizing across distribution shifts. Disentanglement is one promising direction aimed at aligning a model's representation with the underlying factors generating the data (e.g. color or background). Existing disentanglement methods, however, rely on an often unrealistic assumption: that factors are statistically inde… ▽ More

    Submitted 25 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023

  4. arXiv:2207.03784  [pdf, other

    cs.LG stat.ML

    A Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning

    Authors: Michael Kirchhof, Karsten Roth, Zeynep Akata, Enkelejda Kasneci

    Abstract: Proxy-based Deep Metric Learning (DML) learns deep representations by embedding images close to their class representatives (proxies), commonly with respect to the angle between them. However, this disregards the embedding norm, which can carry additional beneficial context such as class- or image-intrinsic uncertainty. In addition, proxy-based DML struggles to learn class-internal structures. To… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: Accepted as conference paper at ECCV 2022

  5. arXiv:2110.12467  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Robustness via Uncertainty-aware Cycle Consistency

    Authors: Uddeshya Upadhyay, Yanbei Chen, Zeynep Akata

    Abstract: Unpaired image-to-image translation refers to learning inter-image-domain mapping without corresponding image pairs. Existing methods learn deterministic mappings without explicitly modelling the robustness to outliers or predictive uncertainty, leading to performance degradation when encountering unseen perturbations at test time. To address this, we propose a novel probabilistic method based on… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021. Code is at https://github.com/ExplainableML/UncertaintyAwareCycleConsistency. arXiv admin note: substantial text overlap with arXiv:2102.11747

  6. arXiv:2002.07017  [pdf, other

    cs.LG stat.ML

    Learning Robust Representations via Multi-View Information Bottleneck

    Authors: Marco Federici, Anjan Dutta, Patrick Forré, Nate Kushman, Zeynep Akata

    Abstract: The information bottleneck principle provides an information-theoretic method for representation learning, by training an encoder to retain all information which is relevant for predicting the label while minimizing the amount of other, excess information in the representation. The original formulation, however, requires labeled data to identify the superfluous information. In this work, we extend… ▽ More

    Submitted 18 February, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

  7. arXiv:1907.09557  [pdf, other

    cs.LG stat.ML

    Relational Generalized Few-Shot Learning

    Authors: Xiahan Shi, Leonard Salewski, Martin Schiegg, Zeynep Akata, Max Welling

    Abstract: Transferring learned models to novel tasks is a challenging problem, particularly if only very few labeled examples are available. Although this few-shot learning setup has received a lot of attention recently, most proposed methods focus on discriminating novel classes only. Instead, we consider the extended setup of generalized few-shot learning (GFSL), where the model is required to perform cla… ▽ More

    Submitted 15 September, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

  8. arXiv:1906.02547  [pdf, other

    stat.ML cs.LG

    Combining Generative and Discriminative Models for Hybrid Inference

    Authors: Victor Garcia Satorras, Zeynep Akata, Max Welling

    Abstract: A graphical model is a structured representation of the data generating process. The traditional method to reason over random variables is to perform inference in this graphical model. However, in many cases the generating process is only a poor approximation of the much more complex true data generating process, leading to suboptimal estimation. The subtleties of the generative process are howeve… ▽ More

    Submitted 30 October, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

  9. arXiv:1902.00566  [pdf, other

    cs.LG stat.ML

    Visual Rationalizations in Deep Reinforcement Learning for Atari Games

    Authors: Laurens Weitkamp, Elise van der Pol, Zeynep Akata

    Abstract: Due to the capability of deep learning to perform well in high dimensional problems, deep reinforcement learning agents perform well in challenging tasks such as Atari 2600 games. However, clearly explaining why a certain action is taken by the agent can be as important as the decision itself. Deep reinforcement learning models, as other deep learning models, tend to be opaque in their decision-ma… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Comments: presented as oral talk at BNAIC 2018

  10. arXiv:1805.09575  [pdf, other

    stat.ML cs.LG

    Primal-Dual Wasserstein GAN

    Authors: Mevlana Gemici, Zeynep Akata, Max Welling

    Abstract: We introduce Primal-Dual Wasserstein GAN, a new learning algorithm for building latent variable models of the data distribution based on the primal and the dual formulations of the optimal transport (OT) problem. We utilize the primal formulation to learn a flexible inference mechanism and to create an optimal approximate coupling between the data distribution and the generative model. In order to… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: 14 pages, 16 figures