Skip to main content

Showing 1–3 of 3 results for author: Dubatovka, A

.
  1. arXiv:2206.12444  [pdf, other

    cs.LG

    Gated Domain Units for Multi-source Domain Generalization

    Authors: Simon Föll, Alina Dubatovka, Eugen Ernst, Siu Lun Chau, Martin Maritsch, Patrik Okanovic, Gudrun Thäter, Joachim M. Buhmann, Felix Wortmann, Krikamol Muandet

    Abstract: The phenomenon of distribution shift (DS) occurs when a dataset at test time differs from the dataset at training time, which can significantly impair the performance of a machine learning model in practical settings due to a lack of knowledge about the data's distribution at test time. To address this problem, we postulate that real-world distributions are composed of latent Invariant Elementary… ▽ More

    Submitted 16 May, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

  2. arXiv:1911.11481  [pdf, other

    cs.LG stat.ML

    Ranking architectures using meta-learning

    Authors: Alina Dubatovka, Efi Kokiopoulou, Luciano Sbaiz, Andrea Gesmundo, Gabor Bartok, Jesse Berent

    Abstract: Neural architecture search has recently attracted lots of research efforts as it promises to automate the manual design of neural networks. However, it requires a large amount of computing resources and in order to alleviate this, a performance prediction network has been recently proposed that enables efficient architecture search by forecasting the performance of candidate architectures, instead… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019 Meta-Learning workshop

  3. arXiv:1806.06362  [pdf, other

    stat.ML cs.LG

    Initialization of ReLUs for Dynamical Isometry

    Authors: Rebekka Burkholz, Alina Dubatovka

    Abstract: Deep learning relies on good initialization schemes and hyperparameter choices prior to training a neural network. Random weight initializations induce random network ensembles, which give rise to the trainability, training speed, and sometimes also generalization ability of an instance. In addition, such ensembles provide theoretical insights into the space of candidate models of which one is sel… ▽ More

    Submitted 24 October, 2019; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: NeurIPS 2019