Skip to main content

Showing 1–9 of 9 results for author: Rusak, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.08258  [pdf, ps, other

    cs.CV

    In Search of Forgotten Domain Generalization

    Authors: Prasanna Mayilvahanan, Roland S. Zimmermann, Thaddäus Wiedemer, Evgenia Rusak, Attila Juhos, Matthias Bethge, Wieland Brendel

    Abstract: Out-of-Domain (OOD) generalization is the ability of a model trained on one or more domains to generalize to unseen domains. In the ImageNet era of computer vision, evaluation sets for measuring a model's OOD performance were designed to be strictly OOD with respect to style. However, the emergence of foundation models and expansive web-scale datasets has obfuscated this evaluation process, as dat… ▽ More

    Submitted 6 June, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: ICLR 2025 camera-ready version

  2. arXiv:2408.03591  [pdf, other

    cs.CV cs.AI cs.HC cs.LG eess.SP

    Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach

    Authors: Benedikt W. Hosp, Björn Severitt, Rajat Agarwala, Evgenia Rusak, Yannick Sauer, Siegfried Wahl

    Abstract: In an era where personalized technology is increasingly intertwined with daily life, traditional eye-tracking systems and autofocal glasses face a significant challenge: the need for frequent, user-specific calibration, which impedes their practicality. This study introduces a groundbreaking calibration-free method for estimating focal depth, leveraging machine learning techniques to analyze eye m… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  3. arXiv:2407.00143  [pdf, other

    cs.LG cs.CV stat.ML

    InfoNCE: Identifying the Gap Between Theory and Practice

    Authors: Evgenia Rusak, Patrik Reizinger, Attila Juhos, Oliver Bringmann, Roland S. Zimmermann, Wieland Brendel

    Abstract: Prior theory work on Contrastive Learning via the InfoNCE loss showed that, under certain assumptions, the learned representations recover the ground-truth latent factors. We argue that these theories overlook crucial aspects of how CL is deployed in practice. Specifically, they either assume equal variance across all latents or that certain latents are kept invariant. However, in practice, positi… ▽ More

    Submitted 16 April, 2025; v1 submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2401.04578  [pdf, other

    cs.CV

    Effective pruning of web-scale datasets based on complexity of concept clusters

    Authors: Amro Abbas, Evgenia Rusak, Kushal Tirumala, Wieland Brendel, Kamalika Chaudhuri, Ari S. Morcos

    Abstract: Utilizing massive web-scale datasets has led to unprecedented performance gains in machine learning models, but also imposes outlandish compute requirements for their training. In order to improve training and data efficiency, we here push the limits of pruning large-scale multimodal datasets for training CLIP-style models. Today's most effective pruning method on ImageNet clusters data samples in… ▽ More

    Submitted 12 March, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted at ICLR 2024, code available at https://github.com/amro-kamal/effective_pruning

  5. arXiv:2310.09562  [pdf, other

    cs.CV cs.AI cs.LG

    Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?

    Authors: Prasanna Mayilvahanan, Thaddäus Wiedemer, Evgenia Rusak, Matthias Bethge, Wieland Brendel

    Abstract: Foundation models like CLIP are trained on hundreds of millions of samples and effortlessly generalize to new tasks and inputs. Out of the box, CLIP shows stellar zero-shot and few-shot capabilities on a wide range of out-of-distribution (OOD) benchmarks, which prior works attribute mainly to today's large and comprehensive training dataset (like LAION). However, it is questionable how meaningful… ▽ More

    Submitted 14 March, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 camera-ready version

  6. arXiv:2104.12928  [pdf, other

    cs.CV cs.LG

    If your data distribution shifts, use self-learning

    Authors: Evgenia Rusak, Steffen Schneider, George Pachitariu, Luisa Eck, Peter Gehler, Oliver Bringmann, Wieland Brendel, Matthias Bethge

    Abstract: We demonstrate that self-learning techniques like entropy minimization and pseudo-labeling are simple and effective at improving performance of a deployed computer vision model under systematic domain shifts. We conduct a wide range of large-scale experiments and show consistent improvements irrespective of the model architecture, the pre-training technique or the type of distribution shift. At th… ▽ More

    Submitted 7 December, 2023; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Web: https://domainadaptation.org/selflearning

  7. arXiv:2006.16971  [pdf, other

    cs.LG cs.CV stat.ML

    Improving robustness against common corruptions by covariate shift adaptation

    Authors: Steffen Schneider, Evgenia Rusak, Luisa Eck, Oliver Bringmann, Wieland Brendel, Matthias Bethge

    Abstract: Today's state-of-the-art machine vision models are vulnerable to image corruptions like blurring or compression artefacts, limiting their performance in many real-world applications. We here argue that popular benchmarks to measure model robustness against common corruptions (like ImageNet-C) underestimate model robustness in many (but not all) application scenarios. The key insight is that in man… ▽ More

    Submitted 23 October, 2020; v1 submitted 30 June, 2020; originally announced June 2020.

    Comments: Accepted at the Thirty-fourth Conference on Neural Information Processing Systems. Web: https://domainadaptation.org/batchnorm/

  8. arXiv:2001.06057  [pdf, other

    cs.CV cs.LG stat.ML

    A simple way to make neural networks robust against diverse image corruptions

    Authors: Evgenia Rusak, Lukas Schott, Roland S. Zimmermann, Julian Bitterwolf, Oliver Bringmann, Matthias Bethge, Wieland Brendel

    Abstract: The human visual system is remarkably robust against a wide range of naturally occurring variations and corruptions like rain or snow. In contrast, the performance of modern image recognition models strongly degrades when evaluated on previously unseen corruptions. Here, we demonstrate that a simple but properly tuned training with additive Gaussian and Speckle noise generalizes surprisingly well… ▽ More

    Submitted 22 July, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: Oral presentation at the European Conference for Computer Vision (ECCV 2020)

  9. arXiv:1907.07484  [pdf, other

    cs.CV cs.LG stat.ML

    Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming

    Authors: Claudio Michaelis, Benjamin Mitzkus, Robert Geirhos, Evgenia Rusak, Oliver Bringmann, Alexander S. Ecker, Matthias Bethge, Wieland Brendel

    Abstract: The ability to detect objects regardless of image distortions or weather conditions is crucial for real-world applications of deep learning like autonomous driving. We here provide an easy-to-use benchmark to assess how object detection models perform when image quality degrades. The three resulting benchmark datasets, termed Pascal-C, Coco-C and Cityscapes-C, contain a large variety of image corr… ▽ More

    Submitted 31 March, 2020; v1 submitted 17 July, 2019; originally announced July 2019.

    Comments: 21 pages, 10 figures, 1 dragon