Skip to main content

Showing 1–18 of 18 results for author: Cygert, S

.
  1. arXiv:2502.04959  [pdf, ps, other

    cs.LG

    No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces

    Authors: Daniel Marczak, Simone Magistri, Sebastian Cygert, Bartłomiej Twardowski, Andrew D. Bagdanov, Joost van de Weijer

    Abstract: Model merging integrates the weights of multiple task-specific models into a single multi-task model. Despite recent interest in the problem, a significant performance gap between the combined and single-task models remains. In this paper, we investigate the key characteristics of task matrices -- weight update matrices applied to a pre-trained model -- that enable effective merging. We show that… ▽ More

    Submitted 11 June, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted at ICML 2025

  2. arXiv:2409.18265  [pdf, other

    cs.LG cs.CV

    Task-recency bias strikes back: Adapting covariances in Exemplar-Free Class Incremental Learning

    Authors: Grzegorz Rypeść, Sebastian Cygert, Tomasz Trzciński, Bartłomiej Twardowski

    Abstract: Exemplar-Free Class Incremental Learning (EFCIL) tackles the problem of training a model on a sequence of tasks without access to past data. Existing state-of-the-art methods represent classes as Gaussian distributions in the feature extractor's latent space, enabling Bayes classification or training the classifier by replaying pseudo features. However, we identify two critical issues that comprom… ▽ More

    Submitted 26 October, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: Accepted for NeurIPS 2024

  3. arXiv:2407.14231  [pdf, other

    cs.LG cs.CV

    Realistic Evaluation of Test-Time Adaptation Algorithms: Unsupervised Hyperparameter Selection

    Authors: Sebastian Cygert, Damian Sójka, Tomasz Trzciński, Bartłomiej Twardowski

    Abstract: Test-Time Adaptation (TTA) has recently emerged as a promising strategy for tackling the problem of machine learning model robustness under distribution shifts by adapting the model during inference without access to any labels. Because of task difficulty, hyperparameters strongly influence the effectiveness of adaptation. However, the literature has provided little exploration into optimal hyperp… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: preprint

  4. arXiv:2407.06322  [pdf, other

    cs.LG cs.AI cs.CV

    MagMax: Leveraging Model Merging for Seamless Continual Learning

    Authors: Daniel Marczak, Bartłomiej Twardowski, Tomasz Trzciński, Sebastian Cygert

    Abstract: This paper introduces a continual learning approach named MagMax, which utilizes model merging to enable large pre-trained models to continuously learn from new data without forgetting previously acquired knowledge. Distinct from traditional continual learning methods that aim to reduce forgetting during task training, MagMax combines sequential fine-tuning with a maximum magnitude weight selectio… ▽ More

    Submitted 29 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted for ECCV2024

  5. arXiv:2401.10191  [pdf, other

    cs.LG cs.CV

    Divide and not forget: Ensemble of selectively trained experts in Continual Learning

    Authors: Grzegorz Rypeść, Sebastian Cygert, Valeriya Khan, Tomasz Trzciński, Bartosz Zieliński, Bartłomiej Twardowski

    Abstract: Class-incremental learning is becoming more popular as it helps models widen their applicability while not forgetting what they already know. A trend in this area is to use a mixture-of-expert technique, where different models work together to solve the task. However, the experts are usually trained all at once using whole task data, which makes them all prone to forgetting and increasing computat… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted for ICLR 2024 (main track), code is available at: https://github.com/grypesc/SEED

  6. Creating New Voices using Normalizing Flows

    Authors: Piotr Bilinski, Thomas Merritt, Abdelhamid Ezzerg, Kamil Pokora, Sebastian Cygert, Kayoko Yanagisawa, Roberto Barra-Chicote, Daniel Korzekwa

    Abstract: Creating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. First… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Interspeech 2022

    Journal ref: Interspeech 2022, 2958-2962

  7. arXiv:2311.13321  [pdf, other

    cs.LG cs.CV

    Revisiting Supervision for Continual Representation Learning

    Authors: Daniel Marczak, Sebastian Cygert, Tomasz Trzciński, Bartłomiej Twardowski

    Abstract: In the field of continual learning, models are designed to learn tasks one after the other. While most research has centered on supervised continual learning, there is a growing interest in unsupervised continual learning, which makes use of the vast amounts of unlabeled data. Recent studies have highlighted the strengths of unsupervised methods, particularly self-supervised learning, in providing… ▽ More

    Submitted 16 July, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Accepted to ECCV2024

  8. arXiv:2310.13533  [pdf, other

    cs.CV cs.AI cs.LG

    Technical Report for ICCV 2023 Visual Continual Learning Challenge: Continuous Test-time Adaptation for Semantic Segmentation

    Authors: Damian Sójka, Yuyang Liu, Dipam Goswami, Sebastian Cygert, Bartłomiej Twardowski, Joost van de Weijer

    Abstract: The goal of the challenge is to develop a test-time adaptation (TTA) method, which could adapt the model to gradually changing domains in video sequences for semantic segmentation task. It is based on a synthetic driving video dataset - SHIFT. The source model is trained on images taken during daytime in clear weather. Domain changes at test-time are mainly caused by varying weather conditions and… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  9. arXiv:2309.10109  [pdf, other

    cs.CV cs.AI cs.LG

    AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation

    Authors: Damian Sójka, Sebastian Cygert, Bartłomiej Twardowski, Tomasz Trzciński

    Abstract: Test-time adaptation is a promising research direction that allows the source model to adapt itself to changes in data distribution without any supervision. Yet, current methods are usually evaluated on benchmarks that are only a simplification of real-world scenarios. Hence, we propose to validate test-time adaptation methods using the recently introduced datasets for autonomous driving, namely C… ▽ More

    Submitted 14 October, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted for BMVC2024

  10. arXiv:2309.10012  [pdf, other

    cs.LG cs.AI cs.CV

    Looking through the past: better knowledge retention for generative replay in continual learning

    Authors: Valeriya Khan, Sebastian Cygert, Kamil Deja, Tomasz Trzciński, Bartłomiej Twardowski

    Abstract: In this work, we improve the generative replay in a continual learning setting to perform well on challenging scenarios. Current generative rehearsal methods are usually benchmarked on small and simple datasets as they are not powerful enough to generate more complex data with a greater number of classes. We notice that in VAE-based generative replay, this could be attributed to the fact that the… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  11. arXiv:2309.08255  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech

    Authors: Dariusz Piotrowski, Renard Korzeniowski, Alessio Falai, Sebastian Cygert, Kamil Pokora, Georgi Tinchev, Ziyao Zhang, Kayoko Yanagisawa

    Abstract: In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with th… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted at ICONIP 2023

  12. arXiv:2308.12112  [pdf, other

    cs.LG cs.CV

    Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery

    Authors: Grzegorz Rypeść, Daniel Marczak, Sebastian Cygert, Tomasz Trzciński, Bartłomiej Twardowski

    Abstract: Generalized Continual Category Discovery (GCCD) tackles learning from sequentially arriving, partially labeled datasets while uncovering new categories. Traditional methods depend on feature distillation to prevent forgetting the old knowledge. However, this strategy restricts the model's ability to adapt and effectively distinguish new categories. To address this, we introduce a novel technique i… ▽ More

    Submitted 25 July, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted for ECCV 2024

  13. arXiv:2308.09544  [pdf, other

    cs.LG cs.AI cs.CV

    Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-free Continual Learning

    Authors: Filip Szatkowski, Mateusz Pyla, Marcin Przewięźlikowski, Sebastian Cygert, Bartłomiej Twardowski, Tomasz Trzciński

    Abstract: In this work, we investigate exemplar-free class incremental learning (CIL) with knowledge distillation (KD) as a regularization strategy, aiming to prevent forgetting. KD-based methods are successfully used in CIL, but they often struggle to regularize the model without access to exemplars of the training data from previous tasks. Our analysis reveals that this issue originates from substantial r… ▽ More

    Submitted 4 November, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted at WACV 2024

  14. arXiv:2307.16679  [pdf, other

    eess.AS cs.CL cs.LG

    Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

    Authors: Guangyan Zhang, Thomas Merritt, Manuel Sam Ribeiro, Biel Tura-Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo-Trueba

    Abstract: Neural text-to-speech systems are often optimized on L1/L2 losses, which make strong assumptions about the distributions of the target data space. Aiming to improve those assumptions, Normalizing Flows and Diffusion Probabilistic Models were recently proposed as alternatives. In this paper, we compare traditional L1/L2-based approaches to diffusion and flow-based approaches for the tasks of prosod… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures, 5 tables. Interspeech 2023

  15. arXiv:2111.13065  [pdf, other

    cs.CV

    Robust Object Detection with Multi-input Multi-output Faster R-CNN

    Authors: Sebastian Cygert, Andrzej Czyzewski

    Abstract: Recent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the real-world in out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. however, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input m… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  16. Closer Look at the Uncertainty Estimation in Semantic Segmentation under Distributional Shift

    Authors: Sebastian Cygert, Bartłomiej Wróblewski, Karol Woźniak, Radosław Słowiński, Andrzej Czyżewski

    Abstract: While recent computer vision algorithms achieve impressive performance on many benchmarks, they lack robustness - presented with an image from a different distribution, (e.g. weather or lighting conditions not considered during training), they may produce an erroneous prediction. Therefore, it is desired that such a model will be able to reliably predict its confidence measure. In this work, uncer… ▽ More

    Submitted 27 September, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: International Joint Conference on Neural Networks 2021, https://ieeexplore.ieee.org/document/9533330

  17. Robustness in Compressed Neural Networks for Object Detection

    Authors: Sebastian Cygert, Andrzej Czyżewski

    Abstract: Model compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for objec… ▽ More

    Submitted 27 September, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: 2021 International Joint Conference on Neural Networks (IJCNN), https://ieeexplore.ieee.org/document/9533773

  18. arXiv:1604.03360  [pdf, ps, other

    physics.comp-ph nucl-th

    Relativistic hydrodynamics on graphics processing units

    Authors: Jan Sikorski, Sebastian Cygert, Joanna Porter-Sobieraj, Marcin Słodkowski, Piotr Krzyżanowski, Natalia Książek, Przemysław Duda

    Abstract: Hydrodynamics calculations have been successfully used in studies of the bulk properties of the Quark-Gluon Plasma, particularly of elliptic flow and shear viscosity. However, there are areas (for instance event-by-event simulations for flow fluctuations and higher-order flow harmonics studies) where further advancement is hampered by lack of efficient and precise 3+1D~program. This problem can be… ▽ More

    Submitted 12 April, 2016; originally announced April 2016.

    Comments: 3 pages, 3 figures, SQM2015 proceedings submitted to Journal of Physics: Conference Series (JPCS) - IOP Conference Series

    Journal ref: J.Phys.Conf.Ser. 509 (2014) 012059