Skip to main content

Showing 1–12 of 12 results for author: Giampouras, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.22549  [pdf, other

    cs.LG

    DES-LOC: Desynced Low Communication Adaptive Optimizers for Training Foundation Models

    Authors: Alex Iacob, Lorenzo Sani, Mher Safaryan, Paris Giampouras, Samuel Horváth, Andrej Jovanovic, Meghdad Kurmanji, Preslav Aleksandrov, William F. Shen, Xinchi Qiu, Nicholas D. Lane

    Abstract: Scaling foundation model training with Distributed Data Parallel (DDP) methods is bandwidth-limited. Existing infrequent communication methods like Local SGD were designed to synchronize only model parameters and cannot be trivially applied to adaptive optimizers due to additional optimizer states. Current approaches extending Local SGD either lack convergence guarantees or require synchronizing a… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Keywords: Distributed Training, Foundation Models, Large Language Models, Optimizers, Communication Efficiency, Federated Learning, Distributed Systems, Optimization Theory, Scaling, Robustness. Preprint, under review at NeurIPS

  2. arXiv:2502.00846  [pdf, ps, other

    cs.LG stat.ML

    Federated Generalised Variational Inference: A Robust Probabilistic Federated Learning Framework

    Authors: Terje Mildner, Oliver Hamelijnck, Paris Giampouras, Theodoros Damoulas

    Abstract: We introduce FedGVI, a probabilistic Federated Learning (FL) framework that is robust to both prior and likelihood misspecification. FedGVI addresses limitations in both frequentist and Bayesian FL by providing unbiased predictions under model misspecification, with calibrated uncertainty quantification. Our approach generalises previous FL approaches, specifically Partitioned Variational Inferenc… ▽ More

    Submitted 10 June, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

    Comments: Accepted at ICML 2025

  3. arXiv:2410.16826  [pdf, ps, other

    math.OC cs.LG

    Guarantees of a Preconditioned Subgradient Algorithm for Overparameterized Asymmetric Low-rank Matrix Recovery

    Authors: Paris Giampouras, HanQin Cai, Rene Vidal

    Abstract: In this paper, we focus on a matrix factorization-based approach to recover low-rank {\it asymmetric} matrices from corrupted measurements. We propose an {\it Overparameterized Preconditioned Subgradient Algorithm (OPSA)} and provide, for the first time in the literature, linear convergence rates independent of the rank of the sought asymmetric matrix in the presence of gross corruptions. Our work… ▽ More

    Submitted 29 May, 2025; v1 submitted 22 October, 2024; originally announced October 2024.

    Journal ref: International Conference on Machine Learning, 2025

  4. arXiv:2309.12078  [pdf, other

    cs.LG

    Clustering-based Domain-Incremental Learning

    Authors: Christiaan Lamers, Rene Vidal, Nabil Belbachir, Niki van Stein, Thomas Baeck, Paris Giampouras

    Abstract: We consider the problem of learning multiple tasks in a continual learning setting in which data from different tasks is presented to the learner in a streaming fashion. A key challenge in this setting is the so-called "catastrophic forgetting problem", in which the performance of the learner in an "old task" decreases when subsequently trained on a "new task". Existing continual learning methods,… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  5. arXiv:2306.04756  [pdf, other

    cs.LG cs.CR

    A Linearly Convergent GAN Inversion-based Algorithm for Reverse Engineering of Deceptions

    Authors: Darshan Thaker, Paris Giampouras, René Vidal

    Abstract: An important aspect of developing reliable deep learning systems is devising strategies that make these systems robust to adversarial attacks. There is a long line of work that focuses on developing defenses against these attacks, but recently, researchers have began to study ways to reverse engineer the attack process. This allows us to not only defend against several attack models, but also clas… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  6. arXiv:2305.00316  [pdf, other

    cs.LG

    The Ideal Continual Learner: An Agent That Never Forgets

    Authors: Liangzu Peng, Paris V. Giampouras, René Vidal

    Abstract: The goal of continual learning is to find a model that solves multiple learning tasks which are presented sequentially to the learner. A key challenge in this setting is that the learner may forget how to solve a previous task when learning a new task, a phenomenon known as catastrophic forgetting. To address this challenge, many practical methods have been proposed, including memory-based, regula… ▽ More

    Submitted 7 June, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: Accepted to ICML 2023

  7. arXiv:2203.04886  [pdf, other

    cs.LG

    Reverse Engineering $\ell_p$ attacks: A block-sparse optimization approach with recovery guarantees

    Authors: Darshan Thaker, Paris Giampouras, René Vidal

    Abstract: Deep neural network-based classifiers have been shown to be vulnerable to imperceptible perturbations to their input, such as $\ell_p$-bounded norm adversarial attacks. This has motivated the development of many defense methods, which are then broken by new attacks, and so on. This paper focuses on a different but related problem of reverse engineering adversarial attacks. Specifically, given an a… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  8. arXiv:2201.09079  [pdf, other

    cs.CV cs.LG

    Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown Codimension

    Authors: Paris V. Giampouras, Benjamin D. Haeffele, René Vidal

    Abstract: Robust subspace recovery (RSR) is a fundamental problem in robust representation learning. Here we focus on a recently proposed RSR method termed Dual Principal Component Pursuit (DPCP) approach, which aims to recover a basis of the orthogonal complement of the subspace and is amenable to handling subspaces of high relative dimension. Prior work has shown that DPCP can provably recover the correct… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

  9. arXiv:2101.02931  [pdf, other

    stat.ME cs.LG math.NA

    Block-Term Tensor Decomposition Model Selection and Computation: The Bayesian Way

    Authors: Paris V. Giampouras, Athanasios A. Rontogiannis, Eleftherios Kofidis

    Abstract: The so-called block-term decomposition (BTD) tensor model, especially in its rank-$(L_r,L_r,1)$ version, has been recently receiving increasing attention due to its enhanced ability of representing systems and signals that are composed of \emph{blocks} of rank higher than one, a scenario encountered in numerous and diverse applications. Uniqueness conditions and fitting methods have thus been thor… ▽ More

    Submitted 5 July, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

  10. arXiv:1710.02004  [pdf, other

    cs.LG

    Alternating Iteratively Reweighted Minimization Algorithms for Low-Rank Matrix Factorization

    Authors: Paris V. Giampouras, Athanasios A. Rontogiannis, Konstantinos D. Koutroumbas

    Abstract: Nowadays, the availability of large-scale data in disparate application domains urges the deployment of sophisticated tools for extracting valuable knowledge out of this huge bulk of information. In that vein, low-rank representations (LRRs) which seek low-dimensional embeddings of data have naturally appeared. In an effort to reduce computational complexity and improve estimation performance, LRR… ▽ More

    Submitted 5 October, 2017; originally announced October 2017.

    Comments: 14 pages

  11. arXiv:1703.05785  [pdf, other

    cs.CV stat.ML

    Low-rank and Sparse NMF for Joint Endmembers' Number Estimation and Blind Unmixing of Hyperspectral Images

    Authors: Paris V. Giampouras, Athanasios A. Rontogiannis, Konstantinos D. Koutroumbas

    Abstract: Estimation of the number of endmembers existing in a scene constitutes a critical task in the hyperspectral unmixing process. The accuracy of this estimate plays a crucial role in subsequent unsupervised unmixing steps i.e., the derivation of the spectral signatures of the endmembers (endmembers' extraction) and the estimation of the abundance fractions of the pixels. A common practice amply follo… ▽ More

    Submitted 16 March, 2017; originally announced March 2017.

  12. arXiv:1504.01515  [pdf, other

    cs.CV math.OC stat.ML

    Simultaneously sparse and low-rank abundance matrix estimation for hyperspectral image unmixing

    Authors: Paris Giampouras, Konstantinos Themelis, Athanasios Rontogiannis, Konstantinos Koutroumbas

    Abstract: In a plethora of applications dealing with inverse problems, e.g. in image processing, social networks, compressive sensing, biological data processing etc., the signal of interest is known to be structured in several ways at the same time. This premise has recently guided the research to the innovative and meaningful idea of imposing multiple constraints on the parameters involved in the problem… ▽ More

    Submitted 14 October, 2015; v1 submitted 7 April, 2015; originally announced April 2015.

    Comments: 30 pages, 9 figures