Skip to main content

Showing 1–14 of 14 results for author: Montesuma, E F

.
  1. arXiv:2504.01757  [pdf, other

    stat.ML cs.LG

    KD$^{2}$M: An unifying framework for feature knowledge distillation

    Authors: Eduardo Fernandes Montesuma

    Abstract: Knowledge Distillation (KD) seeks to transfer the knowledge of a teacher, towards a student neural net. This process is often done by matching the networks' predictions (i.e., their output), but, recently several works have proposed to match the distributions of neural nets' activations (i.e., their features), a process known as \emph{distribution matching}. In this paper, we propose an unifying f… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 8 pages, 2 figures, 1 table, under review

  2. arXiv:2503.17683  [pdf, other

    cs.LG stat.ML

    Decentralized Federated Dataset Dictionary Learning for Multi-Source Domain Adaptation

    Authors: Rebecca Clain, Eduardo Fernandes Montesuma, Fred Ngolè Mboula

    Abstract: Decentralized Multi-Source Domain Adaptation (DMSDA) is a challenging task that aims to transfer knowledge from multiple related and heterogeneous source domains to an unlabeled target domain within a decentralized framework. Our work tackles DMSDA through a fully decentralized federated approach. In particular, we extend the Federated Dataset Dictionary Learning (FedDaDiL) framework by eliminatin… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: Accepted at ICASSP 2025

  3. arXiv:2502.12793  [pdf, other

    stat.ML cs.AI cs.LG

    Unsupervised Anomaly Detection through Mass Repulsing Optimal Transport

    Authors: Eduardo Fernandes Montesuma, Adel El Habazi, Fred Ngole Mboula

    Abstract: Detecting anomalies in datasets is a longstanding problem in machine learning. In this context, anomalies are defined as a sample that significantly deviates from the remaining data. Meanwhile, optimal transport (OT) is a field of mathematics concerned with the transportation, between two probability measures, at least effort. In classical OT, the optimal transportation strategy of a measure to it… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 15 pages, 9 figures, 1 table, under review

  4. arXiv:2501.13732  [pdf, other

    stat.ML cs.LG

    A dimensionality reduction technique based on the Gromov-Wasserstein distance

    Authors: Rafael P. Eufrazio, Eduardo Fernandes Montesuma, Charles C. Cavalcante

    Abstract: Analyzing relationships between objects is a pivotal problem within data science. In this context, Dimensionality reduction (DR) techniques are employed to generate smaller and more manageable data representations. This paper proposes a new method for dimensionality reduction, based on optimal transportation theory and the Gromov-Wasserstein distance. We offer a new probabilistic view of the class… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  5. arXiv:2407.19853  [pdf, other

    cs.LG stat.ML

    Online Multi-Source Domain Adaptation through Gaussian Mixtures and Dataset Dictionary Learning

    Authors: Eduardo Fernandes Montesuma, Stevan Le Stanc, Fred Ngolè Mboula

    Abstract: This paper addresses the challenge of online multi-source domain adaptation (MSDA) in transfer learning, a scenario where one needs to adapt multiple, heterogeneous source domains towards a target domain that comes in a stream. We introduce a novel approach for the online fit of a Gaussian Mixture Model (GMM), based on the Wasserstein geometry of Gaussian measures. We build upon this method and re… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, accepted at the IEEE International Workshop on Machine Learning for Signal Processing 2024

  6. arXiv:2407.11647  [pdf, other

    cs.LG

    Dataset Dictionary Learning in a Wasserstein Space for Federated Domain Adaptation

    Authors: Eduardo Fernandes Montesuma, Fabiola Espinoza Castellon, Fred Ngolè Mboula, Aurélien Mayoue, Antoine Souloumiac, Cédric Gouy-Pailler

    Abstract: Multi-Source Domain Adaptation (MSDA) is a challenging scenario where multiple related and heterogeneous source datasets must be adapted to an unlabeled target dataset. Conventional MSDA methods often overlook that data holders may have privacy concerns, hindering direct data sharing. In response, decentralized MSDA has emerged as a promising strategy to achieve adaptation without centralizing cli… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 17 pages,7 figures

  7. arXiv:2404.10261  [pdf, other

    stat.ML cs.LG

    Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport

    Authors: Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Antoine Souloumiac

    Abstract: In this paper, we tackle Multi-Source Domain Adaptation (MSDA), a task in transfer learning where one adapts multiple heterogeneous, labeled source probability measures towards a different, unlabeled target measure. We propose a novel framework for MSDA, based on Optimal Transport (OT) and Gaussian Mixture Models (GMMs). Our framework has two key advantages. First, OT between GMMs can be solved ef… ▽ More

    Submitted 21 August, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 13 pages, 6 figures, accepted as a research track paper at the ECML-PKDD 2024 conference

  8. arXiv:2403.13847  [pdf, other

    cs.LG cs.AI stat.ML

    Optimal Transport for Domain Adaptation through Gaussian Mixture Models

    Authors: Eduardo Fernandes Montesuma, Fred Maurice Ngolè Mboula, Antoine Souloumiac

    Abstract: Machine learning systems operate under the assumption that training and test data are sampled from a fixed probability distribution. However, this assumptions is rarely verified in practice, as the conditions upon which data was acquired are likely to change. In this context, the adaptation of the unsupervised domain requires minimal access to the data of the new conditions for learning models rob… ▽ More

    Submitted 22 January, 2025; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 29 pages, 9 figures, 8 tables, accepted at Transactions on Machine Learning Research

  9. arXiv:2309.07670  [pdf, other

    cs.LG cs.AI

    Federated Dataset Dictionary Learning for Multi-Source Domain Adaptation

    Authors: Fabiola Espinoza Castellon, Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Aurélien Mayoue, Antoine Souloumiac, Cédric Gouy-Pailler

    Abstract: In this article, we propose an approach for federated domain adaptation, a setting where distributional shift exists among clients and some have unlabeled data. The proposed framework, FedDaDiL, tackles the resulting challenge through dictionary learning of empirical distributions. In our setting, clients' distributions represent particular domains, and FedDaDiL collectively trains a federated dic… ▽ More

    Submitted 8 November, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 7 pages,2 figures; v2: fixed typos

  10. arXiv:2309.07666  [pdf, other

    cs.LG cs.AI stat.ML

    Multi-Source Domain Adaptation meets Dataset Distillation through Dataset Dictionary Learning

    Authors: Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Antoine Souloumiac

    Abstract: In this paper, we consider the intersection of two problems in machine learning: Multi-Source Domain Adaptation (MSDA) and Dataset Distillation (DD). On the one hand, the first considers adapting multiple heterogeneous labeled source domains to an unlabeled target domain. On the other hand, the second attacks the problem of synthesizing a small summary containing all the information about the data… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 7 pages,4 figures

  11. arXiv:2308.11247  [pdf, other

    cs.LG cs.AI eess.SY

    Benchmarking Domain Adaptation for Chemical Processes on the Tennessee Eastman Process

    Authors: Eduardo Fernandes Montesuma, Michela Mulas, Fred Ngolè Mboula, Francesco Corona, Antoine Souloumiac

    Abstract: In system monitoring, automatic fault diagnosis seeks to infer the systems' state based on sensor readings, e.g., through machine learning models. In this context, it is of key importance that, based on historical data, these systems are able to generalize to incoming data. In parallel, many factors may induce changes in the data probability distribution, hindering the possibility of such models t… ▽ More

    Submitted 29 July, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 16 pages, 9 figures, 5 tables. Accepted as a Workshop paper at the ECML-PKDD 2024 conference

  12. arXiv:2307.14953  [pdf, other

    cs.LG cs.AI stat.ML

    Multi-Source Domain Adaptation through Dataset Dictionary Learning in Wasserstein Space

    Authors: Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Antoine Souloumiac

    Abstract: This paper seeks to solve Multi-Source Domain Adaptation (MSDA), which aims to mitigate data distribution shifts when transferring knowledge from multiple labeled source domains to an unlabeled target domain. We propose a novel MSDA framework based on dictionary learning and optimal transport. We interpret each domain in MSDA as an empirical distribution. As such, we express each domain as a Wasse… ▽ More

    Submitted 8 November, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: 13 pages,8 figures,Published as a conference paper at the 26th European Conference on Artificial Intelligence; v2: corrected typos

  13. arXiv:2306.16156  [pdf, other

    cs.LG math.PR stat.ML

    Recent Advances in Optimal Transport for Machine Learning

    Authors: Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Antoine Souloumiac

    Abstract: Recently, Optimal Transport has been proposed as a probabilistic framework in Machine Learning for comparing and manipulating probability distributions. This is rooted in its rich history and theory, and has offered new solutions to different problems in machine learning, such as generative modeling and transfer learning. In this survey we explore contributions of Optimal Transport for Machine Lea… ▽ More

    Submitted 21 August, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: 20 pages,15 figures,under review

  14. OpenDenoising: an Extensible Benchmark for Building Comparative Studies of Image Denoisers

    Authors: Florian Lemarchand, Eduardo Fernandes Montesuma, Maxime Pelcat, Erwan Nogues

    Abstract: Image denoising has recently taken a leap forward due to machine learning. However, image denoisers, both expert-based and learning-based, are mostly tested on well-behaved generated noises (usually Gaussian) rather than on real-life noises, making performance comparisons difficult in real-world conditions. This is especially true for learning-based denoisers which performance depends on training… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.