Skip to main content

Showing 1–7 of 7 results for author: Bouveyron, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.07711  [pdf, other

    cs.LG cs.AI

    Merging Embedded Topics with Optimal Transport for Online Topic Modeling on Data Streams

    Authors: Federica Granese, Benjamin Navet, Serena Villata, Charles Bouveyron

    Abstract: Topic modeling is a key component in unsupervised learning, employed to identify topics within a corpus of textual data. The rapid growth of social media generates an ever-growing volume of textual data daily, making online topic modeling methods essential for managing these data streams that continuously arrive over time. This paper introduces a novel approach to online topic modeling named Strea… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Paper under review

  2. arXiv:2309.02858  [pdf, other

    stat.ML cs.AI cs.IT cs.LG stat.ME

    Generalised Mutual Information: a Framework for Discriminative Clustering

    Authors: Louis Ohl, Pierre-Alexandre Mattei, Charles Bouveyron, Warith Harchaoui, Mickaël Leclercq, Arnaud Droit, Frédéric Precioso

    Abstract: In the last decade, recent successes in deep clustering majorly involved the Mutual Information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight h… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Submitted for review at the IEEE Transactions on Pattern Analysis and Machine Intelligence. This article is an extension of an original NeurIPS 2022 article [arXiv:2210.06300]

    MSC Class: 62H30 ACM Class: G.3

  3. arXiv:2304.08242  [pdf, other

    cs.LG cs.CL cs.SI stat.ME

    The Deep Latent Position Topic Model for Clustering and Representation of Networks with Textual Edges

    Authors: Rémi Boutin, Pierre Latouche, Charles Bouveyron

    Abstract: Numerical interactions leading to users sharing textual content published by others are naturally represented by a network where the individuals are associated with the nodes and the exchanged texts with the edges. To understand those heterogeneous and complex data structures, clustering nodes into homogeneous groups as well as rendering a comprehensible visualisation of the data is mandatory. To… ▽ More

    Submitted 13 February, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: 29 pages including the appendix, 13 figures, 6 tables, journal paper

  4. arXiv:2302.03391  [pdf, other

    stat.ML cs.AI cs.LG stat.CO stat.ME

    Sparse and geometry-aware generalisation of the mutual information for joint discriminative clustering and feature selection

    Authors: Louis Ohl, Pierre-Alexandre Mattei, Charles Bouveyron, Mickaël Leclercq, Arnaud Droit, Frédéric Precioso

    Abstract: Feature selection in clustering is a hard task which involves simultaneously the discovery of relevant clusters as well as relevant variables with respect to these clusters. While feature selection algorithms are often model-based through optimised model selection or strong assumptions on the data distribution, we introduce a discriminative clustering model trying to maximise a geometry-aware gene… ▽ More

    Submitted 18 July, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: Published in Statistics and Computing, Volume 34, article number 155, (2024), https://doi.org/10.1007/s11222-024-10467-9

    MSC Class: 62H30 ACM Class: G.3

  5. arXiv:2210.06300  [pdf, other

    stat.ML cs.AI cs.IT cs.LG stat.ME

    Generalised Mutual Information for Discriminative Clustering

    Authors: Louis Ohl, Pierre-Alexandre Mattei, Charles Bouveyron, Warith Harchaoui, Mickaël Leclercq, Arnaud Droit, Frederic Precioso

    Abstract: In the last decade, recent successes in deep clustering majorly involved the mutual information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight h… ▽ More

    Submitted 14 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: To be published in Neural Information Processing Systems 2022

    MSC Class: 62H30 ACM Class: G.3

  6. arXiv:2209.10097  [pdf, other

    cs.SI stat.ME

    Embedded Topics in the Stochastic Block Model

    Authors: Rémi Boutin, Charles Bouveyron, Pierre Latouche

    Abstract: Communication networks such as emails or social networks are now ubiquitous and their analysis has become a strategic field. In many applications, the goal is to automatically extract relevant information by looking at the nodes and their connections. Unfortunately, most of the existing methods focus on analysing the presence or absence of edges and textual data is often discarded. However, all co… ▽ More

    Submitted 25 July, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

  7. arXiv:2106.03821  [pdf, other

    cs.SD cs.CL cs.CV eess.AS

    Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-based Multimodal Fusion

    Authors: Baptiste Pouthier, Laurent Pilati, Leela K. Gudupudi, Charles Bouveyron, Frederic Precioso

    Abstract: It is now well established from a variety of studies that there is a significant benefit from combining video and audio data in detecting active speakers. However, either of the modalities can potentially mislead audiovisual fusion by inducing unreliable or deceptive information. This paper outlines active speaker detection as a multi-objective learning problem to leverage best of each modalities… ▽ More

    Submitted 15 September, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: In INTERSPEECH 2021

    Journal ref: Proc. Interspeech 2021, 2381-2385