Skip to main content

Showing 1–11 of 11 results for author: Gaussier, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.14033  [pdf, other

    stat.ML cs.LG

    Ensembles of Probabilistic Regression Trees

    Authors: Alexandre Seiller, Éric Gaussier, Emilie Devijver, Marianne Clausel, Sami Alkhoury

    Abstract: Tree-based ensemble methods such as random forests, gradient-boosted trees, and Bayesianadditive regression trees have been successfully used for regression problems in many applicationsand research studies. In this paper, we study ensemble versions of probabilisticregression trees that provide smooth approximations of the objective function by assigningeach observation to each region with respect… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2307.15678  [pdf, other

    cs.LG cs.AI stat.AP stat.ME

    Case Studies of Causal Discovery from IT Monitoring Time Series

    Authors: Ali Aït-Bachir, Charles K. Assaad, Christophe de Bignicourt, Emilie Devijver, Simon Ferreira, Eric Gaussier, Hosein Mohanna, Lei Zan

    Abstract: Information technology (IT) systems are vital for modern businesses, handling data storage, communication, and process automation. Monitoring these systems is crucial for their proper functioning and efficiency, as it allows collecting extensive observational time series data for analysis. The interest in causal discovery is growing in IT monitoring systems as knowing causal relations between diff… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted to the UAI 2023 Workshop on The History and Development of Search Methods for Causal Structure

  3. arXiv:2205.09422  [pdf, ps, other

    cs.AI stat.ML

    Inferring extended summary causal graphs from observational time series

    Authors: Charles K. Assaad, Emilie Devijver, Eric Gaussier

    Abstract: This study addresses the problem of learning an extended summary causal graph on time series. The algorithms we propose fit within the well-known constraint-based framework for causal discovery and make use of information-theoretic measures to determine (in)dependencies between time series. We first introduce generalizations of the causation entropy measure to any lagged or instantaneous relations… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  4. arXiv:2003.11593  [pdf, other

    stat.ML cs.CL cs.LG

    Heavy-tailed Representations, Text Polarity Classification & Data Augmentation

    Authors: Hamid Jalalzai, Pierre Colombo, Chloé Clavel, Eric Gaussier, Giovanna Varni, Emmanuel Vignon, Anne Sabourin

    Abstract: The dominant approaches to text representation in natural language rely on learning embeddings on massive corpora which have convenient properties such as compositionality and distance preservation. In this paper, we develop a novel method to learn a heavy-tailed embedding with desirable regularity properties regarding the distributional tails, which allows to analyze the points far away from the… ▽ More

    Submitted 25 March, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2020

  5. arXiv:2002.11246  [pdf, other

    cs.LG stat.ML

    Supervised Categorical Metric Learning with Schatten p-Norms

    Authors: Xuhui Fan, Eric Gaussier

    Abstract: Metric learning has been successful in learning new metrics adapted to numerical datasets. However, its development on categorical data still needs further exploration. In this paper, we propose a method, called CPML for \emph{categorical projected metric learning}, that tries to efficiently~(i.e. less computational time and better prediction accuracy) address the problem of metric learning in cat… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  6. arXiv:1906.01874  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Terminology-based Text Embedding for Computing Document Similarities on Technical Content

    Authors: Hamid Mirisaee, Eric Gaussier, Cedric Lagnier, Agnes Guerraz

    Abstract: We propose in this paper a new, hybrid document embedding approach in order to address the problem of document similarities with respect to the technical content. To do so, we employ a state-of-the-art graph techniques to first extract the keyphrases (composite keywords) of documents and, then, use them to score the sentences. Using the ranked sentences, we propose two approaches to embed document… ▽ More

    Submitted 1 July, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

  7. arXiv:1810.11698  [pdf, ps, other

    cs.LG stat.ML

    Uncertain Trees: Dealing with Uncertain Inputs in Regression Trees

    Authors: Myriam Tami, Marianne Clausel, Emilie Devijver, Adrien Dulac, Eric Gaussier, Stefan Janaqi, Meriam Chebre

    Abstract: Tree-based ensemble methods, as Random Forests and Gradient Boosted Trees, have been successfully used for regression in many applications and research studies. Furthermore, these methods have been extended in order to deal with uncertainty in the output variable, using for example a quantile loss in Random Forests (Meinshausen, 2006). To the best of our knowledge, no extension has been provided y… ▽ More

    Submitted 18 November, 2018; v1 submitted 27 October, 2018; originally announced October 2018.

    Comments: 9 pages

  8. arXiv:1806.10069  [pdf, other

    cs.LG stat.ML

    Deep $k$-Means: Jointly clustering with $k$-Means and learning representations

    Authors: Maziar Moradi Fard, Thibaut Thonet, Eric Gaussier

    Abstract: We study in this paper the problem of jointly clustering and learning representations. As several previous studies have shown, learning representations that are both faithful to the data to be clustered and adapted to the clustering algorithm can lead to better clustering performance, all the more so that the two tasks are performed jointly. We propose here such an approach for $k$-Means clusterin… ▽ More

    Submitted 12 December, 2018; v1 submitted 26 June, 2018; originally announced June 2018.

    Comments: Under consideration at Pattern Recognition Letters

  9. arXiv:1709.05666  [pdf, other

    cs.LG cs.AI stat.ML

    On Inductive Abilities of Latent Factor Models for Relational Learning

    Authors: Théo Trouillon, Éric Gaussier, Christopher R. Dance, Guillaume Bouchard

    Abstract: Latent factor models are increasingly popular for modeling multi-relational knowledge graphs. By their vectorial nature, it is not only hard to interpret why this class of models works so well, but also to understand where they fail and how they might be improved. We conduct an experimental survey of state-of-the-art models, not towards a purely comparative end, but as a means to get insight about… ▽ More

    Submitted 17 September, 2017; originally announced September 2017.

    Comments: 30+3 pages, submitted to the Journal of Artificial Intelligence Research

  10. arXiv:1702.06879  [pdf, other

    cs.AI cs.LG math.SP stat.ML

    Knowledge Graph Completion via Complex Tensor Factorization

    Authors: Théo Trouillon, Christopher R. Dance, Johannes Welbl, Sebastian Riedel, Éric Gaussier, Guillaume Bouchard

    Abstract: In statistical relational learning, knowledge graph completion deals with automatically understanding the structure of large knowledge graphs---labeled directed graphs---and predicting missing relationships---labeled edges. State-of-the-art embedding models propose different trade-offs between modeling expressiveness, and time and space complexity. We reconcile both expressiveness and complexity t… ▽ More

    Submitted 26 November, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

    Comments: 38 pages, accepted in JMLR. This is an extended version of the article "Complex embeddings for simple link prediction" (ICML 2016)

  11. arXiv:1606.06357  [pdf, other

    cs.AI cs.LG stat.ML

    Complex Embeddings for Simple Link Prediction

    Authors: Théo Trouillon, Johannes Welbl, Sebastian Riedel, Éric Gaussier, Guillaume Bouchard

    Abstract: In statistical relational learning, the link prediction problem is key to automatically understand the structure of large knowledge bases. As in previous studies, we propose to solve this problem through latent factorization. However, here we make use of complex valued embeddings. The composition of complex embeddings can handle a large variety of binary relations, among them symmetric and antisym… ▽ More

    Submitted 20 June, 2016; originally announced June 2016.

    Comments: 10+2 pages, accepted at ICML 2016