Skip to main content

Showing 1–19 of 19 results for author: Drouin, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.13132  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Learning to Defer for Causal Discovery with Imperfect Experts

    Authors: Oscar Clivio, Divyat Mahajan, Perouz Taslakian, Sara Magliacane, Ioannis Mitliagkas, Valentina Zantedeschi, Alexandre Drouin

    Abstract: Integrating expert knowledge, e.g. from large language models, into causal discovery algorithms can be challenging when the knowledge is not guaranteed to be correct. Expert recommendations may contradict data-driven results, and their reliability can vary significantly depending on the domain or specific query. Existing methods based on soft constraints or inconsistencies in predicted causal rela… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  2. arXiv:2412.01953  [pdf, other

    cs.LG stat.ME

    The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications

    Authors: Philippe Brouillard, Chandler Squires, Jonas Wahl, Konrad P. Kording, Karen Sachs, Alexandre Drouin, Dhanya Sridhar

    Abstract: Causal discovery aims to automatically uncover causal relationships from data, a capability with significant potential across many scientific disciplines. However, its real-world applications remain limited. Current methods often rely on unrealistic assumptions and are evaluated only on simple synthetic toy datasets, often with inadequate evaluation metrics. In this paper, we substantiate these cl… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 39 pages, 8 figures

  3. arXiv:2410.18959  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Context is Key: A Benchmark for Forecasting with Essential Textual Information

    Authors: Andrew Robert Williams, Arjun Ashok, Étienne Marcotte, Valentina Zantedeschi, Jithendaraa Subramanian, Roland Riachi, James Requeima, Alexandre Lacoste, Irina Rish, Nicolas Chapados, Alexandre Drouin

    Abstract: Forecasting is a critical task in decision-making across numerous domains. While historical numerical data provide a start, they fail to convey the complete context for reliable and accurate predictions. Human forecasters frequently rely on additional information, such as background knowledge and constraints, which can efficiently be communicated through natural language. However, in spite of rece… ▽ More

    Submitted 5 June, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: ICML 2025. First two authors contributed equally

  4. arXiv:2404.05545  [pdf, other

    cs.LG cs.AI cs.CL stat.ME

    Evaluating Interventional Reasoning Capabilities of Large Language Models

    Authors: Tejas Kasetty, Divyat Mahajan, Gintare Karolina Dziugaite, Alexandre Drouin, Dhanya Sridhar

    Abstract: Numerous decision-making tasks require estimating causal effects under interventions on different parts of a system. As practitioners consider using large language models (LLMs) to automate decisions, studying their causal reasoning capabilities becomes crucial. A recent line of work evaluates LLMs ability to retrieve commonsense causal facts, but these evaluations do not sufficiently assess how L… ▽ More

    Submitted 22 December, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: 17 pages

  5. arXiv:2312.13876  [pdf, other

    cs.LG cs.CL stat.ML

    Capture the Flag: Uncovering Data Insights with Large Language Models

    Authors: Issam Laradji, Perouz Taslakian, Sai Rajeswar, Valentina Zantedeschi, Alexandre Lacoste, Nicolas Chapados, David Vazquez, Christopher Pal, Alexandre Drouin

    Abstract: The extraction of a small number of relevant insights from vast amounts of data is a crucial component of data-driven decision-making. However, accomplishing this task requires considerable technical skills, domain expertise, and human labor. This study explores the potential of using Large Language Models (LLMs) to automate the discovery of insights in data, leveraging recent advances in reasonin… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 14 pages, 1 figure, Foundation Models for Decision Making Workshop at NeurIPS 2023

  6. arXiv:2310.01327  [pdf, other

    cs.LG cs.AI stat.ML

    TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series

    Authors: Arjun Ashok, Étienne Marcotte, Valentina Zantedeschi, Nicolas Chapados, Alexandre Drouin

    Abstract: We introduce a new model for multivariate probabilistic time series prediction, designed to flexibly address a range of tasks including forecasting, interpolation, and their combinations. Building on copula theory, we propose a simplified objective for the recently-introduced transformer-based attentional copulas (TACTiS), wherein the number of distributional parameters now scales linearly with th… ▽ More

    Submitted 25 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 28 pages, 15 figures, The Twelfth International Conference on Learning Representations (ICLR 2024)

  7. arXiv:2307.04988  [pdf, other

    cs.LG stat.ME

    Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation

    Authors: Chris Chinenye Emezue, Alexandre Drouin, Tristan Deleu, Stefan Bauer, Yoshua Bengio

    Abstract: The practical utility of causality in decision-making is widespread and brought about by the intertwining of causal discovery and causal inference. Nevertheless, a notable gap exists in the evaluation of causal discovery methods, where insufficient emphasis is placed on downstream inference. To address this gap, we evaluate seven established baseline causal discovery methods including a newly prop… ▽ More

    Submitted 30 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: Peer-reviewed and Accepted to ICML 2023 Workshop on Structured Probabilistic Inference & Generative Modeling

  8. arXiv:2306.04777  [pdf, other

    cs.LG stat.ME stat.ML

    Invariant Causal Set Covering Machines

    Authors: Thibaud Godon, Baptiste Bauvin, Pascal Germain, Jacques Corbeil, Alexandre Drouin

    Abstract: Rule-based models, such as decision trees, appeal to practitioners due to their interpretable nature. However, the learning algorithms that produce such models are often vulnerable to spurious associations and thus, they are not guaranteed to extract causally-relevant insights. In this work, we build on ideas from the invariant causal prediction literature to propose Invariant Causal Set Covering… ▽ More

    Submitted 21 March, 2025; v1 submitted 7 June, 2023; originally announced June 2023.

  9. arXiv:2304.09836  [pdf, other

    cs.LG stat.ML

    Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts

    Authors: Étienne Marcotte, Valentina Zantedeschi, Alexandre Drouin, Nicolas Chapados

    Abstract: Multivariate probabilistic time series forecasts are commonly evaluated via proper scoring rules, i.e., functions that are minimal in expectation for the ground-truth distribution. However, this property is not sufficient to guarantee good discrimination in the non-asymptotic regime. In this paper, we provide the first systematic finite-sample study of proper scoring rules for time-series forecast… ▽ More

    Submitted 6 June, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 47 pages, 37 figures, camera-ready version, Fortieth International Conference on Machine Learning (ICML 2023)

  10. arXiv:2202.03528  [pdf, other

    cs.LG stat.ML

    TACTiS: Transformer-Attentional Copulas for Time Series

    Authors: Alexandre Drouin, Étienne Marcotte, Nicolas Chapados

    Abstract: The estimation of time-varying quantities is a fundamental component of decision making in fields such as healthcare and finance. However, the practical utility of such estimates is limited by how accurately they quantify predictive uncertainty. In this work, we address the problem of estimating the joint predictive distribution of high-dimensional multivariate time series. We propose a versatile… ▽ More

    Submitted 27 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 47 pages, 33 figures, camera-ready version, Thirty-ninth International Conference on Machine Learning (ICML 2022)

  11. arXiv:2107.10703  [pdf, other

    cs.LG cs.AI stat.ML

    Typing assumptions improve identification in causal discovery

    Authors: Philippe Brouillard, Perouz Taslakian, Alexandre Lacoste, Sebastien Lachapelle, Alexandre Drouin

    Abstract: Causal discovery from observational data is a challenging task that can only be solved up to a set of equivalent solutions, called an equivalence class. Such classes, which are often large in size, encode uncertainties about the orientation of some edges in the causal graph. In this work, we propose a new set of assumptions that constrain possible causal relationships based on the nature of variab… ▽ More

    Submitted 28 February, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: 30 pages, 13 figures, accepted for the 1st conference on Causal Learning and Reasoning (CLeaR), 2022

  12. arXiv:2010.11924  [pdf, other

    cs.LG stat.ML

    In Search of Robust Measures of Generalization

    Authors: Gintare Karolina Dziugaite, Alexandre Drouin, Brady Neal, Nitarshan Rajkumar, Ethan Caballero, Linbo Wang, Ioannis Mitliagkas, Daniel M. Roy

    Abstract: One of the principal scientific challenges in deep learning is explaining generalization, i.e., why the particular way the community now trains networks to achieve small training error also leads to small error on held-out data from the same population. It is widely appreciated that some worst-case theories -- such as those based on the VC dimension of the class of predictors induced by modern neu… ▽ More

    Submitted 20 January, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 27 pages, 11 figures, 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  13. arXiv:2007.01754  [pdf, other

    cs.LG stat.ML

    Differentiable Causal Discovery from Interventional Data

    Authors: Philippe Brouillard, Sébastien Lachapelle, Alexandre Lacoste, Simon Lacoste-Julien, Alexandre Drouin

    Abstract: Learning a causal directed acyclic graph from data is a challenging task that involves solving a combinatorial problem for which the solution is not always identifiable. A new line of work reformulates this problem as a continuous constrained optimization one, which is solved via the augmented Lagrangian method. However, most methods based on this idea do not make use of interventional data, which… ▽ More

    Submitted 3 November, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: Appears in: Advances in Neural Information Processing Systems 34 (NeurIPS 2020). 46 pages

    ACM Class: I.2.6; I.5.1

  14. arXiv:1801.07756  [pdf, other

    cs.LG stat.ML

    Deep Learning for Electromyographic Hand Gesture Signal Classification Using Transfer Learning

    Authors: Ulysse Côté-Allard, Cheikh Latyr Fall, Alexandre Drouin, Alexandre Campeau-Lecours, Clément Gosselin, Kyrre Glette, François Laviolette, Benoit Gosselin

    Abstract: In recent years, deep learning algorithms have become increasingly more prominent for their unparalleled ability to automatically learn discriminant features from large amounts of data. However, within the field of electromyography-based gesture recognition, deep learning algorithms are seldom employed as they require an unreasonable amount of effort from a single person, to generate tens of thous… ▽ More

    Submitted 25 January, 2019; v1 submitted 10 January, 2018; originally announced January 2018.

    Comments: Source code and datasets available: https://github.com/Giguelingueling/MyoArmbandDataset

  15. arXiv:1710.04234  [pdf, other

    stat.ML cs.DS cs.LG stat.AP

    Maximum Margin Interval Trees

    Authors: Alexandre Drouin, Toby Dylan Hocking, François Laviolette

    Abstract: Learning a regression function using censored or interval-valued output data is an important problem in fields such as genomics and medicine. The goal is to learn a real-valued prediction function, and the training output labels indicate an interval of possible values. Whereas most existing algorithms for this task are linear models, in this paper we investigate learning nonlinear tree models. We… ▽ More

    Submitted 27 October, 2017; v1 submitted 11 October, 2017; originally announced October 2017.

    Comments: Accepted for presentation at the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA

  16. arXiv:1612.01030  [pdf, other

    q-bio.GN cs.LG stat.ML

    Large scale modeling of antimicrobial resistance with interpretable classifiers

    Authors: Alexandre Drouin, Frédéric Raymond, Gaël Letarte St-Pierre, Mario Marchand, Jacques Corbeil, François Laviolette

    Abstract: Antimicrobial resistance is an important public health concern that has implications in the practice of medicine worldwide. Accurately predicting resistance phenotypes from genome sequences shows great promise in promoting better use of antimicrobial agents, by determining which antibiotics are likely to be effective in specific clinical cases. In healthcare, this would allow for the design of tre… ▽ More

    Submitted 3 December, 2016; originally announced December 2016.

    Comments: Peer-reviewed and accepted for presentation at the Machine Learning for Health Workshop, NIPS 2016, Barcelona, Spain

  17. arXiv:1505.06249  [pdf, other

    q-bio.GN cs.LG stat.ML

    Greedy Biomarker Discovery in the Genome with Applications to Antimicrobial Resistance

    Authors: Alexandre Drouin, Sébastien Giguère, Maxime Déraspe, François Laviolette, Mario Marchand, Jacques Corbeil

    Abstract: The Set Covering Machine (SCM) is a greedy learning algorithm that produces sparse classifiers. We extend the SCM for datasets that contain a huge number of features. The whole genetic material of living organisms is an example of such a case, where the number of feature exceeds 10^7. Three human pathogens were used to evaluate the performance of the SCM at predicting antimicrobial resistance. Our… ▽ More

    Submitted 22 May, 2015; originally announced May 2015.

    Comments: Peer-reviewed and accepted for an oral presentation in the Greed is Great workshop at the International Conference on Machine Learning, Lille, France, 2015

  18. arXiv:1412.1074  [pdf, other

    q-bio.GN cs.CE cs.LG stat.ML

    Learning interpretable models of phenotypes from whole genome sequences with the Set Covering Machine

    Authors: Alexandre Drouin, Sébastien Giguère, Vladana Sagatovich, Maxime Déraspe, François Laviolette, Mario Marchand, Jacques Corbeil

    Abstract: The increased affordability of whole genome sequencing has motivated its use for phenotypic studies. We address the problem of learning interpretable models for discrete phenotypes from whole genomes. We propose a general approach that relies on the Set Covering Machine and a k-mer representation of the genomes. We show results for the problem of predicting the resistance of Pseudomonas Aeruginosa… ▽ More

    Submitted 2 December, 2014; originally announced December 2014.

    Comments: Presented at Machine Learning in Computational Biology 2014, Montréal, Québec, Canada

  19. arXiv:1207.7253  [pdf, other

    q-bio.QM cs.LG q-bio.BM stat.ML

    Learning a peptide-protein binding affinity predictor with kernel ridge regression

    Authors: Sébastien Giguère, Mario Marchand, François Laviolette, Alexandre Drouin, Jacques Corbeil

    Abstract: We propose a specialized string kernel for small bio-molecules, peptides and pseudo-sequences of binding interfaces. The kernel incorporates physico-chemical properties of amino acids and elegantly generalize eight kernels, such as the Oligo, the Weighted Degree, the Blended Spectrum, and the Radial Basis Function. We provide a low complexity dynamic programming algorithm for the exact computation… ▽ More

    Submitted 31 July, 2012; originally announced July 2012.

    Comments: 22 pages, 4 figures, 5 tables

    MSC Class: 92B05 ACM Class: I.2.6; J.3; G.3; G.4; I.5.2

    Journal ref: BMC Bioinformatics 2013, 14:82