Skip to main content

Showing 1–5 of 5 results for author: Doutreligne, M

.
  1. arXiv:2503.18025  [pdf, other

    cs.LG cs.AI stat.ML

    Decision from Suboptimal Classifiers: Excess Risk Pre- and Post-Calibration

    Authors: Alexandre Perez-Lebel, Gael Varoquaux, Sanmi Koyejo, Matthieu Doutreligne, Marine Le Morvan

    Abstract: Probabilistic classifiers are central for making informed decisions under uncertainty. Based on the maximum expected utility principle, optimal decision rules can be derived using the posterior class probabilities and misclassification costs. Yet, in practice only learned approximations of the oracle posterior probabilities are available. In this work, we quantify the excess risk (a.k.a. regret) i… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

    Journal ref: Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025, Mai Khao, Thailand. PMLR: Volume 258

  2. arXiv:2308.01605  [pdf, other

    stat.ME stat.ML

    Causal thinking for decision making on Electronic Health Records: why and how

    Authors: Matthieu Doutreligne, Tristan Struja, Judith Abecassis, Claire Morgand, Leo Anthony Celi, Gaël Varoquaux

    Abstract: Accurate predictions, as with machine learning, may not suffice to provide optimal healthcare for every patient. Indeed, prediction can be driven by shortcuts in the data, such as racial biases. Causal thinking is needed for data-driven decisions. Here, we give an introduction to the key elements, focusing on routinely-collected data, electronic health records (EHRs) and claims data. Using such da… ▽ More

    Submitted 11 December, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

  3. arXiv:2302.07074  [pdf, other

    cs.CY

    Good practices for clinical data warehouse implementation: a case study in France

    Authors: Matthieu Doutreligne, Adeline Degremont, Pierre-Alain Jachiet, Antoine Lamer, Xavier Tannier

    Abstract: Real World Data (RWD) bears great promises to improve the quality of care. However, specific infrastructures and methodologies are required to derive robust knowledge and brings innovations to the patient. Drawing upon the national case study of the 32 French regional and university hospitals governance, we highlight key aspects of modern Clinical Data Warehouses (CDWs): governance, transparency,… ▽ More

    Submitted 7 March, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 16 pages

  4. arXiv:2302.00370  [pdf, other

    stat.ML cs.LG

    How to select predictive models for causal inference?

    Authors: Matthieu Doutreligne, Gaël Varoquaux

    Abstract: As predictive models -- e.g., from machine learning -- give likely outcomes, they may be used to reason on the effect of an intervention, a causal-inference task. The increasing complexity of health data has opened the door to a plethora of models, but also the Pandora box of model selection: which of these models yield the most valid causal estimates? Here we highlight that classic machine-learni… ▽ More

    Submitted 16 May, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 35 pages

  5. arXiv:1903.07879  [pdf

    cs.CL

    Hybrid Approaches for our Participation to the n2c2 Challenge on Cohort Selection for Clinical Trials

    Authors: Xavier Tannier, Nicolas Paris, Hugo Cisneros, Christel Daniel, Matthieu Doutreligne, Catherine Duclos, Nicolas Griffon, Claire Hassen-Khodja, Ivan Lerner, Adrien Parrot, Éric Sadou, Cyrina Saussol, Pascal Vaillant

    Abstract: Objective: Natural language processing can help minimize human intervention in identifying patients meeting eligibility criteria for clinical trials, but there is still a long way to go to obtain a general and systematic approach that is useful for researchers. We describe two methods taking a step in this direction and present their results obtained during the n2c2 challenge on cohort selection f… ▽ More

    Submitted 9 December, 2020; v1 submitted 19 March, 2019; originally announced March 2019.

    Comments: 15 pages