Skip to main content

Showing 1–8 of 8 results for author: Fawkes, J

.
  1. arXiv:2503.14795  [pdf, other

    stat.ML cs.LG stat.ME

    The Hardness of Validating Observational Studies with Experimental Data

    Authors: Jake Fawkes, Michael O'Riordan, Athanasios Vlontzos, Oriol Corcoll, CiarĂ¡n Mark Gilligan-Lee

    Abstract: Observational data is often readily available in large quantities, but can lead to biased causal effect estimates due to the presence of unobserved confounding. Recent works attempt to remove this bias by supplementing observational data with experimental data, which, when available, is typically on a smaller scale due to the time and cost involved in running a randomised controlled trial. In this… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: Published at AISTATS 2025

  2. arXiv:2410.09600  [pdf, other

    cs.LG cs.CY

    The Fragility of Fairness: Causal Sensitivity Analysis for Fair Machine Learning

    Authors: Jake Fawkes, Nic Fishman, Mel Andrews, Zachary C. Lipton

    Abstract: Fairness metrics are a core tool in the fair machine learning literature (FairML), used to determine that ML models are, in some sense, "fair". Real-world data, however, are typically plagued by various measurement biases and other violated assumptions, which can render fairness assessments meaningless. We adapt tools from causal sensitivity analysis to the FairML context, providing a general fram… ▽ More

    Submitted 15 October, 2024; v1 submitted 12 October, 2024; originally announced October 2024.

    Comments: Published at Neurips 2024 in the Dataset and Benchmarks Track

  3. arXiv:2409.07215  [pdf, other

    stat.ML cs.CR cs.LG

    Is merging worth it? Securely evaluating the information gain for causal dataset acquisition

    Authors: Jake Fawkes, Lucile Ter-Minassian, Desi Ivanova, Uri Shalit, Chris Holmes

    Abstract: Merging datasets across institutions is a lengthy and costly procedure, especially when it involves private information. Data hosts may therefore want to prospectively gauge which datasets are most beneficial to merge with, without revealing sensitive information. For causal estimation this is particularly challenging as the value of a merge will depend not only on the reduction in epistemic uncer… ▽ More

    Submitted 7 March, 2025; v1 submitted 11 September, 2024; originally announced September 2024.

    Comments: Published at AISTATS 2025

  4. arXiv:2405.06582  [pdf, other

    cs.LG cs.CY stat.ML

    The Role of Learning Algorithms in Collective Action

    Authors: Omri Ben-Dov, Jake Fawkes, Samira Samadi, Amartya Sanyal

    Abstract: Collective action in machine learning is the study of the control that a coordinated group can have over machine learning algorithms. While previous research has concentrated on assessing the impact of collectives against Bayes (sub-)optimal classifiers, this perspective is limited in that it does not account for the choice of learning algorithm. Since classifiers seldom behave like Bayes classifi… ▽ More

    Submitted 4 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted at the International Conference in Machine Learning (ICML), 2024

  5. arXiv:2307.08519  [pdf, ps, other

    cs.LG stat.ML

    Results on Counterfactual Invariance

    Authors: Jake Fawkes, Robin J. Evans

    Abstract: In this paper we provide a theoretical analysis of counterfactual invariance. We present a variety of existing definitions, study how they relate to each other and what their graphical implications are. We then turn to the current major question surrounding counterfactual invariance, how does it relate to conditional independence? We show that whilst counterfactual invariance implies conditional i… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 5 pages with 6 pages of supplementary. Accepted at the ICML 2023 workshop on Spurious Correlations, Invariance and Stability

  6. arXiv:2301.11214  [pdf, other

    stat.ML cs.LG

    Returning The Favour: When Regression Benefits From Probabilistic Causal Knowledge

    Authors: Shahine Bouabid, Jake Fawkes, Dino Sejdinovic

    Abstract: A directed acyclic graph (DAG) provides valuable prior knowledge that is often discarded in regression tasks in machine learning. We show that the independences arising from the presence of collider structures in DAGs provide meaningful inductive biases, which constrain the regression hypothesis space and improve predictive performance. We introduce collider regression, a framework to incorporate… ▽ More

    Submitted 21 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  7. arXiv:2212.04922  [pdf, other

    stat.ML cs.LG

    Doubly Robust Kernel Statistics for Testing Distributional Treatment Effects

    Authors: Jake Fawkes, Robert Hu, Robin J. Evans, Dino Sejdinovic

    Abstract: With the widespread application of causal inference, it is increasingly important to have tools which can test for the presence of causal effects in a diverse array of circumstances. In this vein we focus on the problem of testing for \emph{distributional} causal effects, where the treatment affects not just the mean, but also higher order moments of the distribution, as well as multidimensional o… ▽ More

    Submitted 7 November, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 10 pages, Preprint

  8. arXiv:2202.13774  [pdf, ps, other

    stat.ML cs.LG

    Selection, Ignorability and Challenges With Causal Fairness

    Authors: Jake Fawkes, Robin Evans, Dino Sejdinovic

    Abstract: In this paper we look at popular fairness methods that use causal counterfactuals. These methods capture the intuitive notion that a prediction is fair if it coincides with the prediction that would have been made if someone's race, gender or religion were counterfactually different. In order to achieve this, we must have causal models that are able to capture what someone would be like if we were… ▽ More

    Submitted 2 March, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: To appear in Causal Learning and Reasoning 2022. 13 pages main text and 8 pages of appendices

    Report number: PMLR 177