Skip to main content

Showing 1–33 of 33 results for author: Flach, P

Searching in archive cs. Search in all archives.
.
  1. Explaining a probabilistic prediction on the simplex with Shapley compositions

    Authors: Paul-Gauthier Noé, Miquel Perelló-Nieto, Jean-François Bonastre, Peter Flach

    Abstract: Originating in game theory, Shapley values are widely used for explaining a machine learning model's prediction by quantifying the contribution of each feature's value to the prediction. This requires a scalar prediction as in binary classification, whereas a multiclass probabilistic prediction is a discrete probability distribution, living on a multidimensional simplex. In such a multiclass setti… ▽ More

    Submitted 12 February, 2025; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: Published in ECAI2024's proceedings

  2. arXiv:2307.01777  [pdf, other

    cs.LG

    Shapley Sets: Feature Attribution via Recursive Function Decomposition

    Authors: Torty Sivill, Peter Flach

    Abstract: Despite their ubiquitous use, Shapley value feature attributions can be misleading due to feature interaction in both model and data. We propose an alternative attribution approach, Shapley Sets, which awards value to sets of features. Shapley Sets decomposes the underlying model into non-separable variable groups using a recursive function decomposition algorithm with log linear complexity in the… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  3. arXiv:2305.11605  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    MIDI-Draw: Sketching to Control Melody Generation

    Authors: Tashi Namgyal, Peter Flach, Raul Santos-Rodriguez

    Abstract: We describe a proof-of-principle implementation of a system for drawing melodies that abstracts away from a note-level input representation via melodic contours. The aim is to allow users to express their musical intentions without requiring prior knowledge of how notes fit together melodiously. Current approaches to controllable melody generation often require users to choose parameters that are… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Late-Breaking / Demo Session Extended Abstract, ISMIR 2022 Conference

  4. arXiv:2302.02706  [pdf, other

    cs.LG cs.HC

    When the Ground Truth is not True: Modelling Human Biases in Temporal Annotations

    Authors: Taku Yamagata, Emma L. Tonkin, Benjamin Arana Sanchez, Ian Craddock, Miquel Perello Nieto, Raul Santos-Rodriguez, Weisong Yang, Peter Flach

    Abstract: In supervised learning, low quality annotations lead to poorly performing classification and detection models, while also rendering evaluation unreliable. This is particularly apparent on temporal data, where annotation quality is affected by multiple factors. For example, in the post-hoc self-reporting of daily activities, cognitive biases are one of the most common ingredients. In particular, re… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  5. What and How of Machine Learning Transparency: Building Bespoke Explainability Tools with Interoperable Algorithmic Components

    Authors: Kacper Sokol, Alexander Hepburn, Raul Santos-Rodriguez, Peter Flach

    Abstract: Explainability techniques for data-driven predictive models based on artificial intelligence and machine learning algorithms allow us to better understand the operation of such systems and help to hold them accountable. New transparency approaches are developed at breakneck speed, enabling us to peek inside these black boxes and interpret their decisions. Many of these techniques are introduced as… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: Tutorial webpage: https://events.fat-forensics.org/2020_ecml-pkdd

  6. arXiv:2209.03805  [pdf, other

    cs.LG cs.AI cs.CY

    FAT Forensics: A Python Toolbox for Implementing and Deploying Fairness, Accountability and Transparency Algorithms in Predictive Systems

    Authors: Kacper Sokol, Alexander Hepburn, Rafael Poyiadzi, Matthew Clifford, Raul Santos-Rodriguez, Peter Flach

    Abstract: Predictive systems, in particular machine learning algorithms, can take important, and sometimes legally binding, decisions about our everyday life. In most cases, however, these systems and decisions are neither regulated nor certified. Given the potential harm that these algorithms can cause, their qualities such as fairness, accountability and transparency (FAT) are of paramount importance. To… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Journal ref: Journal of Open Source Software, 5(49), 1904 (2020)

  7. Simply Logical -- Intelligent Reasoning by Example (Fully Interactive Online Edition)

    Authors: Peter Flach, Kacper Sokol

    Abstract: "Simply Logical -- Intelligent Reasoning by Example" by Peter Flach was first published by John Wiley in 1994. It could be purchased as book-only or with a 3.5 inch diskette containing the SWI-Prolog programmes printed in the book (for various operating systems). In 2007 the copyright reverted back to the author at which point the book and programmes were made freely available online; the print ve… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

    Comments: The online edition is available at https://book.simply-logical.space/

  8. arXiv:2203.16282  [pdf, other

    cs.LG

    The Weak Supervision Landscape

    Authors: Rafael Poyiadzi, Daniel Bacaicoa-Barber, Jesus Cid-Sueiro, Miquel Perello-Nieto, Peter Flach, Raul Santos-Rodriguez

    Abstract: Many ways of annotating a dataset for machine learning classification tasks that go beyond the usual class labels exist in practice. These are of interest as they can simplify or facilitate the collection of annotations, while not greatly affecting the resulting machine learning model. Many of these fall under the umbrella term of weak labels or annotations. However, it is not always clear how dif… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  9. arXiv:2112.14466  [pdf, other

    cs.AI cs.LG stat.ML

    Explainability Is in the Mind of the Beholder: Establishing the Foundations of Explainable Artificial Intelligence

    Authors: Kacper Sokol, Peter Flach

    Abstract: Explainable artificial intelligence and interpretable machine learning are research domains growing in importance. Yet, the underlying concepts remain somewhat elusive and lack generally agreed definitions. While recent inspiration from social sciences has refocused the work on needs and expectations of human recipients, the field still misses a concrete conceptualisation. We take steps towards ad… ▽ More

    Submitted 8 September, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

  10. Classifier Calibration: A survey on how to assess and improve predicted class probabilities

    Authors: Telmo Silva Filho, Hao Song, Miquel Perello-Nieto, Raul Santos-Rodriguez, Meelis Kull, Peter Flach

    Abstract: This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its instance-wise predictions. This is essential for critical applications, optimal decision making, cost-sensitive classification, and for some types of context change.… ▽ More

    Submitted 16 February, 2023; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Machine Learning (2023)

  11. arXiv:2111.04972  [pdf, other

    cs.LG

    Risk Sensitive Model-Based Reinforcement Learning using Uncertainty Guided Planning

    Authors: Stefan Radic Webster, Peter Flach

    Abstract: Identifying uncertainty and taking mitigating actions is crucial for safe and trustworthy reinforcement learning agents, especially when deployed in high-risk environments. In this paper, risk sensitivity is promoted in a model-based reinforcement learning algorithm by exploiting the ability of a bootstrap ensemble of dynamics models to estimate environment epistemic uncertainty. We propose uncert… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Safe RL Workshop NeurIPS 2021

  12. arXiv:2107.06639  [pdf, other

    cs.PL cs.AI cs.LG

    You Only Write Thrice: Creating Documents, Computational Notebooks and Presentations From a Single Source

    Authors: Kacper Sokol, Peter Flach

    Abstract: Academic trade requires juggling multiple variants of the same content published in different formats: manuscripts, presentations, posters and computational notebooks. The need to track versions to accommodate for the write--review--rebut--revise life-cycle adds another layer of complexity. We propose to significantly reduce this burden by maintaining a single source document in a version-controll… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: Published at Rethinking ML Papers -- ICLR 2021 Workshop. OpenReview: https://openreview.net/forum?id=i4zpuNRiU4G Exhibit: https://so-cool.github.io/you-only-write-thrice/

  13. arXiv:2103.05276  [pdf, other

    stat.ML cs.LG

    Continual Density Ratio Estimation in an Online Setting

    Authors: Yu Chen, Song Liu, Tom Diethe, Peter Flach

    Abstract: In online applications with streaming data, awareness of how far the training or test set has shifted away from the original dataset can be crucial to the performance of the model. However, we may not have access to historical samples in the data stream. To cope with such situations, we propose a novel method, Continual Density Ratio Estimation (CDRE), for estimating density ratios between the ini… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  14. arXiv:2010.06266  [pdf, other

    cs.LG

    Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control

    Authors: Taku Yamagata, Aisling O'Kane, Amid Ayobi, Dmitri Katz, Katarzyna Stawarz, Paul Marshall, Peter Flach, Raúl Santos-Rodríguez

    Abstract: In this paper we investigate the use of model-based reinforcement learning to assist people with Type 1 Diabetes with insulin dose decisions. The proposed architecture consists of multiple Echo State Networks to predict blood glucose levels combined with Model Predictive Controller for planning. Echo State Network is a version of recurrent neural networks which allows us to learn long term depende… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Presented at ECAI 2020 SP4HC Workshop

  15. arXiv:2008.07007  [pdf, other

    cs.LG cs.AI stat.ML

    Interpretable Representations in Explainable AI: From Theory to Practice

    Authors: Kacper Sokol, Peter Flach

    Abstract: Interpretable representations are the backbone of many explainers that target black-box predictive systems based on artificial intelligence and machine learning algorithms. They translate the low-level data representation necessary for good predictive performance into high-level human-intelligible concepts used to convey the explanatory insights. Notably, the explanation type and its cognitive com… ▽ More

    Submitted 26 April, 2024; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: Published in the *Special Issue on Explainable and Interpretable Machine Learning and Data Mining* of the Springer *Data Mining and Knowledge Discovery* journal

  16. arXiv:2006.11234  [pdf, other

    stat.ML cs.LG

    Semi-Discriminative Representation Loss for Online Continual Learning

    Authors: Yu Chen, Tom Diethe, Peter Flach

    Abstract: The use of episodic memory in continual learning has demonstrated effectiveness for alleviating catastrophic forgetting. In recent studies, gradient-based approaches have been developed to make more efficient use of compact episodic memory. Such approaches refine the gradients resulting from new samples by those from memorized samples, aiming to reduce the diversity of gradients from different tas… ▽ More

    Submitted 14 April, 2022; v1 submitted 19 June, 2020; originally announced June 2020.

  17. arXiv:2005.01427  [pdf, other

    cs.LG cs.AI stat.ML

    LIMEtree: Consistent and Faithful Surrogate Explanations of Multiple Classes

    Authors: Kacper Sokol, Peter Flach

    Abstract: Explainable artificial intelligence provides tools to better understand predictive models and their decisions, but many such methods are limited to producing insights with respect to a single class. When generating explanations for several classes, reasoning over them to obtain a comprehensive view may be difficult since they can present competing or contradictory evidence. To address this challen… ▽ More

    Submitted 26 February, 2025; v1 submitted 4 May, 2020; originally announced May 2020.

  18. arXiv:2001.09734  [pdf, other

    cs.LG cs.AI stat.ML

    One Explanation Does Not Fit All: The Promise of Interactive Explanations for Machine Learning Transparency

    Authors: Kacper Sokol, Peter Flach

    Abstract: The need for transparency of predictive systems based on Machine Learning algorithms arises as a consequence of their ever-increasing proliferation in the industry. Whenever black-box algorithmic predictions influence human affairs, the inner workings of these algorithms should be scrutinised and their decisions explained to the relevant stakeholders, including the system engineers, the system's o… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: Published in the Kunstliche Intelligenz journal, special issue on Challenges in Interactive Machine Learning

  19. arXiv:1912.05100  [pdf

    cs.LG cs.AI stat.ML

    Explainability Fact Sheets: A Framework for Systematic Assessment of Explainable Approaches

    Authors: Kacper Sokol, Peter Flach

    Abstract: Explanations in Machine Learning come in many forms, but a consensus regarding their desired properties is yet to emerge. In this paper we introduce a taxonomy and a set of descriptors that can be used to characterise and systematically assess explainable systems along five key dimensions: functional, operational, usability, safety and validation. In order to design a comprehensive and representat… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: Conference on Fairness, Accountability, and Transparency (FAT* '20), January 27-30, 2020, Barcelona, Spain

  20. arXiv:1910.13016  [pdf, other

    cs.LG stat.ML

    bLIMEy: Surrogate Prediction Explanations Beyond LIME

    Authors: Kacper Sokol, Alexander Hepburn, Raul Santos-Rodriguez, Peter Flach

    Abstract: Surrogate explainers of black-box machine learning predictions are of paramount importance in the field of eXplainable Artificial Intelligence since they can be applied to any type of data (images, text and tabular), are model-agnostic and are post-hoc (i.e., can be retrofitted). The Local Interpretable Model-agnostic Explanations (LIME) algorithm is often mistakenly unified with a more general fr… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: 2019 Workshop on Human-Centric Machine Learning (HCML 2019); 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  21. arXiv:1910.12656  [pdf, other

    cs.LG stat.ML

    Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration

    Authors: Meelis Kull, Miquel Perello-Nieto, Markus Kängsepp, Telmo Silva Filho, Hao Song, Peter Flach

    Abstract: Class probabilities predicted by most multiclass classifiers are uncalibrated, often tending towards over-confidence. With neural networks, calibration can be improved by temperature scaling, a method to learn a single corrective multiplicative factor for inputs to the last softmax layer. On non-neural models the existing methods apply binary calibration in a pairwise or one-vs-rest fashion. We… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: Accepted for presentation at NeurIPS 2019

  22. FACE: Feasible and Actionable Counterfactual Explanations

    Authors: Rafael Poyiadzi, Kacper Sokol, Raul Santos-Rodriguez, Tijl De Bie, Peter Flach

    Abstract: Work in Counterfactual Explanations tends to focus on the principle of "the closest possible world" that identifies small changes leading to the desired outcome. In this paper we argue that while this approach might initially seem intuitively appealing it exhibits shortcomings not addressed in the current literature. First, a counterfactual example generated by the state-of-the-art systems is not… ▽ More

    Submitted 24 February, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: Presented at AAAI/ACM Conference on AI, Ethics, and Society 2020

  23. arXiv:1909.05167  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    FAT Forensics: A Python Toolbox for Algorithmic Fairness, Accountability and Transparency

    Authors: Kacper Sokol, Raul Santos-Rodriguez, Peter Flach

    Abstract: Today, artificial intelligence systems driven by machine learning algorithms can be in a position to take important, and sometimes legally binding, decisions about our everyday lives. In many cases, however, these systems and their actions are neither regulated nor certified. To help counter the potential harm that such algorithms can cause we developed an open source toolbox that can analyse sele… ▽ More

    Submitted 25 August, 2022; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: Homepage: https://fat-forensics.org/ Source Code: https://github.com/fat-forensics/fat-forensics/

  24. arXiv:1908.02858  [pdf, other

    cs.LG eess.SY stat.ML

    HyperStream: a Workflow Engine for Streaming Data

    Authors: Tom Diethe, Meelis Kull, Niall Twomey, Kacper Sokol, Hao Song, Miquel Perello-Nieto, Emma Tonkin, Peter Flach

    Abstract: This paper describes HyperStream, a large-scale, flexible and robust software package, written in the Python language, for processing streaming data with workflow creation capabilities. HyperStream overcomes the limitations of other computational engines and provides high-level interfaces to execute complex nesting, fusion, and prediction both in online and offline forms in streaming environments.… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

  25. arXiv:1905.06023  [pdf, other

    stat.ML cs.AI cs.LG

    Distribution Calibration for Regression

    Authors: Hao Song, Tom Diethe, Meelis Kull, Peter Flach

    Abstract: We are concerned with obtaining well-calibrated output distributions from regression models. Such distributions allow us to quantify the uncertainty that the model has regarding the predicted target value. We introduce the novel concept of distribution calibration, and demonstrate its advantages over the existing definition of quantile calibration. We further propose a post-hoc approach to improvi… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Comments: ICML 2019, 10 pages

  26. arXiv:1903.04016  [pdf, other

    stat.ML cs.LG

    $β^3$-IRT: A New Item Response Model and its Applications

    Authors: Yu Chen, Telmo Silva Filho, Ricardo B. C. Prudêncio, Tom Diethe, Peter Flach

    Abstract: Item Response Theory (IRT) aims to assess latent abilities of respondents based on the correctness of their answers in aptitude test items with different difficulty levels. In this paper, we propose the $β^3$-IRT model, which models continuous responses and can generate a much enriched family of Item Characteristic Curve (ICC). In experiments we applied the proposed model to data from an online ex… ▽ More

    Submitted 3 June, 2019; v1 submitted 10 March, 2019; originally announced March 2019.

    Journal ref: AISTATS 2019

  27. arXiv:1806.07690  [pdf, other

    stat.ML cs.LG

    Non-Parametric Calibration of Probabilistic Regression

    Authors: Hao Song, Meelis Kull, Peter Flach

    Abstract: The task of calibration is to retrospectively adjust the outputs from a machine learning model to provide better probability estimates on the target variable. While calibration has been investigated thoroughly in classification, it has not yet been well-established for regression tasks. This paper considers the problem of calibrating a probabilistic regression model to improve the estimated probab… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

  28. arXiv:1709.09003  [pdf, other

    cs.DB

    CASP-DM: Context Aware Standard Process for Data Mining

    Authors: Fernando Martínez-Plumed, Lidia Contreras-Ochando, Cèsar Ferri, Peter Flach, José Hernández-Orallo, Meelis Kull, Nicolas Lachiche, María José Ramírez-Quintana

    Abstract: We propose an extension of the Cross Industry Standard Process for Data Mining (CRISPDM) which addresses specific challenges of machine learning and data mining for context and model reuse handling. This new general context-aware process model is mapped with CRISP-DM reference model proposing some new or enhanced outputs.

    Submitted 19 September, 2017; originally announced September 2017.

  29. arXiv:1702.01209  [pdf, other

    stat.ML cs.HC

    Probabilistic Sensor Fusion for Ambient Assisted Living

    Authors: Tom Diethe, Niall Twomey, Meelis Kull, Peter Flach, Ian Craddock

    Abstract: There is a widely-accepted need to revise current forms of health-care provision, with particular interest in sensing systems in the home. Given a multiple-modality sensor platform with heterogeneous network connectivity, as is under development in the Sensor Platform for HEalthcare in Residential Environment (SPHERE) Interdisciplinary Research Collaboration (IRC), we face specific challenges rela… ▽ More

    Submitted 3 February, 2017; originally announced February 2017.

    Comments: Journal article. 19 pages; 7 figures

  30. arXiv:1603.00797  [pdf, other

    cs.CY cs.HC

    The SPHERE Challenge: Activity Recognition with Multimodal Sensor Data

    Authors: Niall Twomey, Tom Diethe, Meelis Kull, Hao Song, Massimo Camplani, Sion Hannuna, Xenofon Fafoutis, Ni Zhu, Pete Woznowski, Peter Flach, Ian Craddock

    Abstract: This paper outlines the Sensor Platform for HEalthcare in Residential Environment (SPHERE) project and details the SPHERE challenge that will take place in conjunction with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML-PKDD) between March and July 2016. The SPHERE challenge is an activity recognition competition where predictions are made from vid… ▽ More

    Submitted 17 March, 2016; v1 submitted 2 March, 2016; originally announced March 2016.

    Comments: Paper describing dataset. 11 pages; 4 figures

  31. arXiv:1112.2640  [pdf, other

    cs.AI

    Threshold Choice Methods: the Missing Link

    Authors: José Hernández-Orallo, Peter Flach, Cèsar Ferri

    Abstract: Many performance metrics have been introduced for the evaluation of classification performance, with different origins and niches of application: accuracy, macro-accuracy, area under the ROC curve, the ROC convex hull, the absolute error, and the Brier score (with its decomposition into refinement and calibration). One way of understanding the relation among some of these metrics is the use of var… ▽ More

    Submitted 28 January, 2012; v1 submitted 12 December, 2011; originally announced December 2011.

  32. arXiv:1111.2111  [pdf, other

    cs.DS cs.LG

    Generic Multiplicative Methods for Implementing Machine Learning Algorithms on MapReduce

    Authors: Song Liu, Peter Flach, Nello Cristianini

    Abstract: In this paper we introduce a generic model for multiplicative algorithms which is suitable for the MapReduce parallel programming paradigm. We implement three typical machine learning algorithms to demonstrate how similarity comparison, gradient descent, power method and other classic learning techniques fit this model well. Two versions of large-scale matrix multiplication are discussed in this p… ▽ More

    Submitted 1 December, 2011; v1 submitted 9 November, 2011; originally announced November 2011.

    ACM Class: D.1; F.1

  33. arXiv:1107.5930  [pdf, other

    cs.AI

    Technical Note: Towards ROC Curves in Cost Space

    Authors: José Hernández-Orallo, Peter Flach, Cèsar Ferri

    Abstract: ROC curves and cost curves are two popular ways of visualising classifier performance, finding appropriate thresholds according to the operating condition, and deriving useful aggregated measures such as the area under the ROC curve (AUC) or the area under the optimal cost curve. In this note we present some new findings and connections between ROC space and cost space, by using the expected loss… ▽ More

    Submitted 29 July, 2011; originally announced July 2011.