Skip to main content

Showing 1–3 of 3 results for author: Dox, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.06664  [pdf, other

    cs.LG cs.AI

    Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets

    Authors: Tommaso Bendinelli, Artur Dox, Christian Holz

    Abstract: High-quality, error-free datasets are a key ingredient in building reliable, accurate, and unbiased machine learning (ML) models. However, real world datasets often suffer from errors due to sensor malfunctions, data entry mistakes, or improper data integration across multiple sources that can severely degrade model performance. Detecting and correcting these issues typically require tailor-made s… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

    Comments: 14 pages, 1 main figure, 3 plots, Published at ICLR 2025 Workshop on Foundation Models in the Wild

  2. arXiv:2406.11547  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    GECOBench: A Gender-Controlled Text Dataset and Benchmark for Quantifying Biases in Explanations

    Authors: Rick Wilming, Artur Dox, Hjalmar Schulz, Marta Oliveira, Benedict Clark, Stefan Haufe

    Abstract: Large pre-trained language models have become popular for many applications and form an important backbone of many downstream tasks in natural language processing (NLP). Applying 'explainable artificial intelligence' (XAI) techniques to enrich such models' outputs is considered crucial for assuring their quality and shedding light on their inner workings. However, large language models are trained… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review

  3. arXiv:2405.12261  [pdf

    cs.LG cs.AI

    EXACT: Towards a platform for empirically benchmarking Machine Learning model explanation methods

    Authors: Benedict Clark, Rick Wilming, Artur Dox, Paul Eschenbach, Sami Hached, Daniel Jin Wodke, Michias Taye Zewdie, Uladzislau Bruila, Marta Oliveira, Hjalmar Schulz, Luca Matteo Cornils, Danny Panknin, Ahcène Boubekki, Stefan Haufe

    Abstract: The evolving landscape of explainable artificial intelligence (XAI) aims to improve the interpretability of intricate machine learning (ML) models, yet faces challenges in formalisation and empirical validation, being an inherently unsupervised process. In this paper, we bring together various benchmark datasets and novel performance metrics in an initial benchmarking platform, the Explainable AI… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.