Skip to main content

Showing 1–50 of 68 results for author: Kasneci, G

.
  1. arXiv:2505.22655  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents

    Authors: Michael Kirchhof, Gjergji Kasneci, Enkelejda Kasneci

    Abstract: Large-language models (LLMs) and chatbot agents are known to provide wrong outputs at times, and it was recently found that this can never be fully prevented. Hence, uncertainty quantification plays a crucial role, aiming to quantify the level of ambiguity in either one overall number or two numbers for aleatoric and epistemic uncertainty. This position paper argues that this traditional dichotomy… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Accepted at ICML 2025

  2. arXiv:2505.17542  [pdf, ps, other

    cs.LG

    Graph Style Transfer for Counterfactual Explainability

    Authors: Bardh Prenkaj, Efstratios Zaradoukas, Gjergji Kasneci

    Abstract: Counterfactual explainability seeks to uncover model decisions by identifying minimal changes to the input that alter the predicted outcome. This task becomes particularly challenging for graph data due to preserving structural integrity and semantic meaning. Unlike prior approaches that rely on forward perturbation mechanisms, we introduce Graph Inverse Style Transfer (GIST), the first framework… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML'25

  3. arXiv:2505.01238  [pdf, other

    cs.CL cs.AI cs.LG

    EvalxNLP: A Framework for Benchmarking Post-Hoc Explainability Methods on NLP Models

    Authors: Mahdi Dhaini, Kafaite Zahra Hussain, Efstratios Zaradoukas, Gjergji Kasneci

    Abstract: As Natural Language Processing (NLP) models continue to evolve and become integral to high-stakes applications, ensuring their interpretability remains a critical challenge. Given the growing variety of explainability methods and diverse stakeholder requirements, frameworks that help stakeholders select appropriate explanations tailored to their specific use cases are increasingly important. To ad… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: Accepted to the xAI World Conference (2025) - System Demonstration

  4. arXiv:2505.01198  [pdf, other

    cs.CL cs.AI cs.LG

    Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods

    Authors: Mahdi Dhaini, Ege Erdogan, Nils Feldhus, Gjergji Kasneci

    Abstract: While research on applications and evaluations of explanation methods continues to expand, fairness of the explanation methods concerning disparities in their performance across subgroups remains an often overlooked aspect. In this paper, we address this gap by showing that, across three tasks and five language models, widely used post-hoc feature attribution methods exhibit significant gender dis… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: Accepted to ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2025

  5. arXiv:2504.20752  [pdf, other

    cs.CL cs.AI cs.LG

    Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

    Authors: Roman Abramov, Felix Steinbauer, Gjergji Kasneci

    Abstract: Transformers have achieved great success in numerous NLP tasks but continue to exhibit notable gaps in multi-step factual reasoning, especially when real-world knowledge is sparse. Recent advances in grokking have demonstrated that neural networks can transition from memorizing to perfectly generalizing once they detect underlying logical patterns - yet these studies have primarily used small, syn… ▽ More

    Submitted 7 May, 2025; v1 submitted 29 April, 2025; originally announced April 2025.

    Comments: Accepted to the International Conference on Machine Learning (ICML) 2025

    ACM Class: I.2.7; I.2.6; I.2.3; I.7

  6. arXiv:2503.05788  [pdf, other

    cs.LG cs.AI cs.CL

    Emergent Abilities in Large Language Models: A Survey

    Authors: Leonardo Berti, Flavio Giorgi, Gjergji Kasneci

    Abstract: Large Language Models (LLMs) are leading a new technological revolution as one of the most promising research streams toward artificial general intelligence. The scaling of these models, accomplished by increasing the number of parameters and the magnitude of the training datasets, has been linked to various so-called emergent abilities that were previously unobserved. These emergent abilities, ra… ▽ More

    Submitted 14 March, 2025; v1 submitted 27 February, 2025; originally announced March 2025.

  7. arXiv:2502.15757  [pdf, other

    q-fin.ST cs.AI cs.LG q-fin.TR

    TLOB: A Novel Transformer Model with Dual Attention for Price Trend Prediction with Limit Order Book Data

    Authors: Leonardo Berti, Gjergji Kasneci

    Abstract: Price Trend Prediction (PTP) based on Limit Order Book (LOB) data is a fundamental challenge in financial markets. Despite advances in deep learning, existing models fail to generalize across different market conditions and assets. Surprisingly, by adapting a simple MLP-based architecture to LOB, we show that we surpass SoTA performance; thus, challenging the necessity of complex architectures. Un… ▽ More

    Submitted 7 May, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  8. arXiv:2502.08781  [pdf, ps, other

    cs.CY

    Europe's AI Imperative -- A Pragmatic Blueprint for Global Tech Leadership

    Authors: Gjergji Kasneci, Urs Gasser, Thomas F. Hofmann, Gerhard Kramer, Gerhard Müller, Claudia Peus, Helmut Schönenberger, Enkelejda Kasneci

    Abstract: Europe is at a make-or-break moment in the global AI race, squeezed between the massive venture capital and tech giants in the US and China's scale-oriented, top-down drive. At this tipping point, where the convergence of AI with complementary and synergistic technologies, like quantum computing, biotech, VR/AR, 5G/6G, robotics, advanced materials, and high-performance computing, could upend geopo… ▽ More

    Submitted 15 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

  9. arXiv:2412.07675  [pdf, other

    cs.CL cs.LG

    RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting

    Authors: Shuo Yang, Bardh Prenkaj, Gjergji Kasneci

    Abstract: Despite the widespread use of LLMs due to their superior performance in various tasks, their high computational costs often lead potential users to opt for the pretraining-finetuning pipeline. However, biases prevalent in manually constructed datasets can introduce spurious correlations between tokens and labels, creating so-called shortcuts and hindering the generalizability of fine-tuned models.… ▽ More

    Submitted 19 December, 2024; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: Shuo and Bardh contributed equally. Accepted to AAAI'25, Paper #17117

  10. arXiv:2411.01645  [pdf, other

    cs.LG cs.AI

    Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers

    Authors: Gjergji Kasneci, Enkelejda Kasneci

    Abstract: Feature engineering is crucial for optimizing machine learning model performance, particularly in tabular data classification tasks. Leveraging advancements in natural language processing, this study presents a systematic approach to enrich tabular datasets with features derived from large language model embeddings. Through a comprehensive ablation study on diverse datasets, we assess the impact o… ▽ More

    Submitted 5 November, 2024; v1 submitted 3 November, 2024; originally announced November 2024.

  11. Crowd IQ -- Aggregating Opinions to Boost Performance

    Authors: Michal Kosinski, Yoram Bachrach, Thore Graepel, Giergji Kasneci, Jurgen Van Gael

    Abstract: We show how the quality of decisions based on the aggregated opinions of the crowd can be conveniently studied using a sample of individual responses to a standard IQ questionnaire. We aggregated the responses to the IQ questionnaire using simple majority voting and a machine learning approach based on a probabilistic graphical model. The score for the aggregated questionnaire, Crowd IQ, serves as… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2012

  12. arXiv:2409.13367  [pdf, other

    cs.LG

    ALPEC: A Comprehensive Evaluation Framework and Dataset for Machine Learning-Based Arousal Detection in Clinical Practice

    Authors: Stefan Kraft, Andreas Theissler, Vera Wienhausen-Wilke, Philipp Walter, Gjergji Kasneci

    Abstract: Detecting arousals in sleep is essential for diagnosing sleep disorders. However, using Machine Learning (ML) in clinical practice is impeded by fundamental issues, primarily due to mismatches between clinical protocols and ML methods. Clinicians typically annotate only the onset of arousals, while ML methods rely on annotations for both the beginning and end. Additionally, there is no standardize… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    ACM Class: I.2

  13. arXiv:2409.07085  [pdf, other

    cs.CL cs.LG

    Understanding Knowledge Drift in LLMs through Misinformation

    Authors: Alina Fastowski, Gjergji Kasneci

    Abstract: Large Language Models (LLMs) have revolutionized numerous applications, making them an integral part of our digital ecosystem. However, their reliability becomes critical, especially when these models are exposed to misinformation. We primarily analyze the susceptibility of state-of-the-art LLMs to factual inaccuracies when they encounter false information in a QnA scenario, an issue that can lead… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 13 pages, 3 figures. Accepted at DELTA workshop at KDD 2024

  14. arXiv:2408.14126  [pdf, other

    cs.LG cs.CY

    Enhancing Fairness through Reweighting: A Path to Attain the Sufficiency Rule

    Authors: Xuan Zhao, Klaus Broelemann, Salvatore Ruggieri, Gjergji Kasneci

    Abstract: We introduce an innovative approach to enhancing the empirical risk minimization (ERM) process in model training through a refined reweighting scheme of the training data to enhance fairness. This scheme aims to uphold the sufficiency rule in fairness by ensuring that optimal predictors maintain consistency across diverse sub-groups. We employ a bilevel formulation to address this challenge, where… ▽ More

    Submitted 1 October, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: accepted at ECAI 2024

  15. arXiv:2408.05977  [pdf, other

    cs.CL cs.CY

    The Language of Trauma: Modeling Traumatic Event Descriptions Across Domains with Explainable AI

    Authors: Miriam Schirmer, Tobias Leemann, Gjergji Kasneci, Jürgen Pfeffer, David Jurgens

    Abstract: Psychological trauma can manifest following various distressing events and is captured in diverse online contexts. However, studies traditionally focus on a single aspect of trauma, often neglecting the transferability of findings across different scenarios. We address this gap by training language models with progressing complexity on trauma-related datasets, including genocide-related court data… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  16. arXiv:2407.18008  [pdf, other

    cs.CY cs.CL

    GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy

    Authors: Jan Batzner, Volker Stocker, Stefan Schmid, Gjergji Kasneci

    Abstract: LLMs are changing the way humans create and interact with content, potentially affecting citizens' political opinions and voting decisions. As LLMs increasingly shape our digital information ecosystems, auditing to evaluate biases, sycophancy, or steerability has emerged as an active field of research. In this paper, we evaluate and compare the alignment of six LLMs by OpenAI, Anthropic, and Coher… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 12 pages

    ACM Class: K.4

  17. arXiv:2406.11391  [pdf, other

    cs.LG

    P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models

    Authors: Shuo Yang, Chenchen Yuan, Yao Rong, Felix Steinbauer, Gjergji Kasneci

    Abstract: A multitude of industries depend on accurate and reasonable tabular data augmentation for their business processes. Contemporary methodologies in generating tabular data revolve around utilizing Generative Adversarial Networks (GAN) or fine-tuning Large Language Models (LLM). However, GAN-based approaches are documented to produce samples with common-sense errors attributed to the absence of exter… ▽ More

    Submitted 23 February, 2025; v1 submitted 17 June, 2024; originally announced June 2024.

  18. arXiv:2405.13536  [pdf, other

    cs.LG cs.AI cs.CL

    Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers

    Authors: Tobias Leemann, Alina Fastowski, Felix Pfeiffer, Gjergji Kasneci

    Abstract: We address the critical challenge of applying feature attribution methods to the transformer architecture, which dominates current applications in natural language processing and beyond. Traditional attribution methods to explainable AI (XAI) explicitly or implicitly rely on linear or additive surrogate models to quantify the impact of input features on a model's output. In this work, we formally… ▽ More

    Submitted 9 January, 2025; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: TMLR Camera-Ready version

  19. arXiv:2404.15435  [pdf, other

    cs.HC

    Introduction to Eye Tracking: A Hands-On Tutorial for Students and Practitioners

    Authors: Enkelejda Kasneci, Hong Gao, Suleyman Ozdel, Virmarie Maquiling, Enkeleda Thaqi, Carrie Lau, Yao Rong, Gjergji Kasneci, Efe Bozkir

    Abstract: Eye-tracking technology is widely used in various application areas such as psychology, neuroscience, marketing, and human-computer interaction, as it is a valuable tool for understanding how people process information and interact with their environment. This tutorial provides a comprehensive introduction to eye tracking, from the basics of eye anatomy and physiology to the principles and applica… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  20. arXiv:2403.10330  [pdf, other

    cs.LG

    Towards Non-Adversarial Algorithmic Recourse

    Authors: Tobias Leemann, Martin Pawelczyk, Bardh Prenkaj, Gjergji Kasneci

    Abstract: The streams of research on adversarial examples and counterfactual explanations have largely been growing independently. This has led to several recent works trying to elucidate their similarities and differences. Most prominently, it has been argued that adversarial examples, as opposed to counterfactual explanations, have a unique characteristic in that they lead to a misclassification compared… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  21. arXiv:2402.18284  [pdf, other

    cs.CL cs.AI

    Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization

    Authors: Shuo Yang, Gjergji Kasneci

    Abstract: Wide usage of ChatGPT has highlighted the potential of reinforcement learning from human feedback. However, its training pipeline relies on manual ranking, a resource-intensive process. To reduce labor costs, we propose a self-supervised text ranking approach for applying Proximal-Policy-Optimization to fine-tune language models while eliminating the need for human annotators. Our method begins wi… ▽ More

    Submitted 2 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

  22. arXiv:2402.02136  [pdf, other

    cs.HC

    User Intent Recognition and Satisfaction with Large Language Models: A User Study with ChatGPT

    Authors: Anna Bodonhelyi, Efe Bozkir, Shuo Yang, Enkelejda Kasneci, Gjergji Kasneci

    Abstract: The rapid evolution of LLMs represents an impactful paradigm shift in digital interaction and content engagement. While they encode vast amounts of human-generated knowledge and excel in processing diverse data types, they often face the challenge of accurately responding to specific user intents, leading to user dissatisfaction. Based on a fine-grained intent taxonomy and intent-based prompt refo… ▽ More

    Submitted 19 November, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  23. Taking the Next Step with Generative Artificial Intelligence: The Transformative Role of Multimodal Large Language Models in Science Education

    Authors: Arne Bewersdorff, Christian Hartmann, Marie Hornberger, Kathrin Seßler, Maria Bannert, Enkelejda Kasneci, Gjergji Kasneci, Xiaoming Zhai, Claudia Nerdel

    Abstract: The integration of Artificial Intelligence (AI), particularly Large Language Model (LLM)-based systems, in education has shown promise in enhancing teaching and learning experiences. However, the advent of Multimodal Large Language Models (MLLMs) like GPT-4 with vision (GPT-4V), capable of processing multimodal data including text, sound, and visual inputs, opens a new era of enriched, personalize… ▽ More

    Submitted 19 September, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: revised version 2. September 2024

  24. arXiv:2311.12684  [pdf, other

    cs.LG

    Adversarial Reweighting Guided by Wasserstein Distance for Bias Mitigation

    Authors: Xuan Zhao, Simone Fabbrizzi, Paula Reyero Lobo, Siamak Ghodsi, Klaus Broelemann, Steffen Staab, Gjergji Kasneci

    Abstract: The unequal representation of different groups in a sample population can lead to discrimination of minority groups when machine learning models make automated decisions. To address these issues, fairness-aware machine learning jointly optimizes two (or more) metrics aiming at predictive effectiveness and low unfairness. However, the inherent under-representation of minorities in the data makes th… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  25. arXiv:2311.10512  [pdf, other

    cs.LG

    Causal Fairness-Guided Dataset Reweighting using Neural Networks

    Authors: Xuan Zhao, Klaus Broelemann, Salvatore Ruggieri, Gjergji Kasneci

    Abstract: The importance of achieving fairness in machine learning models cannot be overstated. Recent research has pointed out that fairness should be examined from a causal perspective, and several fairness notions based on the on Pearl's causal framework have been proposed. In this paper, we construct a reweighting scheme of datasets to address causal fairness. Our approach aims at mitigating bias by con… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: To be published in the proceedings of 2023 IEEE International Conference on Big Data (IEEE BigData 2023)

  26. arXiv:2311.08228  [pdf, other

    cs.LG

    Counterfactual Explanation for Regression via Disentanglement in Latent Space

    Authors: Xuan Zhao, Klaus Broelemann, Gjergji Kasneci

    Abstract: Counterfactual Explanations (CEs) help address the question: How can the factors that influence the prediction of a predictive model be changed to achieve a more favorable outcome from a user's perspective? Thus, they bear the potential to guide the user's interaction with AI systems since they represent easy-to-understand explanations. To be applicable, CEs need to be realistic and actionable. In… ▽ More

    Submitted 23 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: CXAI workshop @ ICDM 2023. arXiv admin note: text overlap with arXiv:2307.13390

  27. arXiv:2308.02353  [pdf, other

    cs.LG cs.AI

    Adapting to Change: Robust Counterfactual Explanations in Dynamic Data Landscapes

    Authors: Bardh Prenkaj, Mario Villaizan-Vallelado, Tobias Leemann, Gjergji Kasneci

    Abstract: We introduce a novel semi-supervised Graph Counterfactual Explainer (GCE) methodology, Dynamic GRAph Counterfactual Explainer (DyGRACE). It leverages initial knowledge about the data distribution to search for valid counterfactuals while avoiding using information from potentially outdated decision functions in subsequent time steps. Employing two graph autoencoders (GAEs), DyGRACE learns the repr… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  28. arXiv:2307.13390  [pdf, other

    cs.LG

    Counterfactual Explanation via Search in Gaussian Mixture Distributed Latent Space

    Authors: Xuan Zhao, Klaus Broelemann, Gjergji Kasneci

    Abstract: Counterfactual Explanations (CEs) are an important tool in Algorithmic Recourse for addressing two questions: 1. What are the crucial factors that led to an automated prediction/decision? 2. How can these factors be changed to achieve a more favorable outcome from a user's perspective? Thus, guiding the user's interaction with AI systems by proposing easy-to-understand explanations and easy-to-att… ▽ More

    Submitted 21 November, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: XAI workshop of IJCAI 2023

  29. arXiv:2306.07273  [pdf, other

    cs.LG cs.CR stat.ML

    Gaussian Membership Inference Privacy

    Authors: Tobias Leemann, Martin Pawelczyk, Gjergji Kasneci

    Abstract: We propose a novel and practical privacy notion called $f$-Membership Inference Privacy ($f$-MIP), which explicitly considers the capabilities of realistic adversaries under the membership inference attack threat model. Consequently, $f$-MIP offers interpretable privacy guarantees and improved utility (e.g., better classification accuracy). In particular, we derive a parametric family of $f$-MIP g… ▽ More

    Submitted 26 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera-ready. The first two authors contributed equally

  30. arXiv:2303.08081  [pdf, other

    cs.LG stat.ML

    Explanation Shift: How Did the Distribution Shift Impact the Model?

    Authors: Carlos Mougan, Klaus Broelemann, David Masip, Gjergji Kasneci, Thanassis Thiropanis, Steffen Staab

    Abstract: As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions or model prediction distributions and try to understand issues regarding the interactions between learned models and shifting distributions. We suggest a novel… ▽ More

    Submitted 7 September, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.12369

  31. arXiv:2212.12374  [pdf, other

    cs.LG

    Relational Local Explanations

    Authors: Vadim Borisov, Gjergji Kasneci

    Abstract: The majority of existing post-hoc explanation approaches for machine learning models produce independent, per-variable feature attribution scores, ignoring a critical inherent characteristics of homogeneously structured data, such as visual or text data: there exist latent inter-variable relationships between features. In response, we develop a novel model-agnostic and permutation-based feature at… ▽ More

    Submitted 11 February, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

  32. arXiv:2211.09940  [pdf, other

    cs.LG cs.DC

    Entry Dependent Expert Selection in Distributed Gaussian Processes Using Multilabel Classification

    Authors: Hamed Jalali, Gjergji Kasneci

    Abstract: By distributing the training process, local approximation reduces the cost of the standard Gaussian Process. An ensemble technique combines local predictions from Gaussian experts trained on different partitions of the data. Ensemble methods aggregate models' predictions by assuming a perfect diversity of local predictors. Although it keeps the aggregation tractable, this assumption is often viola… ▽ More

    Submitted 8 January, 2024; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: A condensed version of this work has been accepted at the Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems workshop during NeurIPS 2022

  33. arXiv:2211.02151  [pdf, other

    cs.LG cs.AI cs.CY

    Decomposing Counterfactual Explanations for Consequential Decision Making

    Authors: Martin Pawelczyk, Lea Tiyavorabun, Gjergji Kasneci

    Abstract: The goal of algorithmic recourse is to reverse unfavorable decisions (e.g., from loan denial to approval) under automated decision making by suggesting actionable feature changes (e.g., reduce the number of credit cards). To generate low-cost recourse the majority of methods work under the assumption that the features are independently manipulable (IMF). To address the feature dependency issue the… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  34. arXiv:2210.13954  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    I Prefer not to Say: Protecting User Consent in Models with Optional Personal Data

    Authors: Tobias Leemann, Martin Pawelczyk, Christian Thomas Eberle, Gjergji Kasneci

    Abstract: We examine machine learning models in a setup where individuals have the choice to share optional personal information with a decision-making system, as seen in modern insurance pricing models. Some users consent to their data being used whereas others object and keep their data undisclosed. In this work, we show that the decision not to share data can be considered as information in itself that s… ▽ More

    Submitted 2 February, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: v5: AAAI-24 Camera-Ready Version Including Appendices. v1: NeurIPS 2022 Workshop on Algorithmic Fairness through the Lens of Causality and Privacy (AFCP)

  35. arXiv:2210.12369  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Explanation Shift: Detecting distribution shifts on tabular data via the explanation space

    Authors: Carlos Mougan, Klaus Broelemann, Gjergji Kasneci, Thanassis Tiropanis, Steffen Staab

    Abstract: As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In the past, predictive performance was considered the key indicator to monitor. However, explanation aspects have come to attention within the last years. In this work, we investigate how model predictive performance and model explanation characteristics are affected under distribution… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Neural Information Processing Systems (NeurIPS 2022). Workshop on Distribution Shifts: Connecting Methods and Applications

  36. Towards Human-centered Explainable AI: A Survey of User Studies for Model Explanations

    Authors: Yao Rong, Tobias Leemann, Thai-trang Nguyen, Lisa Fiedler, Peizhu Qian, Vaibhav Unhelkar, Tina Seidel, Gjergji Kasneci, Enkelejda Kasneci

    Abstract: Explainable AI (XAI) is widely viewed as a sine qua non for ever-expanding AI research. A better understanding of the needs of XAI users, as well as human-centered evaluations of explainable models are both a necessity and a challenge. In this paper, we explore how HCI and AI researchers conduct user studies in XAI applications based on a systematic literature review. After identifying and thoroug… ▽ More

    Submitted 15 October, 2024; v1 submitted 20 October, 2022; originally announced October 2022.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (Volume: 46, Issue: 4, April 2024)

  37. arXiv:2210.06280  [pdf, other

    cs.LG

    Language Models are Realistic Tabular Data Generators

    Authors: Vadim Borisov, Kathrin Seßler, Tobias Leemann, Martin Pawelczyk, Gjergji Kasneci

    Abstract: Tabular data is among the oldest and most ubiquitous forms of data. However, the generation of synthetic samples with the original data's characteristics remains a significant challenge for tabular data. While many generative models from the computer vision domain, such as variational autoencoders or generative adversarial networks, have been adapted for tabular data generation, less research has… ▽ More

    Submitted 22 April, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  38. arXiv:2209.02764  [pdf, other

    cs.LG stat.ML

    Change Detection for Local Explainability in Evolving Data Streams

    Authors: Johannes Haug, Alexander Braun, Stefan Zürn, Gjergji Kasneci

    Abstract: As complex machine learning models are increasingly used in sensitive applications like banking, trading or credit scoring, there is a growing demand for reliable explanation mechanisms. Local feature attribution methods have become a popular technique for post-hoc and model-agnostic explanations. However, attribution methods typically assume a stationary environment in which the predictive model… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: To be published in the proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM 2022)

  39. arXiv:2208.14137  [pdf, other

    cs.LG cs.AI cs.CY

    On the Trade-Off between Actionable Explanations and the Right to be Forgotten

    Authors: Martin Pawelczyk, Tobias Leemann, Asia Biega, Gjergji Kasneci

    Abstract: As machine learning (ML) models are increasingly being deployed in high-stakes applications, policymakers have suggested tighter data protection regulations (e.g., GDPR, CCPA). One key principle is the "right to be forgotten" which gives users the right to have their data deleted. Another key principle is the right to an actionable explanation, also known as algorithmic recourse, allowing users to… ▽ More

    Submitted 11 October, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: ICLR 2023 camera ready version

    Journal ref: 11th International Conference on Learning Representations (ICLR) 2023

  40. arXiv:2208.03142  [pdf, other

    cs.CV cs.LG

    BoxShrink: From Bounding Boxes to Segmentation Masks

    Authors: Michael Gröger, Vadim Borisov, Gjergji Kasneci

    Abstract: One of the core challenges facing the medical image computing community is fast and efficient data sample labeling. Obtaining fine-grained labels for segmentation is particularly demanding since it is expensive, time-consuming, and requires sophisticated tools. On the contrary, applying bounding boxes is fast and takes significantly less time than fine-grained labeling, but does not produce detail… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  41. Fairness in Agreement With European Values: An Interdisciplinary Perspective on AI Regulation

    Authors: Alejandra Bringas Colmenarejo, Luca Nannini, Alisa Rieger, Kristen M. Scott, Xuan Zhao, Gourab K. Patro, Gjergji Kasneci, Katharina Kinder-Kurlanda

    Abstract: With increasing digitalization, Artificial Intelligence (AI) is becoming ubiquitous. AI-based systems to identify, optimize, automate, and scale solutions to complex economic and societal problems are being proposed and implemented. This has motivated regulation efforts, including the Proposal of an EU AI Act. This interdisciplinary position paper considers various concerns surrounding fairness an… ▽ More

    Submitted 8 June, 2022; originally announced July 2022.

    Comments: In proceedings of AAAI/ACM Conference AIES 2022 (https://doi.org/10.1145/3514094.3534158)

  42. arXiv:2206.13872  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    When are Post-hoc Conceptual Explanations Identifiable?

    Authors: Tobias Leemann, Michael Kirchhof, Yao Rong, Enkelejda Kasneci, Gjergji Kasneci

    Abstract: Interest in understanding and factorizing learned embedding spaces through conceptual explanations is steadily growing. When no human concept labels are available, concept discovery methods search trained embedding spaces for interpretable concepts like object shape or color that can provide post-hoc explanations for decisions. Unlike previous work, we argue that concept discovery should be identi… ▽ More

    Submitted 6 June, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: v5: UAI2023 camera-ready including supplementary material. The first two authors contributed equally

  43. arXiv:2204.13625  [pdf, other

    cs.LG stat.ML

    Standardized Evaluation of Machine Learning Methods for Evolving Data Streams

    Authors: Johannes Haug, Effi Tramountani, Gjergji Kasneci

    Abstract: Due to the unspecified and dynamic nature of data streams, online machine learning requires powerful and flexible solutions. However, evaluating online machine learning methods under realistic conditions is difficult. Existing work therefore often draws on different heuristics and simulations that do not necessarily produce meaningful and reliable results. Indeed, in the absence of common evaluati… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  44. Dynamic Model Tree for Interpretable Data Stream Learning

    Authors: Johannes Haug, Klaus Broelemann, Gjergji Kasneci

    Abstract: Data streams are ubiquitous in modern business and society. In practice, data streams may evolve over time and cannot be stored indefinitely. Effective and transparent machine learning on data streams is thus often challenging. Hoeffding Trees have emerged as a state-of-the art for online predictive modelling. They are easy to train and provide meaningful convergence guarantees under a stationary… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: To be published in the proceedings of the 38th IEEE International Conference on Data Engineering (ICDE 2022)

  45. arXiv:2203.06768  [pdf, other

    cs.LG cs.CY

    Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse

    Authors: Martin Pawelczyk, Teresa Datta, Johannes van-den-Heuvel, Gjergji Kasneci, Himabindu Lakkaraju

    Abstract: As machine learning models are increasingly being employed to make consequential decisions in real-world settings, it becomes critical to ensure that individuals who are adversely impacted (e.g., loan denied) by the predictions of these models are provided with a means for recourse. While several approaches have been proposed to construct recourses for affected individuals, the recourses output by… ▽ More

    Submitted 11 October, 2023; v1 submitted 13 March, 2022; originally announced March 2022.

    Comments: ICLR 2023, camera ready version

    Journal ref: 11th International Conference on Learning Representations (ICLR) 2023

  46. arXiv:2202.03287  [pdf, other

    cs.LG cs.DC stat.ML

    Gaussian Graphical Models as an Ensemble Method for Distributed Gaussian Processes

    Authors: Hamed Jalali, Gjergji Kasneci

    Abstract: Distributed Gaussian process (DGP) is a popular approach to scale GP to big data which divides the training data into some subsets, performs local inference for each partition, and aggregates the results to acquire global prediction. To combine the local predictions, the conditional independence assumption is used which basically means there is a perfect diversity between the subsets. Although it… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: OPT2021: 13th Annual Workshop on Optimization for Machine Learning

  47. arXiv:2202.00449  [pdf, other

    cs.CV cs.LG

    A Consistent and Efficient Evaluation Strategy for Attribution Methods

    Authors: Yao Rong, Tobias Leemann, Vadim Borisov, Gjergji Kasneci, Enkelejda Kasneci

    Abstract: With a variety of local feature attribution methods being proposed in recent years, follow-up work suggested several evaluation strategies. To assess the attribution quality across different attribution techniques, the most popular among these evaluation strategies in the image domain use pixel perturbations. However, recent advances discovered that different evaluation strategies produce conflict… ▽ More

    Submitted 14 June, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: 26 pages. Accepted at ICML 2022

  48. arXiv:2111.07379  [pdf, other

    cs.LG

    A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines

    Authors: Vadim Borisov, Johannes Meier, Johan van den Heuvel, Hamed Jalali, Gjergji Kasneci

    Abstract: Understanding the results of deep neural networks is an essential step towards wider acceptance of deep learning algorithms. Many approaches address the issue of interpreting artificial neural networks, but often provide divergent explanations. Moreover, different hyperparameters of an explanatory method can lead to conflicting interpretations. In this paper, we propose a technique for aggregating… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

  49. Deep Neural Networks and Tabular Data: A Survey

    Authors: Vadim Borisov, Tobias Leemann, Kathrin Seßler, Johannes Haug, Martin Pawelczyk, Gjergji Kasneci

    Abstract: Heterogeneous tabular data are the most commonly used form of data and are essential for numerous critical and computationally demanding applications. On homogeneous data sets, deep neural networks have repeatedly shown excellent performance and have therefore been widely adopted. However, their adaptation to tabular data for inference or data generation tasks remains challenging. To facilitate fu… ▽ More

    Submitted 29 June, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

  50. arXiv:2108.00783  [pdf, other

    cs.LG cs.AI

    CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

    Authors: Martin Pawelczyk, Sascha Bielawski, Johannes van den Heuvel, Tobias Richter, Gjergji Kasneci

    Abstract: Counterfactual explanations provide means for prescriptive model explanations by suggesting actionable feature changes (e.g., increase income) that allow individuals to achieve favorable outcomes in the future (e.g., insurance approval). Choosing an appropriate method is a crucial aspect for meaningful counterfactual explanations. As documented in recent reviews, there exists a quickly growing lit… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: Accepted to NeurIPS Benchmark & Data Set Track

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks