Skip to main content

Showing 1–50 of 94 results for author: Seifert, C

.
  1. arXiv:2505.20819  [pdf, other

    cs.CL

    Tracing and Reversing Rank-One Model Edits

    Authors: Paul Youssef, Zhixue Zhao, Christin Seifert, Jörg Schlötterer

    Abstract: Knowledge editing methods (KEs) are a cost-effective way to update the factual content of large language models (LLMs), but they pose a dual-use risk. While KEs are beneficial for updating outdated or incorrect information, they can be exploited maliciously to implant misinformation or bias. In order to defend against these types of malicious manipulation, we need robust techniques that can reliab… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2505.13972  [pdf, ps, other

    cs.CL

    Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals

    Authors: Qianli Wang, Van Bach Nguyen, Nils Feldhus, Luis Felipe Villa-Arenas, Christin Seifert, Sebastian Möller, Vera Schmitt

    Abstract: Counterfactual examples are widely employed to enhance the performance and robustness of large language models (LLMs) through counterfactual data augmentation (CDA). However, the selection of the judge model used to evaluate label flipping, the primary metric for assessing the validity of generated counterfactuals for CDA, yields inconsistent results. To decipher this, we define four types of rela… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: in submission

  3. arXiv:2504.20567  [pdf, other

    cs.HC

    Explanation format does not matter; but explanations do -- An Eggsbert study on explaining Bayesian Optimisation tasks

    Authors: Tanmay Chakraborty, Marion Koelle, Jörg Schlötterer, Nadine Schlicker, Christian Wirth, Christin Seifert

    Abstract: Bayesian Optimisation (BO) is a family of methods for finding optimal parameters when the underlying function to be optimised is unknown. BO is used, for example, for hyperparameter tuning in machine learning and as an expert support tool for tuning cyberphysical systems. For settings where humans are involved in the tuning task, methods have been developed to explain BO (Explainable Bayesian Opti… ▽ More

    Submitted 30 April, 2025; v1 submitted 29 April, 2025; originally announced April 2025.

  4. arXiv:2504.15686  [pdf, other

    cs.LG

    Invariant Learning with Annotation-free Environments

    Authors: Phuong Quynh Le, Christin Seifert, Jörg Schlötterer

    Abstract: Invariant learning is a promising approach to improve domain generalization compared to Empirical Risk Minimization (ERM). However, most invariant learning methods rely on the assumption that training examples are pre-partitioned into different known environments. We instead infer environments without the need for additional annotations, motivated by observations of the properties within the repre… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: Accepted at NeurIPS 2024 Workshop UniReps

  5. arXiv:2504.15664  [pdf, other

    cs.LG cs.CV

    An XAI-based Analysis of Shortcut Learning in Neural Networks

    Authors: Phuong Quynh Le, Jörg Schlötterer, Christin Seifert

    Abstract: Machine learning models tend to learn spurious features - features that strongly correlate with target labels but are not causal. Existing approaches to mitigate models' dependence on spurious features work in some cases, but fail in others. In this paper, we systematically analyze how and where neural networks encode spurious correlations. We introduce the neuron spurious score, an XAI-based diag… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: Accepted at The World Conference on eXplainable Artificial Intelligence 2025 (XAI-2025)

  6. arXiv:2504.03744  [pdf, other

    cs.LG cs.AI

    Comparative Explanations: Explanation Guided Decision Making for Human-in-the-Loop Preference Selection

    Authors: Tanmay Chakraborty, Christian Wirth, Christin Seifert

    Abstract: This paper introduces Multi-Output LOcal Narrative Explanation (MOLONE), a novel comparative explanation method designed to enhance preference selection in human-in-the-loop Preference Bayesian optimization (PBO). The preference elicitation in PBO is a non-trivial task because it involves navigating implicit trade-offs between vector-valued outcomes, subjective priorities of decision-makers, and d… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  7. arXiv:2503.04463  [pdf, other

    cs.CL

    Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification

    Authors: Van Bach Nguyen, Christin Seifert, Jörg Schlötterer

    Abstract: The need for interpretability in deep learning has driven interest in counterfactual explanations, which identify minimal changes to an instance that change a model's prediction. Current counterfactual (CF) generation methods require task-specific fine-tuning and produce low-quality text. Large Language Models (LLMs), though effective for high-quality text generation, struggle with label-flipping… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  8. arXiv:2502.14613  [pdf, other

    cs.CL

    Behavioral Analysis of Information Salience in Large Language Models

    Authors: Jan Trienes, Jörg Schlötterer, Junyi Jessy Li, Christin Seifert

    Abstract: Large Language Models (LLMs) excel at text summarization, a task that requires models to select content based on its importance. However, the exact notion of salience that LLMs have internalized remains unclear. To bridge this gap, we introduce an explainable framework to systematically derive and investigate information salience in LLMs through their summarization behavior. Using length-controlle… ▽ More

    Submitted 27 May, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

    Comments: Accepted at ACL 2025 (Findings)

  9. arXiv:2502.09340  [pdf, other

    cs.LG

    This looks like what? Challenges and Future Research Directions for Part-Prototype Models

    Authors: Khawla Elhadri, Tomasz Michalski, Adam Wróbel, Jörg Schlötterer, Bartosz Zieliński, Christin Seifert

    Abstract: The growing interest in eXplainable Artificial Intelligence (XAI) has prompted research into models with built-in interpretability, the most prominent of which are part-prototype models. Part-Prototype Models (PPMs) make decisions by comparing an input image to a set of learned prototypes, providing human-understandable explanations in the form of ``this looks like that''. Despite their inherent i… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  10. arXiv:2502.02958  [pdf, ps, other

    cs.CL

    Position: Editing Large Language Models Poses Serious Safety Risks

    Authors: Paul Youssef, Zhixue Zhao, Daniel Braun, Jörg Schlötterer, Christin Seifert

    Abstract: Large Language Models (LLMs) contain large amounts of facts about the world. These facts can become outdated over time, which has led to the development of knowledge editing methods (KEs) that can change specific facts in LLMs with limited side effects. This position paper argues that editing LLMs poses serious safety risks that have been largely overlooked. First, we note the fact that KEs are wi… ▽ More

    Submitted 10 June, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: Accepted at ICML 2025

  11. arXiv:2501.00942  [pdf, other

    cs.LG cs.CV

    Efficient Unsupervised Shortcut Learning Detection and Mitigation in Transformers

    Authors: Lukas Kuhn, Sari Sadiya, Jorg Schlotterer, Christin Seifert, Gemma Roig

    Abstract: Shortcut learning, i.e., a model's reliance on undesired features not directly relevant to the task, is a major challenge that severely limits the applications of machine learning algorithms, particularly when deploying them to assist in making sensitive decisions, such as in medical diagnostics. In this work, we leverage recent advancements in machine learning to create an unsupervised framework… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

  12. arXiv:2411.14239  [pdf, ps, other

    math.AP math.FA math.OC

    Duality for Evolutionary Equations with Applications to Control Theory

    Authors: Andreas Buchinger, Christian Seifert

    Abstract: We study evolutionary equations in exponentially weighted $\mathrm{L}^{2}$-spaces as introduced by Picard in 2009. First, for a given evolutionary equation, we explicitly describe the $ν$-adjoint system, which turns out to describe a system backwards in time. We prove well-posedness for the $ν$-adjoint system. We then apply the thus obtained duality to introduce and study notions of null-controlla… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 19 pages

    MSC Class: 35Axx; 35F35; 35M10; 47F05; 47N20

  13. arXiv:2410.13562  [pdf, other

    cs.CL

    Enhancing Fact Retrieval in PLMs through Truthfulness

    Authors: Paul Youssef, Jörg Schlötterer, Christin Seifert

    Abstract: Pre-trained Language Models (PLMs) encode various facts about the world at their pre-training phase as they are trained to predict the next or missing word in a sentence. There has a been an interest in quantifying and improving the amount of facts that can be extracted from PLMs, as they have been envisioned to act as soft knowledge bases, which can be queried in natural language. Different appro… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  14. arXiv:2410.12586  [pdf, other

    cs.CL

    How to Make LLMs Forget: On Reversing In-Context Knowledge Edits

    Authors: Paul Youssef, Zhixue Zhao, Jörg Schlötterer, Christin Seifert

    Abstract: In-context knowledge editing (IKE) enables efficient modification of large language model (LLM) outputs without parameter changes and at zero-cost. However, it can be misused to manipulate responses opaquely, e.g., insert misinformation or offensive content. Such malicious interventions could be incorporated into high-level wrapped APIs where the final input prompt is not shown to end-users. To ad… ▽ More

    Submitted 10 April, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Accepted at NAACL Main 2025

  15. arXiv:2410.02828  [pdf, other

    cs.CR cs.AI cs.CL

    PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI System

    Authors: Gary D. Lopez Munoz, Amanda J. Minnich, Roman Lutz, Richard Lundeen, Raja Sekhar Rao Dheekonda, Nina Chikanov, Bolor-Erdene Jagdagdorj, Martin Pouliot, Shiven Chawla, Whitney Maxwell, Blake Bullwinkel, Katherine Pratt, Joris de Gruyter, Charlotte Siska, Pete Bryan, Tori Westerhoff, Chang Kawaguchi, Christian Seifert, Ram Shankar Siva Kumar, Yonatan Zunger

    Abstract: Generative Artificial Intelligence (GenAI) is becoming ubiquitous in our daily lives. The increase in computational power and data availability has led to a proliferation of both single- and multi-modal models. As the GenAI ecosystem matures, the need for extensible and model-agnostic risk identification frameworks is growing. To meet this need, we introduce the Python Risk Identification Toolkit… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  16. arXiv:2410.02806  [pdf, other

    cs.CV cs.AI

    Investigating the Impact of Randomness on Reproducibility in Computer Vision: A Study on Applications in Civil Engineering and Medicine

    Authors: Bahadır Eryılmaz, Osman Alperen Koraş, Jörg Schlötterer, Christin Seifert

    Abstract: Reproducibility is essential for scientific research. However, in computer vision, achieving consistent results is challenging due to various factors. One influential, yet often unrecognized, factor is CUDA-induced randomness. Despite CUDA's advantages for accelerating algorithm execution on GPUs, if not controlled, its behavior across multiple executions remains non-deterministic. While reproduci… ▽ More

    Submitted 19 September, 2024; originally announced October 2024.

  17. arXiv:2407.14974  [pdf, other

    cs.LG cs.AI

    Out of spuriousity: Improving robustness to spurious correlations without group annotations

    Authors: Phuong Quynh Le, Jörg Schlötterer, Christin Seifert

    Abstract: Machine learning models are known to learn spurious correlations, i.e., features having strong relations with class labels but no causal relation. Relying on those correlations leads to poor performance in the data groups without these correlations and poor generalization ability. To improve the robustness of machine learning models to spurious correlations, we propose an approach to extract a sub… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  18. arXiv:2407.14277  [pdf, other

    cs.CV

    Patch-based Intuitive Multimodal Prototypes Network (PIMPNet) for Alzheimer's Disease classification

    Authors: Lisa Anita De Santi, Jörg Schlötterer, Meike Nauta, Vincenzo Positano, Christin Seifert

    Abstract: Volumetric neuroimaging examinations like structural Magnetic Resonance Imaging (sMRI) are routinely applied to support the clinical diagnosis of dementia like Alzheimer's Disease (AD). Neuroradiologists examine 3D sMRI to detect and monitor abnormalities in brain morphology due to AD, like global and/or local brain atrophy and shape alteration of characteristic structures. There is a strong resea… ▽ More

    Submitted 22 July, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Accepted "late-breaking work" at XAI-2024

  19. arXiv:2407.13833  [pdf, other

    cs.CL cs.AI

    Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

    Authors: Emman Haider, Daniel Perez-Becker, Thomas Portet, Piyush Madan, Amit Garg, Atabak Ashfaq, David Majercak, Wen Wen, Dongwoo Kim, Ziyi Yang, Jianwen Zhang, Hiteshi Sharma, Blake Bullwinkel, Martin Pouliot, Amanda Minnich, Shiven Chawla, Solianna Herrera, Shahed Warreth, Maggie Engler, Gary Lopez, Nina Chikanov, Raja Sekhar Rao Dheekonda, Bolor-Erdene Jagdagdorj, Roman Lutz, Richard Lundeen , et al. (6 additional authors not shown)

    Abstract: Recent innovations in language model training have demonstrated that it is possible to create highly performant models that are small enough to run on a smartphone. As these models are deployed in an increasing number of domains, it is critical to ensure that they are aligned with human preferences and safety considerations. In this report, we present our methodology for safety aligning the Phi-3… ▽ More

    Submitted 22 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  20. arXiv:2405.02765  [pdf, other

    cs.CL cs.AI

    Has this Fact been Edited? Detecting Knowledge Edits in Language Models

    Authors: Paul Youssef, Zhixue Zhao, Christin Seifert, Jörg Schlötterer

    Abstract: Knowledge editing methods (KEs) can update language models' obsolete or inaccurate knowledge learned from pre-training. However, KEs can be used for malicious applications, e.g., inserting misinformation and toxic content. Knowing whether a generated output is based on edited knowledge or first-hand knowledge from pre-training can increase users' trust in generative models and provide more transpa… ▽ More

    Submitted 10 February, 2025; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted at NAACL Main 2025

  21. arXiv:2405.00722  [pdf, other

    cs.CL cs.AI

    LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study

    Authors: Van Bach Nguyen, Paul Youssef, Christin Seifert, Jörg Schlötterer

    Abstract: As NLP models become more complex, understanding their decisions becomes more crucial. Counterfactuals (CFs), where minimal changes to inputs flip a model's prediction, offer a way to explain these models. While Large Language Models (LLMs) have shown remarkable performance in NLP tasks, their efficacy in generating high-quality CFs remains uncertain. This work fills this gap by investigating how… ▽ More

    Submitted 12 November, 2024; v1 submitted 26 April, 2024; originally announced May 2024.

    Comments: Accepted to EMNLP Findings 2024

  22. arXiv:2404.18631  [pdf, other

    cs.LG

    Feature importance to explain multimodal prediction models. A clinical use case

    Authors: Jorn-Jan van de Beld, Shreyasi Pathak, Jeroen Geerdink, Johannes H. Hegeman, Christin Seifert

    Abstract: Surgery to treat elderly hip fracture patients may cause complications that can lead to early mortality. An early warning system for complications could provoke clinicians to monitor high-risk patients more carefully and address potential complications early, or inform the patient. In this work, we develop a multimodal deep-learning model for post-operative mortality prediction using pre-operative… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted at World Conference on Explainable Artificial Intelligence; 19 pages, 2 figures, 7 tables

  23. arXiv:2404.17475  [pdf, other

    cs.CL cs.AI

    CEval: A Benchmark for Evaluating Counterfactual Text Generation

    Authors: Van Bach Nguyen, Jörg Schlötterer, Christin Seifert

    Abstract: Counterfactual text generation aims to minimally change a text, such that it is classified differently. Judging advancements in method development for counterfactual text generation is hindered by a non-uniform usage of data sets and metrics in related work. We propose CEval, a benchmark for comparing counterfactual text generation methods. CEval unifies counterfactual and text quality metrics, in… ▽ More

    Submitted 13 August, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Journal ref: INLG 2024

  24. arXiv:2404.07660  [pdf, ps, other

    math.FA math.AP math.NA math.PR

    Approximation of Random Evolution Equations of Parabolic type

    Authors: Katharina Klioba, Christian Seifert

    Abstract: In this paper, we present an abstract framework to obtain convergence rates for the approximation of random evolution equations corresponding to a random family of forms determined by finite-dimensional noise. The full discretization error in space, time, and randomness is considered, where polynomial chaos expansion (PCE) is used for the semi-discretization in randomness. The main result are regu… ▽ More

    Submitted 18 December, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 39 pages

    MSC Class: 47D06; 47N40; 65J08; 35K90; 41A25

  25. arXiv:2404.05694  [pdf, other

    cs.CL cs.AI cs.LG

    Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding

    Authors: Ahmad Idrissi-Yaghir, Amin Dada, Henning Schäfer, Kamyar Arzideh, Giulia Baldini, Jan Trienes, Max Hasin, Jeanette Bewersdorff, Cynthia S. Schmidt, Marie Bauer, Kaleb E. Smith, Jiang Bian, Yonghui Wu, Jörg Schlötterer, Torsten Zesch, Peter A. Horn, Christin Seifert, Felix Nensa, Jens Kleesiek, Christoph M. Friedrich

    Abstract: Recent advances in natural language processing (NLP) can be largely attributed to the advent of pre-trained language models such as BERT and RoBERTa. While these models demonstrate remarkable performance on general datasets, they can struggle in specialized domains such as medicine, where unique domain-specific terminologies, domain-specific abbreviations, and varying document structures are commo… ▽ More

    Submitted 8 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024

  26. arXiv:2403.20260  [pdf, other

    cs.CV

    Prototype-based Interpretable Breast Cancer Prediction Models: Analysis and Challenges

    Authors: Shreyasi Pathak, Jörg Schlötterer, Jeroen Veltman, Jeroen Geerdink, Maurice van Keulen, Christin Seifert

    Abstract: Deep learning models have achieved high performance in medical applications, however, their adoption in clinical practice is hindered due to their black-box nature. Self-explainable models, like prototype-based models, can be especially beneficial as they are interpretable by design. However, if the learnt prototypes are of low quality then the prototype-based models are as good as black-box. Havi… ▽ More

    Submitted 19 July, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted at World Conference on Explainable Artificial Intelligence. Cham: Springer Nature Switzerland, 2024; 21 pages, 5 figures, 3 tables

  27. arXiv:2403.18328  [pdf, other

    cs.CV

    PIPNet3D: Interpretable Detection of Alzheimer in MRI Scans

    Authors: Lisa Anita De Santi, Jörg Schlötterer, Michael Scheschenja, Joel Wessendorf, Meike Nauta, Vincenzo Positano, Christin Seifert

    Abstract: Information from neuroimaging examinations is increasingly used to support diagnoses of dementia, e.g., Alzheimer's disease. While current clinical practice is mainly based on visual inspection and feature engineering, Deep Learning approaches can be used to automate the analysis and to discover new image-biomarkers. Part-prototype neural networks (PP-NN) are an alternative to standard blackbox mo… ▽ More

    Submitted 22 July, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at iMIMIC workshop @MICCAI 2024

  28. arXiv:2403.02930  [pdf, other

    cs.CL cs.LG

    A Second Look on BASS -- Boosting Abstractive Summarization with Unified Semantic Graphs -- A Replication Study

    Authors: Osman Alperen Koraş, Jörg Schlötterer, Christin Seifert

    Abstract: We present a detailed replication study of the BASS framework, an abstractive summarization system based on the notion of Unified Semantic Graphs. Our investigation includes challenges in replicating key components and an ablation study to systematically isolate error sources rooted in replicating novel components. Our findings reveal discrepancies in performance compared to the original work. We… ▽ More

    Submitted 25 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in Advances in Information Retrieval, 46th European Conference on Information Retrieval, ECIR 2024. 16 pages, 4 figures

  29. arXiv:2402.08368  [pdf, ps, other

    math.AP

    On solitary waves for the Korteweg--de Vries equation on metric star graphs

    Authors: Delio Mugnolo, Diego Noja, Christian Seifert

    Abstract: We study the Korteweg--de Vries equation on a metric star graph and investigate existence of solitary waves on the metric graph in terms of the coefficients of the equation on each edge, the coupling condition at the central vertex of the star and the speeds of the travelling wave. We show that, with a continuity condition at the vertex, solitary waves can occur exactly when the parameters are cho… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  30. arXiv:2402.01453  [pdf, other

    cs.CL

    The Queen of England is not England's Queen: On the Lack of Factual Coherency in PLMs

    Authors: Paul Youssef, Jörg Schlötterer, Christin Seifert

    Abstract: Factual knowledge encoded in Pre-trained Language Models (PLMs) enriches their representations and justifies their use as knowledge bases. Previous work has focused on probing PLMs for factual knowledge by measuring how often they can correctly predict an object entity given a subject and a relation, and improving fact retrieval by optimizing the prompts used for querying PLMs. In this work, we co… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL Findings 2024

  31. arXiv:2401.16475  [pdf, other

    cs.CL

    InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification

    Authors: Jan Trienes, Sebastian Joseph, Jörg Schlötterer, Christin Seifert, Kyle Lo, Wei Xu, Byron C. Wallace, Junyi Jessy Li

    Abstract: Text simplification aims to make technical texts more accessible to laypeople but often results in deletion of information and vagueness. This work proposes InfoLossQA, a framework to characterize and recover simplification-induced information loss in form of question-and-answer (QA) pairs. Building on the theory of Question Under Discussion, the QA pairs are designed to help readers deepen their… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted at ACL 2024 (main conference)

  32. arXiv:2401.13334  [pdf, other

    cs.LG cs.AI

    Explainable Bayesian Optimization

    Authors: Tanmay Chakraborty, Christian Wirth, Christin Seifert

    Abstract: Manual parameter tuning of cyber-physical systems is a common practice, but it is labor-intensive. Bayesian Optimization (BO) offers an automated alternative, yet its black-box nature reduces trust and limits human-BO collaborative system tuning. Experts struggle to interpret BO recommendations due to the lack of explanations. This paper addresses the post-hoc BO explainability problem for cyber-p… ▽ More

    Submitted 1 April, 2025; v1 submitted 24 January, 2024; originally announced January 2024.

  33. arXiv:2311.05498  [pdf, other

    cs.CR cs.NI cs.SE eess.SY

    Trust your BMS: Designing a Lightweight Authentication Architecture for Industrial Networks

    Authors: Fikret Basic, Christian Steger, Christian Seifert, Robert Kofler

    Abstract: With the advent of clean energy awareness and systems that rely on extensive battery usage, the community has seen an increased interest in the development of more complex and secure Battery Management Systems (BMS). In particular, the inclusion of BMS in modern complex systems like electric vehicles and power grids has presented a new set of security-related challenges. A concern is shown when BM… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted copy for Publication at the 23rd International Conference on Industrial Technology (ICIT), IEEE, 2022

  34. arXiv:2311.02110  [pdf, other

    cs.NE cs.LG

    Feature Attribution Explanations for Spiking Neural Networks

    Authors: Elisa Nguyen, Meike Nauta, Gwenn Englebienne, Christin Seifert

    Abstract: Third-generation artificial neural networks, Spiking Neural Networks (SNNs), can be efficiently implemented on hardware. Their implementation on neuromorphic chips opens a broad range of applications, such as machine learning-based autonomous control and intelligent biomedical devices. In critical applications, however, insight into the reasoning of SNNs is important, thus SNNs need to be equipped… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE CogMI 2023, copyright final version IEEE

  35. arXiv:2310.16570  [pdf, other

    cs.CL

    Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models

    Authors: Paul Youssef, Osman Alperen Koraş, Meijie Li, Jörg Schlötterer, Christin Seifert

    Abstract: Pre-trained Language Models (PLMs) are trained on vast unlabeled data, rich in world knowledge. This fact has sparked the interest of the community in quantifying the amount of factual knowledge present in PLMs, as this explains their performance on downstream tasks, and potentially justifies their use as knowledge bases. In this work, we survey methods and datasets that are used to probe PLMs for… ▽ More

    Submitted 4 December, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP Findings 2023

  36. arXiv:2310.12677  [pdf, other

    cs.CV

    Case-level Breast Cancer Prediction for Real Hospital Settings

    Authors: Shreyasi Pathak, Jörg Schlötterer, Jeroen Geerdink, Jeroen Veltman, Maurice van Keulen, Nicola Strisciuglio, Christin Seifert

    Abstract: Breast cancer prediction models for mammography assume that annotations are available for individual images or regions of interest (ROIs), and that there is a fixed number of images per patient. These assumptions do not hold in real hospital settings, where clinicians provide only a final diagnosis for the entire mammography exam (case). Since data in real hospital settings scales with continuous… ▽ More

    Submitted 19 October, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 31 pages, 15 figures, 12 tables

  37. arXiv:2309.14459  [pdf, other

    cs.HC

    Bridging the Gulf of Envisioning: Cognitive Design Challenges in LLM Interfaces

    Authors: Hariharan Subramonyam, Roy Pea, Christopher Lawrence Pondoc, Maneesh Agrawala, Colleen Seifert

    Abstract: Large language models (LLMs) exhibit dynamic capabilities and appear to comprehend complex and ambiguous natural language prompts. However, calibrating LLM interactions is challenging for interface designers and end-users alike. A central issue is our limited grasp of how human cognitive processes begin with a goal and form intentions for executing actions, a blindspot even in established interact… ▽ More

    Submitted 18 March, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  38. Know What Not To Know: Users' Perception of Abstaining Classifiers

    Authors: Andrea Papenmeier, Daniel Hienert, Yvonne Kammerer, Christin Seifert, Dagmar Kern

    Abstract: Machine learning systems can help humans to make decisions by providing decision suggestions (i.e., a label for a datapoint). However, individual datapoints do not always provide enough clear evidence to make confident suggestions. Although methods exist that enable systems to identify those datapoints and subsequently abstain from suggesting a label, it remains unclear how users would react to su… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Journal ref: In Companion Publication of the 2023 ACM Designing Interactive Systems Conference (DIS 2023 Companion)

  39. arXiv:2308.00473  [pdf, other

    cs.LG cs.CV

    Is Last Layer Re-Training Truly Sufficient for Robustness to Spurious Correlations?

    Authors: Phuong Quynh Le, Jörg Schlötterer, Christin Seifert

    Abstract: Models trained with empirical risk minimization (ERM) are known to learn to rely on spurious features, i.e., their prediction is based on undesired auxiliary features which are strongly correlated with class labels but lack causal reasoning. This behavior particularly degrades accuracy in groups of samples of the correlated class that are missing the spurious feature or samples of the opposite cla… ▽ More

    Submitted 9 January, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted at IJCAI Workshop on XAI 2023

  40. arXiv:2307.14517  [pdf, other

    cs.CV cs.AI

    The Co-12 Recipe for Evaluating Interpretable Part-Prototype Image Classifiers

    Authors: Meike Nauta, Christin Seifert

    Abstract: Interpretable part-prototype models are computer vision models that are explainable by design. The models learn prototypical parts and recognise these components in an image, thereby combining classification and explanation. Despite the recent attention for intrinsically interpretable models, there is no comprehensive overview on evaluating the explanation quality of interpretable part-prototype m… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 24 pages, 1 image, accepted at the 1st World Conference on eXplainable Artificial Intelligence (xAI 2023)

  41. arXiv:2307.12803  [pdf, other

    cs.CL

    Guidance in Radiology Report Summarization: An Empirical Evaluation and Error Analysis

    Authors: Jan Trienes, Paul Youssef, Jörg Schlötterer, Christin Seifert

    Abstract: Automatically summarizing radiology reports into a concise impression can reduce the manual burden of clinicians and improve the consistency of reporting. Previous work aimed to enhance content selection and factuality through guided abstractive summarization. However, two key issues persist. First, current methods heavily rely on domain-specific resources to extract the guidance signal, limiting… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted at INLG2023

  42. arXiv:2307.10404  [pdf, other

    cs.CV cs.AI cs.LG

    Interpreting and Correcting Medical Image Classification with PIP-Net

    Authors: Meike Nauta, Johannes H. Hegeman, Jeroen Geerdink, Jörg Schlötterer, Maurice van Keulen, Christin Seifert

    Abstract: Part-prototype models are explainable-by-design image classifiers, and a promising alternative to black box AI. This paper explores the applicability and potential of interpretable machine learning, in particular PIP-Net, for automated diagnosis support on real-world medical imaging data. PIP-Net learns human-understandable prototypical image parts and we evaluate its accuracy and interpretability… ▽ More

    Submitted 11 September, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted to the International Workshop on Explainable and Interpretable Machine Learning (XI-ML), co-located with ECAI 2023

  43. arXiv:2303.01924  [pdf, ps, other

    math.SP math.AP math.FA

    Spectral Theory for Schrödinger operators on compact metric graphs with $δ$ and $δ'$ couplings: a survey

    Authors: Jonathan Rohleder, Christian Seifert

    Abstract: Spectral properties of Schrödinger operators on compact metric graphs are studied and special emphasis is put on differences in the spectral behavior between different classes of vertex conditions. We survey recent results especially for $δ$ and $δ'$ couplings and demonstrate the spectral properties on many examples. Amongst other things, properties of the ground state eigenvalue and eigenfunction… ▽ More

    Submitted 3 July, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 36 pages

  44. arXiv:2302.11270  [pdf, ps, other

    math.FA math.AP

    Perturbations of non-autonomous second-order abstract Cauchy problems

    Authors: Christian Budde, Christian Seifert

    Abstract: In this paper we present time-dependent perturbations of second-order non-autonomous abstract Cauchy problems associated to a family of operators with constant domain. We make use of the equivalence to a first-order non-autonomous abstract Cauchy problem in a product space, which we elaborate in full detail. As an application we provide a perturbed non-autonomous wave equation.

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 15 pages

  45. How Accurate Does It Feel? -- Human Perception of Different Types of Classification Mistakes

    Authors: Andrea Papenmeier, Dagmar Kern, Daniel Hienert, Yvonne Kammerer, Christin Seifert

    Abstract: Supervised machine learning utilizes large datasets, often with ground truth labels annotated by humans. While some data points are easy to classify, others are hard to classify, which reduces the inter-annotator agreement. This causes noise for the classifier and might affect the user's perception of the classifier's performance. In our research, we investigated whether the classification difficu… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: In CHI '22: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems

  46. From Black Boxes to Conversations: Incorporating XAI in a Conversational Agent

    Authors: Van Bach Nguyen, Jörg Schlötterer, Christin Seifert

    Abstract: The goal of Explainable AI (XAI) is to design methods to provide insights into the reasoning process of black-box models, such as deep neural networks, in order to explain them to humans. Social science research states that such explanations should be conversational, similar to human-to-human explanations. In this work, we show how to incorporate XAI in a conversational agent, using a standard des… ▽ More

    Submitted 22 July, 2024; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: Accepted at The World Conference on eXplainable Artificial Intelligence 2023 (XAI-2023)

    Journal ref: World Conference on Explainable Artificial Intelligence 2023

  47. arXiv:2207.01749  [pdf, other

    cs.SE cs.HC

    Human-AI Guidelines in Practice: Leaky Abstractions as an Enabler in Collaborative Software Teams

    Authors: Hariharan Subramonyam, Jane Im, Colleen Seifert, Eytan Adar

    Abstract: In conventional software development, user experience (UX) designers and engineers collaborate through separation of concerns (SoC): designers create human interface specifications, and engineers build to those specifications. However, we argue that Human-AI systems thwart SoC because human needs must shape the design of the AI interface, the underlying AI sub-components, and training data. How do… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  48. A note on the Lumer--Phillips theorem for bi-continuous semigroups

    Authors: Karsten Kruse, Christian Seifert

    Abstract: Given a Banach space $X$ and an additional coarser Hausdorff locally convex topology $τ$ on $X$ we characterise the generators of $τ$-bi-continuous semigroups in the spirit of the Lumer--Phillips theorem, i.e. by means of dissipativity w.r.t.~a directed system of seminorms and a range condition.

    Submitted 8 November, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    MSC Class: Primary 47B44 Secondary 47D06; 46A70

    Journal ref: Zeitschrift für Analysis und ihre Anwendungen 41 (2023), 417-437

  49. Final state observability estimates and cost-uniform approximate null-controllability for bi-continuous semigroups

    Authors: Karsten Kruse, Christian Seifert

    Abstract: We consider final state observability estimates for bi-continuous semigroups on Banach spaces, i.e. for every initial value, estimating the state at a final time $T>0$ by taking into account the orbit of the initial value under the semigroup for $t\in [0,T]$, measured in a suitable norm. We state a sufficient criterion based on an uncertainty relation and a dissipation estimate and provide two exa… ▽ More

    Submitted 31 March, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    MSC Class: Primary 93C20; 93C25; 47N70; Secondary 47D06; 46A70

    Journal ref: Semigroup Forum 106 (2023), 421-443

  50. arXiv:2204.03503  [pdf, other

    cs.CL cs.AI cs.LG

    Survey on Automated Short Answer Grading with Deep Learning: from Word Embeddings to Transformers

    Authors: Stefan Haller, Adina Aldea, Christin Seifert, Nicola Strisciuglio

    Abstract: Automated short answer grading (ASAG) has gained attention in education as a means to scale educational tasks to the growing number of students. Recent progress in Natural Language Processing and Machine Learning has largely influenced the field of ASAG, of which we survey the recent research advancements. We complement previous surveys by providing a comprehensive analysis of recently published m… ▽ More

    Submitted 11 March, 2022; originally announced April 2022.

    Comments: Under review