Skip to main content

Showing 1–50 of 254 results for author: Müller, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.06744  [pdf, other

    cs.LG stat.ML

    LineFlow: A Framework to Learn Active Control of Production Lines

    Authors: Kai Müller, Martin Wenzel, Tobias Windisch

    Abstract: Many production lines require active control mechanisms, such as adaptive routing, worker reallocation, and rescheduling, to maintain optimal performance. However, designing these control systems is challenging for various reasons, and while reinforcement learning (RL) has shown promise in addressing these challenges, a standardized and general framework is still lacking. In this work, we introduc… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

    Comments: Accepted at ICML 2025

  2. Smart Starts: Accelerating Convergence through Uncommon Region Exploration

    Authors: Xinyu Zhang, Mário Antunes, Tyler Estro, Erez Zadok, Klaus Mueller

    Abstract: Initialization profoundly affects evolutionary algorithm (EA) efficacy by dictating search trajectories and convergence. This study introduces a hybrid initialization strategy combining empty-space search algorithm (ESA) and opposition-based learning (OBL). OBL initially generates a diverse population, subsequently augmented by ESA, which identifies under-explored regions. This synergy enhances po… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  3. arXiv:2504.16255  [pdf, other

    cs.LG cs.CY cs.HC

    FairPlay: A Collaborative Approach to Mitigate Bias in Datasets for Improved AI Fairness

    Authors: Tina Behzad, Mithilesh Kumar Singh, Anthony J. Ripa, Klaus Mueller

    Abstract: The issue of fairness in decision-making is a critical one, especially given the variety of stakeholder demands for differing and mutually incompatible versions of fairness. Adopting a strategic interaction of perspectives provides an alternative to enforcing a singular standard of fairness. We present a web-based software application, FairPlay, that enables multiple stakeholders to debias dataset… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: Accepted at ACM CSCW 2025. 30 pages total (including references and supplementary material). Contains 10 figures

    ACM Class: H.5.2; H.5.3; I.2.6

  4. arXiv:2504.14582  [pdf, other

    cs.CV

    NTIRE 2025 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Kai Liu, Jue Gong, Jingkai Wang, Lei Sun, Zongwei Wu, Radu Timofte, Yulun Zhang, Xiangyu Kong, Xiaoxuan Yu, Hyunhee Park, Suejin Han, Hakjae Jeon, Dafeng Zhang, Hyung-Ju Chun, Donghun Ryou, Inju Ha, Bohyung Han, Lu Zhao, Yuyi Zhang, Pengyu Yan, Jiawei Hu, Pengwei Liu, Fengjun Guo, Hongyuan Yu , et al. (86 additional authors not shown)

    Abstract: This paper presents the NTIRE 2025 image super-resolution ($\times$4) challenge, one of the associated competitions of the 10th NTIRE Workshop at CVPR 2025. The challenge aims to recover high-resolution (HR) images from low-resolution (LR) counterparts generated through bicubic downsampling with a $\times$4 scaling factor. The objective is to develop effective network designs or solutions that ach… ▽ More

    Submitted 28 April, 2025; v1 submitted 20 April, 2025; originally announced April 2025.

    Comments: NTIRE 2025 webpage: https://www.cvlai.net/ntire/2025. Code: https://github.com/zhengchen1999/NTIRE2025_ImageSR_x4

  5. arXiv:2504.08553  [pdf, other

    cs.LG cs.AI

    Uncovering the Structure of Explanation Quality with Spectral Analysis

    Authors: Johannes Maeß, Grégoire Montavon, Shinichi Nakajima, Klaus-Robert Müller, Thomas Schnake

    Abstract: As machine learning models are increasingly considered for high-stakes domains, effective explanation methods are crucial to ensure that their prediction strategies are transparent to the user. Over the years, numerous metrics have been proposed to assess quality of explanations. However, their practical applicability remains unclear, in particular due to a limited understanding of which specific… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 14 pages, 5 figures, Accepted at XAI World Conference 2025

  6. arXiv:2504.01947  [pdf, other

    cs.LG cs.AI cs.DC eess.SP

    Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction

    Authors: Daniel Becking, Ingo Friese, Karsten Müller, Thomas Buchholz, Mandy Galkow-Schneider, Wojciech Samek, Detlev Marpe

    Abstract: In telecommunications, Autonomous Networks (ANs) automatically adjust configurations based on specific requirements (e.g., bandwidth) and available resources. These networks rely on continuous monitoring and intelligent mechanisms for self-optimization, self-repair, and self-protection, nowadays enhanced by Neural Networks (NNs) to enable predictive modeling and pattern recognition. Here, Federate… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: Accepted at 2025 EuCNC & 6G Summit Poster Session

  7. arXiv:2503.15964  [pdf, other

    cs.CR

    Are We There Yet? A Study of Decentralized Identity Applications

    Authors: Daria Schumm, Katharina O. E. Müller, Burkhard Stiller

    Abstract: The development of Decentralized Identities (DI) and Self-Sovereign Identities (SSI) has seen significant growth in recent years. This is accompanied by a numerous academic and commercial contributions to the development of principles, standards, and systems. While several comprehensive reviews have been produced, they predominantly focus on academic literature, with few considering grey literatur… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: 27 pages

  8. arXiv:2503.01431  [pdf, other

    cs.LG

    How simple can you go? An off-the-shelf transformer approach to molecular dynamics

    Authors: Max Eissler, Tim Korjakow, Stefan Ganscha, Oliver T. Unke, Klaus-Robert Müller, Stefan Gugler

    Abstract: Most current neural networks for molecular dynamics (MD) include physical inductive biases, resulting in specialized and complex architectures. This is in contrast to most other machine learning domains, where specialist approaches are increasingly replaced by general-purpose architectures trained on vast datasets. In line with this trend, several recent studies have questioned the necessity of ar… ▽ More

    Submitted 5 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: 21 pages, code at https://github.com/mx-e/simple-md

  9. arXiv:2502.08598  [pdf, other

    cs.LG stat.ML

    Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio

    Authors: Khaled Kahouli, Winfried Ripken, Stefan Gugler, Oliver T. Unke, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: The long sampling time of diffusion models remains a significant bottleneck, which can be mitigated by reducing the number of diffusion time steps. However, the quality of samples with fewer steps is highly dependent on the noise schedule, i.e., the specific manner in which noise is introduced and the signal is reduced at each step. Although prior work has improved upon the original variance-prese… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  10. arXiv:2502.01685  [pdf, other

    cs.AI cs.CL cs.CV cs.SD eess.AS

    Automated Extraction of Spatio-Semantic Graphs for Identifying Cognitive Impairment

    Authors: Si-Ioi Ng, Pranav S. Ambadi, Kimberly D. Mueller, Julie Liss, Visar Berisha

    Abstract: Existing methods for analyzing linguistic content from picture descriptions for assessment of cognitive-linguistic impairment often overlook the participant's visual narrative path, which typically requires eye tracking to assess. Spatio-semantic graphs are a useful tool for analyzing this narrative path from transcripts alone, however they are limited by the need for manual tagging of content inf… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

    Comments: To appear in ICASSP 2025

  11. arXiv:2501.15273  [pdf, other

    cs.LG cs.HC

    Into the Void: Mapping the Unseen Gaps in High Dimensional Data

    Authors: Xinyu Zhang, Tyler Estro, Geoff Kuenning, Erez Zadok, Klaus Mueller

    Abstract: We present a comprehensive pipeline, augmented by a visual analytics system named ``GapMiner'', that is aimed at exploring and exploiting untapped opportunities within the empty areas of high-dimensional datasets. Our approach begins with an initial dataset and then uses a novel Empty Space Search Algorithm (ESA) to identify the center points of these uncharted voids, which are regarded as reservo… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

  12. Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework

    Authors: Yoonsang Kim, Zainab Aamir, Mithilesh Singh, Saeed Boorboor, Klaus Mueller, Arie E. Kaufman

    Abstract: We present Explainable XR, an end-to-end framework for analyzing user behavior in diverse eXtended Reality (XR) environments by leveraging Large Language Models (LLMs) for data interpretation assistance. Existing XR user analytics frameworks face challenges in handling cross-virtuality - AR, VR, MR - transitions, multi-user collaborative application scenarios, and the complexity of multimodal data… ▽ More

    Submitted 10 March, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Comments: 11 pages, 8 figures. This is the author's version of the article that has been accepted for publication in IEEE Transactions on Visualization and Computer Graphics

  13. arXiv:2501.05409  [pdf, other

    cs.CV cs.AI cs.LG

    Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics

    Authors: Maximilian Alber, Stephan Tietz, Jonas Dippel, Timo Milbich, Timothée Lesort, Panos Korfiatis, Moritz Krügener, Beatriz Perez Cancer, Neelay Shah, Alexander Möllers, Philipp Seegerer, Alexandra Carpen-Amarie, Kai Standvoss, Gabriel Dernbach, Edwin de Jong, Simon Schallenberg, Andreas Kunft, Helmut Hoffer von Ankershoffen, Gavin Schaeferle, Patrick Duffy, Matt Redlon, Philipp Jurmeister, David Horst, Lukas Ruff, Klaus-Robert Müller , et al. (2 additional authors not shown)

    Abstract: Recent advances in digital pathology have demonstrated the effectiveness of foundation models across diverse applications. In this report, we present Atlas, a novel vision foundation model based on the RudolfV approach. Our model was trained on a dataset comprising 1.2 million histopathology whole slide images, collected from two medical institutions: Mayo Clinic and Charité - Universtätsmedizin B… ▽ More

    Submitted 10 January, 2025; v1 submitted 9 January, 2025; originally announced January 2025.

  14. arXiv:2412.08541  [pdf, other

    cs.LG

    Euclidean Fast Attention: Machine Learning Global Atomic Representations at Linear Cost

    Authors: J. Thorben Frank, Stefan Chmiela, Klaus-Robert Müller, Oliver T. Unke

    Abstract: Long-range correlations are essential across numerous machine learning tasks, especially for data embedded in Euclidean space, where the relative positions and orientations of distant components are often critical for accurate predictions. Self-attention offers a compelling mechanism for capturing these global effects, but its quadratic complexity presents a significant practical limitation. This… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  15. arXiv:2412.03118  [pdf, other

    cs.HC cs.CV

    ObjectFinder: An Open-Vocabulary Assistive System for Interactive Object Search by Blind People

    Authors: Ruiping Liu, Jiaming Zhang, Angela Schön, Karin Müller, Junwei Zheng, Kailun Yang, Anhong Guo, Kathrin Gerling, Rainer Stiefelhagen

    Abstract: Searching for objects in unfamiliar scenarios is a challenging task for blind people. It involves specifying the target object, detecting it, and then gathering detailed information according to the user's intent. However, existing description- and detection-based assistive technologies do not sufficiently support the multifaceted nature of interactive object search tasks. We present ObjectFinder,… ▽ More

    Submitted 30 April, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

  16. arXiv:2411.12759  [pdf

    cs.CL cs.AI

    A Novel Approach to Eliminating Hallucinations in Large Language Model-Assisted Causal Discovery

    Authors: Grace Sng, Yanming Zhang, Klaus Mueller

    Abstract: The increasing use of large language models (LLMs) in causal discovery as a substitute for human domain experts highlights the need for optimal model selection. This paper presents the first hallucination survey of popular LLMs for causal discovery. We show that hallucinations exist when using LLMs in causal discovery so the choice of LLM is important. We propose using Retrieval Augmented Generati… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  17. arXiv:2411.07643  [pdf, other

    cs.CV cs.LG

    xCG: Explainable Cell Graphs for Survival Prediction in Non-Small Cell Lung Cancer

    Authors: Marvin Sextro, Gabriel Dernbach, Kai Standvoss, Simon Schallenberg, Frederick Klauschen, Klaus-Robert Müller, Maximilian Alber, Lukas Ruff

    Abstract: Understanding how deep learning models predict oncology patient risk can provide critical insights into disease progression, support clinical decision-making, and pave the way for trustworthy and data-driven precision medicine. Building on recent advances in the spatial modeling of the tumor microenvironment using graph neural networks, we present an explainable cell graph (xCG) approach for survi… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Comments: Findings paper presented at Machine Learning for Health (ML4H) symposium 2024, December 15-16, 2024, Vancouver, Canada, 11 pages

  18. arXiv:2411.05894  [pdf, other

    cs.CL cs.AI cs.LG

    SSSD: Simply-Scalable Speculative Decoding

    Authors: Michele Marzollo, Jiawei Zhuang, Niklas Roemer, Lorenz K. Müller, Lukas Cavigelli

    Abstract: Over the past year, Speculative Decoding has gained popularity as a technique for accelerating Large Language Model inference. While several methods have been introduced, most struggle to deliver satisfactory performance at batch sizes typical for data centers ($\geq 8$) and often involve significant deployment complexities. In this work, we offer a theoretical explanation of how Speculative Decod… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: 14 pages, 7 figures

    ACM Class: I.2.7

  19. arXiv:2411.01240  [pdf, other

    cs.LG cs.AI cs.DC

    Boosting Federated Learning with FedEntOpt: Mitigating Label Skew by Entropy-Based Client Selection

    Authors: Andreas Lutz, Gabriele Steidl, Karsten Müller, Wojciech Samek

    Abstract: Deep learning is an emerging field revolutionizing various industries, including natural language processing, computer vision, and many more. These domains typically require an extensive amount of data for optimal performance, potentially utilizing huge centralized data repositories. However, such centralization could raise privacy issues concerning the storage of sensitive data. To address this i… ▽ More

    Submitted 29 January, 2025; v1 submitted 2 November, 2024; originally announced November 2024.

  20. arXiv:2411.00143  [pdf, other

    eess.IV cs.LG

    Enhancing Brain Source Reconstruction through Physics-Informed 3D Neural Networks

    Authors: Marco Morik, Ali Hashemi, Klaus-Robert Müller, Stefan Haufe, Shinichi Nakajima

    Abstract: Reconstructing brain sources is a fundamental challenge in neuroscience, crucial for understanding brain function and dysfunction. Electroencephalography (EEG) signals have a high temporal resolution. However, identifying the correct spatial location of brain sources from these signals remains difficult due to the ill-posed structure of the problem. Traditional methods predominantly rely on manual… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

    Comments: Under Review in IEEE Transactions on Medical Imaging

  21. arXiv:2410.22568  [pdf, other

    q-fin.RM cs.LG q-fin.CP

    Fast Deep Hedging with Second-Order Optimization

    Authors: Konrad Mueller, Amira Akkari, Lukas Gonon, Ben Wood

    Abstract: Hedging exotic options in presence of market frictions is an important risk management task. Deep hedging can solve such hedging problems by training neural network policies in realistic simulated markets. Training these neural networks may be delicate and suffer from slow convergence, particularly for options with long maturities and complex sensitivities to market parameters. To address this, we… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

  22. arXiv:2410.14146  [pdf, other

    cs.AI cs.HC cs.LG cs.SI

    CausalChat: Interactive Causal Model Development and Refinement Using Large Language Models

    Authors: Yanming Zhang, Akshith Kota, Eric Papenhausen, Klaus Mueller

    Abstract: Causal networks are widely used in many fields to model the complex relationships between variables. A recent approach has sought to construct causal networks by leveraging the wisdom of crowds through the collective participation of humans. While this can yield detailed causal networks that model the underlying phenomena quite well, it requires a large number of individuals with domain understand… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  23. arXiv:2409.16821  [pdf, other

    cs.CV cs.AI

    XAI-guided Insulator Anomaly Detection for Imbalanced Datasets

    Authors: Maximilian Andreas Hoefler, Karsten Mueller, Wojciech Samek

    Abstract: Power grids serve as a vital component in numerous industries, seamlessly delivering electrical energy to industrial processes and technologies, making their safe and reliable operation indispensable. However, powerlines can be hard to inspect due to difficult terrain or harsh climatic conditions. Therefore, unmanned aerial vehicles are increasingly deployed to inspect powerlines, resulting in a s… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: Accepted as a workshop paper at ECCV 2024

  24. arXiv:2409.12965  [pdf, other

    cs.ET cond-mat.dis-nn cs.LG physics.app-ph physics.optics

    Streamlined optical training of large-scale modern deep learning architectures with direct feedback alignment

    Authors: Ziao Wang, Kilian Müller, Matthew Filipovich, Julien Launay, Ruben Ohana, Gustave Pariente, Safa Mokaadi, Charles Brossollet, Fabien Moreau, Alessandro Cappelli, Iacopo Poli, Igor Carron, Laurent Daudet, Florent Krzakala, Sylvain Gigan

    Abstract: Modern deep learning relies nearly exclusively on dedicated electronic hardware accelerators. Photonic approaches, with low consumption and high operation speed, are increasingly considered for inference but, to date, remain mostly limited to relatively basic tasks. Simultaneously, the problem of training deep and complex neural networks, overwhelmingly performed through backpropagation, remains a… ▽ More

    Submitted 2 April, 2025; v1 submitted 1 September, 2024; originally announced September 2024.

    Comments: 20 pages, 4 figures; Additional experiments conducted;

  25. arXiv:2409.06509  [pdf, other

    cs.CV cs.AI cs.LG

    Aligning Machine and Human Visual Representations across Abstraction Levels

    Authors: Lukas Muttenthaler, Klaus Greff, Frieda Born, Bernhard Spitzer, Simon Kornblith, Michael C. Mozer, Klaus-Robert Müller, Thomas Unterthiner, Andrew K. Lampinen

    Abstract: Deep neural networks have achieved success across a wide range of applications, including as models of human behavior in vision tasks. However, neural network training and human learning differ in fundamental ways, and neural networks often fail to generalize as robustly as humans do, raising questions regarding the similarity of their underlying representations. What is missing for modern learnin… ▽ More

    Submitted 29 October, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: 54 pages

  26. arXiv:2409.04670  [pdf, other

    cs.CV

    Multi-Conditioned Denoising Diffusion Probabilistic Model (mDDPM) for Medical Image Synthesis

    Authors: Arjun Krishna, Ge Wang, Klaus Mueller

    Abstract: Medical imaging applications are highly specialized in terms of human anatomy, pathology, and imaging domains. Therefore, annotated training datasets for training deep learning applications in medical imaging not only need to be highly accurate but also diverse and large enough to encompass almost all plausible examples with respect to those specifications. We argue that achieving this goal can be… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  27. arXiv:2409.02730  [pdf, other

    cs.LG physics.chem-ph

    Complete and Efficient Covariants for 3D Point Configurations with Application to Learning Molecular Quantum Properties

    Authors: Hartmut Maennel, Oliver T. Unke, Klaus-Robert Müller

    Abstract: When modeling physical properties of molecules with machine learning, it is desirable to incorporate $SO(3)$-covariance. While such models based on low body order features are not complete, we formulate and prove general completeness properties for higher order methods, and show that $6k-5$ of these features are enough for up to $k$ atoms. We also find that the Clebsch--Gordan operations commonly… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  28. arXiv:2408.17198  [pdf, other

    cs.AI cs.LG

    Towards Symbolic XAI -- Explanation Through Human Understandable Logical Relationships Between Features

    Authors: Thomas Schnake, Farnoush Rezaei Jafari, Jonas Lederer, Ping Xiong, Shinichi Nakajima, Stefan Gugler, Grégoire Montavon, Klaus-Robert Müller

    Abstract: Explainable Artificial Intelligence (XAI) plays a crucial role in fostering transparency and trust in AI systems, where traditional XAI approaches typically offer one level of abstraction for explanations, often in the form of heatmaps highlighting single or multiple input features. However, we ask whether abstract reasoning or problem-solving strategies of a model may also be relevant, as these a… ▽ More

    Submitted 1 October, 2024; v1 submitted 30 August, 2024; originally announced August 2024.

  29. arXiv:2408.08041  [pdf, other

    cs.LG cs.AI stat.ML

    The Clever Hans Effect in Unsupervised Learning

    Authors: Jacob Kauffmann, Jonas Dippel, Lukas Ruff, Wojciech Samek, Klaus-Robert Müller, Grégoire Montavon

    Abstract: Unsupervised learning has become an essential building block of AI systems. The representations it produces, e.g. in foundation models, are critical to a wide variety of downstream applications. It is therefore important to carefully examine unsupervised models to ensure not only that they produce accurate predictions, but also that these predictions are not "right for the wrong reasons", the so-c… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 12 pages + supplement

  30. arXiv:2407.18935  [pdf, other

    physics.chem-ph cs.LG

    A Machine Learning and Explainable AI Framework Tailored for Unbalanced Experimental Catalyst Discovery

    Authors: Parastoo Semnani, Mihail Bogojeski, Florian Bley, Zizheng Zhang, Qiong Wu, Thomas Kneib, Jan Herrmann, Christoph Weisser, Florina Patcas, Klaus-Robert Müller

    Abstract: The successful application of machine learning (ML) in catalyst design relies on high-quality and diverse data to ensure effective generalization to novel compositions, thereby aiding in catalyst discovery. However, due to complex interactions, catalyst design has long relied on trial-and-error, a costly and labor-intensive process leading to scarce data that is heavily biased towards undesired, l… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  31. Early Explorations of Lightweight Models for Wound Segmentation on Mobile Devices

    Authors: Vanessa Borst, Timo Dittus, Konstantin Müller, Samuel Kounev

    Abstract: The aging population poses numerous challenges to healthcare, including the increase in chronic wounds in the elderly. The current approach to wound assessment by therapists based on photographic documentation is subjective, highlighting the need for computer-aided wound recognition from smartphone photos. This offers objective and convenient therapy monitoring, while being accessible to patients… ▽ More

    Submitted 30 August, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Extended version of our paper that was published in the "47th German Conference on Artificial Intelligence (KI 2024)"

  32. arXiv:2406.17805  [pdf, other

    cs.CL cs.AI cs.HC

    Can LLMs Generate Visualizations with Dataless Prompts?

    Authors: Darius Coelho, Harshit Barot, Naitik Rathod, Klaus Mueller

    Abstract: Recent advancements in large language models have revolutionized information access, as these models harness data available on the web to address complex queries, becoming the preferred information source for many users. In certain cases, queries are about publicly available data, which can be effectively answered with data visualizations. In this paper, we investigate the ability of large languag… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  33. arXiv:2406.14866  [pdf, other

    cs.AI eess.IV

    AI-based Anomaly Detection for Clinical-Grade Histopathological Diagnostics

    Authors: Jonas Dippel, Niklas Prenißl, Julius Hense, Philipp Liznerski, Tobias Winterhoff, Simon Schallenberg, Marius Kloft, Oliver Buchstab, David Horst, Maximilian Alber, Lukas Ruff, Klaus-Robert Müller, Frederick Klauschen

    Abstract: While previous studies have demonstrated the potential of AI to diagnose diseases in imaging data, clinical implementation is still lagging behind. This is partly because AI models require training with large numbers of examples only available for common diseases. In clinical reality, however, only few diseases are common, whereas the majority of diseases are less frequent (long-tail distribution)… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  34. arXiv:2406.07592  [pdf, other

    cs.LG cs.AI stat.ML

    MambaLRP: Explaining Selective State Space Sequence Models

    Authors: Farnoush Rezaei Jafari, Grégoire Montavon, Klaus-Robert Müller, Oliver Eberle

    Abstract: Recent sequence modeling approaches using selective state space sequence models, referred to as Mamba models, have seen a surge of interest. These models allow efficient processing of long sequences in linear time and are rapidly being adopted in a wide range of applications such as language modeling, demonstrating promising performance. To foster their reliable use in real-world scenarios, it is… ▽ More

    Submitted 15 January, 2025; v1 submitted 11 June, 2024; originally announced June 2024.

  35. arXiv:2406.06150  [pdf, other

    cs.LG quant-ph

    Physics-Informed Bayesian Optimization of Variational Quantum Circuits

    Authors: Kim A. Nicoli, Christopher J. Anders, Lena Funcke, Tobias Hartung, Karl Jansen, Stefan Kühn, Klaus-Robert Müller, Paolo Stornati, Pan Kessel, Shinichi Nakajima

    Abstract: In this paper, we propose a novel and powerful method to harness Bayesian optimization for Variational Quantum Eigensolvers (VQEs) -- a hybrid quantum-classical protocol used to approximate the ground state of a quantum Hamiltonian. Specifically, we derive a VQE-kernel which incorporates important prior information about quantum circuits: the kernel feature map of the VQE-kernel exactly matches th… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 36 pages, 17 figures, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  36. arXiv:2406.04280  [pdf, other

    cs.LG cs.CV

    xMIL: Insightful Explanations for Multiple Instance Learning in Histopathology

    Authors: Julius Hense, Mina Jamshidi Idaji, Oliver Eberle, Thomas Schnake, Jonas Dippel, Laure Ciernik, Oliver Buchstab, Andreas Mock, Frederick Klauschen, Klaus-Robert Müller

    Abstract: Multiple instance learning (MIL) is an effective and widely used approach for weakly supervised machine learning. In histopathology, MIL models have achieved remarkable success in tasks like tumor detection, biomarker prediction, and outcome prognostication. However, MIL explanation methods are still lagging behind, as they are limited to small bag sizes or disregard instance interactions. We revi… ▽ More

    Submitted 7 January, 2025; v1 submitted 6 June, 2024; originally announced June 2024.

  37. arXiv:2405.19124  [pdf, other

    cs.CV

    ACCSAMS: Automatic Conversion of Exam Documents to Accessible Learning Material for Blind and Visually Impaired

    Authors: David Wilkening, Omar Moured, Thorsten Schwarz, Karin Muller, Rainer Stiefelhagen

    Abstract: Exam documents are essential educational materials for exam preparation. However, they pose a significant academic barrier for blind and visually impaired students, as they are often created without accessibility considerations. Typically, these documents are incompatible with screen readers, contain excessive white space, and lack alternative text for visual elements. This situation frequently re… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted at ICCHP 2024

  38. arXiv:2405.19117  [pdf, other

    cs.CV

    ChartFormer: A Large Vision Language Model for Converting Chart Images into Tactile Accessible SVGs

    Authors: Omar Moured, Sara Alzalabny, Anas Osman, Thorsten Schwarz, Karin Muller, Rainer Stiefelhagen

    Abstract: Visualizations, such as charts, are crucial for interpreting complex data. However, they are often provided as raster images, which are not compatible with assistive technologies for people with blindness and visual impairments, such as embossed papers or tactile displays. At the same time, creating accessible vector graphics requires a skilled sighted person and is time-intensive. In this work, w… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted at ICCHP 2024. Codes will be available at https://github.com/nsothman/ChartFormer

  39. arXiv:2405.19111  [pdf, other

    cs.CV cs.HC

    Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation

    Authors: Omar Moured, Shahid Ali Farooqui, Karin Muller, Sharifeh Fadaeijouybari, Thorsten Schwarz, Mohammed Javed, Rainer Stiefelhagen

    Abstract: Alternative Texts (Alt-Text) for chart images are essential for making graphics accessible to people with blindness and visual impairments. Traditionally, Alt-Text is manually written by authors but often encounters issues such as oversimplification or complication. Recent trends have seen the use of AI for Alt-Text generation. However, existing models are susceptible to producing inaccurate or mi… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted at ICCHP 2024. Codes will be available at https://moured.github.io/alt4blind/

  40. arXiv:2404.10935  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph stat.ML

    Molecular relaxation by reverse diffusion with time step prediction

    Authors: Khaled Kahouli, Stefaan Simon Pierre Hessmann, Klaus-Robert Müller, Shinichi Nakajima, Stefan Gugler, Niklas Wolf Andreas Gebauer

    Abstract: Molecular relaxation, finding the equilibrium state of a non-equilibrium structure, is an essential component of computational chemistry to understand reactivity. Classical force field (FF) methods often rely on insufficient local energy minimization, while neural network FF models require large labeled datasets encompassing both equilibrium and non-equilibrium structures. As a remedy, we propose… ▽ More

    Submitted 3 August, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  41. arXiv:2403.13321  [pdf, other

    cs.RO

    Robotics meets Fluid Dynamics: A Characterization of the Induced Airflow below a Quadrotor as a Turbulent Jet

    Authors: Leonard Bauersfeld, Koen Muller, Dominic Ziegler, Filippo Coletti, Davide Scaramuzza

    Abstract: The widespread adoption of quadrotors for diverse applications, from agriculture to public safety, necessitates an understanding of the aerodynamic disturbances they create. This paper introduces a computationally lightweight model for estimating the time-averaged magnitude of the induced flow below quadrotors in hover. Unlike related approaches that rely on expensive computational fluid dynamics… ▽ More

    Submitted 12 December, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 7+1 pages

    Journal ref: IEEE Robotics and Automation Letters (RA-L), 2024

  42. arXiv:2403.07486  [pdf, other

    cs.LG

    XpertAI: uncovering regression model strategies for sub-manifolds

    Authors: Simon Letzgus, Klaus-Robert Müller, Grégoire Montavon

    Abstract: In recent years, Explainable AI (XAI) methods have facilitated profound validation and knowledge extraction from ML models. While extensively studied for classification, few XAI solutions have addressed the challenges specific to regression models. In regression, explanations need to be precisely formulated to address specific user queries (e.g.\ distinguishing between `Why is the output above 0?'… ▽ More

    Submitted 4 April, 2025; v1 submitted 12 March, 2024; originally announced March 2024.

  43. Chart4Blind: An Intelligent Interface for Chart Accessibility Conversion

    Authors: Omar Moured, Morris Baumgarten-Egemole, Alina Roitberg, Karin Muller, Thorsten Schwarz, Rainer Stiefelhagen

    Abstract: In a world driven by data visualization, ensuring the inclusive accessibility of charts for Blind and Visually Impaired (BVI) individuals remains a significant challenge. Charts are usually presented as raster graphics without textual and visual metadata needed for an equivalent exploration experience for BVI people. Additionally, converting these charts into accessible formats requires considerab… ▽ More

    Submitted 25 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted to IUI 2024. 19 pages, 7 figures, 2 table. For a demo video, see this https://moured.github.io/chart4blind/ . The source code is available at https://github.com/moured/chart4blind_code/

  44. Belief Miner: A Methodology for Discovering Causal Beliefs and Causal Illusions from General Populations

    Authors: Shahreen Salim, Md Naimul Hoque, Klaus Mueller

    Abstract: Causal belief is a cognitive practice that humans apply everyday to reason about cause and effect relations between factors, phenomena, or events. Like optical illusions, humans are prone to drawing causal relations between events that are only coincidental (i.e., causal illusions). Researchers in domains such as cognitive psychology and healthcare often use logistically expensive experiments to u… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  45. arXiv:2401.06122  [pdf, other

    cs.LG cs.AI cs.CV

    Manipulating Feature Visualizations with Gradient Slingshots

    Authors: Dilyara Bareeva, Marina M. -C. Höhne, Alexander Warnecke, Lukas Pirch, Klaus-Robert Müller, Konrad Rieck, Kirill Bykov

    Abstract: Deep Neural Networks (DNNs) are capable of learning complex and versatile representations, however, the semantic nature of the learned concepts remains unknown. A common method used to explain the concepts learned by DNNs is Feature Visualization (FV), which generates a synthetic input signal that maximally activates a particular neuron in the network. In this paper, we investigate the vulnerabili… ▽ More

    Submitted 10 July, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  46. arXiv:2401.04079  [pdf, ps, other

    eess.IV cs.CV cs.LG

    RudolfV: A Foundation Model by Pathologists for Pathologists

    Authors: Jonas Dippel, Barbara Feulner, Tobias Winterhoff, Timo Milbich, Stephan Tietz, Simon Schallenberg, Gabriel Dernbach, Andreas Kunft, Simon Heinke, Marie-Lisa Eich, Julika Ribbat-Idel, Rosemarie Krupar, Philipp Anders, Niklas Prenißl, Philipp Jurmeister, David Horst, Lukas Ruff, Klaus-Robert Müller, Frederick Klauschen, Maximilian Alber

    Abstract: Artificial intelligence has started to transform histopathology impacting clinical diagnostics and biomedical research. However, while many computational pathology approaches have been proposed, most current AI models are limited with respect to generalization, application variety, and handling rare diseases. Recent efforts introduced self-supervised foundation models to address these challenges,… ▽ More

    Submitted 11 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  47. arXiv:2312.16211  [pdf, other

    cs.AI cs.HC cs.LG

    An Explainable AI Approach to Large Language Model Assisted Causal Model Auditing and Development

    Authors: Yanming Zhang, Brette Fitzgibbon, Dino Garofolo, Akshith Kota, Eric Papenhausen, Klaus Mueller

    Abstract: Causal networks are widely used in many fields, including epidemiology, social science, medicine, and engineering, to model the complex relationships between variables. While it can be convenient to algorithmically infer these models directly from observational data, the resulting networks are often plagued with erroneous edges. Auditing and correcting these networks may require domain expertise f… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  48. arXiv:2312.15306  [pdf, other

    cs.LG cs.HC

    Reconstructing High-Dimensional Datasets From Their Bivariate Projections

    Authors: Eli Dugan, Klaus Mueller

    Abstract: This paper deals with developing techniques for the reconstruction of high-dimensional datasets given each bivariate projection, as would be found in a matrix scatterplot. A graph-based solution is introduced, involving clique-finding, providing a set of possible rows that might make up the original dataset. Complications are discussed, including cases where phantom cliques are found, as well as c… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  49. arXiv:2312.08479  [pdf

    cs.CV

    Vision Transformer-Based Deep Learning for Histologic Classification of Endometrial Cancer

    Authors: Manu Goyal, Laura J. Tafe, James X. Feng, Kristen E. Muller, Liesbeth Hondelink, Jessica L. Bentz, Saeed Hassanpour

    Abstract: Endometrial cancer, the fourth most common cancer in females in the United States, with the lifetime risk for developing this disease is approximately 2.8% in women. Precise histologic evaluation and molecular classification of endometrial cancer is important for effective patient management and determining the best treatment modalities. This study introduces EndoNet, which uses convolutional neur… ▽ More

    Submitted 27 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 4 Tables and 3 Figures

  50. Auralization based on multi-perspective ambisonic room impulse responses

    Authors: Kaspar Müller, Franz Zotter

    Abstract: Most often, virtual acoustic rendering employs real-time updated room acoustic simulations to accomplish auralization for a variable listener perspective. As an alternative, we propose and test a technique to interpolate room impulse responses, specifically Ambisonic room impulse responses (ARIRs) available at a grid of spatially distributed receiver perspectives, measured or simulated in a desire… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 18 pages, published in Acta Acustica (Open Access), datasets are available via https://paperswithcode.com/dataset/cube-b-format-ambisonic-rir-dataset and https://paperswithcode.com/dataset/variable-perspective-arir-rendering-listening

    Journal ref: Acta Acustica, Volume 4, Number 6, Article Number 25, 2020