Skip to main content

Showing 1–50 of 93 results for author: Weber, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05127  [pdf, ps, other

    cs.LG

    Kronecker-factored Approximate Curvature (KFAC) From Scratch

    Authors: Felix Dangel, Bálint Mucsányi, Tobias Weber, Runa Eschenhagen

    Abstract: Kronecker-factored approximate curvature (KFAC) is arguably one of the most prominent curvature approximations in deep learning. Its applications range from optimization to Bayesian deep learning, training data attribution with influence functions, and model compression or merging. While the intuition behind KFAC is easy to understand, its implementation is tedious: It comes in many flavours, has… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  2. arXiv:2505.01754  [pdf, other

    cs.AI cs.CL cs.IR cs.LG cs.MA

    Unraveling Media Perspectives: A Comprehensive Methodology Combining Large Language Models, Topic Modeling, Sentiment Analysis, and Ontology Learning to Analyse Media Bias

    Authors: Orlando Jähde, Thorsten Weber, Rüdiger Buchkremer

    Abstract: Biased news reporting poses a significant threat to informed decision-making and the functioning of democracies. This study introduces a novel methodology for scalable, minimally biased analysis of media bias in political news. The proposed approach examines event selection, labeling, word choice, and commission and omission biases across news sources by leveraging natural language processing tech… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    MSC Class: 68T09; 68T50; 68T05; 62R07; 68U15; 68T27; 68T20 68T09; 68T50; 68T05; 62R07; 68U15; 68T27; 68T20 68T09; 68T50; 68T05; 62R07; 68U15; 68T27; 68T20 ACM Class: I.2; H.3; I.5; I.7; H.5; H.1

    Journal ref: J Comput Soc Sc 8, 41 (2025)

  3. arXiv:2504.15844  [pdf, other

    cs.LO

    Sound and Complete Invariant-Based Heap Encodings (Technical Report)

    Authors: Zafer Esen, Philipp Rümmer, Tjark Weber

    Abstract: Verification of programs operating on mutable, heap-allocated data structures poses significant challenges due to potentially unbounded structures like linked lists and trees. In this paper, we present a novel relational heap encoding leveraging uninterpreted predicates and prophecy variables, reducing heap verification tasks to satisfiability checks over integers in constrained Horn clauses (CHCs… ▽ More

    Submitted 23 April, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

    Comments: Added acknowledgements

  4. arXiv:2504.14631  [pdf, ps, other

    cs.HC cs.SE

    Explainability for Embedding AI: Aspirations and Actuality

    Authors: Thomas Weber

    Abstract: With artificial intelligence (AI) embedded in many everyday software systems, effectively and reliably developing and maintaining AI systems becomes an essential skill for software developers. However, the complexity inherent to AI poses new challenges. Explainable AI (XAI) may allow developers to understand better the systems they build, which, in turn, can help with tasks like debugging. In this… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: Second Workshop on Engineering Interactive Systems Embedding AI Technologies at EICS 2024, Tuesday June 25th, 2024 - Cagliary, Sardinia, Italy

  5. arXiv:2503.09919  [pdf, ps, other

    math.CO cs.CG math.MG

    Drums of high width

    Authors: Alex Davies, Prateek Gupta, Sebastien Racaniere, Grzegorz Swirszcz, Adam Zsolt Wagner, Theophane Weber, Geordie Williamson

    Abstract: We provide a family of $5$-dimensional prismatoids whose width grows linearly in the number of vertices. This provides a new infinite family of counter-examples to the Hirsch conjecture whose excess width grows linearly in the number of vertices, and answers a question of Matschke, Santos and Weibel.

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: 31 pages

  6. NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model

    Authors: Yuzhi Lai, Shenghai Yuan, Youssef Nassar, Mingyu Fan, Thomas Weber, Matthias Rätsch

    Abstract: Effective Human-Robot Interaction (HRI) is crucial for future service robots in aging societies. Existing solutions are biased toward only well-trained objects, creating a gap when dealing with new objects. Currently, HRI systems using predefined gestures or language tokens for pretrained objects pose challenges for all individuals, especially elderly ones. These challenges include difficulties in… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: This work has been accepted for publication in ESWA @ 2025 Elsevier. Personal use of this material is permitted. Permission from Elsevier must be obtained for all other uses, including reprinting/redistribution, creating new works, or reuse of any copyrighted components of this work in other media

  7. arXiv:2502.05458  [pdf, other

    cs.CV cs.LG stat.ML

    Block Graph Neural Networks for tumor heterogeneity prediction

    Authors: Marianne Abémgnigni Njifon, Tobias Weber, Viktor Bezborodov, Tyll Krueger, Dominic Schuhmacher

    Abstract: Accurate tumor classification is essential for selecting effective treatments, but current methods have limitations. Standard tumor grading, which categorizes tumors based on cell differentiation, is not recommended as a stand-alone procedure, as some well-differentiated tumors can be malignant. Tumor heterogeneity assessment via single-cell sequencing offers profound insights but can be costly an… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 27 pages, 8 figures

  8. arXiv:2502.05199  [pdf, other

    math.CO cs.CG math.MG

    Advancing Geometry with AI: Multi-agent Generation of Polytopes

    Authors: Grzegorz Swirszcz, Adam Zsolt Wagner, Geordie Williamson, Sam Blackwell, Bogdan Georgiev, Alex Davies, Ali Eslami, Sebastien Racaniere, Theophane Weber, Pushmeet Kohli

    Abstract: Polytopes are one of the most primitive concepts underlying geometry. Discovery and study of polytopes with complex structures provides a means of advancing scientific knowledge. Construction of polytopes with specific extremal structure is very difficult and time-consuming. Having an automated tool for the generation of such extremal examples is therefore of great value. We present an Artificial… ▽ More

    Submitted 30 January, 2025; originally announced February 2025.

    Comments: 18 pages, 5 figures

    MSC Class: 52B05; 52B55; 68T20

  9. arXiv:2502.02496  [pdf, other

    cs.LG stat.ML

    Deep Weight Factorization: Sparse Learning Through the Lens of Artificial Symmetries

    Authors: Chris Kolb, Tobias Weber, Bernd Bischl, David Rügamer

    Abstract: Sparse regularization techniques are well-established in machine learning, yet their application in neural networks remains challenging due to the non-differentiability of penalties like the $L_1$ norm, which is incompatible with stochastic gradient descent. A promising alternative is shallow weight factorization, where weights are decomposed into two factors, allowing for smooth optimization of… ▽ More

    Submitted 7 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: accepted at ICLR 2025

  10. One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor

    Authors: Zhikun Wu, Thomas Weber, Florian Müller

    Abstract: Collaboration has been shown to enhance creativity, leading to more innovative and effective outcomes. While previous research has explored the abilities of Large Language Models (LLMs) to serve as co-creative partners in tasks like writing poetry or creating narratives, the collaborative potential of LLMs in humor-rich and culturally nuanced domains remains an open question. To address this gap,… ▽ More

    Submitted 23 January, 2025; v1 submitted 20 January, 2025; originally announced January 2025.

    Comments: to appear in: 30th International Conference on Intelligent User Interfaces IUI 25 March 2427 2025 Cagliari Italy

  11. arXiv:2411.10308  [pdf, other

    cs.CV cs.AI physics.med-ph

    A Realistic Collimated X-Ray Image Simulation Pipeline

    Authors: Benjamin El-Zein, Dominik Eckert, Thomas Weber, Maximilian Rohleder, Ludwig Ritschl, Steffen Kappler, Andreas Maier

    Abstract: Collimator detection remains a challenging task in X-ray systems with unreliable or non-available information about the detectors position relative to the source. This paper presents a physically motivated image processing pipeline for simulating the characteristics of collimator shadows in X-ray images. By generating randomized labels for collimator shapes and locations, incorporating scattered r… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  12. arXiv:2409.15183  [pdf, other

    cs.AI cs.AR eess.SP

    Chattronics: using GPTs to assist in the design of data acquisition systems

    Authors: Jonathan Paul Driemeyer Brown, Tiago Oliveira Weber

    Abstract: The usefulness of Large Language Models (LLM) is being continuously tested in various fields. However, their intrinsic linguistic characteristic is still one of the limiting factors when applying these models to exact sciences. In this article, a novel approach to use General Pre-Trained Transformers to assist in the design phase of data acquisition systems will be presented. The solution is packa… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 8 pages

    MSC Class: 68T35 ACM Class: I.2.1; J.6

  13. A novel fusion of Sentinel-1 and Sentinel-2 with climate data for crop phenology estimation using Machine Learning

    Authors: Shahab Aldin Shojaeezadeh, Abdelrazek Elnashar, Tobias Karl David Weber

    Abstract: Crop phenology describes the physiological development stages of crops from planting to harvest which is valuable information for decision makers to plan and adapt agricultural management strategies. In the era of big Earth observation data ubiquity, attempts have been made to accurately detect crop phenology using Remote Sensing (RS) and high resolution weather data. However, most studies have fo… ▽ More

    Submitted 12 May, 2025; v1 submitted 16 August, 2024; originally announced September 2024.

    Journal ref: Science of Remote Sensing, 100227 (2025)

  14. arXiv:2407.13760  [pdf, other

    eess.SY cs.AI

    Neural Network Tire Force Modeling for Automated Drifting

    Authors: Nicholas Drake Broadbent, Trey Weber, Daiki Mori, J. Christian Gerdes

    Abstract: Automated drifting presents a challenge problem for vehicle control, requiring models and control algorithms that can precisely handle nonlinear, coupled tire forces at the friction limits. We present a neural network architecture for predicting front tire lateral force as a drop-in replacement for physics-based approaches. With a full-scale automated vehicle purpose-built for the drifting applica… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 16th International Symposium on Advanced Vehicle Control (AVEC). September 2nd-6th, 2024. Milan, Italy

  15. arXiv:2406.10636  [pdf, other

    cs.HC

    From Computational to Conversational Notebooks

    Authors: Thomas Weber, Sven Mayer

    Abstract: Today, we see a drastic increase in LLM-based user interfaces to support users in various tasks. Also, in programming, we witness a productivity boost with features like LLM-supported code completion and conversational agents to generate code. In this work, we look at the future of computational notebooks by enriching them with LLM support. We propose a spectrum of support, from simple inline code… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 1st ACM CHI Workshop on Human-Notebook Interactions

  16. arXiv:2406.05072  [pdf, other

    cs.LG stat.ML

    Linearization Turns Neural Operators into Function-Valued Gaussian Processes

    Authors: Emilia Magnani, Marvin Pförtner, Tobias Weber, Philipp Hennig

    Abstract: Neural operators generalize neural networks to learn mappings between function spaces from data. They are commonly used to learn solution operators of parametric partial differential equations (PDEs) or propagators of time-dependent PDEs. However, to make them useful in high-stakes simulation scenarios, their inherent predictive error must be quantified reliably. We introduce LUNO, a novel framewo… ▽ More

    Submitted 31 January, 2025; v1 submitted 7 June, 2024; originally announced June 2024.

    MSC Class: G.1.0; I.2.6; G.3; G.1.8

  17. arXiv:2405.02475  [pdf, other

    cs.LG cs.AI stat.CO stat.ME

    Generalizing Orthogonalization for Models with Non-Linearities

    Authors: David Rügamer, Chris Kolb, Tobias Weber, Lucas Kook, Thomas Nagler

    Abstract: The complexity of black-box algorithms can lead to various challenges, including the introduction of biases. These biases present immediate risks in the algorithms' application. It was, for instance, shown that neural networks can deduce racial information solely from a patient's X-ray scan, a task beyond the capability of medical experts. If this fact is not known to the medical expert, automatic… ▽ More

    Submitted 2 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  18. arXiv:2404.09683  [pdf, other

    eess.IV cs.CV cs.LG

    Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition

    Authors: Tobias Weber, Jakob Dexl, David Rügamer, Michael Ingrisch

    Abstract: We address the computational barrier of deploying advanced deep learning segmentation models in clinical settings by studying the efficacy of network compression through tensor decomposition. We propose a post-training Tucker factorization that enables the decomposition of pre-existing models to reduce computational requirements without impeding segmentation accuracy. We applied Tucker decompositi… ▽ More

    Submitted 18 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  19. arXiv:2311.05540  [pdf, other

    cs.HC

    Usability and Adoption of Graphical Data-Driven Development Tools

    Authors: Thomas Weber, Sven Mayer

    Abstract: Software development of modern, data-driven applications still relies on tools that use interaction paradigms that have remained mostly unchanged for decades. While rich forms of interactions exist as an alternative to textual command input, they find little adoption in professional software creation. In this work, we compare graphical programming using direct manipulation to the traditional, text… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  20. arXiv:2311.01349  [pdf, other

    cs.LG cs.CY stat.ML

    Post-hoc Orthogonalization for Mitigation of Protected Feature Bias in CXR Embeddings

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: Purpose: To analyze and remove protected feature effects in chest radiograph embeddings of deep learning models. Methods: An orthogonalization is utilized to remove the influence of protected features (e.g., age, sex, race) in CXR embeddings, ensuring feature-independent results. To validate the efficacy of the approach, we retrospectively study the MIMIC and CheXpert datasets using three pre-trai… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  21. arXiv:2310.18091  [pdf, other

    cs.LG stat.ML

    Adversarial Anomaly Detection using Gaussian Priors and Nonlinear Anomaly Scores

    Authors: Fiete Lüer, Tobias Weber, Maxim Dolgich, Christian Böhm

    Abstract: Anomaly detection in imbalanced datasets is a frequent and crucial problem, especially in the medical domain where retrieving and labeling irregularities is often expensive. By combining the generative stability of a $β$-variational autoencoder (VAE) with the discriminative strengths of generative adversarial networks (GANs), we propose a novel model, $β$-VAEGAN. We investigate methods for composi… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: accepted at AI4TS @ ICDMW 2023

  22. arXiv:2307.02924  [pdf, other

    cs.RO cs.HC

    The Emotional Dilemma: Influence of a Human-like Robot on Trust and Cooperation

    Authors: Dennis Becker, Diana Rueda, Felix Beese, Brenda Scarleth Gutierrez Torres, Myriem Lafdili, Kyra Ahrens, Di Fu, Erik Strahl, Tom Weber, Stefan Wermter

    Abstract: Increasing anthropomorphic robot behavioral design could affect trust and cooperation positively. However, studies have shown contradicting results and suggest a task-dependent relationship between robots that display emotions and trust. Therefore, this study analyzes the effect of robots that display human-like emotions on trust, cooperation, and participants' emotions. In the between-group study… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at 2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)

  23. arXiv:2305.16376  [pdf, other

    eess.IV cs.CV cs.LG

    Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: Undersampling is a common method in Magnetic Resonance Imaging (MRI) to subsample the number of data points in k-space, reducing acquisition times at the cost of decreased image quality. A popular approach is to employ undersampling patterns following various strategies, e.g., variable density sampling or radial trajectories. In this work, we propose a method that directly learns the undersampling… ▽ More

    Submitted 22 August, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: accepted at WACV 2024

  24. arXiv:2305.02054  [pdf

    cs.LG cs.AI cs.RO

    Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning

    Authors: Muhammad Burhan Hafez, Tilman Immisch, Tom Weber, Stefan Wermter

    Abstract: Deep Reinforcement Learning agents often suffer from catastrophic forgetting, forgetting previously found solutions in parts of the input space when training on new data. Replay Memories are a common solution to the problem, decorrelating and shuffling old and new training samples. They naively store state transitions as they come in, without regard for redundancy. We introduce a novel cognitive-i… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Journal ref: Frontiers in Neurorobotics 17:1127642 (2023)

  25. arXiv:2304.05823  [pdf, other

    q-bio.MN cs.LG q-bio.GN

    DiscoGen: Learning to Discover Gene Regulatory Networks

    Authors: Nan Rosemary Ke, Sara-Jane Dunn, Jorg Bornschein, Silvia Chiappa, Melanie Rey, Jean-Baptiste Lespiau, Albin Cassirer, Jane Wang, Theophane Weber, David Barrett, Matthew Botvinick, Anirudh Goyal, Mike Mozer, Danilo Rezende

    Abstract: Accurately inferring Gene Regulatory Networks (GRNs) is a critical and challenging task in biology. GRNs model the activatory and inhibitory interactions between genes and are inherently causal in nature. To accurately identify GRNs, perturbational data is required. However, most GRN discovery methods only operate on observational data. Recent advances in neural network-based causal discovery meth… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  26. Automated wildlife image classification: An active learning tool for ecological applications

    Authors: Ludwig Bothmann, Lisa Wimmer, Omid Charrakh, Tobias Weber, Hendrik Edelhoff, Wibke Peters, Hien Nguyen, Caryl Benjamin, Annette Menzel

    Abstract: Wildlife camera trap images are being used extensively to investigate animal abundance, habitat associations, and behavior, which is complicated by the fact that experts must first classify the images manually. Artificial intelligence systems can take over this task but usually need a large number of already-labeled training images to achieve sufficient performance. This requirement necessitates h… ▽ More

    Submitted 2 August, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Journal ref: Ecological Informatics (2023) 102231

  27. arXiv:2303.14041  [pdf, other

    physics.ins-det cs.CG

    Motion Planning for Triple-Axis Spectrometers

    Authors: Tobias Weber

    Abstract: We present the free and open source software TAS-Paths, a novel system which calculates optimal, collision-free paths for the movement of triple-axis spectrometers. The software features an easy to use graphical user interface, but can also be scripted and used as a library. It allows the user to plan and visualise the motion of the instrument before the experiment and can be used during measureme… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: 6 pages, 4 figures

  28. arXiv:2303.11224  [pdf, other

    eess.IV cs.CV cs.LG

    Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: While recent advances in large-scale foundational models show promising results, their application to the medical domain has not yet been explored in detail. In this paper, we progress into the realms of large-scale modeling in medical synthesis by proposing Cheff - a foundational cascaded latent diffusion model, which generates highly-realistic chest radiographs providing state-of-the-art quality… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: accepted at PAKDD 2023

  29. arXiv:2302.04798  [pdf, other

    cs.LG cs.AI stat.ML

    Equivariant MuZero

    Authors: Andreea Deac, Théophane Weber, George Papamakarios

    Abstract: Deep reinforcement learning repeatedly succeeds in closed, well-defined domains such as games (Chess, Go, StarCraft). The next frontier is real-world scenarios, where setups are numerous and varied. For this, agents need to learn the underlying rules governing the environment, so as to robustly generalise to conditions that differ from those they were trained on. Model-based reinforcement learning… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 9 pages, 3 figures

  30. arXiv:2302.04009  [pdf, other

    cs.LG

    Investigating the role of model-based learning in exploration and transfer

    Authors: Jacob Walker, Eszter Vértes, Yazhe Li, Gabriel Dulac-Arnold, Ankesh Anand, Théophane Weber, Jessica B. Hamrick

    Abstract: State of the art reinforcement learning has enabled training agents on tasks of ever increasing complexity. However, the current paradigm tends to favor training agents from scratch on every new task or on collections of tasks with a view towards generalizing to novel task configurations. The former suffers from poor data efficiency while the latter is difficult when test tasks are out-of-distribu… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  31. arXiv:2301.05747  [pdf, other

    cs.CV cs.AI

    Laser: Latent Set Representations for 3D Generative Modeling

    Authors: Pol Moreno, Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Rosalia G. Schneider, Björn Winckler, Larisa Markeeva, Théophane Weber, Danilo J. Rezende

    Abstract: NeRF provides unparalleled fidelity of novel view synthesis: rendering a 3D scene from an arbitrary viewpoint. NeRF requires training on a large number of views that fully cover a scene, which limits its applicability. While these issues can be addressed by learning a prior over scenes in various forms, previous approaches have been either applied to overly simple scenes or struggling to render un… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: See https://laser-nv-paper.github.io/ for video results

  32. arXiv:2212.14882  [pdf, other

    cs.CL cs.LG

    ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports

    Authors: Katharina Jeblick, Balthasar Schachtner, Jakob Dexl, Andreas Mittermeier, Anna Theresa Stüber, Johanna Topalis, Tobias Weber, Philipp Wesp, Bastian Sabel, Jens Ricke, Michael Ingrisch

    Abstract: The release of ChatGPT, a language model capable of generating text that appears human-like and authentic, has gained significant attention beyond the research community. We expect that the convincing performance of ChatGPT incentivizes users to apply it to a variety of downstream tasks, including prompting the model to simplify their own medical reports. To investigate this phenomenon, we conduct… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

  33. arXiv:2206.05314  [pdf, other

    cs.LG cs.AI

    Large-Scale Retrieval for Reinforcement Learning

    Authors: Peter C. Humphreys, Arthur Guez, Olivier Tieleman, Laurent Sifre, Théophane Weber, Timothy Lillicrap

    Abstract: Effective decision making involves flexibly relating past experiences and relevant contextual information to a novel situation. In deep reinforcement learning (RL), the dominant paradigm is for an agent to amortise information that helps decision making into its network weights via gradient descent on training losses. Here, we pursue an alternative approach in which agents can utilise large-scale… ▽ More

    Submitted 16 December, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: Thirty-sixth Annual Conference on Neural Information Processing Systems (NeurIPS 2022), 16 pages

  34. GASP: Gated Attention For Saliency Prediction

    Authors: Fares Abawi, Tom Weber, Stefan Wermter

    Abstract: Saliency prediction refers to the computational task of modeling overt attention. Social cues greatly influence our attention, consequently altering our eye movements and behavior. To emphasize the efficacy of such features, we present a neural model for integrating social cues and weighting their influences. Our model consists of two stages. During the first stage, we detect two social cues by fo… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI-21)

    Journal ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (2021) 584-591

  35. arXiv:2204.04875  [pdf, other

    stat.ML cs.LG

    Learning to Induce Causal Structure

    Authors: Nan Rosemary Ke, Silvia Chiappa, Jane Wang, Anirudh Goyal, Jorg Bornschein, Melanie Rey, Theophane Weber, Matthew Botvinic, Michael Mozer, Danilo Jimenez Rezende

    Abstract: The fundamental challenge in causal induction is to infer the underlying graph structure given observational and/or interventional data. Most existing causal induction algorithms operate by generating candidate graphs and evaluating them using either score-based methods (including continuous optimization) or independence tests. In our work, we instead treat the inference process as a black box and… ▽ More

    Submitted 7 October, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  36. arXiv:2204.04501  [pdf, other

    cs.RO cs.LG

    Explain yourself! Effects of Explanations in Human-Robot Interaction

    Authors: Jakob Ambsdorf, Alina Munir, Yiyao Wei, Klaas Degkwitz, Harm Matthias Harms, Susanne Stannek, Kyra Ahrens, Dennis Becker, Erik Strahl, Tom Weber, Stefan Wermter

    Abstract: Recent developments in explainable artificial intelligence promise the potential to transform human-robot interaction: Explanations of robot decisions could affect user perceptions, justify their reliability, and increase trust. However, the effects on human perceptions of robots that explain their decisions have not been studied thoroughly. To analyze the effect of explainable robots, we conduct… ▽ More

    Submitted 14 June, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

    Comments: Accepted at 2022 31st IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)

  37. arXiv:2202.08417  [pdf, other

    cs.LG

    Retrieval-Augmented Reinforcement Learning

    Authors: Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C. Humphreys, Ksenia Konyushkova, Laurent Sifre, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell

    Abstract: Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive, (2) it can take many updates to integrate experiences into the parametric model, (3) experiences that are not fully integrated do not appropriately influence the… ▽ More

    Submitted 24 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  38. arXiv:2111.05149  [pdf, other

    cs.CV cs.LG

    Ethically aligned Deep Learning: Unbiased Facial Aesthetic Prediction

    Authors: Michael Danner, Thomas Weber, Leping Peng, Tobias Gerlach, Xueping Su, Matthias Rätsch

    Abstract: Facial beauty prediction (FBP) aims to develop a machine that automatically makes facial attractiveness assessment. In the past those results were highly correlated with human ratings, therefore also with their bias in annotating. As artificial intelligence can have racist and discriminatory tendencies, the cause of skews in the data must be identified. Development of training data and AI algorith… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Peer reviewed and accepted at CEPE/IACAP 2021 as Extended Abstract

  39. arXiv:2111.01587  [pdf, other

    cs.LG cs.AI

    Procedural Generalization by Planning with Self-Supervised World Models

    Authors: Ankesh Anand, Jacob Walker, Yazhe Li, Eszter Vértes, Julian Schrittwieser, Sherjil Ozair, Théophane Weber, Jessica B. Hamrick

    Abstract: One of the key promises of model-based reinforcement learning is the ability to generalize using an internal model of the world to make predictions in novel environments and tasks. However, the generalization ability of model-based agents is not well understood because existing work has focused on model-free agents when benchmarking generalization. Here, we explicitly measure the generalization ab… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  40. arXiv:2110.11312  [pdf, other

    cs.LG

    Towards modelling hazard factors in unstructured data spaces using gradient-based latent interpolation

    Authors: Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer

    Abstract: The application of deep learning in survival analysis (SA) allows utilizing unstructured and high-dimensional data types uncommon in traditional survival methods. This allows to advance methods in fields such as digital health, predictive maintenance, and churn analysis, but often yields less interpretable and intuitively understandable models due to the black-box character of deep learning-based… ▽ More

    Submitted 17 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 Workshop, Deep Generative Models and Downstream Applications

  41. arXiv:2110.11303  [pdf, other

    cs.LG

    Survival-oriented embeddings for improving accessibility to complex data structures

    Authors: Tobias Weber, Michael Ingrisch, Matthias Fabritius, Bernd Bischl, David Rügamer

    Abstract: Deep learning excels in the analysis of unstructured data and recent advancements allow to extend these techniques to survival analysis. In the context of clinical radiology, this enables, e.g., to relate unstructured volumetric images to a risk score or a prognosis of life expectancy and support clinical decision making. Medical applications are, however, associated with high criticality and cons… ▽ More

    Submitted 3 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 Workshop, Bridging the Gap: From Machine Learning Research to Clinical Practice

  42. arXiv:2105.03354  [pdf

    cs.AI cs.HC

    The future of human-AI collaboration: a taxonomy of design knowledge for hybrid intelligence systems

    Authors: Dominik Dellermann, Adrian Calma, Nikolaus Lipusch, Thorsten Weber, Sascha Weigel, Philipp Ebel

    Abstract: Recent technological advances, especially in the field of machine learning, provide astonishing progress on the road towards artificial general intelligence. However, tasks in current real-world business applications cannot yet be solved by machines alone. We, therefore, identify the need for developing socio-technological ensembles of humans and machines. Such systems possess the ability to accom… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

  43. arXiv:2104.06159  [pdf, other

    cs.LG cs.AI

    Muesli: Combining Improvements in Policy Optimization

    Authors: Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt

    Abstract: We propose a novel policy update that combines regularized policy optimization with model learning as an auxiliary loss. The update (henceforth Muesli) matches MuZero's state-of-the-art performance on Atari. Notably, Muesli does so without using deep search: it acts directly with a policy network and has computation speed comparable to model-free baselines. The Atari results are complemented by ex… ▽ More

    Submitted 31 March, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

  44. arXiv:2102.12425  [pdf, other

    cs.LG

    Synthetic Returns for Long-Term Credit Assignment

    Authors: David Raposo, Sam Ritter, Adam Santoro, Greg Wayne, Theophane Weber, Matt Botvinick, Hado van Hasselt, Francis Song

    Abstract: Since the earliest days of reinforcement learning, the workhorse method for assigning credit to actions over time has been temporal-difference (TD) learning, which propagates credit backward timestep-by-timestep. This approach suffers when delays between actions and rewards are long and when intervening unrelated events contribute variance to long-term returns. We propose state-associative (SA) le… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  45. Hierarchical Learning Using Deep Optimum-Path Forest

    Authors: Luis C. S. Afonso, Clayton R. Pereira, Silke A. T. Weber, Christian Hook, Alexandre X. Falcão, João P. Papa

    Abstract: Bag-of-Visual Words (BoVW) and deep learning techniques have been widely used in several domains, which include computer-assisted medical diagnoses. In this work, we are interested in developing tools for the automatic identification of Parkinson's disease using machine learning and the concept of BoVW. The proposed approach concerns a hierarchical-based learning technique to design visual diction… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  46. arXiv:2102.02274  [pdf, other

    cs.LG cs.AI cs.MA

    Neural Recursive Belief States in Multi-Agent Reinforcement Learning

    Authors: Pol Moreno, Edward Hughes, Kevin R. McKee, Bernardo Avila Pires, Théophane Weber

    Abstract: In multi-agent reinforcement learning, the problem of learning to act is particularly difficult because the policies of co-players may be heavily conditioned on information only observed by them. On the other hand, humans readily form beliefs about the knowledge possessed by their peers and leverage beliefs to inform decision-making. Such abilities underlie individual success in a wide range of Ma… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

  47. Protecting Privacy and Transforming COVID-19 Case Surveillance Datasets for Public Use

    Authors: Brian Lee, Brandi Dupervil, Nicholas P. Deputy, Wil Duck, Stephen Soroka, Lyndsay Bottichio, Benjamin Silk, Jason Price, Patricia Sweeney, Jennifer Fuld, Todd Weber, Dan Pollock

    Abstract: Objectives: Federal open data initiatives that promote increased sharing of federally collected data are important for transparency, data quality, trust, and relationships with the public and state, tribal, local, and territorial (STLT) partners. These initiatives advance understanding of health conditions and diseases by providing data to more researchers, scientists, and policymakers for analysi… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: 19 pages, 4 figures, 1 table, 5 supplements

  48. Mechanisation of Model-theoretic Conservative Extension for HOL with Ad-hoc Overloading

    Authors: Arve Gengelbach, Johannes Åman Pohjola, Tjark Weber

    Abstract: Definitions of new symbols merely abbreviate expressions in logical frameworks, and no new facts (regarding previously defined symbols) should hold because of a new definition. In Isabelle/HOL, definable symbols are types and constants. The latter may be ad-hoc overloaded, i.e. have different definitions for non-overlapping types. We prove that symbols that are independent of a new definition may… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: In Proceedings LFMTP 2020, arXiv:2101.02835

    ACM Class: F.3.1; F.3.2

    Journal ref: EPTCS 332, 2021, pp. 1-17

  49. arXiv:2012.07969  [pdf, other

    stat.ML cs.LG

    A case for new neural network smoothness constraints

    Authors: Mihaela Rosca, Theophane Weber, Arthur Gretton, Shakir Mohamed

    Abstract: How sensitive should machine learning models be to input changes? We tackle the question of model smoothness and show that it is a useful inductive bias which aids generalization, adversarial robustness, generative modeling and reinforcement learning. We explore current methods of imposing smoothness constraints and observe they lack the flexibility to adapt to new tasks, they don't account for da… ▽ More

    Submitted 7 July, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

  50. Discovering key topics from short, real-world medical inquiries via natural language processing and unsupervised learning

    Authors: Angelo Ziletti, Christoph Berns, Oliver Treichel, Thomas Weber, Jennifer Liang, Stephanie Kammerath, Marion Schwaerzler, Jagatheswari Virayah, David Ruau, Xin Ma, Andreas Mattern

    Abstract: Millions of unsolicited medical inquiries are received by pharmaceutical companies every year. It has been hypothesized that these inquiries represent a treasure trove of information, potentially giving insight into matters regarding medicinal products and the associated medical treatments. However, due to the large volume and specialized nature of the inquiries, it is difficult to perform timely,… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Journal ref: Front. Comput. Sci 88 (3) (2021)