Skip to main content

Showing 1–22 of 22 results for author: Lüdtke, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11023  [pdf, ps, other

    cs.LG

    Informed, but Not Always Improved: Challenging the Benefit of Background Knowledge in GNNs

    Authors: Kutalmış Coşkun, Ivo Kavisanczki, Amin Mirzaei, Tom Siegl, Bjarne C. Hiller, Stefan Lüdtke, Martin Becker

    Abstract: In complex and low-data domains such as biomedical research, incorporating background knowledge (BK) graphs, such as protein-protein interaction (PPI) networks, into graph-based machine learning pipelines is a promising research direction. However, while BK is often assumed to improve model performance, its actual contribution and the impact of imperfect knowledge remain poorly understood. In this… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 10 pages, 7 figures

  2. arXiv:2503.09159  [pdf, other

    cs.LG

    Unreflected Use of Tabular Data Repositories Can Undermine Research Quality

    Authors: Andrej Tschalzev, Lennart Purucker, Stefan Lüdtke, Frank Hutter, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Data repositories have accumulated a large number of tabular datasets from various domains. Machine Learning researchers are actively using these datasets to evaluate novel approaches. Consequently, data repositories have an important standing in tabular data research. They not only host datasets but also provide information on how to use them in supervised learning tasks. In this paper, we argue… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  3. arXiv:2408.08761  [pdf, other

    cs.LG

    Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization

    Authors: Sascha Marton, Tim Grams, Florian Vogt, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Reinforcement learning (RL) has seen significant success across various domains, but its adoption is often limited by the black-box nature of neural network policies, making them difficult to interpret. In contrast, symbolic policies allow representing decision-making strategies in a compact and interpretable way. However, learning symbolic policies directly within on-policy methods remains challe… ▽ More

    Submitted 11 March, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

  4. arXiv:2407.02112  [pdf, other

    cs.LG cs.AI

    A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular Data

    Authors: Andrej Tschalzev, Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Tabular data is prevalent in real-world machine learning applications, and new models for supervised learning of tabular data are frequently proposed. Comparative studies assessing the performance of models typically consist of model-centric evaluation setups with overly standardized data preprocessing. This paper demonstrates that such model-centric evaluations are biased, as real-world modeling… ▽ More

    Submitted 18 December, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2407.01115  [pdf, other

    cs.LG stat.ML

    Enabling Mixed Effects Neural Networks for Diverse, Clustered Data Using Monte Carlo Methods

    Authors: Andrej Tschalzev, Paul Nitschke, Lukas Kirchdorfer, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Neural networks often assume independence among input data samples, disregarding correlations arising from inherent clustering patterns in real-world datasets (e.g., due to different sites or repeated measurements). Recently, mixed effects neural networks (MENNs) which separate cluster-specific 'random effects' from cluster-invariant 'fixed effects' have been proposed to improve generalization and… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  6. arXiv:2309.17130  [pdf, other

    cs.LG

    GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Despite the success of deep learning for text and image data, tree-based ensemble models are still state-of-the-art for machine learning with heterogeneous tabular data. However, there is a significant need for tabular-specific gradient-based methods due to their high flexibility. In this paper, we propose $\text{GRANDE}$, $\text{GRA}$die$\text{N}$t-Based $\text{D}$ecision Tree $\text{E}$nsembles,… ▽ More

    Submitted 12 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  7. arXiv:2309.00306  [pdf, ps, other

    cs.AI

    On the Aggregation of Rules for Knowledge Graph Completion

    Authors: Patrick Betz, Stefan Lüdtke, Christian Meilicke, Heiner Stuckenschmidt

    Abstract: Rule learning approaches for knowledge graph completion are efficient, interpretable and competitive to purely neural models. The rule aggregation problem is concerned with finding one plausibility score for a candidate fact which was simultaneously predicted by multiple rules. Although the problem is ubiquitous, as data-driven rule learning can result in noisy and large rulesets, it is underrepre… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: KLR Workshop@ICML2023

  8. arXiv:2308.03403  [pdf, other

    cs.LG

    Towards Machine Learning-based Fish Stock Assessment

    Authors: Stefan Lüdtke, Maria E. Pierce

    Abstract: The accurate assessment of fish stocks is crucial for sustainable fisheries management. However, existing statistical stock assessment models can have low forecast performance of relevant stock parameters like recruitment or spawning stock biomass, especially in ecosystems that are changing due to global warming and other anthropogenic stressors. In this paper, we investigate the use of machine le… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted at Fragile Earth Workshop 2023

  9. arXiv:2305.03515  [pdf, other

    cs.LG cs.AI

    GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Decision Trees (DTs) are commonly used for many machine learning tasks due to their high degree of interpretability. However, learning a DT from data is a difficult optimization problem, as it is non-convex and non-differentiable. Therefore, common approaches learn DTs using a greedy growth algorithm that minimizes the impurity locally at each internal node. Unfortunately, this greedy procedure ca… ▽ More

    Submitted 19 August, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  10. arXiv:2301.10571  [pdf, other

    cs.AI

    Leveraging Planning Landmarks for Hybrid Online Goal Recognition

    Authors: Nils Wilken, Lea Cohausz, Johannes Schaum, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Goal recognition is an important problem in many application domains (e.g., pervasive computing, intrusion detection, computer games, etc.). In many application scenarios it is important that goal recognition algorithms can recognize goals of an observed agent as fast as possible and with minimal domain knowledge. Hence, in this paper, we propose a hybrid method for online goal recognition that co… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 9 pages. Presented at SPARK 2022 (https://icaps22.icaps-conference.org/workshops/SPARK/)

  11. arXiv:2301.05608  [pdf, other

    cs.AI

    Investigating the Combination of Planning-Based and Data-Driven Methods for Goal Recognition

    Authors: Nils Wilken, Lea Cohausz, Johannes Schaum, Stefan Lüdtke, Heiner Stuckenschmidt

    Abstract: An important feature of pervasive, intelligent assistance systems is the ability to dynamically adapt to the current needs of their users. Hence, it is critical for such systems to be able to recognize those goals and needs based on observations of the user's actions and state of the environment. In this work, we investigate the application of two state-of-the-art, planning-based plan recognition… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  12. Discovering Behavioral Predispositions in Data to Improve Human Activity Recognition

    Authors: Maximilian Popko, Sebastian Bader, Stefan Lüdtke, Thomas Kirste

    Abstract: The automatic, sensor-based assessment of challenging behavior of persons with dementia is an important task to support the selection of interventions. However, predicting behaviors like apathy and agitation is challenging due to the large inter- and intra-patient variability. Goal of this paper is to improve the recognition performance by making use of the observation that patients tend to show s… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: Submitted to iWOAR 2022 - 7th international Workshop on Sensor-Based Activity Recognition and Artificial Intelligence

    Journal ref: 2022. Proceedings of the 7th International Workshop on Sensor-based Activity Recognition and Artificial Intelligence. Association for Computing Machinery, New York, NY, USA

  13. arXiv:2207.08414  [pdf, other

    cs.LG

    Outlier Explanation via Sum-Product Networks

    Authors: Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Outlier explanation is the task of identifying a set of features that distinguish a sample from normal data, which is important for downstream (human) decision-making. Existing methods are based on beam search in the space of feature subsets. They quickly becomes computationally expensive, as they require to run an outlier detection algorithm from scratch for each feature subset. To alleviate this… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  14. Explaining Neural Networks without Access to Training Data

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Andrej Tschalzev, Heiner Stuckenschmidt

    Abstract: We consider generating explanations for neural networks in cases where the network's training data is not accessible, for instance due to privacy or safety issues. Recently, $\mathcal{I}$-Nets have been proposed as a sample-free approach to post-hoc, global model interpretability that does not require access to training data. They formulate interpretation as a machine learning task that maps netwo… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Journal ref: Machine Learning (2024)

  15. arXiv:2202.00332  [pdf, other

    cs.AI

    Activity Recognition in Assembly Tasks by Bayesian Filtering in Multi-Hypergraphs

    Authors: Timon Felske, Stefan Lüdtke, Sebastian Bader, Thomas Kirste

    Abstract: We study sensor-based human activity recognition in manual work processes like assembly tasks. In such processes, the system states often have a rich structure, involving object properties and relations. Thus, estimating the hidden system state from sensor observations by recursive Bayesian filtering can be very challenging, due to the combinatorial explosion in the number of system states. To all… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Accepted for presentation at the 2nd GCLR workshop in conjunction with AAAI 2022

  16. arXiv:2111.04564  [pdf, other

    eess.SP cs.AI cs.LG

    Human Activity Recognition using Attribute-Based Neural Networks and Context Information

    Authors: Stefan Lüdtke, Fernando Moya Rueda, Waqas Ahmed, Gernot A. Fink, Thomas Kirste

    Abstract: We consider human activity recognition (HAR) from wearable sensor data in manual-work processes, like warehouse order-picking. Such structured domains can often be partitioned into distinct process steps, e.g., packaging or transporting. Each process step can have a different prior distribution over activity classes, e.g., standing or walking, and different system dynamics. Here, we show how such… ▽ More

    Submitted 28 October, 2021; originally announced November 2021.

    Comments: 3rd International Workshop on Deep Learning for Human Activity Recognition

  17. arXiv:2110.05165  [pdf, other

    cs.LG cs.AI stat.ML

    Exchangeability-Aware Sum-Product Networks

    Authors: Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Sum-Product Networks (SPNs) are expressive probabilistic models that provide exact, tractable inference. They achieve this efficiency by making use of local independence. On the other hand, mixtures of exchangeable variable models (MEVMs) are a class of tractable probabilistic models that make use of exchangeability of discrete random variables to render inference tractable. Exchangeability, which… ▽ More

    Submitted 28 April, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: accepted at IJCAI 2022

  18. VPIC 2.0: Next Generation Particle-in-Cell Simulations

    Authors: Robert Bird, Nigel Tan, Scott V. Luedtke, Stephen Lien Harrell, Michela Taufer, Brian Albright

    Abstract: VPIC is a general purpose Particle-in-Cell simulation code for modeling plasma phenomena such as magnetic reconnection, fusion, solar weather, and laser-plasma interaction in three dimensions using large numbers of particles. VPIC's capacity in both fidelity and scale makes it particularly well-suited for plasma research on pre-exascale and exascale platforms. In this paper we demonstrate the uniq… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  19. Deep learning based mixed-dimensional GMM for characterizing variability in CryoEM

    Authors: Muyuan Chen, Steven Ludtke

    Abstract: Structural flexibility and/or dynamic interactions with other molecules is a critical aspect of protein function. CryoEM provides direct visualization of individual macromolecules sampling different conformational and compositional states. While numerous methods are available for computational classification of discrete states, characterization of continuous conformational changes or large numbers… ▽ More

    Submitted 23 May, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: 31 pages, 5 main figures and 8 supplementary figures

    Journal ref: Nature Methods 18, 930-936 (2021)

  20. State-Space Abstractions for Probabilistic Inference: A Systematic Review

    Authors: Stefan Lüdtke, Max Schröder, Frank Krüger, Sebastian Bader, Thomas Kirste

    Abstract: Tasks such as social network analysis, human behavior recognition, or modeling biochemical reactions, can be solved elegantly by using the probabilistic inference framework. However, standard probabilistic inference algorithms work at a propositional level, and thus cannot capture the symmetries and redundancies that are present in these tasks. Algorithms that exploit those symmetries have been de… ▽ More

    Submitted 4 December, 2018; v1 submitted 18 April, 2018; originally announced April 2018.

  21. arXiv:1801.10495  [pdf, other

    cs.AI

    Lifted Filtering via Exchangeable Decomposition

    Authors: Stefan Lüdtke, Max Schröder, Sebastian Bader, Kristian Kersting, Thomas Kirste

    Abstract: We present a model for exact recursive Bayesian filtering based on lifted multiset states. Combining multisets with lifting makes it possible to simultaneously exploit multiple strategies for reducing inference complexity when compared to list-based grounded state representations. The core idea is to borrow the concept of Maximally Parallel Multiset Rewriting Systems and to enhance it by concepts… ▽ More

    Submitted 7 May, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

  22. arXiv:1707.06446  [pdf, other

    cs.AI

    Sequential Lifted Bayesian Filtering in Multiset Rewriting Systems

    Authors: Max Schröder, Stefan Lüdtke, Sebastian Bader, Frank Krüger, Thomas Kirste

    Abstract: Bayesian Filtering for plan and activity recognition is challenging for scenarios that contain many observation equivalent entities (i.e. entities that produce the same observations). This is due to the combinatorial explosion in the number of hypotheses that need to be tracked. However, this class of problems exhibits a certain symmetry that can be exploited for state space representation and inf… ▽ More

    Submitted 14 August, 2017; v1 submitted 20 July, 2017; originally announced July 2017.

    Comments: 7 pages, 3 figures, accepted at UAI-17 Statistical Relational AI (StarAI) workshop