Skip to main content

Showing 1–32 of 32 results for author: Kersting, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.12707  [pdf, other

    cs.LG stat.ML

    CausalMan: A physics-based simulator for large-scale causality

    Authors: Nicholas Tagliapietra, Juergen Luettin, Lavdim Halilaj, Moritz Willig, Tim Pychynski, Kristian Kersting

    Abstract: A comprehensive understanding of causality is critical for navigating and operating within today's complex real-world systems. The absence of realistic causal models with known data generating processes complicates fair benchmarking. In this paper, we present the CausalMan simulator, modeled after a real-world production line. The simulator features a diverse range of linear and non-linear mechani… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  2. arXiv:2411.05791  [pdf, ps, other

    q-fin.ST cs.LG econ.GN stat.AP

    Forecasting Company Fundamentals

    Authors: Felix Divo, Eric Endress, Kevin Endler, Kristian Kersting, Devendra Singh Dhami

    Abstract: Company fundamentals are key to assessing companies' financial and overall success and stability. Forecasting them is important in multiple fields, including investing and econometrics. While statistical and contemporary machine learning methods have been applied to many time series tasks, there is a lack of comparison of these approaches on this particularly challenging data regime. To this end,… ▽ More

    Submitted 3 June, 2025; v1 submitted 21 October, 2024; originally announced November 2024.

    Comments: See https://openreview.net/forum?id=haf78jerSt

    ACM Class: I.2.6

    Journal ref: Transactions on Machine Learning Research (2025)

  3. arXiv:2410.13054  [pdf, other

    cs.LG cs.AI stat.ML

    Systems with Switching Causal Relations: A Meta-Causal Perspective

    Authors: Moritz Willig, Tim Nelson Tobiasch, Florian Peter Busch, Jonas Seng, Devendra Singh Dhami, Kristian Kersting

    Abstract: Most work on causality in machine learning assumes that causal relationships are driven by a constant underlying process. However, the flexibility of agents' actions or tipping points in the environmental process can change the qualitative dynamics of the system. As a result, new causal relationships may emerge, while existing ones change or disappear, resulting in an altered causal graph. To anal… ▽ More

    Submitted 17 April, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: 21 pages, 3 figures, 4 tables, ICLR 2025 Camera Ready Version

  4. arXiv:2402.06434  [pdf, ps, other

    cs.LG stat.ML

    Where is the Truth? The Risk of Getting Confounded in a Continual World

    Authors: Florian Peter Busch, Roshni Kamath, Rupert Mitchell, Wolfgang Stammer, Kristian Kersting, Martin Mundt

    Abstract: A dataset is confounded if it is most easily solved via a spurious correlation, which fails to generalize to new data. In this work, we show that, in a continual learning setting where confounders may vary in time across tasks, the challenge of mitigating the effect of confounders far exceeds the standard forgetting problem normally considered. In particular, we provide a formal description of suc… ▽ More

    Submitted 12 June, 2025; v1 submitted 9 February, 2024; originally announced February 2024.

  5. arXiv:2312.07790  [pdf, ps, other

    cs.LG stat.ML

    Characteristic Circuits

    Authors: Zhongjie Yu, Martin Trapp, Kristian Kersting

    Abstract: In many real-world scenarios, it is crucial to be able to reliably and efficiently reason under uncertainty while capturing complex relationships in data. Probabilistic circuits (PCs), a prominent family of tractable probabilistic models, offer a remedy to this challenge by composing simple, tractable distributions into a high-dimensional probability distribution. However, learning PCs on heteroge… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Published at NeurIPS 2023

  6. arXiv:2110.12066  [pdf, other

    cs.LG stat.ML

    The Causal Loss: Driving Correlation to Imply Causation

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Most algorithms in classical and contemporary machine learning focus on correlation-based dependence between features to drive performance. Although success has been observed in many relevant problems, these algorithms fail when the underlying causality is inconsistent with the assumed relations. We propose a novel model-agnostic loss function called Causal Loss that improves the interventional qu… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: Main paper: 8 pages, References: 2 pages, Appendix: 3 pages. Figures: 4 main, 4 appendix. Tables: 2 main

  7. arXiv:2109.04173  [pdf, other

    cs.LG stat.ML

    Relating Graph Neural Networks to Structural Causal Models

    Authors: Matej Zečević, Devendra Singh Dhami, Petar Veličković, Kristian Kersting

    Abstract: Causality can be described in terms of a structural causal model (SCM) that carries information on the variables of interest and their mechanistic relations. For most processes of interest the underlying SCM will only be partially observable, thus causal inference tries leveraging the exposed. Graph neural networks (GNN) as universal approximators on structured input pose a viable candidate for ca… ▽ More

    Submitted 22 October, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Main paper: 12 pages, References: 2 pages, Appendix: 13 pages; Main paper: 4 figures, Appendix: 2 figures

  8. arXiv:2106.08687  [pdf, other

    cs.LG stat.ML

    Leveraging Probabilistic Circuits for Nonparametric Multi-Output Regression

    Authors: Zhongjie Yu, Mingye Zhu, Martin Trapp, Arseny Skryagin, Kristian Kersting

    Abstract: Inspired by recent advances in the field of expert-based approximations of Gaussian processes (GPs), we present an expert-based approach to large-scale multi-output regression using single-output GP experts. Employing a deeply structured mixture of single-output GPs encoded via a probabilistic circuit allows us to capture correlations between multiple output dimensions accurately. By recursively p… ▽ More

    Submitted 1 August, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted for the 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021)

  9. arXiv:2104.01148  [pdf, other

    cs.CV cs.LG stat.ML

    Decomposing 3D Scenes into Objects via Unsupervised Volume Segmentation

    Authors: Karl Stelzner, Kristian Kersting, Adam R. Kosiorek

    Abstract: We present ObSuRF, a method which turns a single image of a scene into a 3D model represented as a set of Neural Radiance Fields (NeRFs), with each NeRF corresponding to a different object. A single forward pass of an encoder network outputs a set of latent vectors describing the objects in the scene. These vectors are used independently to condition a NeRF decoder, defining the geometry and appea… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: 15 pages, 3 figures. For project page with videos, see http://stelzner.github.io/obsurf/

  10. arXiv:2007.08663  [pdf, ps, other

    cs.LG cs.NE stat.ML

    TUDataset: A collection of benchmark datasets for learning with graphs

    Authors: Christopher Morris, Nils M. Kriege, Franka Bause, Kristian Kersting, Petra Mutzel, Marion Neumann

    Abstract: Recently, there has been an increasing interest in (supervised) learning with graph data, especially using graph neural networks. However, the development of meaningful benchmark datasets and standardized evaluation procedures is lagging, consequently hindering advancements in this area. To address this, we introduce the TUDataset for graph classification and regression. The collection consists of… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: ICML 2020 workshop "Graph Representation Learning and Beyond"

  11. arXiv:2004.06231  [pdf, other

    cs.LG stat.ML

    Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits

    Authors: Robert Peharz, Steven Lang, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Guy Van den Broeck, Kristian Kersting, Zoubin Ghahramani

    Abstract: Probabilistic circuits (PCs) are a promising avenue for probabilistic modeling, as they permit a wide range of exact and efficient inference routines. Recent ``deep-learning-style'' implementations of PCs strive for a better scalability, but are still difficult to train on real-world data, due to their sparsely connected computational graphs. In this paper, we propose Einsum Networks (EiNets), a n… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  12. arXiv:2001.05371  [pdf, other

    cs.LG cs.AI stat.ML

    Making deep neural networks right for the right scientific reasons by interacting with their explanations

    Authors: Patrick Schramowski, Wolfgang Stammer, Stefano Teso, Anna Brugger, Xiaoting Shao, Hans-Georg Luigs, Anne-Katrin Mahlein, Kristian Kersting

    Abstract: Deep neural networks have shown excellent performances in many real-world applications. Unfortunately, they may show "Clever Hans"-like behavior -- making use of confounding factors within datasets -- to achieve high performance. In this work, we introduce the novel learning setting of "explanatory interactive learning" (XIL) and illustrate its benefits on a plant phenotyping research task. XIL ad… ▽ More

    Submitted 5 March, 2024; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1805.08578

  13. arXiv:1912.05238  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    BERT has a Moral Compass: Improvements of ethical and moral values of machines

    Authors: Patrick Schramowski, Cigdem Turan, Sophie Jentzsch, Constantin Rothkopf, Kristian Kersting

    Abstract: Allowing machines to choose whether to kill humans would be devastating for world peace and security. But how do we equip machines with the ability to learn ethical or even moral choices? Jentzsch et al.(2019) showed that applying machine learning to human texts can extract deontological ethical reasoning about "right" and "wrong" conduct by calculating a moral bias score on a sentence level using… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

  14. arXiv:1910.02425  [pdf, other

    cs.LG cs.CV stat.ML

    Structured Object-Aware Physics Prediction for Video Modeling and Planning

    Authors: Jannik Kossen, Karl Stelzner, Marcel Hussing, Claas Voelcker, Kristian Kersting

    Abstract: When humans observe a physical system, they can easily locate objects, understand their interactions, and anticipate future behavior, even in settings with complicated and previously unseen interactions. For computers, however, learning such models from videos in an unsupervised fashion is an unsolved research problem. In this paper, we present STOVE, a novel state-space model for videos, which ex… ▽ More

    Submitted 12 February, 2020; v1 submitted 6 October, 2019; originally announced October 2019.

    Comments: Published as a conference paper at 2020 International Conference for Learning Representations

  15. arXiv:1908.03250  [pdf, other

    cs.LG cs.AI stat.ML

    Random Sum-Product Forests with Residual Links

    Authors: Fabrizio Ventola, Karl Stelzner, Alejandro Molina, Kristian Kersting

    Abstract: Tractable yet expressive density estimators are a key building block of probabilistic machine learning. While sum-product networks (SPNs) offer attractive inference capabilities, obtaining structures large enough to fit complex, high-dimensional data has proven challenging. In this paper, we present random sum-product forests (RSPFs), an ensemble approach for mixing multiple randomly generated SPN… ▽ More

    Submitted 8 August, 2019; originally announced August 2019.

  16. arXiv:1905.08550  [pdf, other

    cs.LG stat.ML

    Conditional Sum-Product Networks: Imposing Structure on Deep Probabilistic Architectures

    Authors: Xiaoting Shao, Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Thomas Liebig, Kristian Kersting

    Abstract: Probabilistic graphical models are a central tool in AI; however, they are generally not as expressive as deep neural models, and inference is notoriously hard and slow. In contrast, deep probabilistic models such as sum-product networks (SPNs) capture joint distributions in a tractable fashion, but still lack the expressive power of intractable models based on deep neural networks. Therefore, we… ▽ More

    Submitted 29 September, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: 13 pages, 6 figures

  17. arXiv:1901.03704  [pdf, other

    cs.LG stat.ML

    SPFlow: An Easy and Extensible Library for Deep Probabilistic Learning using Sum-Product Networks

    Authors: Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Pranav Subramani, Nicola Di Mauro, Pascal Poupart, Kristian Kersting

    Abstract: We introduce SPFlow, an open-source Python library providing a simple interface to inference, learning and manipulation routines for deep and tractable probabilistic models called Sum-Product Networks (SPNs). The library allows one to quickly create SPNs both from data and through a domain specific language (DSL). It efficiently implements several probabilistic inference routines like computing ma… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: 4 pages, 1 figure, code

  18. arXiv:1808.02123  [pdf, other

    cs.LG cs.AI stat.ML

    Structure Learning for Relational Logistic Regression: An Ensemble Approach

    Authors: Nandini Ramanan, Gautam Kunapuli, Tushar Khot, Bahare Fatemi, Seyed Mehran Kazemi, David Poole, Kristian Kersting, Sriraam Natarajan

    Abstract: We consider the problem of learning Relational Logistic Regression (RLR). Unlike standard logistic regression, the features of RLRs are first-order formulae with associated weight vectors instead of scalar weights. We turn the problem of learning RLR to learning these vector-weighted formulae and develop a learning algorithm based on the recently successful functional-gradient boosting methods for… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

  19. arXiv:1807.09306  [pdf, other

    stat.ML cs.LG

    Automatic Bayesian Density Analysis

    Authors: Antonio Vergari, Alejandro Molina, Robert Peharz, Zoubin Ghahramani, Kristian Kersting, Isabel Valera

    Abstract: Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for {exploratory data analysis} are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous… ▽ More

    Submitted 10 February, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: In proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)

  20. arXiv:1806.01910  [pdf, other

    cs.LG cs.AI stat.ML

    Probabilistic Deep Learning using Random Sum-Product Networks

    Authors: Robert Peharz, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Kristian Kersting, Zoubin Ghahramani

    Abstract: The need for consistent treatment of uncertainty has recently triggered increased interest in probabilistic deep learning methods. However, most current approaches have severe limitations when it comes to inference, since many of these models do not even permit to evaluate exact data likelihoods. Sum-product networks (SPNs), on the other hand, are an excellent architecture in that regard, as they… ▽ More

    Submitted 22 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  21. arXiv:1805.08578  [pdf, other

    stat.ML cs.LG

    "Why Should I Trust Interactive Learners?" Explaining Interactive Queries of Classifiers to Users

    Authors: Stefano Teso, Kristian Kersting

    Abstract: Although interactive learning puts the user into the loop, the learner remains mostly a black box for the user. Understanding the reasons behind queries and predictions is important when assessing how the learner works and, in turn, trust. Consequently, we propose the novel framework of explanatory interactive learning: in each step, the learner explains its interactive query to the user, and she… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

    Comments: Submitted to NIPS 2018

  22. arXiv:1803.04300  [pdf, other

    cs.LG stat.ML

    Neural Conditional Gradients

    Authors: Patrick Schramowski, Christian Bauckhage, Kristian Kersting

    Abstract: The move from hand-designed to learned optimizers in machine learning has been quite successful for gradient-based and -free optimizers. When facing a constrained problem, however, maintaining feasibility typically requires a projection step, which might be computationally expensive and not differentiable. We show how the design of projection-free convex optimization algorithms can be cast as a le… ▽ More

    Submitted 30 July, 2018; v1 submitted 12 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1610.05120 by other authors

  23. arXiv:1710.03297  [pdf, other

    cs.LG stat.ML

    Sum-Product Networks for Hybrid Domains

    Authors: Alejandro Molina, Antonio Vergari, Nicola Di Mauro, Sriraam Natarajan, Floriana Esposito, Kristian Kersting

    Abstract: While all kinds of mixed data -from personal data, over panel and scientific data, to public and commercial data- are collected and stored, building probabilistic graphical models for these hybrid domains becomes more difficult. Users spend significant amounts of time in identifying the parametric form of the random variables (Gaussian, Poisson, Logit, etc.) involved and learning the mixed models.… ▽ More

    Submitted 6 November, 2017; v1 submitted 9 October, 2017; originally announced October 2017.

    Comments: 16 Pages, 5 Figures

  24. arXiv:1710.03285  [pdf, other

    cs.AI cs.LG stat.ML

    Coresets for Dependency Networks

    Authors: Alejandro Molina, Alexander Munteanu, Kristian Kersting

    Abstract: Many applications infer the structure of a probabilistic graphical model from data to elucidate the relationships between variables. But how can we train graphical models on a massive data set? In this paper, we show how to construct coresets -compressed data sets which can be used as proxy for the original data and have provably bounded worst case error- for Gaussian dependency networks (DNs), i.… ▽ More

    Submitted 16 October, 2017; v1 submitted 9 October, 2017; originally announced October 2017.

    Comments: 16 pages, 3 figures

  25. arXiv:1703.02379  [pdf, other

    cs.LG stat.ML

    Global Weisfeiler-Lehman Graph Kernels

    Authors: Christopher Morris, Kristian Kersting, Petra Mutzel

    Abstract: Most state-of-the-art graph kernels only take local graph properties into account, i.e., the kernel is computed with regard to properties of the neighborhood of vertices or other small substructures. On the other hand, kernels that do take global graph propertiesinto account may not scale well to large graph databases. Here we propose to start exploring the space between local and global graph ker… ▽ More

    Submitted 22 September, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: 10 pages, accepted at IEEE ICDM 2017 ("Glocalized Weisfeiler-Lehman Graph Kernels: Global-Local Feature Maps of Graphs")

  26. A Unifying View of Explicit and Implicit Feature Maps of Graph Kernels

    Authors: Nils M. Kriege, Marion Neumann, Christopher Morris, Kristian Kersting, Petra Mutzel

    Abstract: Non-linear kernel methods can be approximated by fast linear ones using suitable explicit feature maps allowing their application to large scale problems. We investigate how convolution kernels for structured data are composed from base kernels and construct corresponding feature maps. On this basis we propose exact and approximative feature maps for widely used graph kernels based on the kernel t… ▽ More

    Submitted 3 September, 2019; v1 submitted 2 March, 2017; originally announced March 2017.

    Journal ref: Data Mining and Knowledge Discovery 33 (2019) 1505-1547

  27. arXiv:1610.00064  [pdf, other

    cs.LG stat.ML

    Faster Kernels for Graphs with Continuous Attributes via Hashing

    Authors: Christopher Morris, Nils M. Kriege, Kristian Kersting, Petra Mutzel

    Abstract: While state-of-the-art kernels for graphs with discrete labels scale well to graphs with thousands of nodes, the few existing kernels for graphs with continuous attributes, unfortunately, do not scale well. To overcome this limitation, we present hash graph kernels, a general framework to derive kernels for graphs with continuous attributes from discrete ones. The idea is to iteratively turn conti… ▽ More

    Submitted 30 September, 2016; originally announced October 2016.

    Comments: IEEE ICDM 2016

  28. arXiv:1606.05110  [pdf, other

    stat.ML cs.CY

    Machine Learning meets Data-Driven Journalism: Boosting International Understanding and Transparency in News Coverage

    Authors: Elena Erdmann, Karin Boczek, Lars Koppers, Gerret von Nordheim, Christian Pölitz, Alejandro Molina, Katharina Morik, Henrik Müller, Jörg Rahnenführer, Kristian Kersting

    Abstract: Migration crisis, climate change or tax havens: Global challenges need global solutions. But agreeing on a joint approach is difficult without a common ground for discussion. Public spheres are highly segmented because news are mainly produced and received on a national level. Gain- ing a global view on international debates about important issues is hindered by the enormous quantity of news and b… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: presented at 2016 ICML Workshop on #Data4Good: Machine Learning in Social Good Applications, New York, NY

  29. arXiv:1606.02346  [pdf, other

    cs.LG cs.PF cs.SI stat.ML

    How is a data-driven approach better than random choice in label space division for multi-label classification?

    Authors: Piotr Szymański, Tomasz Kajdanowicz, Kristian Kersting

    Abstract: We propose using five data-driven community detection approaches from social networks to partition the label space for the task of multi-label classification as an alternative to random partitioning into equal subsets as performed by RAkELd: modularity-maximizing fastgreedy and leading eigenvector, infomap, walktrap and label propagation algorithms. We construct a label co-occurence graph (both we… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

  30. arXiv:1410.3314  [pdf, other

    stat.ML cs.LG

    Propagation Kernels

    Authors: Marion Neumann, Roman Garnett, Christian Bauckhage, Kristian Kersting

    Abstract: We introduce propagation kernels, a general graph-kernel framework for efficiently measuring the similarity of structured data. Propagation kernels are based on monitoring how information spreads through a set of given graphs. They leverage early-stage distributions from propagation schemes such as random walks to capture structural information encoded in node labels, attributes, and edge informat… ▽ More

    Submitted 13 October, 2014; originally announced October 2014.

  31. arXiv:1407.0179  [pdf, other

    stat.ML cs.LG

    Mind the Nuisance: Gaussian Process Classification using Privileged Noise

    Authors: Daniel Hernández-Lobato, Viktoriia Sharmanska, Kristian Kersting, Christoph H. Lampert, Novi Quadrianto

    Abstract: The learning with privileged information setting has recently attracted a lot of attention within the machine learning community, as it allows the integration of additional knowledge into the training process of a classifier, even when this comes in the form of a data modality that is not available at test time. Here, we show that privileged information can naturally be treated as noise in the lat… ▽ More

    Submitted 1 July, 2014; originally announced July 2014.

    Comments: 14 pages with figures

  32. arXiv:1210.4919  [pdf

    cs.LG cs.CE stat.ML

    Latent Dirichlet Allocation Uncovers Spectral Characteristics of Drought Stressed Plants

    Authors: Mirwaes Wahabzada, Kristian Kersting, Christian Bauckhage, Christoph Roemer, Agim Ballvora, Francisco Pinto, Uwe Rascher, Jens Leon, Lutz Ploemer

    Abstract: Understanding the adaptation process of plants to drought stress is essential in improving management practices, breeding strategies as well as engineering viable crops for a sustainable agriculture in the coming decades. Hyper-spectral imaging provides a particularly promising approach to gain such understanding since it allows to discover non-destructively spectral characteristics of plants gove… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-852-862