Skip to main content

Showing 1–50 of 52 results for author: Dhami, D S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03915  [pdf, ps, other

    cs.AI

    Causal Explanations Over Time: Articulated Reasoning for Interactive Environments

    Authors: Sebastian Rödling, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Structural Causal Explanations (SCEs) can be used to automatically generate explanations in natural language to questions about given data that are grounded in a (possibly learned) causal model. Unfortunately they work for small data only. In turn they are not attractive to offer reasons for events, e.g., tracking causal changes over multiple time steps, or a behavioral component that involves fee… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Main paper: 9 pages, References: 2 pages, Supplementary: 9 pages. Number of figures: 10, number of tables: 3

  2. arXiv:2503.08141  [pdf, other

    cs.LG

    Scaling Probabilistic Circuits via Data Partitioning

    Authors: Jonas Seng, Florian Peter Busch, Pooja Prasad, Devendra Singh Dhami, Martin Mundt, Kristian Kersting

    Abstract: Probabilistic circuits (PCs) enable us to learn joint distributions over a set of random variables and to perform various probabilistic queries in a tractable fashion. Though the tractability property allows PCs to scale beyond non-tractable models such as Bayesian Networks, scaling training and inference of PCs to larger, real-world datasets remains challenging. To remedy the situation, we show h… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  3. arXiv:2502.09981  [pdf, other

    cs.LG

    Exploring Neural Granger Causality with xLSTMs: Unveiling Temporal Dependencies in Complex Data

    Authors: Harsh Poonia, Felix Divo, Kristian Kersting, Devendra Singh Dhami

    Abstract: Causality in time series can be difficult to determine, especially in the presence of non-linear dependencies. The concept of Granger causality helps analyze potential relationships between variables, thereby offering a method to determine whether one time series can predict - Granger cause - future values of another. Although successful, Granger causal methods still struggle with capturing long-r… ▽ More

    Submitted 21 May, 2025; v1 submitted 14 February, 2025; originally announced February 2025.

    ACM Class: I.2.6

  4. arXiv:2501.01439  [pdf, other

    cs.AI cs.RO

    Probabilistic Mission Design in Neuro-Symbolic Systems

    Authors: Simon Kohaut, Benedict Flade, Daniel Ochs, Devendra Singh Dhami, Julian Eggert, Kristian Kersting

    Abstract: Advanced Air Mobility (AAM) is a growing field that demands accurate modeling of legal concepts and restrictions in navigating intelligent vehicles. In addition, any implementation of AAM needs to face the challenges posed by inherently dynamic and uncertain human-inhabited spaces robustly. Nevertheless, the employment of Unmanned Aircraft Systems (UAS) beyond visual line of sight (BVLOS) is an en… ▽ More

    Submitted 25 December, 2024; originally announced January 2025.

    Comments: arXiv admin note: text overlap with arXiv:2406.03454

  5. arXiv:2412.18514  [pdf, other

    cs.RO

    Hybrid Many-Objective Optimization in Probabilistic Mission Design for Compliant and Effective UAV Routing

    Authors: Simon Kohaut, Nikolas Hohmann, Sebastian Brulin, Benedict Flade, Julian Eggert, Markus Olhofer, Jürgen Adamy, Devendra Singh Dhami, Kristian Kersting

    Abstract: Advanced Aerial Mobility encompasses many outstanding applications that promise to revolutionize modern logistics and pave the way for various public services and industry uses. However, throughout its history, the development of such systems has been impeded by the complexity of legal restrictions and physical constraints. While airspaces are often tightly shaped by various legal requirements, Un… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  6. arXiv:2412.18356  [pdf, other

    cs.RO

    StaR Maps: Unveiling Uncertainty in Geospatial Relations

    Authors: Simon Kohaut, Benedict Flade, Julian Eggert, Devendra Singh Dhami, Kristian Kersting

    Abstract: The growing complexity of intelligent transportation systems and their applications in public spaces has increased the demand for expressive and versatile knowledge representation. While various mapping efforts have achieved widespread coverage, including detailed annotation of features with semantic labels, it is essential to understand their inherent uncertainties, which are commonly underrepres… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  7. arXiv:2412.18347  [pdf, other

    cs.RO

    The Constitutional Filter: Bayesian Estimation of Compliant Agents

    Authors: Simon Kohaut, Felix Divo, Benedict Flade, Devendra Singh Dhami, Julian Eggert, Kristian Kersting

    Abstract: Predicting agents impacted by legal policies, physical limitations, and operational preferences is inherently difficult. In recent years, neuro-symbolic methods have emerged, integrating machine learning and symbolic reasoning models into end-to-end learnable systems. Hereby, a promising avenue for expressing high-level constraints over multi-modal input data in robotics has opened up. This work i… ▽ More

    Submitted 4 March, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

  8. arXiv:2412.14814  [pdf, other

    cs.AI cs.LG cs.SC

    Answer Set Networks: Casting Answer Set Programming into Deep Learning

    Authors: Arseny Skryagin, Daniel Ochs, Phillip Deibert, Simon Kohaut, Devendra Singh Dhami, Kristian Kersting

    Abstract: Although Answer Set Programming (ASP) allows constraining neural-symbolic (NeSy) systems, its employment is hindered by the prohibitive costs of computing stable models and the CPU-bound nature of state-of-the-art solvers. To this end, we propose Answer Set Networks (ASN), a NeSy solver. Based on Graph Neural Networks (GNN), ASNs are a scalable approach to ASP-based Deep Probabilistic Logic Progra… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 16 pages, 9 figures

    MSC Class: 68T37; 68T30; 68T27 ACM Class: I.2.4; I.2.5

  9. arXiv:2412.04064  [pdf, other

    cs.LG cs.AI

    Graph Neural Networks Need Cluster-Normalize-Activate Modules

    Authors: Arseny Skryagin, Felix Divo, Mohammad Amin Ali, Devendra Singh Dhami, Kristian Kersting

    Abstract: Graph Neural Networks (GNNs) are non-Euclidean deep learning models for graph-structured data. Despite their successful and diverse applications, oversmoothing prohibits deep architectures due to node features converging to a single fixed point. This severely limits their potential to solve complex tasks. To counteract this tendency, we propose a plug-and-play module consisting of three steps: Clu… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: 17 pages, 6 figures, 6 tables, accepted at NeurIPS 2024

    MSC Class: 68T07 ACM Class: I.2.0

  10. arXiv:2411.05791  [pdf, ps, other

    q-fin.ST cs.LG econ.GN stat.AP

    Forecasting Company Fundamentals

    Authors: Felix Divo, Eric Endress, Kevin Endler, Kristian Kersting, Devendra Singh Dhami

    Abstract: Company fundamentals are key to assessing companies' financial and overall success and stability. Forecasting them is important in multiple fields, including investing and econometrics. While statistical and contemporary machine learning methods have been applied to many time series tasks, there is a lack of comparison of these approaches on this particularly challenging data regime. To this end,… ▽ More

    Submitted 3 June, 2025; v1 submitted 21 October, 2024; originally announced November 2024.

    Comments: See https://openreview.net/forum?id=haf78jerSt

    ACM Class: I.2.6

    Journal ref: Transactions on Machine Learning Research (2025)

  11. arXiv:2410.19546  [pdf, other

    cs.AI cs.LG

    Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad?

    Authors: Antonia Wüst, Tim Tobiasch, Lukas Helff, Inga Ibs, Wolfgang Stammer, Devendra S. Dhami, Constantin A. Rothkopf, Kristian Kersting

    Abstract: Recently, newly developed Vision-Language Models (VLMs), such as OpenAI's o1, have emerged, seemingly demonstrating advanced reasoning capabilities across text and image modalities. However, the depth of these advances in language-guided perception and abstract reasoning remains underexplored, and it is unclear whether these models can truly live up to their ambitious promises. To assess the progr… ▽ More

    Submitted 25 February, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

  12. arXiv:2410.16928  [pdf, other

    cs.LG

    xLSTM-Mixer: Multivariate Time Series Forecasting by Mixing via Scalar Memories

    Authors: Maurice Kraus, Felix Divo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Time series data is prevalent across numerous fields, necessitating the development of robust and accurate forecasting models. Capturing patterns both within and between temporal and multivariate components is crucial for reliable predictions. We introduce xLSTM-Mixer, a model designed to effectively integrate temporal sequences, joint time-variate information, and multiple perspectives for robust… ▽ More

    Submitted 23 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

  13. arXiv:2410.13054  [pdf, other

    cs.LG cs.AI stat.ML

    Systems with Switching Causal Relations: A Meta-Causal Perspective

    Authors: Moritz Willig, Tim Nelson Tobiasch, Florian Peter Busch, Jonas Seng, Devendra Singh Dhami, Kristian Kersting

    Abstract: Most work on causality in machine learning assumes that causal relationships are driven by a constant underlying process. However, the flexibility of agents' actions or tipping points in the environmental process can change the qualitative dynamics of the system. As a result, new causal relationships may emerge, while existing ones change or disappear, resulting in an altered causal graph. To anal… ▽ More

    Submitted 17 April, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: 21 pages, 3 figures, 4 tables, ICLR 2025 Camera Ready Version

  14. arXiv:2410.11689  [pdf, other

    cs.LG cs.AI

    BlendRL: A Framework for Merging Symbolic and Neural Policy Learning

    Authors: Hikaru Shindo, Quentin Delfosse, Devendra Singh Dhami, Kristian Kersting

    Abstract: Humans can leverage both symbolic reasoning and intuitive reactions. In contrast, reinforcement learning policies are typically encoded in either opaque systems like neural networks or symbolic systems that rely on predefined symbols and rules. This disjointed approach severely limits the agents' capabilities, as they often lack either the flexible low-level reaction characteristic of neural agent… ▽ More

    Submitted 21 April, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: ICLR 2025 (Spotlight)

  15. arXiv:2408.07545  [pdf, other

    cs.LG cs.AI

    $χ$SPN: Characteristic Interventional Sum-Product Networks for Causal Inference in Hybrid Domains

    Authors: Harsh Poonia, Moritz Willig, Zhongjie Yu, Matej Zečević, Kristian Kersting, Devendra Singh Dhami

    Abstract: Causal inference in hybrid domains, characterized by a mixture of discrete and continuous variables, presents a formidable challenge. We take a step towards this direction and propose Characteristic Interventional Sum-Product Network ($χ$SPN) that is capable of estimating interventional distributions in presence of random variables drawn from mixed distributions. $χ$SPN uses characteristic functio… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 17 pages, 11 figures. Accepted as poster at UAI (Uncertainty in Artificial Intelligence) 2024

  16. Towards Probabilistic Clearance, Explanation and Optimization

    Authors: Simon Kohaut, Benedict Flade, Devendra Singh Dhami, Julian Eggert, Kristian Kersting

    Abstract: Employing Unmanned Aircraft Systems (UAS) beyond visual line of sight (BVLOS) is an endearing and challenging task. While UAS have the potential to significantly enhance today's logistics and emergency response capabilities, unmanned flying objects above the heads of unprotected pedestrians induce similarly significant safety risks. In this work, we make strides towards improved safety and legal c… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  17. arXiv:2406.06107  [pdf, other

    cs.AI

    EXPIL: Explanatory Predicate Invention for Learning in Games

    Authors: Jingyuan Sha, Hikaru Shindo, Quentin Delfosse, Kristian Kersting, Devendra Singh Dhami

    Abstract: Reinforcement learning (RL) has proven to be a powerful tool for training agents that excel in various games. However, the black-box nature of neural network models often hinders our ability to understand the reasoning behind the agent's actions. Recent research has attempted to address this issue by using the guidance of pretrained neural agents to encode logic-based policies, allowing for interp… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 pages references, 8 figures, 3 tables

  18. Mission Design for Unmanned Aerial Vehicles using Hybrid Probabilistic Logic Programs

    Authors: Simon Kohaut, Benedict Flade, Devendra Singh Dhami, Julian Eggert, Kristian Kersting

    Abstract: Advanced Air Mobility (AAM) is a growing field that demands a deep understanding of legal, spatial and temporal concepts in navigation. Hence, any implementation of AAM is forced to deal with the inherent uncertainties of human-inhabited spaces. Enabling growth and innovation requires the creation of a system for safe and robust mission design, i.e., the way we formalize intentions and decide thei… ▽ More

    Submitted 18 October, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  19. arXiv:2402.15404  [pdf, other

    cs.LG

    United We Pretrain, Divided We Fail! Representation Learning for Time Series by Pretraining on 75 Datasets at Once

    Authors: Maurice Kraus, Felix Divo, David Steinmann, Devendra Singh Dhami, Kristian Kersting

    Abstract: In natural language processing and vision, pretraining is utilized to learn effective representations. Unfortunately, the success of pretraining does not easily carry over to time series due to potential mismatch between sources and target. Actually, common belief is that multi-dataset pretraining does not work for time series! Au contraire, we introduce a new self-supervised contrastive pretraini… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  20. arXiv:2402.14123  [pdf, other

    cs.LG cs.AI cs.CV

    DeiSAM: Segment Anything with Deictic Prompting

    Authors: Hikaru Shindo, Manuel Brack, Gopika Sudhakaran, Devendra Singh Dhami, Patrick Schramowski, Kristian Kersting

    Abstract: Large-scale, pre-trained neural networks have demonstrated strong capabilities in various tasks, including zero-shot image segmentation. To identify concrete objects in complex scenes, humans instinctively rely on deictic descriptions in natural language, i.e., referring to something depending on the context such as "The object that is on the desk and behind the cup.". However, deep learning appro… ▽ More

    Submitted 5 December, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Published as a conference paper at NeurIPS 2024

  21. arXiv:2402.08280  [pdf, other

    cs.AI cs.CV cs.LG

    Pix2Code: Learning to Compose Neural Visual Concepts as Programs

    Authors: Antonia Wüst, Wolfgang Stammer, Quentin Delfosse, Devendra Singh Dhami, Kristian Kersting

    Abstract: The challenge in learning abstract concepts from images in an unsupervised fashion lies in the required integration of visual perception and generalizable relational reasoning. Moreover, the unsupervised nature of this task makes it necessary for human users to be able to understand a model's learnt concepts and potentially revise false behaviours. To tackle both the generalizability and interpret… ▽ More

    Submitted 6 July, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  22. arXiv:2310.08377  [pdf, other

    cs.AI

    Do Not Marginalize Mechanisms, Rather Consolidate!

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Structural causal models (SCMs) are a powerful tool for understanding the complex causal relationships that underlie many real-world systems. As these systems grow in size, the number of variables and complexity of interactions between them does, too. Thus, becoming convoluted and difficult to analyze. This is particularly true in the context of machine learning and artificial intelligence, where… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 19 pages, 8 figures

  23. arXiv:2308.13067  [pdf, other

    cs.AI cs.CL

    Causal Parrots: Large Language Models May Talk Causality But Are Not Causal

    Authors: Matej Zečević, Moritz Willig, Devendra Singh Dhami, Kristian Kersting

    Abstract: Some argue scale is all what is needed to achieve AI, covering even causal models. We make it clear that large language models (LLMs) cannot be causal and give reason onto why sometimes we might feel otherwise. To this end, we define and exemplify a new subgroup of Structural Causal Model (SCM) that we call meta SCM which encode causal facts about other SCM within their variables. We conjecture th… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Published in Transactions in Machine Learning Research (TMLR) (08/2023). Main paper: 17 pages, References: 3 pages, Appendix: 7 pages. Figures: 5 main, 3 appendix. Tables: 3 main

    Journal ref: Transactions in Machine Learning Research (08/2023)

  24. arXiv:2308.09472  [pdf, other

    cs.CV cs.AI

    Vision Relation Transformer for Unbiased Scene Graph Generation

    Authors: Gopika Sudhakaran, Devendra Singh Dhami, Kristian Kersting, Stefan Roth

    Abstract: Recent years have seen a growing interest in Scene Graph Generation (SGG), a comprehensive visual scene understanding task that aims to predict entity relationships using a relation encoder-decoder pipeline stacked on top of an object encoder-decoder backbone. Unfortunately, current SGG methods suffer from an information loss regarding the entities local-level cues during the relation encoding pro… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in ICCV 2023

  25. arXiv:2307.00928  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Differentiable Logic Programs for Abstract Visual Reasoning

    Authors: Hikaru Shindo, Viktor Pfanschilling, Devendra Singh Dhami, Kristian Kersting

    Abstract: Visual reasoning is essential for building intelligent agents that understand the world and perform problem-solving beyond perception. Differentiable forward reasoning has been developed to integrate reasoning with gradient-based machine learning paradigms. However, due to the memory intensity, most existing approaches do not bring the best of the expressivity of first-order logic, excluding a cru… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: under review

  26. arXiv:2306.08397  [pdf, other

    cs.AI

    Scalable Neural-Probabilistic Answer Set Programming

    Authors: Arseny Skryagin, Daniel Ochs, Devendra Singh Dhami, Kristian Kersting

    Abstract: The goal of combining the robustness of neural networks and the expressiveness of symbolic methods has rekindled the interest in Neuro-Symbolic AI. Deep Probabilistic Programming Languages (DPPLs) have been developed for probabilistic logic programming to be carried out via the probability estimations of deep neural networks. However, recent SOTA DPPL approaches allow only for limited conditional… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 37 pages, 14 figures

  27. arXiv:2306.07743  [pdf, other

    cs.AI cs.CV cs.LG

    V-LoL: A Diagnostic Dataset for Visual Logical Learning

    Authors: Lukas Helff, Wolfgang Stammer, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Despite the successes of recent developments in visual AI, different shortcomings still exist; from missing exact logical reasoning, to abstract generalization abilities, to understanding complex and noisy scenes. Unfortunately, existing benchmarks, were not designed to capture more than a few of these aspects. Whereas deep learning datasets focus on visually complex data but simple visual reasoni… ▽ More

    Submitted 13 November, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

  28. arXiv:2212.12570  [pdf, other

    cs.AI cs.CV

    Pearl Causal Hierarchy on Image Data: Intricacies & Challenges

    Authors: Matej Zečević, Moritz Willig, Devendra Singh Dhami, Kristian Kersting

    Abstract: Many researchers have voiced their support towards Pearl's counterfactual theory of causation as a stepping stone for AI/ML research's ultimate goal of intelligent systems. As in any other growing subfield, patience seems to be a virtue since significant progress on integrating notions from both fields takes time, yet, major challenges such as the lack of ground truth benchmarks or a unified persp… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Main paper: 9 pages, References: 2 pages. Main paper: 7 figures

  29. arXiv:2211.11650  [pdf, other

    cs.AI

    Neural Meta-Symbolic Reasoning and Learning

    Authors: Zihan Ye, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Deep neural learning uses an increasing amount of computation and data to solve very specific problems. By stark contrast, human minds solve a wide range of problems using a fixed amount of computation and limited experience. One ability that seems crucial to this kind of general intelligence is meta-reasoning, i.e., our ability to reason about reasoning. To make deep learning do more from less, w… ▽ More

    Submitted 15 December, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  30. arXiv:2208.13518  [pdf, other

    cs.AI cs.CL cs.CV cs.LO cs.SC

    LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems

    Authors: Björn Deiseroth, Patrick Schramowski, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Text-to-image models have recently achieved remarkable success with seemingly accurate samples in photo-realistic quality. However as state-of-the-art language models still struggle evaluating precise statements consistently, so do language model based image generation processes. In this work we showcase problems of state-of-the-art text-to-image models like DALL-E with generating accurate samples… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  31. arXiv:2206.12342  [pdf, other

    cs.LG

    FEATHERS: Federated Architecture and Hyperparameter Search

    Authors: Jonas Seng, Pooja Prasad, Martin Mundt, Devendra Singh Dhami, Kristian Kersting

    Abstract: Deep neural architectures have profound impact on achieved performance in many of today's AI tasks, yet, their design still heavily relies on human prior knowledge and experience. Neural architecture search (NAS) together with hyperparameter optimization (HO) helps to reduce this dependence. However, state of the art NAS and HO rapidly become infeasible with increasing amount of data being stored… ▽ More

    Submitted 27 March, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: Main paper: 8 pages, References: 2 pages, Supplement: 4.5 pages, Main paper: 3 figures, 2 tables, 1 algorithm, Supplement: 2 figure, 4 algorithms, extended previous version by Differential Privacy, theoretical results and more experiments. Updated author list as it was incomplete

  32. arXiv:2206.10591  [pdf, other

    cs.AI cs.CL cs.LG

    Can Foundation Models Talk Causality?

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Foundation models are subject to an ongoing heated debate, leaving open the question of progress towards AGI and dividing the community into two camps: the ones who see the arguably impressive results as evidence to the scaling hypothesis, and the others who are worried about the lack of interpretability and reasoning capabilities. By investigating to which extent causal representations might be c… ▽ More

    Submitted 23 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 6 pages, References: 1.5 pages, Supplement: 11.5 pages. Main paper: 4 figures, Supplement: 3 figures, 8 tables

  33. arXiv:2206.07203  [pdf, other

    cs.LG

    Attributions Beyond Neural Networks: The Linear Program Case

    Authors: Florian Peter Busch, Matej Zečević, Kristian Kersting, Devendra Singh Dhami

    Abstract: Linear Programs (LPs) have been one of the building blocks in machine learning and have championed recent strides in differentiable optimizers for learning systems. While there exist solvers for even high-dimensional LPs, understanding said high-dimensional solutions poses an orthogonal and unresolved problem. We introduce an approach where we consider neural encodings for LPs that justify the app… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 9.5 pages, References: 2 pages, Supplement: 2.5 pages. Main paper: 5 figures, 2 tables, Supplement: 1 figure

  34. arXiv:2206.07196  [pdf, other

    cs.LG

    Towards a Solution to Bongard Problems: A Causal Approach

    Authors: Salahedine Youssef, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Even though AI has advanced rapidly in recent years displaying success in solving highly complex problems, the class of Bongard Problems (BPs) yet remain largely unsolved by modern ML techniques. In this paper, we propose a new approach in an attempt to not only solve BPs but also extract meaning out of learned representations. This includes the reformulation of the classical BP into a reinforceme… ▽ More

    Submitted 23 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 12 pages, References: 2 pages, Supplement: 3 pages. Main paper: 9 figures, Supplement: 3 figures

  35. arXiv:2206.07195  [pdf, other

    cs.LG

    Tearing Apart NOTEARS: Controlling the Graph Prediction via Variance Manipulation

    Authors: Jonas Seng, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Simulations are ubiquitous in machine learning. Especially in graph learning, simulations of Directed Acyclic Graphs (DAG) are being deployed for evaluating new algorithms. In the literature, it was recently argued that continuous-optimization approaches to structure discovery such as NOTEARS might be exploiting the sortability of the variable's variances in the available data due to their use of… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 5.5 pages, References: 1 page, Supplement: 2 pages. Main paper: 3 figures, Supplement: 1 figure, 1 table

  36. arXiv:2206.07194  [pdf, other

    cs.LG

    Machines Explaining Linear Programs

    Authors: David Steinmann, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: There has been a recent push in making machine learning models more interpretable so that their performance can be trusted. Although successful, these methods have mostly focused on the deep learning methods while the fundamental optimization methods in machine learning such as linear programs (LP) have been left out. Even if LPs can be considered as whitebox or clearbox models, they are not easy… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 9.5 pages, References: 2.5 pages, Supplement: 6 pages. Main paper: 5 figures, 4 tables, Supplement: 3 figures, 6 tables

  37. arXiv:2203.15274  [pdf, other

    cs.AI

    Finding Structure and Causality in Linear Programs

    Authors: Matej Zečević, Florian Peter Busch, Devendra Singh Dhami, Kristian Kersting

    Abstract: Linear Programs (LP) are celebrated widely, particularly so in machine learning where they have allowed for effectively solving probabilistic inference tasks or imposing structure on end-to-end learning systems. Their potential might seem depleted but we propose a foundational, causal perspective that reveals intriguing intra- and inter-structure relations for LP components. We conduct a systemati… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Main paper: 5 pages, References: 2 pages, Appendix: 1 page. Figures: 8 main, 1 appendix. Tables: 1 appendix

  38. arXiv:2110.12066  [pdf, other

    cs.LG stat.ML

    The Causal Loss: Driving Correlation to Imply Causation

    Authors: Moritz Willig, Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Most algorithms in classical and contemporary machine learning focus on correlation-based dependence between features to drive performance. Although success has been observed in many relevant problems, these algorithms fail when the underlying causality is inconsistent with the assumed relations. We propose a novel model-agnostic loss function called Causal Loss that improves the interventional qu… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: Main paper: 8 pages, References: 2 pages, Appendix: 3 pages. Figures: 4 main, 4 appendix. Tables: 2 main

  39. arXiv:2110.12052  [pdf, other

    cs.LG cs.AI

    A Taxonomy for Inference in Causal Model Families

    Authors: Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: Neurally-parameterized Structural Causal Models in the Pearlian notion to causality, referred to as NCM, were recently introduced as a step towards next-generation learning systems. However, said NCM are only concerned with the learning aspect of causal inference but totally miss out on the architecture aspect. That is, actual causal inference within NCM is intractable in that the NCM won't return… ▽ More

    Submitted 23 December, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: Main paper: 12 pages, References: 3 pages, Appendix: 4 pages. Figures: 3 main, 2 appendix

  40. arXiv:2110.09383  [pdf, other

    cs.AI cs.CV cs.LG

    Neuro-Symbolic Forward Reasoning

    Authors: Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Reasoning is an essential part of human intelligence and thus has been a long-standing goal in artificial intelligence research. With the recent success of deep learning, incorporating reasoning with deep learning systems, i.e., neuro-symbolic AI has become a major field of interest. We propose the Neuro-Symbolic Forward Reasoner (NSFR), a new approach for reasoning tasks taking advantage of diffe… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: Preprint

  41. arXiv:2110.03395  [pdf, other

    cs.AI

    SLASH: Embracing Probabilistic Circuits into Neural Answer Set Programming

    Authors: Arseny Skryagin, Wolfgang Stammer, Daniel Ochs, Devendra Singh Dhami, Kristian Kersting

    Abstract: The goal of combining the robustness of neural networks and the expressivity of symbolic methods has rekindled the interest in neuro-symbolic AI. Recent advancements in neuro-symbolic AI often consider specifically-tailored architectures consisting of disjoint neural and symbolic components, and thus do not exhibit desired gains that can be achieved by integrating them into a unifying framework. W… ▽ More

    Submitted 23 November, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 18 pages, 7 figures and 6 tables

    ACM Class: I.2.5; D.3.2

  42. arXiv:2110.02395  [pdf, other

    cs.LG

    Causal Explanations of Structural Causal Models

    Authors: Matej Zečević, Devendra Singh Dhami, Constantin A. Rothkopf, Kristian Kersting

    Abstract: In explanatory interactive learning (XIL) the user queries the learner, then the learner explains its answer to the user and finally the loop repeats. XIL is attractive for two reasons, (1) the learner becomes better and (2) the user's trust increases. For both reasons to hold, the learner's explanations must be useful to the user and the user must be allowed to ask useful questions. Ideally, both… ▽ More

    Submitted 23 December, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Main paper: 9 pages, References: 2.5 pages, Supplement: 12 pages. Main paper: 4 figures, Supplement: 6 figures, 2 tables

  43. arXiv:2109.06587  [pdf, other

    cs.LG

    Sum-Product-Attention Networks: Leveraging Self-Attention in Probabilistic Circuits

    Authors: Zhongjie Yu, Devendra Singh Dhami, Kristian Kersting

    Abstract: Probabilistic circuits (PCs) have become the de-facto standard for learning and inference in probabilistic modeling. We introduce Sum-Product-Attention Networks (SPAN), a new generative model that integrates probabilistic circuits with Transformers. SPAN uses self-attention to select the most relevant parts of a probabilistic circuit, here sum-product networks, to improve the modeling capability o… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

  44. arXiv:2109.04173  [pdf, other

    cs.LG stat.ML

    Relating Graph Neural Networks to Structural Causal Models

    Authors: Matej Zečević, Devendra Singh Dhami, Petar Veličković, Kristian Kersting

    Abstract: Causality can be described in terms of a structural causal model (SCM) that carries information on the variables of interest and their mechanistic relations. For most processes of interest the underlying SCM will only be partially observable, thus causal inference tries leveraging the exposed. Graph neural networks (GNN) as universal approximators on structured input pose a viable candidate for ca… ▽ More

    Submitted 22 October, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Main paper: 12 pages, References: 2 pages, Appendix: 13 pages; Main paper: 4 figures, Appendix: 2 figures

  45. arXiv:2105.12697  [pdf, other

    cs.LG cs.CR

    Structural Causal Models Reveal Confounder Bias in Linear Program Modelling

    Authors: Matej Zečević, Devendra Singh Dhami, Kristian Kersting

    Abstract: The recent years have been marked by extended research on adversarial attacks, especially on deep neural networks. With this work we intend on posing and investigating the question of whether the phenomenon might be more general in nature, that is, adversarial-style attacks outside classical classification tasks. Specifically, we investigate optimization problems as they constitute a fundamental p… ▽ More

    Submitted 7 November, 2023; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Published at the 15th Asian Conference on Machine Learning (ACML 2023) Journal Track. Main paper: 19 pages, References: 2 pages, Supplement: .5 page. Main paper: 3 figures, 3 tables, Supplement: 1 table

  46. arXiv:2103.10916  [pdf, other

    cs.LG

    Predicting Drug-Drug Interactions from Heterogeneous Data: An Embedding Approach

    Authors: Devendra Singh Dhami, Siwen Yan, Gautam Kunapuli, David Page, Sriraam Natarajan

    Abstract: Predicting and discovering drug-drug interactions (DDIs) using machine learning has been studied extensively. However, most of the approaches have focused on text data or textual representation of the drug structures. We present the first work that uses multiple data sources such as drug structure images, drug structure string representation and relational representation of drug relationships as t… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: 10 pages, 6 figures, Accepted as a short paper to 'Artificial Intelligence in Medicine 2021'

  47. arXiv:2102.10440  [pdf, other

    cs.LG

    Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models

    Authors: Matej Zečević, Devendra Singh Dhami, Athresh Karanam, Sriraam Natarajan, Kristian Kersting

    Abstract: While probabilistic models are an important tool for studying causality, doing so suffers from the intractability of inference. As a step towards tractable causal models, we consider the problem of learning interventional distributions using sum-product networks (SPNs) that are over-parameterized by gate functions, e.g., neural networks. Providing an arbitrarily intervened causal graph as input, e… ▽ More

    Submitted 25 October, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: Main paper: 10 pages, References: 3 pages, Appendix: 8 pages. Main paper: 6 figures, Appendix: 5 figures

  48. arXiv:2102.07007  [pdf, other

    cs.LG

    A Statistical Relational Approach to Learning Distance-based GCNs

    Authors: Devendra Singh Dhami, Siwen Yan, Sriraam Natarajan

    Abstract: We consider the problem of learning distance-based Graph Convolutional Networks (GCNs) for relational data. Specifically, we first embed the original graph into the Euclidean space $\mathbb{R}^m$ using a relational density estimation technique thereby constructing a secondary Euclidean graph. The graph vertices correspond to the target triples and edges denote the Euclidean distances between the t… ▽ More

    Submitted 12 October, 2021; v1 submitted 13 February, 2021; originally announced February 2021.

    Comments: 8 pages, 5 figures, 4 tables; accepted to STARAI workshop

  49. arXiv:2001.00528  [pdf, other

    cs.LG stat.ML

    Non-Parametric Learning of Gaifman Models

    Authors: Devendra Singh Dhami, Siwen Yan, Gautam Kunapuli, Sriraam Natarajan

    Abstract: We consider the problem of structure learning for Gaifman models and learn relational features that can be used to derive feature representations from a knowledge base. These relational features are first-order rules that are then partially grounded and counted over local neighborhoods of a Gaifman model to obtain the feature representations. We propose a method for learning these relational featu… ▽ More

    Submitted 15 January, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: 8 pages, 6 figures

  50. arXiv:1911.06356  [pdf, other

    cs.LG stat.ML

    Beyond Textual Data: Predicting Drug-Drug Interactions from Molecular Structure Images using Siamese Neural Networks

    Authors: Devendra Singh Dhami, Siwen Yan, Gautam Kunapuli, David Page, Sriraam Natarajan

    Abstract: Predicting and discovering drug-drug interactions (DDIs) is an important problem and has been studied extensively both from medical and machine learning point of view. Almost all of the machine learning approaches have focused on text data or textual representation of the structural data of drugs. We present the first work that uses drug structure images as the input and utilizes a Siamese convolu… ▽ More

    Submitted 29 June, 2020; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: 9 pages, 9 figures