Skip to main content

Showing 1–27 of 27 results for author: Peharz, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.13748  [pdf, other

    cs.LG cs.CR

    Exact Soft Analytical Side-Channel Attacks using Tractable Circuits

    Authors: Thomas Wedenig, Rishub Nagpal, Gaëtan Cassiers, Stefan Mangard, Robert Peharz

    Abstract: Detecting weaknesses in cryptographic algorithms is of utmost importance for designing secure information systems. The state-of-the-art soft analytical side-channel attack (SASCA) uses physical leakage information to make probabilistic predictions about intermediate computations and combines these "guesses" with the known algorithmic logic to compute the posterior distribution over the key. This a… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: ICML 2024 Conference Paper

  2. arXiv:2409.07953  [pdf, other

    cs.LG

    What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?

    Authors: Lorenzo Loconte, Antonio Mari, Gennaro Gala, Robert Peharz, Cassio de Campos, Erik Quaeghebeur, Gennaro Vessio, Antonio Vergari

    Abstract: This paper establishes a rigorous connection between circuit representations and tensor factorizations, two seemingly distinct yet fundamentally related areas. By connecting these fields, we highlight a series of opportunities that can benefit both communities. Our work generalizes popular tensor factorizations within the circuit language, and unifies various circuit learning algorithms under a si… ▽ More

    Submitted 3 February, 2025; v1 submitted 12 September, 2024; originally announced September 2024.

    Journal ref: Transactions on Machine Learning Research, 2025

  3. arXiv:2406.02177  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    One-Shot Federated Learning with Bayesian Pseudocoresets

    Authors: Tim d'Hondt, Mykola Pechenizkiy, Robert Peharz

    Abstract: Optimization-based techniques for federated learning (FL) often come with prohibitive communication cost, as high dimensional model parameters need to be communicated repeatedly between server and clients. In this paper, we follow a Bayesian approach allowing to perform FL with one-shot communication, by solving the global inference problem as a product of local client posteriors. For models with… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 10 pages

  4. arXiv:2402.14781  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders

    Authors: Christian Toth, Christian Knoll, Franz Pernkopf, Robert Peharz

    Abstract: The traditional two-stage approach to causal inference first identifies a single causal model (or equivalence class of models), which is then used to answer causal queries. However, this neglects any epistemic model uncertainty. In contrast, Bayesian causal inference does incorporate epistemic uncertainty into query estimates via Bayesian marginalisation (posterior averaging) over all causal model… ▽ More

    Submitted 23 April, 2025; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 9 pages + references + appendices (37 pages total)

  5. arXiv:2310.16986  [pdf, other

    cs.LG

    Probabilistic Integral Circuits

    Authors: Gennaro Gala, Cassio de Campos, Robert Peharz, Antonio Vergari, Erik Quaeghebeur

    Abstract: Continuous latent variables (LVs) are a key ingredient of many generative models, as they allow modelling expressive mixtures with an uncountable number of components. In contrast, probabilistic circuits (PCs) are hierarchical discrete mixtures represented as computational graphs composed of input, sum and product units. Unlike continuous LV models, PCs provide tractable inference but are limited… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  6. arXiv:2305.15944  [pdf, other

    cs.LG cs.AI

    How to Turn Your Knowledge Graph Embeddings into Generative Models

    Authors: Lorenzo Loconte, Nicola Di Mauro, Robert Peharz, Antonio Vergari

    Abstract: Some of the most successful knowledge graph embedding (KGE) models for link prediction -- CP, RESCAL, TuckER, ComplEx -- can be interpreted as energy-based models. Under this perspective they are not amenable for exact maximum-likelihood estimation (MLE), sampling and struggle to integrate logical constraints. This work re-interprets the score functions of these KGEs as circuits -- constrained com… ▽ More

    Submitted 16 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  7. arXiv:2302.12130  [pdf, other

    cs.LG cs.AI stat.ML

    Bayesian Structure Scores for Probabilistic Circuits

    Authors: Yang Yang, Gennaro Gala, Robert Peharz

    Abstract: Probabilistic circuits (PCs) are a prominent representation of probability distributions with tractable inference. While parameter learning in PCs is rigorously studied, structure learning is often more based on heuristics than on principled objectives. In this paper, we develop Bayesian structure scores for deterministic PCs, i.e., the structure likelihood with parameters marginalized out, which… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  8. arXiv:2209.10584  [pdf, other

    cs.LG cs.AI stat.ML

    Continuous Mixtures of Tractable Probabilistic Models

    Authors: Alvaro H. C. Correia, Gennaro Gala, Erik Quaeghebeur, Cassio de Campos, Robert Peharz

    Abstract: Probabilistic models based on continuous latent spaces, such as variational autoencoders, can be understood as uncountable mixture models where components depend continuously on the latent code. They have proven to be expressive tools for generative and probabilistic modelling, but are at odds with tractable probabilistic inference, that is, computing marginals and conditionals of the represented… ▽ More

    Submitted 24 March, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

  9. arXiv:2206.02063  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Active Bayesian Causal Inference

    Authors: Christian Toth, Lars Lorch, Christian Knoll, Andreas Krause, Franz Pernkopf, Robert Peharz, Julius von Kügelgen

    Abstract: Causal discovery and causal reasoning are classically treated as separate and consecutive tasks: one first infers the causal graph, and then uses it to estimate causal effects of interventions. However, such a two-stage approach is uneconomical, especially in terms of actively collected interventional data, since the causal query of interest may not require a fully-specified causal model. From a B… ▽ More

    Submitted 15 October, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version. RP & JvK are shared last authors. 10 pages + Bibliography + Appendix (34 pages total)

  10. arXiv:2007.05721  [pdf, other

    stat.ML cs.LG

    Towards Robust Classification with Deep Generative Forests

    Authors: Alvaro H. C. Correia, Robert Peharz, Cassio de Campos

    Abstract: Decision Trees and Random Forests are among the most widely used machine learning models, and often achieve state-of-the-art performance in tabular, domain-agnostic datasets. Nonetheless, being primarily discriminative models they lack principled methods to manipulate the uncertainty of predictions. In this paper, we exploit Generative Forests (GeFs), a recent class of deep probabilistic models th… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: Presented at the ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

  11. arXiv:2006.14937  [pdf, other

    cs.LG cs.AI stat.ML

    Joints in Random Forests

    Authors: Alvaro H. C. Correia, Robert Peharz, Cassio de Campos

    Abstract: Decision Trees (DTs) and Random Forests (RFs) are powerful discriminative learners and tools of central importance to the everyday machine learning practitioner and data scientist. Due to their discriminative nature, however, they lack principled methods to process inputs with missing features or to detect outliers, which requires pairing them with imputation techniques or a separate generative mo… ▽ More

    Submitted 19 November, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (2020)

  12. arXiv:2004.06231  [pdf, other

    cs.LG stat.ML

    Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits

    Authors: Robert Peharz, Steven Lang, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Guy Van den Broeck, Kristian Kersting, Zoubin Ghahramani

    Abstract: Probabilistic circuits (PCs) are a promising avenue for probabilistic modeling, as they permit a wide range of exact and efficient inference routines. Recent ``deep-learning-style'' implementations of PCs strive for a better scalability, but are still difficult to train on real-world data, due to their sparsely connected computational graphs. In this paper, we propose Einsum Networks (EiNets), a n… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  13. arXiv:2001.03048  [pdf, other

    stat.ML cs.LG

    Resource-Efficient Neural Networks for Embedded Systems

    Authors: Wolfgang Roth, Günther Schindler, Bernhard Klein, Robert Peharz, Sebastian Tschiatschek, Holger Fröning, Franz Pernkopf, Zoubin Ghahramani

    Abstract: While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation, and the vision of the Internet of Things fuel the interest in resource-efficient approaches. These approaches aim for a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. The development of such approaches is among the major challenges… ▽ More

    Submitted 7 April, 2024; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1812.02240; accepted at JMLR

  14. arXiv:1912.10092  [pdf, other

    cs.AI cs.LG

    Sum-Product Network Decompilation

    Authors: Cory J. Butz, Jhonatan S. Oliveira, Robert Peharz

    Abstract: There exists a dichotomy between classical probabilistic graphical models, such as Bayesian networks (BNs), and modern tractable models, such as sum-product networks (SPNs). The former generally have intractable inference, but provide a high level of interpretability, while the latter admits a wide range of tractable inference routines, but are typically harder to interpret. Due to this dichotomy,… ▽ More

    Submitted 19 May, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

  15. arXiv:1910.04536  [pdf, other

    cs.LG stat.ML

    Deep Structured Mixtures of Gaussian Processes

    Authors: Martin Trapp, Robert Peharz, Franz Pernkopf, Carl E. Rasmussen

    Abstract: Gaussian Processes (GPs) are powerful non-parametric Bayesian regression models that allow exact posterior inference, but exhibit high computational and memory costs. In order to improve scalability of GPs, approximate posterior inference is frequently employed, where a prominent class of approximation techniques is based on local GP experts. However, local-expert techniques proposed so far are ei… ▽ More

    Submitted 26 April, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: AISTATS 2020

  16. arXiv:1905.10884  [pdf, other

    cs.LG stat.ML

    Bayesian Learning of Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Hong Ge, Franz Pernkopf, Zoubin Ghahramani

    Abstract: Sum-product networks (SPNs) are flexible density estimators and have received significant attention due to their attractive inference properties. While parameter learning in SPNs is well developed, structure learning leaves something to be desired: Even though there is a plethora of SPN structure learners, most of them are somewhat ad-hoc and based on intuition rather than a clear learning princip… ▽ More

    Submitted 4 November, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019; See conference page for supplement

  17. arXiv:1905.08550  [pdf, other

    cs.LG stat.ML

    Conditional Sum-Product Networks: Imposing Structure on Deep Probabilistic Architectures

    Authors: Xiaoting Shao, Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Thomas Liebig, Kristian Kersting

    Abstract: Probabilistic graphical models are a central tool in AI; however, they are generally not as expressive as deep neural models, and inference is notoriously hard and slow. In contrast, deep probabilistic models such as sum-product networks (SPNs) capture joint distributions in a tractable fashion, but still lack the expressive power of intractable models based on deep neural networks. Therefore, we… ▽ More

    Submitted 29 September, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: 13 pages, 6 figures

  18. arXiv:1905.08196  [pdf, other

    cs.LG stat.ML

    Optimisation of Overparametrized Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Franz Pernkopf

    Abstract: It seems to be a pearl of conventional wisdom that parameter learning in deep sum-product networks is surprisingly fast compared to shallow mixture models. This paper examines the effects of overparameterization in sum-product networks on the speed of parameter optimisation. Using theoretical analysis and empirical experiments, we show that deep sum-product networks exhibit an implicit acceleratio… ▽ More

    Submitted 29 May, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

    Comments: Workshop on Tractable Probabilistic Models (TPM) at ICML 2019

  19. arXiv:1901.03704  [pdf, other

    cs.LG stat.ML

    SPFlow: An Easy and Extensible Library for Deep Probabilistic Learning using Sum-Product Networks

    Authors: Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Pranav Subramani, Nicola Di Mauro, Pascal Poupart, Kristian Kersting

    Abstract: We introduce SPFlow, an open-source Python library providing a simple interface to inference, learning and manipulation routines for deep and tractable probabilistic models called Sum-Product Networks (SPNs). The library allows one to quickly create SPNs both from data and through a domain specific language (DSL). It efficiently implements several probabilistic inference routines like computing ma… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: 4 pages, 1 figure, code

  20. arXiv:1812.02240  [pdf, other

    cs.LG stat.ML

    Efficient and Robust Machine Learning for Real-World Systems

    Authors: Franz Pernkopf, Wolfgang Roth, Matthias Zoehrer, Lukas Pfeifenberger, Guenther Schindler, Holger Froening, Sebastian Tschiatschek, Robert Peharz, Matthew Mattina, Zoubin Ghahramani

    Abstract: While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation and the vision of the Internet-of-Things fuel the interest in resource efficient approaches. These approaches require a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. On top of this, it is crucial to treat uncertainty in a consisten… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

  21. arXiv:1810.00440  [pdf, other

    stat.ML cs.LG

    Minimal Random Code Learning: Getting Bits Back from Compressed Model Parameters

    Authors: Marton Havasi, Robert Peharz, José Miguel Hernández-Lobato

    Abstract: While deep neural networks are a highly successful model class, their large memory footprint puts considerable strain on energy consumption, communication bandwidth, and storage requirements. Consequently, model size reduction has become an utmost goal in deep learning. A typical approach is to train a set of deterministic weights, while applying certain techniques such as pruning and quantization… ▽ More

    Submitted 30 September, 2018; originally announced October 2018.

    Comments: Under review as a conference paper at ICLR 2019

  22. arXiv:1809.04400  [pdf, other

    cs.LG stat.ML

    Learning Deep Mixtures of Gaussian Process Experts Using Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Carl E. Rasmussen, Franz Pernkopf

    Abstract: While Gaussian processes (GPs) are the method of choice for regression tasks, they also come with practical difficulties, as inference cost scales cubic in time and quadratic in memory. In this paper, we introduce a natural and expressive way to tackle these problems, by incorporating GPs in sum-product networks (SPNs), a recently proposed tractable probabilistic model allowing exact and efficient… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Presented at the Workshop on Tractable Probabilistic Models (TPM 2018), ICML 2018

  23. arXiv:1807.09306  [pdf, other

    stat.ML cs.LG

    Automatic Bayesian Density Analysis

    Authors: Antonio Vergari, Alejandro Molina, Robert Peharz, Zoubin Ghahramani, Kristian Kersting, Isabel Valera

    Abstract: Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for {exploratory data analysis} are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous… ▽ More

    Submitted 10 February, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: In proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)

  24. arXiv:1806.01910  [pdf, other

    cs.LG cs.AI stat.ML

    Probabilistic Deep Learning using Random Sum-Product Networks

    Authors: Robert Peharz, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Kristian Kersting, Zoubin Ghahramani

    Abstract: The need for consistent treatment of uncertainty has recently triggered increased interest in probabilistic deep learning methods. However, most current approaches have severe limitations when it comes to inference, since many of these models do not even permit to evaluate exact data likelihoods. Sum-product networks (SPNs), on the other hand, are an excellent architecture in that regard, as they… ▽ More

    Submitted 22 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  25. arXiv:1710.03444  [pdf, other

    stat.ML cs.LG

    Safe Semi-Supervised Learning of Sum-Product Networks

    Authors: Martin Trapp, Tamas Madl, Robert Peharz, Franz Pernkopf, Robert Trappl

    Abstract: In several domains obtaining class annotations is expensive while at the same time unlabelled data are abundant. While most semi-supervised approaches enforce restrictive assumptions on the data distribution, recent work has managed to learn semi-supervised models in a non-restrictive regime. However, so far such approaches have only been proposed for linear models. In this work, we introduce semi… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI), 2017

  26. arXiv:1601.06180  [pdf, ps, other

    cs.AI cs.LG

    On the Latent Variable Interpretation in Sum-Product Networks

    Authors: Robert Peharz, Robert Gens, Franz Pernkopf, Pedro Domingos

    Abstract: One of the central themes in Sum-Product networks (SPNs) is the interpretation of sum nodes as marginalized latent variables (LVs). This interpretation yields an increased syntactic or semantic structure, allows the application of the EM algorithm and to efficiently perform MPE inference. In literature, the LV interpretation was justified by explicitly introducing the indicator variables correspon… ▽ More

    Submitted 28 October, 2016; v1 submitted 22 January, 2016; originally announced January 2016.

    Comments: Revised version, accepted for publication in IEEE Transactions on Machine Intelligence and Pattern Analysis (TPAMI). Shortened and revised Section 4: Thanks to our reviewers, pointing out that Theorem 2 holds for selective SPNs. Added paragraph in Section 2.1, relating sizes of original/augmented SPNs. Fixed typos, rephrased sentences, revised references

    MSC Class: 62

  27. arXiv:1206.6431  [pdf

    cs.LG stat.ML

    Exact Maximum Margin Structure Learning of Bayesian Networks

    Authors: Robert Peharz, Franz Pernkopf

    Abstract: Recently, there has been much interest in finding globally optimal Bayesian network structures. These techniques were developed for generative scores and can not be directly extended to discriminative scores, as desired for classification. In this paper, we propose an exact method for finding network structures maximizing the probabilistic soft margin, a successfully applied discriminative score.… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: ICML