Skip to main content

Showing 1–18 of 18 results for author: Zaffalon, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.05620  [pdf, other

    stat.ML cs.LG

    dynoGP: Deep Gaussian Processes for dynamic system identification

    Authors: Alessio Benavoli, Dario Piga, Marco Forgione, Marco Zaffalon

    Abstract: In this work, we present a novel approach to system identification for dynamical systems, based on a specific class of Deep Gaussian Processes (Deep GPs). These models are constructed by interconnecting linear dynamic GPs (equivalent to stochastic linear time-invariant dynamical systems) and static GPs (to model static nonlinearities). Our approach combines the strengths of data-driven methods, su… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  2. arXiv:2402.17087  [pdf, ps, other

    stat.ML cs.AI cs.LG

    A Note on Bayesian Networks with Latent Root Variables

    Authors: Marco Zaffalon, Alessandro Antonucci

    Abstract: We characterise the likelihood function computed from a Bayesian network with latent variables as root nodes. We show that the marginal distribution over the remaining, manifest, variables also factorises as a Bayesian network, which we call empirical. A dataset of observations of the manifest variables allows us to quantify the parameters of the empirical Bayesian net. We prove that (i) the likel… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  3. arXiv:2307.16577  [pdf, ps, other

    stat.ME cs.AI

    Approximating Counterfactual Bounds while Fusing Observational, Biased and Randomised Data Sources

    Authors: Marco Zaffalon, Alessandro Antonucci, Rafael Cabañas, David Huber

    Abstract: We address the problem of integrating data from multiple, possibly biased, observational and interventional studies, to eventually compute counterfactuals in structural causal models. We start from the case of a single observational dataset affected by a selection bias. We show that the likelihood of the available data has no local maxima. This enables us to use the causal expectation-maximisation… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

  4. arXiv:2212.02932  [pdf, ps, other

    cs.AI stat.ME

    Learning to Bound Counterfactual Inference from Observational, Biased and Randomised Data

    Authors: Marco Zaffalon, Alessandro Antonucci, David Huber, Rafael Cabañas

    Abstract: We address the problem of integrating data from multiple, possibly biased, observational and interventional studies, to eventually compute counterfactuals in structural causal models. We start from the case of a single observational dataset affected by a selection bias. We show that the likelihood of the available data has no local maxima. This enables us to use the causal expectation-maximisation… ▽ More

    Submitted 16 March, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

  5. arXiv:2208.01417  [pdf, ps, other

    stat.ML cs.AI stat.ME

    Bounding Counterfactuals under Selection Bias

    Authors: Marco Zaffalon, Alessandro Antonucci, Rafael Cabañas, David Huber, Dario Azzimonti

    Abstract: Causal analysis may be affected by selection bias, which is defined as the systematic exclusion of data from a certain subpopulation. Previous work in this area focused on the derivation of identifiability conditions. We propose instead a first algorithm to address both identifiable and unidentifiable queries. We prove that, in spite of the missingness induced by the selection bias, the likelihood… ▽ More

    Submitted 26 July, 2022; originally announced August 2022.

    Comments: Eleventh International Conference on Probabilistic Graphical Models (PGM 2022)

  6. arXiv:2112.09519  [pdf, other

    stat.ML cs.LG

    Correlated Product of Experts for Sparse Gaussian Process Regression

    Authors: Manuel Schürch, Dario Azzimonti, Alessio Benavoli, Marco Zaffalon

    Abstract: Gaussian processes (GPs) are an important tool in machine learning and statistics with applications ranging from social and natural science through engineering. They constitute a powerful kernelized non-parametric method with well-calibrated uncertainty estimates, however, off-the-shelf GP inference procedures are limited to datasets with several thousand data points because of their cubic computa… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  7. Time series forecasting with Gaussian Processes needs priors

    Authors: Giorgio Corani, Alessio Benavoli, Marco Zaffalon

    Abstract: Automatic forecasting is the task of receiving a time series and returning a forecast for the next time steps without any human intervention. Gaussian Processes (GPs) are a powerful tool for modeling time series, but so far there are no competitive approaches for automatic forecasting based on GPs. We propose practical solutions to two problems: automatic selection of the optimal kernel and reliab… ▽ More

    Submitted 21 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

  8. arXiv:2007.06363  [pdf, other

    stat.ML cs.LG

    Orthogonally Decoupled Variational Fourier Features

    Authors: Dario Azzimonti, Manuel Schürch, Alessio Benavoli, Marco Zaffalon

    Abstract: Sparse inducing points have long been a standard method to fit Gaussian processes to big data. In the last few years, spectral methods that exploit approximations of the covariance kernel have shown to be competitive. In this work we exploit a recently introduced orthogonally decoupled variational basis to combine spectral methods and sparse inducing points methods. We show that the method is comp… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  9. Reconciling Hierarchical Forecasts via Bayes' Rule

    Authors: Giorgio Corani, Dario Azzimonti, João P. S. C. Augusto, Marco Zaffalon

    Abstract: We present a novel approach for reconciling hierarchical forecasts, based on Bayes rule. We define a prior distribution for the bottom time series of the hierarchy, based on the bottom base forecasts. Then we update their distribution via Bayes rule, based on the base forecasts for the upper time series. Under the Gaussian assumption, we derive the updating in closed-form. We derive two algorithms… ▽ More

    Submitted 22 June, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

    Journal ref: ECML PKDD 2020: Proc. Machine Learning and Knowledge Discovery in Databases, 211 - 226

  10. Recursive Estimation for Sparse Gaussian Process Regression

    Authors: Manuel Schürch, Dario Azzimonti, Alessio Benavoli, Marco Zaffalon

    Abstract: Gaussian Processes (GPs) are powerful kernelized methods for non-parameteric regression used in many applications. However, their use is limited to a few thousand of training samples due to their cubic time complexity. In order to scale GPs to larger datasets, several sparse approximations based on so-called inducing points have been proposed in the literature. In this work we investigate the conn… ▽ More

    Submitted 22 June, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  11. Hierarchical Multinomial-Dirichlet model for the estimation of conditional probability tables

    Authors: L. Azzimonti, G. Corani, M. Zaffalon

    Abstract: We present a novel approach for estimating conditional probability tables, based on a joint, rather than independent, estimate of the conditional distributions belonging to the same table. We derive exact analytical expressions for the estimators and we analyse their properties both analytically and via simulation. We then apply this method to the estimation of parameters in a Bayesian network. Gi… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

  12. arXiv:1707.06194  [pdf, ps, other

    cs.AI stat.ML

    Entropy-based Pruning for Learning Bayesian Networks using BIC

    Authors: Cassio P. de Campos, Mauro Scanagatta, Giorgio Corani, Marco Zaffalon

    Abstract: For decomposable score-based structure learning of Bayesian networks, existing approaches first compute a collection of candidate parent sets for each variable and then optimize over this collection by choosing one parent set for each variable without creating directed cycles while maximizing the total score. We target the task of constructing the collection of candidate parent sets when the score… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

  13. arXiv:1609.08905  [pdf, other

    cs.LG stat.ME stat.ML

    Statistical comparison of classifiers through Bayesian hierarchical modelling

    Authors: Giorgio Corani, Alessio Benavoli, Janez Demšar, Francesca Mangili, Marco Zaffalon

    Abstract: Usually one compares the accuracy of two competing classifiers via null hypothesis significance tests (nhst). Yet the nhst tests suffer from important shortcomings, which can be overcome by switching to Bayesian hypothesis testing. We propose a Bayesian hierarchical model which jointly analyzes the cross-validation results obtained by two classifiers on multiple data sets. It returns the posterior… ▽ More

    Submitted 22 November, 2016; v1 submitted 28 September, 2016; originally announced September 2016.

  14. arXiv:1606.04316  [pdf, other

    stat.ML cs.LG

    Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis

    Authors: Alessio Benavoli, Giorgio Corani, Janez Demsar, Marco Zaffalon

    Abstract: The machine learning community adopted the use of null hypothesis significance testing (NHST) in order to ensure the statistical validity of results. Many scientific fields however realized the shortcomings of frequentist reasoning and in the most radical cases even banned its use in publications. We should do the same: just as we have embraced the Bayesian paradigm in the development of new machi… ▽ More

    Submitted 15 July, 2017; v1 submitted 14 June, 2016; originally announced June 2016.

    Comments: This paper has been published in the Journal of Machine Learning Research (JMLR) vol.18, 2017

  15. arXiv:1601.01544  [pdf, other

    cs.LG stat.ML

    State Space representation of non-stationary Gaussian Processes

    Authors: Alessio Benavoli, Marco Zaffalon

    Abstract: The state space (SS) representation of Gaussian processes (GP) has recently gained a lot of interest. The main reason is that it allows to compute GPs based inferences in O(n), where $n$ is the number of observations. This implementation makes GPs suitable for Big Data. For this reason, it is important to provide a SS representation of the most important kernels used in machine learning. The aim o… ▽ More

    Submitted 7 January, 2016; originally announced January 2016.

  16. arXiv:1402.2755  [pdf, ps, other

    math.ST stat.ME

    Imprecise Dirichlet Process with application to the hypothesis test on the probability that X< Y

    Authors: Alessio Benavoli, Francesca Mangili, Fabrizio Ruggeri, Marco Zaffalon

    Abstract: The Dirichlet process (DP) is one of the most popular Bayesian nonparametric models. An open problem with the DP is how to choose its infinite dimensional parameter (base measure) in case of lack of prior information. In this work we present the Imprecise DP (IDP) -- a prior near-ignorance DP-based model that does not require any choice of this probability measure. It consists of a class of DPs ob… ▽ More

    Submitted 20 February, 2014; v1 submitted 12 February, 2014; originally announced February 2014.

  17. arXiv:1109.1754  [pdf, ps, other

    cs.AI cs.CC stat.ML

    Solving Limited Memory Influence Diagrams

    Authors: Denis Deratani Mauá, Cassio Polpo de Campos, Marco Zaffalon

    Abstract: We present a new algorithm for exactly solving decision making problems represented as influence diagrams. We do not require the usual assumptions of no forgetting and regularity; this allows us to solve problems with simultaneous decisions and limited information. The algorithm is empirically shown to outperform a state-of-the-art algorithm on randomly generated problems of up to 150 variables an… ▽ More

    Submitted 9 September, 2011; v1 submitted 8 September, 2011; originally announced September 2011.

    Comments: 43 pages, 8 figures

    MSC Class: 68T37 ACM Class: I.2.1; I.2.8; F.2

  18. arXiv:1008.2514  [pdf, ps, other

    cs.AI math.PR stat.ML

    Epistemic irrelevance in credal nets: the case of imprecise Markov trees

    Authors: Gert de Cooman, Filip Hermans, Alessandro Antonucci, Marco Zaffalon

    Abstract: We focus on credal nets, which are graphical models that generalise Bayesian nets to imprecise probability. We replace the notion of strong independence commonly used in credal nets with the weaker notion of epistemic irrelevance, which is arguably more suited for a behavioural theory of probability. Focusing on directed trees, we show how to combine the given local uncertainty models in the nodes… ▽ More

    Submitted 15 August, 2010; originally announced August 2010.

    Comments: 29 pages, 5 figures, 1 table