-
SBAMDT: Bayesian Additive Decision Trees with Adaptive Soft Semi-multivariate Split Rules
Authors:
Stamatina Lamprinakou,
Huiyan Sang,
Bledar A. Konomi,
Ligang Lu
Abstract:
Bayesian Additive Regression Trees [BART, Chipman et al., 2010] have gained significant popularity due to their remarkable predictive performance and ability to quantify uncertainty. However, standard decision tree models rely on recursive data splits at each decision node, using deterministic decision rules based on a single univariate feature. This approach limits their ability to effectively ca…
▽ More
Bayesian Additive Regression Trees [BART, Chipman et al., 2010] have gained significant popularity due to their remarkable predictive performance and ability to quantify uncertainty. However, standard decision tree models rely on recursive data splits at each decision node, using deterministic decision rules based on a single univariate feature. This approach limits their ability to effectively capture complex decision boundaries, particularly in scenarios involving multiple features, such as spatial domains, or when transitions are either sharp or smoothly varying. In this paper, we introduce a novel probabilistic additive decision tree model that employs a soft split rule. This method enables highly flexible splits that leverage both univariate and multivariate features, while also respecting the geometric properties of the feature domain. Notably, the probabilistic split rule adapts dynamically across decision nodes, allowing the model to account for varying levels of smoothness in the regression function. We demonstrate the utility of the proposed model through comparisons with existing tree-based models on synthetic datasets and a New York City education dataset.
△ Less
Submitted 16 January, 2025;
originally announced January 2025.
-
Age-stratified epidemic model using a latent marked Hawkes process
Authors:
Stamatina Lamprinakou,
Axel Gandy
Abstract:
We extend the unstructured homogeneously mixing epidemic model introduced by Lamprinakou et al. [arXiv:2208.07340] considering a finite population stratified by age bands. We model the actual unobserved infections using a latent marked Hawkes process and the reported aggregated infections as random quantities driven by the underlying Hawkes process. We apply a Kernel Density Particle Filter (KDPF)…
▽ More
We extend the unstructured homogeneously mixing epidemic model introduced by Lamprinakou et al. [arXiv:2208.07340] considering a finite population stratified by age bands. We model the actual unobserved infections using a latent marked Hawkes process and the reported aggregated infections as random quantities driven by the underlying Hawkes process. We apply a Kernel Density Particle Filter (KDPF) to infer the marked counting process, the instantaneous reproduction number for each age group and forecast the epidemic's future trajectory in the near future; considering the age bands and the population size does not increase the computational effort. We demonstrate the performance of the proposed inference algorithm on synthetic data sets and COVID-19 reported cases in various local authorities in the UK. We illustrate that taking into account the individual heterogeneity in age decreases the uncertainty of estimates and provides a real-time measurement of interventions and behavioural changes.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
BART-based inference for Poisson processes
Authors:
Stamatina Lamprinakou,
Mauricio Barahona,
Seth Flaxman,
Sarah Filippi,
Axel Gandy,
Emma McCoy
Abstract:
The effectiveness of Bayesian Additive Regression Trees (BART) has been demonstrated in a variety of contexts including non-parametric regression and classification. A BART scheme for estimating the intensity of inhomogeneous Poisson processes is introduced. Poisson intensity estimation is a vital task in various applications including medical imaging, astrophysics and network traffic analysis. Th…
▽ More
The effectiveness of Bayesian Additive Regression Trees (BART) has been demonstrated in a variety of contexts including non-parametric regression and classification. A BART scheme for estimating the intensity of inhomogeneous Poisson processes is introduced. Poisson intensity estimation is a vital task in various applications including medical imaging, astrophysics and network traffic analysis. The new approach enables full posterior inference of the intensity in a non-parametric regression setting. The performance of the novel scheme is demonstrated through simulation studies on synthetic and real datasets up to five dimensions, and the new scheme is compared with alternative approaches.
△ Less
Submitted 12 November, 2022; v1 submitted 16 May, 2020;
originally announced May 2020.