Skip to main content

Showing 1–20 of 20 results for author: Parnell, A C

.
  1. arXiv:2501.13879  [pdf, other

    stat.ME math.ST

    Finite mixture representations of zero-&-$N$-inflated distributions for count-compositional data

    Authors: André F. B. Menezes, Andrew C. Parnell, Keefe Murphy

    Abstract: We provide novel probabilistic portrayals of two multivariate models designed to handle zero-inflation in count-compositional data. We develop a new unifying framework that represents both as finite mixture distributions. One of these distributions, based on Dirichlet-multinomial components, has been studied before, but has not yet been properly characterised as a sampling distribution of the coun… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  2. arXiv:2412.14946  [pdf, other

    stat.ME stat.AP stat.ML

    Joint Models for Handling Non-Ignorable Missing Data using Bayesian Additive Regression Trees: Application to Leaf Photosynthetic Traits Data

    Authors: Yong Chen Goh, Wuu Kuang Soh, Andrew C. Parnell, Keefe Murphy

    Abstract: Dealing with missing data poses significant challenges in predictive analysis, often leading to biased conclusions when oversimplified assumptions about the missing data process are made. In cases where the data are missing not at random (MNAR), jointly modeling the data and missing data indicators is essential. Motivated by a real data application with partially missing multivariate outcomes rela… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

  3. arXiv:2408.17230  [pdf, other

    stat.AP stat.ME

    cosimmr: an R package for fast fitting of Stable Isotope Mixing Models with covariates

    Authors: Emma Govan, Andrew L Jackson, Stuart Bearhop, Richard Inger, Brian C Stock, Brice X Semmens, Eric J Ward, Andrew C Parnell

    Abstract: The study of animal diets and the proportional contribution that different foods make to their diets is an important task in ecology. Stable Isotope Mixing Models (SIMMs) are an important tool for studying an animal's diet and understanding how the animal interacts with its environment. We present cosimmr, a new R package designed to include covariates when estimating diet proportions in SIMMs, wi… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  4. arXiv:2404.02228  [pdf, other

    stat.ME econ.EM stat.AP

    Seemingly unrelated Bayesian additive regression trees for cost-effectiveness analyses in healthcare

    Authors: Jonas Esser, Mateus Maia, Andrew C. Parnell, Judith Bosmans, Hanneke van Dongen, Thomas Klausch, Keefe Murphy

    Abstract: In recent years, theoretical results and simulation evidence have shown Bayesian additive regression trees to be a highly-effective method for nonparametric regression. Motivated by cost-effectiveness analyses in health economics, where interest lies in jointly modelling the costs of healthcare treatments and the associated health-related quality of life experienced by a patient, we propose a mult… ▽ More

    Submitted 26 February, 2025; v1 submitted 2 April, 2024; originally announced April 2024.

  5. arXiv:2306.07817  [pdf, other

    stat.AP

    simmr: A package for fitting Stable Isotope Mixing Models in R

    Authors: Emma Govan, Andrew L. Jackson, Richard Inger, Stuart Bearhop, Andrew C. Parnell

    Abstract: We introduce an R package for fitting Stable Isotope Mixing Models (SIMMs) via both Markov chain Monte Carlo and Variational Bayes. The package is mainly used for estimating dietary contributions from food sources taken via measurements of stable isotope ratios from animals. It can also be used to estimate proportional contributions of a mixture from known sources, for example apportionment of riv… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 27 pages, 9 figures

  6. arXiv:2301.03655  [pdf, other

    stat.ML cs.LG

    Bayesian Additive Main Effects and Multiplicative Interaction Models using Tensor Regression for Multi-environmental Trials

    Authors: Antonia A. L. Dos Santos, Danilo A. Sarti, Rafael A. Moral, Andrew C. Parnell

    Abstract: We propose a Bayesian tensor regression model to accommodate the effect of multiple factors on phenotype prediction. We adopt a set of prior distributions that resolve identifiability issues that may arise between the parameters in the model. Simulation experiments show that our method out-performs previous related models and machine learning algorithms under different sample sizes and degrees of… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  7. arXiv:2207.00011  [pdf, other

    stat.ML cs.LG stat.ME

    Variational Inference for Additive Main and Multiplicative Interaction Effects Models

    Authors: AntÔnia A. L. Dos Santos, Rafael A. Moral, Danilo A. Sarti, Andrew C. Parnell

    Abstract: In plant breeding the presence of a genotype by environment (GxE) interaction has a strong impact on cultivation decision making and the introduction of new crop cultivars. The combination of linear and bilinear terms has been shown to be very useful in modelling this type of data. A widely-used approach to identify GxE is the Additive Main Effects and Multiplicative Interaction Effects (AMMI) mod… ▽ More

    Submitted 29 June, 2022; originally announced July 2022.

  8. arXiv:2204.02112  [pdf, other

    stat.ME cs.LG stat.ML

    GP-BART: a novel Bayesian additive regression trees approach using Gaussian processes

    Authors: Mateus Maia, Keefe Murphy, Andrew C. Parnell

    Abstract: The Bayesian additive regression trees (BART) model is an ensemble method extensively and successfully used in regression tasks due to its consistently strong predictive performance and its ability to quantify uncertainty. BART combines "weak" tree models through a set of shrinkage priors, whereby each tree explains a small portion of the variability in the data. However, the lack of smoothness an… ▽ More

    Submitted 14 September, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

  9. arXiv:2108.07636  [pdf, other

    stat.ML cs.LG

    Accounting for shared covariates in semi-parametric Bayesian additive regression trees

    Authors: Estevão B. Prado, Andrew C. Parnell, Keefe Murphy, Nathan McJames, Ann O'Shea, Rafael A. Moral

    Abstract: We propose some extensions to semi-parametric models based on Bayesian additive regression trees (BART). In the semi-parametric BART paradigm, the response variable is approximated by a linear predictor and a BART model, where the linear component is responsible for estimating the main effects and BART accounts for non-specified interactions and non-linearities. Previous semi-parametric models bas… ▽ More

    Submitted 30 July, 2024; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: 48 pages, 8 tables, 10 figures

  10. arXiv:2007.04177  [pdf, other

    stat.ME

    Modelling excess zeros in count data: A new perspective on modelling approaches

    Authors: John Haslett, Andrew C. Parnell, John Hinde, Rafael A. Moral

    Abstract: We consider the analysis of count data in which the observed frequency of zero counts is unusually large, typically with respect to the Poisson distribution. We focus on two alternative modelling approaches: Over-Dispersion (OD) models, and Zero-Inflation (ZI) models, both of which can be seen as generalisations of the Poisson distribution; we refer to these as Implicit and Explicit ZI models, res… ▽ More

    Submitted 29 July, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: 41 pages, 3 figures, 1 table

  11. Bayesian Additive Regression Trees with Model Trees

    Authors: Estevão B. Prado, Rafael A. Moral, Andrew C. Parnell

    Abstract: Bayesian Additive Regression Trees (BART) is a tree-based machine learning method that has been successfully applied to regression and classification problems. BART assumes regularisation priors on a set of trees that work as weak learners and is very flexible for predicting in the presence of non-linearity and high-order interactions. In this paper, we introduce an extension of BART, called Model… ▽ More

    Submitted 10 March, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Journal ref: Statistics and Computing 31, 20 (2021)

  12. arXiv:1906.06744  [pdf, other

    stat.AP

    Bayesian spatial extreme value analysis of maximum temperatures in County Dublin, Ireland

    Authors: John O'Sullivan, Conor Sweeney, Andrew C. Parnell

    Abstract: In this study, we begin a comprehensive characterisation of temperature extremes in Ireland for the period 1981-2010. We produce return levels of anomalies of daily maximum temperature extremes for an area over Ireland, for the 30-year period 1981-2010. We employ extreme value theory (EVT) to model the data using the generalised Pareto distribution (GPD) as part of a three-level Bayesian hierarchi… ▽ More

    Submitted 16 June, 2019; originally announced June 2019.

  13. arXiv:1508.02010  [pdf, other

    stat.AP physics.ao-ph

    A Bayesian Hierarchical Model for Reconstructing Sea Levels: From Raw Data to Rates of Change

    Authors: Niamh Cahill, Andrew C. Kemp, Benjamin P. Horton, Andrew C. Parnell

    Abstract: We present a holistic Bayesian hierarchical model for reconstructing the continuous and dynamic evolution of relative sea-level (RSL) change with fully quantified uncertainty. The reconstruction is produced from biological (foraminifera) and geochemical (δ13C) sea-level indicators preserved in dated cores of salt-marsh sediment. Our model is comprised of three modules: (1) A Bayesian transfer func… ▽ More

    Submitted 9 August, 2015; originally announced August 2015.

    Comments: 27 pages, 7 figures

  14. arXiv:1507.00181  [pdf, other

    stat.CO stat.ME

    Bayesian Additive Regression Trees using Bayesian Model Averaging

    Authors: Belinda Hernández, Adrian E. Raftery, Stephen R. Pennington, Andrew C. Parnell

    Abstract: Bayesian Additive Regression Trees (BART) is a statistical sum of trees model. It can be considered a Bayesian version of machine learning tree ensemble methods where the individual trees are the base learners. However for data sets where the number of variables $p$ is large (e.g. $p>5,000$) the algorithm can become prohibitively expensive, computationally. Another method which is popular for hi… ▽ More

    Submitted 8 July, 2015; v1 submitted 1 July, 2015; originally announced July 2015.

  15. arXiv:1407.6242  [pdf, ps, other

    stat.AP

    Frequency behaviour for multinomial counts of fisheries discards via a nested wavelet zero and N inflated binomial model

    Authors: Andrew C. Parnell, Norman Graham, Andrew L. Jackson, Mafalda Viana

    Abstract: In this paper we identify the changing frequency behaviour of multinomial counts of fish species discarded by vessels in the Irish Sea. We use a Bayesian hierarchical model which captures dynamic frequency changes via a shrinkage model applied to wavelet basis functions. Wavelets are known for capturing data features at different temporal scales; we use a recently-proposed shrinkage prior from the… ▽ More

    Submitted 23 July, 2014; originally announced July 2014.

    Comments: 24 pages, 9 figures

  16. arXiv:1407.0064  [pdf, ps, other

    stat.ME

    The zero & $N$-inflated binomial distribution with applications

    Authors: James Sweeney, John Haslett, Andrew C. Parnell

    Abstract: In this article we consider the distribution arising when two zero-inflated Poisson count processes are constrained by their sum total, resulting in a novel zero & $N$-inflated binomial distribution. This result motivates a general class of model for applications in which a sum-constrained count response is subject to multiple sources of heterogeneity, principally an excess of zeroes and $N$'s in… ▽ More

    Submitted 17 February, 2016; v1 submitted 30 June, 2014; originally announced July 2014.

  17. arXiv:1402.3014  [pdf, other

    stat.AP

    Joint Inference of Misaligned Irregular Time Series with Application to Greenland Ice Core Data

    Authors: Thinh K. Doan, Andrew C. Parnell, John Haslett

    Abstract: Ice cores provide insight into the past climate over many millennia. Due to ice compaction, the raw data for any single core are irregular in time. Multiple cores have different irregularities; jointly these series are misaligned. After processing, such data are made available to researchers as regular time series: a data product. Typically, these cores are independently processed. In this paper,… ▽ More

    Submitted 22 September, 2014; v1 submitted 12 February, 2014; originally announced February 2014.

    Comments: 14 pages, 8 figures

  18. Modeling sea-level change using errors-in-variables integrated Gaussian processes

    Authors: Niamh Cahill, Andrew C. Kemp, Benjamin P. Horton, Andrew C. Parnell

    Abstract: We perform Bayesian inference on historical and late Holocene (last 2000 years) rates of sea-level change. The input data to our model are tide-gauge measurements and proxy reconstructions from cores of coastal sediment. These data are complicated by multiple sources of uncertainty, some of which arise as part of the data collection exercise. Notably, the proxy reconstructions include temporal unc… ▽ More

    Submitted 11 September, 2015; v1 submitted 24 December, 2013; originally announced December 2013.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS824 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS824

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 2, 547-571

  19. arXiv:1209.6457  [pdf, other

    stat.AP

    Bayesian Stable Isotope Mixing Models

    Authors: Andrew C. Parnell, Donald L. Phillips, Stuart Bearhop, Brice X. Semmens, Eric J. Ward, Jonathan W. Moore, Andrew L. Jackson, Richard Inger

    Abstract: In this paper we review recent advances in Stable Isotope Mixing Models (SIMMs) and place them into an over-arching Bayesian statistical framework which allows for several useful extensions. SIMMs are used to quantify the proportional contributions of various sources to a mixture. The most widely used application is quantifying the diet of organisms based on the food sources they have been observe… ▽ More

    Submitted 28 September, 2012; originally announced September 2012.

    Comments: 16 pages, 9 Figures, 1 Table

  20. arXiv:1206.5009  [pdf, other

    stat.AP

    On Bayesian Modelling of the Uncertainties in Palaeoclimate Reconstruction

    Authors: Andrew C. Parnell, James Sweeney, Thinh K. Doan, Michael Salter-Townshend, Judy R. M. Allen, Brian Huntley, John Haslett

    Abstract: We outline a model and algorithm to perform inference on the palaeoclimate and palaeoclimate volatility from pollen proxy data. We use a novel multivariate non-linear non-Gaussian state space model consisting of an observation equation linking climate to proxy data and an evolution equation driving climate change over time. The link from climate to proxy data is defined by a pre-calibrated forward… ▽ More

    Submitted 21 June, 2012; originally announced June 2012.

    Comments: 25 pages, 7 figures