-
Market-based insurance ratemaking: application to pet insurance
Authors:
Pierre-Olivier Goffard,
Pierrick Piette,
Gareth W. Peters
Abstract:
This paper introduces a method for pricing insurance policies using market data. The approach is designed for scenarios in which the insurance company seeks to enter a new market, in our case: pet insurance, lacking historical data. The methodology involves an iterative two-step process. First, a suitable parameter is proposed to characterize the underlying risk. Second, the resulting pure premium…
▽ More
This paper introduces a method for pricing insurance policies using market data. The approach is designed for scenarios in which the insurance company seeks to enter a new market, in our case: pet insurance, lacking historical data. The methodology involves an iterative two-step process. First, a suitable parameter is proposed to characterize the underlying risk. Second, the resulting pure premium is linked to the observed commercial premium using an isotonic regression model. To validate the method, comprehensive testing is conducted on synthetic data, followed by its application to a dataset of actual pet insurance rates. To facilitate practical implementation, we have developed an R package called IsoPriceR. By addressing the challenge of pricing insurance policies in the absence of historical data, this method helps enhance pricing strategies in emerging markets.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
Signature Isolation Forest
Authors:
Marta Campi,
Guillaume Staerman,
Gareth W. Peters,
Tomoko Matsui
Abstract:
Functional Isolation Forest (FIF) is a recent state-of-the-art Anomaly Detection (AD) algorithm designed for functional data. It relies on a tree partition procedure where an abnormality score is computed by projecting each curve observation on a drawn dictionary through a linear inner product. Such linear inner product and the dictionary are a priori choices that highly influence the algorithm's…
▽ More
Functional Isolation Forest (FIF) is a recent state-of-the-art Anomaly Detection (AD) algorithm designed for functional data. It relies on a tree partition procedure where an abnormality score is computed by projecting each curve observation on a drawn dictionary through a linear inner product. Such linear inner product and the dictionary are a priori choices that highly influence the algorithm's performances and might lead to unreliable results, particularly with complex datasets. This work addresses these challenges by introducing \textit{Signature Isolation Forest}, a novel AD algorithm class leveraging the rough path theory's signature transform. Our objective is to remove the constraints imposed by FIF through the proposition of two algorithms which specifically target the linearity of the FIF inner product and the choice of the dictionary. We provide several numerical experiments, including a real-world applications benchmark showing the relevance of our methods.
△ Less
Submitted 25 February, 2025; v1 submitted 7 March, 2024;
originally announced March 2024.
-
Parsimonious Feature Extraction Methods: Extending Robust Probabilistic Projections with Generalized Skew-t
Authors:
Dorota Toczydlowska,
Gareth W. Peters,
Pavel V. Shevchenko
Abstract:
We propose a novel generalisation to the Student-t Probabilistic Principal Component methodology which: (1) accounts for an asymmetric distribution of the observation data; (2) is a framework for grouped and generalised multiple-degree-of-freedom structures, which provides a more flexible approach to modelling groups of marginal tail dependence in the observation data; and (3) separates the tail e…
▽ More
We propose a novel generalisation to the Student-t Probabilistic Principal Component methodology which: (1) accounts for an asymmetric distribution of the observation data; (2) is a framework for grouped and generalised multiple-degree-of-freedom structures, which provides a more flexible approach to modelling groups of marginal tail dependence in the observation data; and (3) separates the tail effect of the error terms and factors. The new feature extraction methods are derived in an incomplete data setting to efficiently handle the presence of missing values in the observation vector. We discuss various special cases of the algorithm being a result of simplified assumptions on the process generating the data. The applicability of the new framework is illustrated on a data set that consists of crypto currencies with the highest market capitalisation.
△ Less
Submitted 24 September, 2020;
originally announced September 2020.
-
Spatiotemporal analysis of urban heatwaves using Tukey g-and-h random field models
Authors:
Daisuke Murakami,
Gareth W. Peters,
Tomoko Matsui,
Yoshiki Yamagata
Abstract:
The statistical quantification of temperature processes for the analysis of urban heat island (UHI) effects and local heat-waves is an increasingly important application domain in smart city dynamic modelling. This leads to the increased importance of real-time heatwave risk management on a fine-grained spatial resolution. This study attempts to analyze and develop new methods for modelling the sp…
▽ More
The statistical quantification of temperature processes for the analysis of urban heat island (UHI) effects and local heat-waves is an increasingly important application domain in smart city dynamic modelling. This leads to the increased importance of real-time heatwave risk management on a fine-grained spatial resolution. This study attempts to analyze and develop new methods for modelling the spatio-temporal behavior of ground temperatures. The developed models consider higher-order stochastic spatial properties such as skewness and kurtosis, which are key components for understanding and describing local temperature fluctuations and UHI's. The developed models are applied to the greater Tokyo metropolitan area for a detailed real-world data case study. The analysis also demonstrates how to statistically incorporate a variety of real data sets. This includes remotely sensed imagery and a variety of ground-based monitoring site data to build models linking city and urban covariates to air temperature. The air temperature models are then used to capture high-resolution spatial emulator outputs for ground surface temperature modelling. The main class of processes studied includes the Tukey g-and-h processes for capturing spatial and temporal aspects of heat processes in urban environments.
△ Less
Submitted 2 April, 2020;
originally announced April 2020.
-
Quantile Diffusions for Risk Analysis
Authors:
Holly Brannelly,
Andrea Macrina,
Gareth W. Peters
Abstract:
We develop a novel approach for the construction of quantile processes governing the stochastic dynamics of quantiles in continuous time. Two classes of quantile diffusions are identified: the first, which we largely focus on, features a dynamic random quantile level and allows for direct interpretation of the resulting quantile process characteristics such as location, scale, skewness and kurtosi…
▽ More
We develop a novel approach for the construction of quantile processes governing the stochastic dynamics of quantiles in continuous time. Two classes of quantile diffusions are identified: the first, which we largely focus on, features a dynamic random quantile level and allows for direct interpretation of the resulting quantile process characteristics such as location, scale, skewness and kurtosis, in terms of the model parameters. The second type are function-valued quantile diffusions and are driven by stochastic parameter processes, which determine the entire quantile function at each point in time. By the proposed innovative and simple -- yet powerful -- construction method, quantile processes are obtained by transforming the marginals of a diffusion process under a composite map consisting of a distribution and a quantile function. Such maps, analogous to rank transmutation maps, produce the marginals of the resulting quantile process. We discuss the relationship and differences between our approach and existing methods and characterisations of quantile processes in discrete and continuous time. As an example of an application of quantile diffusions, we show how probability measure distortions, a form of dynamic tilting, can be induced. Though particularly useful in financial mathematics and actuarial science, examples of which are given in this work, measure distortions feature prominently across multiple research areas. For instance, dynamic distributional approximations (statistics), non-parametric and asymptotic analysis (mathematical statistics), dynamic risk measures (econometrics), behavioural economics, decision making (operations research), signal processing (information theory), and not least in general risk theory including applications thereof, for example in the context of climate change.
△ Less
Submitted 11 September, 2021; v1 submitted 23 December, 2019;
originally announced December 2019.
-
Multimodal Data Fusion of Non-Gaussian Spatial Fields in Sensor Networks
Authors:
Pengfei Zhang,
Gareth W. Peters,
Ido Nevat,
Keng Boon Teo,
Yixin Wang
Abstract:
We develop a robust data fusion algorithm for field reconstruction of multiple physical phenomena. The contribution of this paper is twofold: First, we demonstrate how multi-spatial fields which can have any marginal distributions and exhibit complex dependence structures can be constructed. To this end we develop a model where a latent process of these physical phenomena is modelled as Multiple G…
▽ More
We develop a robust data fusion algorithm for field reconstruction of multiple physical phenomena. The contribution of this paper is twofold: First, we demonstrate how multi-spatial fields which can have any marginal distributions and exhibit complex dependence structures can be constructed. To this end we develop a model where a latent process of these physical phenomena is modelled as Multiple Gaussian Process (MGP), and the dependence structure between these phenomena is captured through a Copula process. This model has the advantage of allowing one to choose any marginal distributions for the physical phenomenon. Second, we develop an efficient and robust linear estimation algorithm to predict the mean behaviour of the physical phenomena using rank correlation instead of the conventional linear Pearson correlation. Our approach has the advantage of avoiding the need to derive intractable predictive posterior distribution and also has a tractable solution for the rank correlation values. We show that our model outperforms the model which uses the conventional linear Pearson correlation metric in terms of the prediction mean-squared-errors (MSE). This provides the motivation for using our models for multimodal data fusion.
△ Less
Submitted 9 June, 2019;
originally announced June 2019.
-
Riemannian tangent space mapping and elastic net regularization for cost-effective EEG markers of brain atrophy in Alzheimer's disease
Authors:
Wolfgang Fruehwirt,
Matthias Gerstgrasser,
Pengfei Zhang,
Leonard Weydemann,
Markus Waser,
Reinhold Schmidt,
Thomas Benke,
Peter Dal-Bianco,
Gerhard Ransmayr,
Dieter Grossegger,
Heinrich Garn,
Gareth W. Peters,
Stephen Roberts,
Georg Dorffner
Abstract:
The diagnosis of Alzheimer's disease (AD) in routine clinical practice is most commonly based on subjective clinical interpretations. Quantitative electroencephalography (QEEG) measures have been shown to reflect neurodegenerative processes in AD and might qualify as affordable and thereby widely available markers to facilitate the objectivization of AD assessment. Here, we present a novel framewo…
▽ More
The diagnosis of Alzheimer's disease (AD) in routine clinical practice is most commonly based on subjective clinical interpretations. Quantitative electroencephalography (QEEG) measures have been shown to reflect neurodegenerative processes in AD and might qualify as affordable and thereby widely available markers to facilitate the objectivization of AD assessment. Here, we present a novel framework combining Riemannian tangent space mapping and elastic net regression for the development of brain atrophy markers. While most AD QEEG studies are based on small sample sizes and psychological test scores as outcome measures, here we train and test our models using data of one of the largest prospective EEG AD trials ever conducted, including MRI biomarkers of brain atrophy.
△ Less
Submitted 22 November, 2017;
originally announced November 2017.
-
Sensor Selection and Random Field Reconstruction for Robust and Cost-effective Heterogeneous Weather Sensor Networks for the Developing World
Authors:
Pengfei Zhang,
Ido Nevat,
Gareth W. Peters,
Wolfgang Fruehwirt,
Yongchao Huang,
Ivonne Anders,
Michael Osborne
Abstract:
We address the two fundamental problems of spatial field reconstruction and sensor selection in heterogeneous sensor networks: (i) how to efficiently perform spatial field reconstruction based on measurements obtained simultaneously from networks with both high and low quality sensors; and (ii) how to perform query based sensor set selection with predictive MSE performance guarantee. For the first…
▽ More
We address the two fundamental problems of spatial field reconstruction and sensor selection in heterogeneous sensor networks: (i) how to efficiently perform spatial field reconstruction based on measurements obtained simultaneously from networks with both high and low quality sensors; and (ii) how to perform query based sensor set selection with predictive MSE performance guarantee. For the first problem, we developed a low complexity algorithm based on the spatial best linear unbiased estimator (S-BLUE). Next, building on the S-BLUE, we address the second problem, and develop an efficient algorithm for query based sensor set selection with performance guarantee. Our algorithm is based on the Cross Entropy method which solves the combinatorial optimization problem in an efficient manner.
△ Less
Submitted 23 November, 2017; v1 submitted 12 November, 2017;
originally announced November 2017.
-
Dynamic Quantile Function Models
Authors:
Wilson Ye Chen,
Gareth W. Peters,
Richard H. Gerlach,
Scott A. Sisson
Abstract:
Motivated by the need for effectively summarising, modelling, and forecasting the distributional characteristics of intra-daily returns, as well as the recent work on forecasting histogram-valued time-series in the area of symbolic data analysis, we develop a time-series model for forecasting quantile-function-valued (QF-valued) daily summaries for intra-daily returns. We call this model the dynam…
▽ More
Motivated by the need for effectively summarising, modelling, and forecasting the distributional characteristics of intra-daily returns, as well as the recent work on forecasting histogram-valued time-series in the area of symbolic data analysis, we develop a time-series model for forecasting quantile-function-valued (QF-valued) daily summaries for intra-daily returns. We call this model the dynamic quantile function (DQF) model. Instead of a histogram, we propose to use a $g$-and-$h$ quantile function to summarise the distribution of intra-daily returns. We work with a Bayesian formulation of the DQF model in order to make statistical inference while accounting for parameter uncertainty; an efficient MCMC algorithm is developed for sampling-based posterior inference. Using ten international market indices and approximately 2,000 days of out-of-sample data from each market, the performance of the DQF model compares favourably, in terms of forecasting VaR of intra-daily returns, against the interval-valued and histogram-valued time-series models. Additionally, we demonstrate that the QF-valued forecasts can be used to forecast VaR measures at the daily timescale via a simple quantile regression model on daily returns (QR-DQF). In certain markets, the resulting QR-DQF model is able to provide competitive VaR forecasts for daily returns.
△ Less
Submitted 4 May, 2021; v1 submitted 9 July, 2017;
originally announced July 2017.
-
A unified approach to mortality modelling using state-space framework: characterisation, identification, estimation and forecasting
Authors:
Man Chung Fung,
Gareth W. Peters,
Pavel V. Shevchenko
Abstract:
This paper explores and develops alternative statistical representations and estimation approaches for dynamic mortality models. The framework we adopt is to reinterpret popular mortality models such as the Lee-Carter class of models in a general state-space modelling methodology, which allows modelling, estimation and forecasting of mortality under a unified framework. Furthermore, we propose an…
▽ More
This paper explores and develops alternative statistical representations and estimation approaches for dynamic mortality models. The framework we adopt is to reinterpret popular mortality models such as the Lee-Carter class of models in a general state-space modelling methodology, which allows modelling, estimation and forecasting of mortality under a unified framework. Furthermore, we propose an alternative class of model identification constraints which is more suited to statistical inference in filtering and parameter estimation settings based on maximization of the marginalized likelihood or in Bayesian inference. We then develop a novel class of Bayesian state-space models which incorporate apriori beliefs about the mortality model characteristics as well as for more flexible and appropriate assumptions relating to heteroscedasticity that present in observed mortality data. We show that multiple period and cohort effect can be cast under a state-space structure. To study long term mortality dynamics, we introduce stochastic volatility to the period effect. The estimation of the resulting stochastic volatility model of mortality is performed using a recent class of Monte Carlo procedure specifically designed for state and parameter estimation in Bayesian state-space models, known as the class of particle Markov chain Monte Carlo methods. We illustrate the framework we have developed using Danish male mortality data, and show that incorporating heteroscedasticity and stochastic volatility markedly improves model fit despite an increase of model complexity. Forecasting properties of the enhanced models are examined with long term and short term calibration periods on the reconstruction of life tables.
△ Less
Submitted 30 May, 2016;
originally announced May 2016.
-
Estimating Quantile Families of Loss Distributions for Non-Life Insurance Modelling via L-moments
Authors:
Gareth W. Peters,
Wilson Y. Chen,
Richard H. Gerlach
Abstract:
This paper discusses different classes of loss models in non-life insurance settings. It then overviews the class Tukey transform loss models that have not yet been widely considered in non-life insurance modelling, but offer opportunities to produce flexible skewness and kurtosis features often required in loss modelling. In addition, these loss models admit explicit quantile specifications which…
▽ More
This paper discusses different classes of loss models in non-life insurance settings. It then overviews the class Tukey transform loss models that have not yet been widely considered in non-life insurance modelling, but offer opportunities to produce flexible skewness and kurtosis features often required in loss modelling. In addition, these loss models admit explicit quantile specifications which make them directly relevant for quantile based risk measure calculations. We detail various parameterizations and sub-families of the Tukey transform based models, such as the g-and-h, g-and-k and g-and-j models, including their properties of relevance to loss modelling.
One of the challenges with such models is to perform robust estimation for the loss model parameters that will be amenable to practitioners when fitting such models. In this paper we develop a novel, efficient and robust estimation procedure for estimation of model parameters in this family Tukey transform models, based on L-moments. It is shown to be more robust and efficient than current state of the art methods of estimation for such families of loss models and is simple to implement for practical purposes.
△ Less
Submitted 3 March, 2016;
originally announced March 2016.
-
A spatiotemporal analysis of participatory sensing data "tweets" and extreme climate events toward real-time urban risk management
Authors:
Yoshiki Yamagata,
Daisuke Murakami,
Gareth W. Peters,
Tomoko Matsui
Abstract:
Real-time urban climate monitoring provides useful information that can be utilized to help monitor and adapt to extreme events, including urban heatwaves. Typical approaches to the monitoring of climate data include weather station monitoring and remote sensing. However, climate monitoring stations are very often distributed spatially in a sparse manner, and consequently, this has a significant i…
▽ More
Real-time urban climate monitoring provides useful information that can be utilized to help monitor and adapt to extreme events, including urban heatwaves. Typical approaches to the monitoring of climate data include weather station monitoring and remote sensing. However, climate monitoring stations are very often distributed spatially in a sparse manner, and consequently, this has a significant impact on the ability to reveal exposure risks due to extreme climates at an intra-urban scale. Additionally, traditional remote sensing data sources are typically not received and analyzed in real-time which is often required for adaptive urban management of climate extremes, such as sudden heatwaves. Fortunately, recent social media, such as Twitter, furnishes real-time and high-resolution spatial information that might be useful for climate condition estimation. The objective of this study is utilizing geo-tagged tweets (participatory sensing data) for urban temperature analysis. We first detect tweets relating hotness (hot-tweets). Then, we study relationships between monitored temperatures and hot-tweets via a statistical model framework based on copula modelling methods. We demonstrate that there are strong relationships between "hot-tweets" and temperatures recorded at an intra-urban scale. Subsequently, we then investigate the application of "hot-tweets" informing spatio-temporal Gaussian process interpolation of temperatures as an application example of "hot-tweets". We utilize a combination of spatially sparse weather monitoring sensor data and spatially and temporally dense lower quality twitter data. Here, a spatial best linear unbiased estimation technique is applied. The result suggests that tweets provide some useful auxiliary information for urban climate assessment. Lastly, effectiveness of tweets toward a real-time urban risk management is discussed based on the results.
△ Less
Submitted 17 September, 2015; v1 submitted 22 May, 2015;
originally announced May 2015.
-
New Perspectives on Multiple Source Localization in Wireless Sensor Networks
Authors:
Thi Le Thu Nguyen,
Francois Septier,
Harizo Rajaona,
Gareth W. Peters,
Ido Nevat,
Yves Delignon
Abstract:
In this paper we address the challenging problem of multiple source localization in Wireless Sensor Networks (WSN). We develop an efficient statistical algorithm, based on the novel application of Sequential Monte Carlo (SMC) sampler methodology, that is able to deal with an unknown number of sources given quantized data obtained at the fusion center from different sensors with imperfect wireless…
▽ More
In this paper we address the challenging problem of multiple source localization in Wireless Sensor Networks (WSN). We develop an efficient statistical algorithm, based on the novel application of Sequential Monte Carlo (SMC) sampler methodology, that is able to deal with an unknown number of sources given quantized data obtained at the fusion center from different sensors with imperfect wireless channels. We also derive the Posterior Cramér-Rao Bound (PCRB) of the source location estimate. The PCRB is used to analyze the accuracy of the proposed SMC sampler algorithm and the impact that quantization has on the accuracy of location estimates of the sources. Extensive experiments show that the benefits of the proposed scheme in terms of the accuracy of the estimation method that are required for model selection (i.e., the number of sources) and the estimation of the source characteristics compared to the classical importance sampling method.
△ Less
Submitted 22 April, 2015;
originally announced April 2015.
-
SMC-ABC methods for the estimation of stochastic simulation models of the limit order book
Authors:
Gareth W. Peters,
Efstathios Panayi,
Francois Septier
Abstract:
In this paper we consider classes of models that have been recently developed for quantitative finance that involve modelling a highly complex multivariate, multi-attribute stochastic process known as the Limit Order Book (LOB). The LOB is the primary data structure recorded each day intra-daily for all assets on every electronic exchange in the world in which trading takes place. As such, it repr…
▽ More
In this paper we consider classes of models that have been recently developed for quantitative finance that involve modelling a highly complex multivariate, multi-attribute stochastic process known as the Limit Order Book (LOB). The LOB is the primary data structure recorded each day intra-daily for all assets on every electronic exchange in the world in which trading takes place. As such, it represents one of the most important fundamental structures to study from a stochastic process perspective if one wishes to characterize features of stochastic dynamics for price, volume, liquidity and other important attributes for a traded asset. In this paper we aim to adopt the model structure which develops a stochastic model framework for the LOB of a given asset and to explain how to perform calibration of this stochastic model to real observed LOB data for a range of different assets.
△ Less
Submitted 22 April, 2015;
originally announced April 2015.
-
Efficient Sequential Monte-Carlo Samplers for Bayesian Inference
Authors:
Thi Le Thu Nguyen,
Francois Septier,
Gareth W. Peters,
Yves Delignon
Abstract:
In many problems, complex non-Gaussian and/or nonlinear models are required to accurately describe a physical system of interest. In such cases, Monte Carlo algorithms are remarkably flexible and extremely powerful approaches to solve such inference problems. However, in the presence of a high-dimensional and/or multimodal posterior distribution, it is widely documented that standard Monte-Carlo t…
▽ More
In many problems, complex non-Gaussian and/or nonlinear models are required to accurately describe a physical system of interest. In such cases, Monte Carlo algorithms are remarkably flexible and extremely powerful approaches to solve such inference problems. However, in the presence of a high-dimensional and/or multimodal posterior distribution, it is widely documented that standard Monte-Carlo techniques could lead to poor performance. In this paper, the study is focused on a Sequential Monte-Carlo (SMC) sampler framework, a more robust and efficient Monte Carlo algorithm. Although this approach presents many advantages over traditional Monte-Carlo methods, the potential of this emergent technique is however largely underexploited in signal processing. In this work, we aim at proposing some novel strategies that will improve the efficiency and facilitate practical implementation of the SMC sampler specifically for signal processing applications. Firstly, we propose an automatic and adaptive strategy that selects the sequence of distributions within the SMC sampler that minimizes the asymptotic variance of the estimator of the posterior normalization constant. This is critical for performing model selection in modelling applications in Bayesian signal processing. The second original contribution we present improves the global efficiency of the SMC sampler by introducing a novel correction mechanism that allows the use of the particles generated through all the iterations of the algorithm (instead of only particles from the last iteration). This is a significant contribution as it removes the need to discard a large portion of the samples obtained, as is standard in standard SMC methods. This will improve estimation performance in practical settings where computational budget is important to consider.
△ Less
Submitted 22 April, 2015;
originally announced April 2015.
-
Langevin and Hamiltonian based Sequential MCMC for Efficient Bayesian Filtering in High-dimensional Spaces
Authors:
Francois Septier,
Gareth W. Peters
Abstract:
Nonlinear non-Gaussian state-space models arise in numerous applications in statistics and signal processing. In this context, one of the most successful and popular approximation techniques is the Sequential Monte Carlo (SMC) algorithm, also known as particle filtering. Nevertheless, this method tends to be inefficient when applied to high dimensional problems. In this paper, we focus on another…
▽ More
Nonlinear non-Gaussian state-space models arise in numerous applications in statistics and signal processing. In this context, one of the most successful and popular approximation techniques is the Sequential Monte Carlo (SMC) algorithm, also known as particle filtering. Nevertheless, this method tends to be inefficient when applied to high dimensional problems. In this paper, we focus on another class of sequential inference methods, namely the Sequential Markov Chain Monte Carlo (SMCMC) techniques, which represent a promising alternative to SMC methods. After providing a unifying framework for the class of SMCMC approaches, we propose novel efficient strategies based on the principle of Langevin diffusion and Hamiltonian dynamics in order to cope with the increasing number of high-dimensional applications. Simulation results show that the proposed algorithms achieve significantly better performance compared to existing algorithms.
△ Less
Submitted 29 October, 2015; v1 submitted 22 April, 2015;
originally announced April 2015.
-
Tensor Approximation of Generalized Correlated Diffusions and Functional Copula Operators
Authors:
Antonio Dalessandro,
Gareth W. Peters
Abstract:
We investigate aspects of semimartingale decompositions, approximation and the martingale representation for multidimensional correlated Markov processes. A new interpretation of the dependence among processes is given using the martingale approach. We show that it is possible to represent, in both continuous and discrete space, that a multidimensional correlated generalized diffusion is a linear…
▽ More
We investigate aspects of semimartingale decompositions, approximation and the martingale representation for multidimensional correlated Markov processes. A new interpretation of the dependence among processes is given using the martingale approach. We show that it is possible to represent, in both continuous and discrete space, that a multidimensional correlated generalized diffusion is a linear combination of processes that originate from the decomposition of the starting multidimensional semimartingale. This result not only reconciles with the existing theory of diffusion approximations and decompositions, but defines the general representation of infinitesimal generators for both multidimensional generalized diffusions and as we will demonstrate also for the specification of copula density dependence structures. This new result provides immediate representation of the approximate solution for correlated stochastic differential equations. We demonstrate desirable convergence results for the proposed multidimensional semimartingales decomposition approximations.
△ Less
Submitted 23 February, 2015;
originally announced February 2015.
-
Sequential Monte Carlo Samplers for capital allocation under copula-dependent risk models
Authors:
Rodrigo S. Targino,
Gareth W. Peters,
Pavel V. Shevchenko
Abstract:
In this paper we assume a multivariate risk model has been developed for a portfolio and its capital derived as a homogeneous risk measure. The Euler (or gradient) principle, then, states that the capital to be allocated to each component of the portfolio has to be calculated as an expectation conditional to a rare event, which can be challenging to evaluate in practice. We exploit the copula-depe…
▽ More
In this paper we assume a multivariate risk model has been developed for a portfolio and its capital derived as a homogeneous risk measure. The Euler (or gradient) principle, then, states that the capital to be allocated to each component of the portfolio has to be calculated as an expectation conditional to a rare event, which can be challenging to evaluate in practice. We exploit the copula-dependence within the portfolio risks to design a Sequential Monte Carlo Samplers based estimate to the marginal conditional expectations involved in the problem, showing its efficiency through a series of computational examples.
△ Less
Submitted 17 February, 2015; v1 submitted 4 October, 2014;
originally announced October 2014.
-
Risk Margin Quantile Function Via Parametric and Non-Parametric Bayesian Quantile Regression
Authors:
Alice X. D. Dong,
Jennifer S. K. Chan,
Gareth W. Peters
Abstract:
We develop quantile regression models in order to derive risk margin and to evaluate capital in non-life insurance applications. By utilizing the entire range of conditional quantile functions, especially higher quantile levels, we detail how quantile regression is capable of providing an accurate estimation of risk margin and an overview of implied capital based on the historical volatility of a…
▽ More
We develop quantile regression models in order to derive risk margin and to evaluate capital in non-life insurance applications. By utilizing the entire range of conditional quantile functions, especially higher quantile levels, we detail how quantile regression is capable of providing an accurate estimation of risk margin and an overview of implied capital based on the historical volatility of a general insurers loss portfolio. Two modelling frameworks are considered based around parametric and nonparametric quantile regression models which we develop specifically in this insurance setting.
In the parametric quantile regression framework, several models including the flexible generalized beta distribution family, asymmetric Laplace (AL) distribution and power Pareto distribution are considered under a Bayesian regression framework. The Bayesian posterior quantile regression models in each case are studied via Markov chain Monte Carlo (MCMC) sampling strategies.
In the nonparametric quantile regression framework, that we contrast to the parametric Bayesian models, we adopted an AL distribution as a proxy and together with the parametric AL model, we expressed the solution as a scale mixture of uniform distributions to facilitate implementation. The models are extended to adopt dynamic mean, variance and skewness and applied to analyze two real loss reserve data sets to perform inference and discuss interesting features of quantile regression for risk margin calculations.
△ Less
Submitted 11 February, 2014;
originally announced February 2014.
-
Optimal insurance purchase strategies via optimal multiple stopping times
Authors:
Rodrigo S. Targino,
Gareth W. Peters,
Georgy Sofronov,
Pavel V. Shevchenko
Abstract:
In this paper we study a class of insurance products where the policy holder has the option to insure $k$ of its annual Operational Risk losses in a horizon of $T$ years. This involves a choice of $k$ out of $T$ years in which to apply the insurance policy coverage by making claims against losses in the given year. The insurance product structure presented can accommodate any kind of annual mitiga…
▽ More
In this paper we study a class of insurance products where the policy holder has the option to insure $k$ of its annual Operational Risk losses in a horizon of $T$ years. This involves a choice of $k$ out of $T$ years in which to apply the insurance policy coverage by making claims against losses in the given year. The insurance product structure presented can accommodate any kind of annual mitigation, but we present three basic generic insurance policy structures that can be combined to create more complex types of coverage. Following the Loss Distributional Approach (LDA) with Poisson distributed annual loss frequencies and Inverse-Gaussian loss severities we are able to characterize in closed form analytical expressions for the multiple optimal decision strategy that minimizes the expected Operational Risk loss over the next $T$ years. For the cases where the combination of insurance policies and LDA model does not lead to closed form expressions for the multiple optimal decision rules, we also develop a principled class of closed form approximations to the optimal decision rule. These approximations are developed based on a class of orthogonal Askey polynomial series basis expansion representations of the annual loss compound process distribution and functions of this annual loss.
△ Less
Submitted 2 December, 2013;
originally announced December 2013.
-
Heavy-Tailed Features and Empirical Analysis of the Limit Order Book Volume Profiles in Futures Markets
Authors:
Kylie-Anne Richards,
Gareth W. Peters,
William Dunsmuir
Abstract:
This paper poses a few fundamental questions regarding the attributes of the volume profile of a Limit Order Books stochastic structure by taking into consideration aspects of intraday and interday statistical features, the impact of different exchange features and the impact of market participants in different asset sectors. This paper aims to address the following questions:
1. Is there statis…
▽ More
This paper poses a few fundamental questions regarding the attributes of the volume profile of a Limit Order Books stochastic structure by taking into consideration aspects of intraday and interday statistical features, the impact of different exchange features and the impact of market participants in different asset sectors. This paper aims to address the following questions:
1. Is there statistical evidence that heavy-tailed sub-exponential volume profiles occur at different levels of the Limit Order Book on the bid and ask and if so does this happen on intra or interday time scales ?
2.In futures exchanges, are heavy tail features exchange (CBOT, CME, EUREX, SGX and COMEX) or asset class (government bonds, equities and precious metals) dependent and do they happen on ultra-high (<1sec) or mid-range (1sec -10min) high frequency data?
3.Does the presence of stochastic heavy-tailed volume profile features evolve in a manner that would inform or be indicative of market participant behaviors, such as high frequency algorithmic trading, quote stuffing and price discovery intra-daily?
4. Is there statistical evidence for a need to consider dynamic behavior of the parameters of models for Limit Order Book volume profiles on an intra-daily time scale ?
Progress on aspects of each question is obtained via statistically rigorous results to verify the empirical findings for an unprecedentedly large set of futures market LOB data. The data comprises several exchanges, several futures asset classes and all trading days of 2010, using market depth (Type II) order book data to 5 levels on the bid and ask.
△ Less
Submitted 22 April, 2015; v1 submitted 26 October, 2012;
originally announced October 2012.
-
A Copula Based Bayesian Approach for Paid-Incurred Claims Models for Non-Life Insurance Reserving
Authors:
Gareth W. Peters,
Alice X. D. Dong,
Robert Kohn
Abstract:
Our article considers the class of recently developed stochastic models that combine claims payments and incurred losses information into a coherent reserving methodology. In particular, we develop a family of Heirarchical Bayesian Paid-Incurred-Claims models, combining the claims reserving models of Hertig et al. (1985) and Gogol et al. (1993). In the process we extend the independent log-normal…
▽ More
Our article considers the class of recently developed stochastic models that combine claims payments and incurred losses information into a coherent reserving methodology. In particular, we develop a family of Heirarchical Bayesian Paid-Incurred-Claims models, combining the claims reserving models of Hertig et al. (1985) and Gogol et al. (1993). In the process we extend the independent log-normal model of Merz et al. (2010) by incorporating different dependence structures using a Data-Augmented mixture Copula Paid-Incurred claims model.
The utility and influence of incorporating both payment and incurred losses into estimating of the full predictive distribution of the outstanding loss liabilities and the resulting reserves is demonstrated in the following cases: (i) an independent payment (P) data model; (ii) the independent Payment-Incurred Claims (PIC) data model of Merz et al. (2010); (iii) a novel dependent lag-year telescoping block diagonal Gaussian Copula PIC data model incorporating conjugacy via transformation; (iv) a novel data-augmented mixture Archimedean copula dependent PIC data model.
Inference in such models is developed via a class of adaptive Markov chain Monte Carlo sampling algorithms. These incorporate a data-augmentation framework utilized to efficiently evaluate the likelihood for the copula based PIC model in the loss reserving triangles. The adaptation strategy is based on representing a positive definite covariance matrix by the exponential of a symmetric matrix as proposed by Leonard et al. (1992).
△ Less
Submitted 9 December, 2012; v1 submitted 14 October, 2012;
originally announced October 2012.
-
Generalized Interference Models in Doubly Stochastic Poisson Random Fields for Wideband Communications: the PNSC(alpha) model
Authors:
Gareth W. Peters,
Ido Nevat,
Francois Septier,
Laurent Clavier
Abstract:
A general stochastic model is developed for the total interference in wideband systems, denoted as the PNSC(alpha) Interference Model. It allows one to obtain, analytic representations in situations where (a) interferers are distributed according to either a homogeneous or an inhomogeneous in time or space Cox point process and (b) when the frequency bands occupied by each of the unknown number of…
▽ More
A general stochastic model is developed for the total interference in wideband systems, denoted as the PNSC(alpha) Interference Model. It allows one to obtain, analytic representations in situations where (a) interferers are distributed according to either a homogeneous or an inhomogeneous in time or space Cox point process and (b) when the frequency bands occupied by each of the unknown number of interferers is also a random variable in the allowable bandwidth. The analytic representations obtained are generalizations of Cox processes to the family of sub-exponential models characterized by distributions from the alpha-stable family. We develop general parametric density representations for the interference models via doubly stochastic Poisson mixture representations of Scaled Mixture of Normal's via the Normal-Stable variance mixture. To illustrate members of this class of interference model we also develop two special cases for a moderately impulsive interference (alpha=3/2) and a highly impulsive interference (alpha=2/3) where closed form representations can be obtained either by the SMiN representation or via function expansions based on the Holtsmark distribution or Whittaker functions. To illustrate the paper we propose expressions for the Capacity of a BPSK system under a PNSC(alpha) interference, via analytic expressions for the Likelihood Ratio Test statistic.
△ Less
Submitted 6 July, 2012;
originally announced July 2012.
-
Adaptive Markov Chain Monte Carlo Forward Simulation for Statistical Analysis in Epidemic Modelling of Human Papillomavirus
Authors:
Igor A. Korostil,
Gareth W. Peters,
Julien Cornebise,
David G. Regan
Abstract:
We develop a Bayesian statistical model and estimation methodology based on Forward Projection Adaptive Markov chain Monte Carlo in order to perform the calibration of a high-dimensional non-linear system of Ordinary Differential Equations representing an epidemic model for Human Papillomavirus types 6 and 11 (HPV-6, HPV-11). The model is compartmental and involves stratification by age, gender an…
▽ More
We develop a Bayesian statistical model and estimation methodology based on Forward Projection Adaptive Markov chain Monte Carlo in order to perform the calibration of a high-dimensional non-linear system of Ordinary Differential Equations representing an epidemic model for Human Papillomavirus types 6 and 11 (HPV-6, HPV-11). The model is compartmental and involves stratification by age, gender and sexual activity-group. Developing this model and a means to calibrate it efficiently is relevant since HPV is a very multi-typed and common sexually transmitted infection with more than 100 types currently known. The two types studied in this paper, types 6 and 11, are causing about 90% of anogenital warts.
We extend the development of a sexual mixing matrix for the population, based on a formulation first suggested by Garnett and Anderson. In particular we consider a stochastic mixing matrix framework which allows us to jointly estimate unknown attributes and parameters of the mixing matrix along with the parameters involved in the calibration of the HPV epidemic model. This matrix describes the sexual interactions between members of the population under study and relies on several quantities which are a-priori unknown. The Bayesian model developed allows one to estimate jointly the HPV-6 and HPV-11 epidemic model parameters such as the probability of transmission, HPV incubation period, duration of infection, duration of genital warts treatment, duration of immunity, the probability of seroconversion, per gender, age-group and sexual activity-group, as well as unknown sexual mixing matrix parameters related to assortativity. We conclude with simulation studies on synthetic and actual data from studies undertaken recently in Australia.
△ Less
Submitted 15 August, 2011;
originally announced August 2011.
-
System Identification in Wireless Relay Networks via Gaussian Process
Authors:
Gareth W. Peters,
Ido Nevat,
Jinhong Yuan,
Ian B. Collings
Abstract:
We present a flexible stochastic model for a class of cooperative wireless relay networks, in which the relay processing functionality is not known at the destination. In addressing this problem we develop efficient algorithms to perform relay identification in a wireless relay network. We first construct a statistical model based on a representation of the system using Gaussian Processes in a non…
▽ More
We present a flexible stochastic model for a class of cooperative wireless relay networks, in which the relay processing functionality is not known at the destination. In addressing this problem we develop efficient algorithms to perform relay identification in a wireless relay network. We first construct a statistical model based on a representation of the system using Gaussian Processes in a non-standard manner due to the way we treat the imperfect channel state information. We then formulate the estimation problem to perform system identification, taking into account complexity and computational efficiency. Next we develop a set of three algorithms to solve the identification problem each of decreasing complexity, trading-off the estimation bias for computational efficiency. The joint optimisation problem is tackled via a Bayesian framework using the Iterated Conditioning on the Modes methodology. We develop a lower bound and several sub-optimal computationally efficient solutions to the identification problem, for comparison. We illustrate the estimation performance of our methodology for a range of widely used relay functionalities. The relative total error attained by our algorithm when compared to the lower bound is found to be at worst 9% for low SNR values under all functions considered. The effect of the relay functional estimation error is also studied via BER simulations and is shown to be less than 2dB worse than the lower bound.
△ Less
Submitted 17 January, 2012; v1 submitted 17 June, 2011;
originally announced June 2011.
-
Calibration and filtering for multi factor commodity models with seasonality: incorporating panel data from futures contracts
Authors:
Gareth W. Peters,
Mark Briers,
Pavel V. Shevchenko,
Arnaud Doucet
Abstract:
We examine a general multi-factor model for commodity spot prices and futures valuation. We extend the multi-factor long-short model in Schwartz and Smith (2000) and Yan (2002) in two important aspects: firstly we allow for both the long and short term dynamic factors to be mean reverting incorporating stochastic volatility factors and secondly we develop an additive structural seasonality model.…
▽ More
We examine a general multi-factor model for commodity spot prices and futures valuation. We extend the multi-factor long-short model in Schwartz and Smith (2000) and Yan (2002) in two important aspects: firstly we allow for both the long and short term dynamic factors to be mean reverting incorporating stochastic volatility factors and secondly we develop an additive structural seasonality model. Then a Milstein discretized non-linear stochastic volatility state space representation for the model is developed which allows for futures and options contracts in the observation equation. We then develop numerical methodology based on an advanced Sequential Monte Carlo algorithm utilising Particle Markov chain Monte Carlo to perform calibration of the model jointly with the filtering of the latent processes for the long-short dynamics and volatility factors. In this regard we explore and develop a novel methodology based on an adaptive Rao-Blackwellised version of the Particle Markov chain Monte Carlo methodology. In doing this we deal accurately with the non-linearities in the state-space model which are therefore introduced into the filtering framework. We perform analysis on synthetic and real data for oil commodities.
△ Less
Submitted 29 May, 2011;
originally announced May 2011.
-
Parameter Estimation for Hidden Markov Models with Intractable Likelihoods
Authors:
Thomas A. Dean,
Sumeetpal S. Singh,
Ajay Jasra,
Gareth W. Peters
Abstract:
Approximate Bayesian computation (ABC) is a popular technique for approximating likelihoods and is often used in parameter estimation when the likelihood functions are analytically intractable. Although the use of ABC is widespread in many fields, there has been little investigation of the theoretical properties of the resulting estimators. In this paper we give a theoretical analysis of the asymp…
▽ More
Approximate Bayesian computation (ABC) is a popular technique for approximating likelihoods and is often used in parameter estimation when the likelihood functions are analytically intractable. Although the use of ABC is widespread in many fields, there has been little investigation of the theoretical properties of the resulting estimators. In this paper we give a theoretical analysis of the asymptotic properties of ABC based maximum likelihood parameter estimation for hidden Markov models. In particular, we derive results analogous to those of consistency and asymptotic normality for standard maximum likelihood estimation. We also discuss how Sequential Monte Carlo methods provide a natural method for implementing likelihood based ABC procedures.
△ Less
Submitted 28 March, 2011;
originally announced March 2011.
-
Analytic Loss Distributional Approach Model for Operational Risk from the alpha-Stable Doubly Stochastic Compound Processes and Implications for Capital Allocation
Authors:
Gareth W. Peters,
Pavel Shevchenko,
Mark Young,
Wendy Yip
Abstract:
Under the Basel II standards, the Operational Risk (OpRisk) advanced measurement approach is not prescriptive regarding the class of statistical model utilised to undertake capital estimation. It has however become well accepted to utlise a Loss Distributional Approach (LDA) paradigm to model the individual OpRisk loss process corresponding to the Basel II Business line/event type. In this paper w…
▽ More
Under the Basel II standards, the Operational Risk (OpRisk) advanced measurement approach is not prescriptive regarding the class of statistical model utilised to undertake capital estimation. It has however become well accepted to utlise a Loss Distributional Approach (LDA) paradigm to model the individual OpRisk loss process corresponding to the Basel II Business line/event type. In this paper we derive a novel class of doubly stochastic alpha-stable family LDA models. These models provide the ability to capture the heavy tailed loss process typical of OpRisk whilst also providing analytic expressions for the compound process annual loss density and distributions as well as the aggregated compound process annual loss models. In particular we develop models of the annual loss process in two scenarios. The first scenario considers the loss process with a stochastic intensity parameter, resulting in an inhomogeneous compound Poisson processes annually. The resulting arrival process of losses under such a model will have independent counts over increments within the year. The second scenario considers discretization of the annual loss process into monthly increments with dependent time increments as captured by a Binomial process with a stochastic probability of success changing annually. Each of these models will be coupled under an LDA framework with heavy-tailed severity models comprised of $α$-stable severities for the loss amounts per loss event. In this paper we will derive analytic results for the annual loss distribution density and distribution under each of these models and study their properties.
△ Less
Submitted 17 February, 2011;
originally announced February 2011.
-
Discussion of "Riemann manifold Langevin and Hamiltonian Monte Carlo methods'' by M. Girolami and B. Calderhead
Authors:
Luke Bornn,
Julien Cornebise,
Gareth W. Peters
Abstract:
This technical report is the union of two contributions to the discussion of the Read Paper "Riemann manifold Langevin and Hamiltonian Monte Carlo methods" by B. Calderhead and M. Girolami, presented in front of the Royal Statistical Society on October 13th 2010 and to appear in the Journal of the Royal Statistical Society Series B. The first comment establishes a parallel and possible interaction…
▽ More
This technical report is the union of two contributions to the discussion of the Read Paper "Riemann manifold Langevin and Hamiltonian Monte Carlo methods" by B. Calderhead and M. Girolami, presented in front of the Royal Statistical Society on October 13th 2010 and to appear in the Journal of the Royal Statistical Society Series B. The first comment establishes a parallel and possible interactions with Adaptive Monte Carlo methods. The second comment exposes a detailed study of Riemannian Manifold Hamiltonian Monte Carlo (RMHMC) for a weakly identifiable model presenting a strong ridge in its geometry.
△ Less
Submitted 30 October, 2010;
originally announced November 2010.
-
Impact of Insurance for Operational Risk: Is it worthwhile to insure or be insured for severe losses?
Authors:
Gareth W. Peters,
Aaron D. Byrnes,
Pavel V. Shevchenko
Abstract:
Under the Basel II standards, the Operational Risk (OpRisk) advanced measurement approach allows a provision for reduction of capital as a result of insurance mitigation of up to 20%. This paper studies the behaviour of different insurance policies in the context of capital reduction for a range of possible extreme loss models and insurance policy scenarios in a multi-period, multiple risk setting…
▽ More
Under the Basel II standards, the Operational Risk (OpRisk) advanced measurement approach allows a provision for reduction of capital as a result of insurance mitigation of up to 20%. This paper studies the behaviour of different insurance policies in the context of capital reduction for a range of possible extreme loss models and insurance policy scenarios in a multi-period, multiple risk settings. A Loss Distributional Approach (LDA) for modelling of the annual loss process, involving homogeneous compound Poisson processes for the annual losses, with heavy tailed severity models comprised of alpha-stable severities is considered. There has been little analysis of such models to date and it is believed, insurance models will play more of a role in OpRisk mitigation and capital reduction in future. The first question of interest is when would it be equitable for a bank or financial institution to purchase insurance for heavy tailed OpRisk losses under different insurance policy scenarios? The second question then pertains to Solvency II and addresses what the insurers capital would be for such operational risk scenarios under different policy offerings. In addition we consider the insurers perspective with respect to fair premium as a percentage above the expected annual claim for each insurance policy. The intention being to address questions related to VaR reduction under Basel II, SCR under Solvency II and fair insurance premiums in OpRisk for different extreme loss scenarios. In the process we provide closed form solutions for the distribution of loss process and claims process in an LDA structure as well as closed form analytic solutions for the Expected Shortfall, SCR and MCR under Basel II and Solvency II. We also provide closed form analytic solutions for the annual loss distribution of multiple risks including insurance mitigation.
△ Less
Submitted 2 November, 2010; v1 submitted 21 October, 2010;
originally announced October 2010.
-
Bayesian Cointegrated Vector Autoregression models incorporating Alpha-stable noise for inter-day price movements via Approximate Bayesian Computation
Authors:
Gareth W. Peters,
Balakrishnan B. Kannan,
Ben Lasscock,
Chris Mellen,
Simon Godsill
Abstract:
We consider a statistical model for pairs of traded assets, based on a Cointegrated Vector Auto Regression (CVAR) Model. We extend standard CVAR models to incorporate estimation of model parameters in the presence of price series level shifts which are not accurately modeled in the standard Gaussian error correction model (ECM) framework. This involves developing a novel matrix variate Bayesian CV…
▽ More
We consider a statistical model for pairs of traded assets, based on a Cointegrated Vector Auto Regression (CVAR) Model. We extend standard CVAR models to incorporate estimation of model parameters in the presence of price series level shifts which are not accurately modeled in the standard Gaussian error correction model (ECM) framework. This involves developing a novel matrix variate Bayesian CVAR mixture model comprised of Gaussian errors intra-day and Alpha-stable errors inter-day in the ECM framework. To achieve this we derive a novel conjugate posterior model for the Scaled Mixtures of Normals (SMiN CVAR) representation of Alpha-stable inter-day innovations. These results are generalized to asymmetric models for the innovation noise at inter-day boundaries allowing for skewed Alpha-stable models.
Our proposed model and sampling methodology is general, incorporating the current literature on Gaussian models as a special subclass and also allowing for price series level shifts either at random estimated time points or known a priori time points. We focus analysis on regularly observed non-Gaussian level shifts that can have significant effect on estimation performance in statistical models failing to account for such level shifts, such as at the close and open of markets. We compare the estimation accuracy of our model and estimation approach to standard frequentist and Bayesian procedures for CVAR models when non-Gaussian price series level shifts are present in the individual series, such as inter-day boundaries. We fit a bi-variate Alpha-stable model to the inter-day jumps and model the effect of such jumps on estimation of matrix-variate CVAR model parameters using the likelihood based Johansen procedure and a Bayesian estimation. We illustrate our model and the corresponding estimation procedures we develop on both synthetic and actual data.
△ Less
Submitted 1 August, 2010;
originally announced August 2010.
-
Bayesian Symbol Detection in Wireless Relay Networks via Likelihood-Free Inference
Authors:
Gareth W. Peters,
Ido Nevat,
Scott A. Sisson,
Yanan Fan,
Jinhong Yuan
Abstract:
This paper presents a general stochastic model developed for a class of cooperative wireless relay networks, in which imperfect knowledge of the channel state information at the destination node is assumed. The framework incorporates multiple relay nodes operating under general known non-linear processing functions. When a non-linear relay function is considered, the likelihood function is general…
▽ More
This paper presents a general stochastic model developed for a class of cooperative wireless relay networks, in which imperfect knowledge of the channel state information at the destination node is assumed. The framework incorporates multiple relay nodes operating under general known non-linear processing functions. When a non-linear relay function is considered, the likelihood function is generally intractable resulting in the maximum likelihood and the maximum a posteriori detectors not admitting closed form solutions. We illustrate our methodology to overcome this intractability under the example of a popular optimal non-linear relay function choice and demonstrate how our algorithms are capable of solving the previously intractable detection problem. Overcoming this intractability involves development of specialised Bayesian models. We develop three novel algorithms to perform detection for this Bayesian model, these include a Markov chain Monte Carlo Approximate Bayesian Computation (MCMC-ABC) approach; an Auxiliary Variable MCMC (MCMC-AV) approach; and a Suboptimal Exhaustive Search Zero Forcing (SES-ZF) approach. Finally, numerical examples comparing the symbol error rate (SER) performance versus signal to noise ratio (SNR) of the three detection algorithms are studied in simulated examples.
△ Less
Submitted 26 July, 2010;
originally announced July 2010.
-
A note on target distribution ambiguity of likelihood-free samplers
Authors:
S. A. Sisson,
G. W. Peters,
M. Briers,
Y. Fan
Abstract:
Methods for Bayesian simulation in the presence of computationally intractable likelihood functions are of growing interest. Termed likelihood-free samplers, standard simulation algorithms such as Markov chain Monte Carlo have been adapted for this setting. In this article, by presenting generalisations of existing algorithms, we demonstrate that likelihood-free samplers can be ambiguous over the…
▽ More
Methods for Bayesian simulation in the presence of computationally intractable likelihood functions are of growing interest. Termed likelihood-free samplers, standard simulation algorithms such as Markov chain Monte Carlo have been adapted for this setting. In this article, by presenting generalisations of existing algorithms, we demonstrate that likelihood-free samplers can be ambiguous over the form of the target distribution. We also consider the theoretical justification of these samplers. Distinguishing between the forms of the target distribution may have implications for the future development of likelihood-free samplers.
△ Less
Submitted 27 May, 2010;
originally announced May 2010.
-
Ecological non-linear state space model selection via adaptive particle Markov chain Monte Carlo (AdPMCMC)
Authors:
Gareth W. Peters,
Geoff R. Hosack,
Keith R. Hayes
Abstract:
We develop a novel advanced Particle Markov chain Monte Carlo algorithm that is capable of sampling from the posterior distribution of non-linear state space models for both the unobserved latent states and the unknown model parameters. We apply this novel methodology to five population growth models, including models with strong and weak Allee effects, and test if it can efficiently sample from t…
▽ More
We develop a novel advanced Particle Markov chain Monte Carlo algorithm that is capable of sampling from the posterior distribution of non-linear state space models for both the unobserved latent states and the unknown model parameters. We apply this novel methodology to five population growth models, including models with strong and weak Allee effects, and test if it can efficiently sample from the complex likelihood surface that is often associated with these models. Utilising real and also synthetically generated data sets we examine the extent to which observation noise and process error may frustrate efforts to choose between these models. Our novel algorithm involves an Adaptive Metropolis proposal combined with an SIR Particle MCMC algorithm (AdPMCMC). We show that the AdPMCMC algorithm samples complex, high-dimensional spaces efficiently, and is therefore superior to standard Gibbs or Metropolis Hastings algorithms that are known to converge very slowly when applied to the non-linear state space ecological models considered in this paper. Additionally, we show how the AdPMCMC algorithm can be used to recursively estimate the Bayesian Cramér-Rao Lower Bound of Tichavský (1998). We derive expressions for these Cramér-Rao Bounds and estimate them for the models considered. Our results demonstrate a number of important features of common population growth models, most notably their multi-modal posterior surfaces and dependence between the static and dynamic parameters. We conclude by sampling from the posterior distribution of each of the models, and use Bayes factors to highlight how observation noise significantly diminishes our ability to select among some of the models, particularly those that are designed to reproduce an Allee effect.
△ Less
Submitted 12 May, 2010;
originally announced May 2010.
-
Model Selection and Adaptive Markov chain Monte Carlo for Bayesian Cointegrated VAR model
Authors:
Gareth W. Peters,
Balakrishnan Kannan,
Ben Lasscock,
Chris Mellen
Abstract:
This paper develops a matrix-variate adaptive Markov chain Monte Carlo (MCMC) methodology for Bayesian Cointegrated Vector Auto Regressions (CVAR). We replace the popular approach to sampling Bayesian CVAR models, involving griddy Gibbs, with an automated efficient alternative, based on the Adaptive Metropolis algorithm of Roberts and Rosenthal, (2009). Developing the adaptive MCMC framework for B…
▽ More
This paper develops a matrix-variate adaptive Markov chain Monte Carlo (MCMC) methodology for Bayesian Cointegrated Vector Auto Regressions (CVAR). We replace the popular approach to sampling Bayesian CVAR models, involving griddy Gibbs, with an automated efficient alternative, based on the Adaptive Metropolis algorithm of Roberts and Rosenthal, (2009). Developing the adaptive MCMC framework for Bayesian CVAR models allows for efficient estimation of posterior parameters in significantly higher dimensional CVAR series than previously possible with existing griddy Gibbs samplers. For a n-dimensional CVAR series, the matrix-variate posterior is in dimension $3n^2 + n$, with significant correlation present between the blocks of matrix random variables. We also treat the rank of the CVAR model as a random variable and perform joint inference on the rank and model parameters. This is achieved with a Bayesian posterior distribution defined over both the rank and the CVAR model parameters, and inference is made via Bayes Factor analysis of rank. Practically the adaptive sampler also aids in the development of automated Bayesian cointegration models for algorithmic trading systems considering instruments made up of several assets, such as currency baskets. Previously the literature on financial applications of CVAR trading models typically only considers pairs trading (n=2) due to the computational cost of the griddy Gibbs. We are able to extend under our adaptive framework to $n >> 2$ and demonstrate an example with n = 10, resulting in a posterior distribution with parameters up to dimension 310. By also considering the rank as a random quantity we can ensure our resulting trading models are able to adjust to potentially time varying market conditions in a coherent statistical framework.
△ Less
Submitted 21 April, 2010;
originally announced April 2010.
-
Chain ladder method: Bayesian bootstrap versus classical bootstrap
Authors:
Gareth W. Peters,
Mario V. Wüthrich,
Pavel V. Shevchenko
Abstract:
The intention of this paper is to estimate a Bayesian distribution-free chain ladder (DFCL) model using approximate Bayesian computation (ABC) methodology. We demonstrate how to estimate quantities of interest in claims reserving and compare the estimates to those obtained from classical and credibility approaches. In this context, a novel numerical procedure utilising Markov chain Monte Carlo (MC…
▽ More
The intention of this paper is to estimate a Bayesian distribution-free chain ladder (DFCL) model using approximate Bayesian computation (ABC) methodology. We demonstrate how to estimate quantities of interest in claims reserving and compare the estimates to those obtained from classical and credibility approaches. In this context, a novel numerical procedure utilising Markov chain Monte Carlo (MCMC), ABC and a Bayesian bootstrap procedure was developed in a truly distribution-free setting. The ABC methodology arises because we work in a distribution-free setting in which we make no parametric assumptions, meaning we can not evaluate the likelihood point-wise or in this case simulate directly from the likelihood model. The use of a bootstrap procedure allows us to generate samples from the intractable likelihood without the requirement of distributional assumptions, this is crucial to the ABC framework. The developed methodology is used to obtain the empirical distribution of the DFCL model parameters and the predictive distribution of the outstanding loss liabilities conditional on the observed claims. We then estimate predictive Bayesian capital estimates, the Value at Risk (VaR) and the mean square error of prediction (MSEP). The latter is compared with the classical bootstrap and credibility methods.
△ Less
Submitted 15 April, 2010;
originally announced April 2010.
-
Likelihood-free Bayesian inference for alpha-stable models
Authors:
G. W. Peters,
S. A. Sisson,
Y. Fan
Abstract:
$α$-stable distributions are utilised as models for heavy-tailed noise in many areas of statistics, finance and signal processing engineering.
However, in general, neither univariate nor multivariate $α$-stable models admit closed form densities which can be evaluated pointwise. This complicates the inferential procedure.
As a result, $α$-stable models are practically limited to the univaria…
▽ More
$α$-stable distributions are utilised as models for heavy-tailed noise in many areas of statistics, finance and signal processing engineering.
However, in general, neither univariate nor multivariate $α$-stable models admit closed form densities which can be evaluated pointwise. This complicates the inferential procedure.
As a result, $α$-stable models are practically limited to the univariate setting under the Bayesian paradigm, and to bivariate models under the classical framework.
In this article we develop a novel Bayesian approach to modelling univariate and multivariate $α$-stable distributions based on recent advances in "likelihood-free" inference.
We present an evaluation of the performance of this procedure in 1, 2 and 3 dimensions, and provide an analysis of real daily currency exchange rate data. The proposed approach provides a feasible inferential methodology at a moderate computational cost.
△ Less
Submitted 23 December, 2009;
originally announced December 2009.
-
Comments on "Particle Markov Chain Monte Carlo" by C. Andrieu, A. Doucet and R. Hollenstein
Authors:
Julien Cornebise,
Gareth W. Peters
Abstract:
We merge in this note our two discussions about the Read Paper "Particle Markov chain Monte Carlo" (Andrieu, Doucet, and Holenstein, 2010) presented on October 16th 2009 at the Royal Statistical Society, appearing in the Journal of the Royal Statistical Society Series B. We also present a more detailed version of the ABC extension.
We merge in this note our two discussions about the Read Paper "Particle Markov chain Monte Carlo" (Andrieu, Doucet, and Holenstein, 2010) presented on October 16th 2009 at the Royal Statistical Society, appearing in the Journal of the Royal Statistical Society Series B. We also present a more detailed version of the ABC extension.
△ Less
Submitted 19 November, 2009;
originally announced November 2009.
-
On sequential Monte Carlo, partial rejection control and approximate Bayesian computation
Authors:
G. W. Peters,
Y. Fan,
S. A. Sisson
Abstract:
We present a sequential Monte Carlo sampler variant of the partial rejection control algorithm, and show that this variant can be considered as a sequential Monte Carlo sampler with a modified mutation kernel. We prove that the new sampler can reduce the variance of the incremental importance weights when compared with standard sequential Monte Carlo samplers. We provide a study of theoretical p…
▽ More
We present a sequential Monte Carlo sampler variant of the partial rejection control algorithm, and show that this variant can be considered as a sequential Monte Carlo sampler with a modified mutation kernel. We prove that the new sampler can reduce the variance of the incremental importance weights when compared with standard sequential Monte Carlo samplers. We provide a study of theoretical properties of the new algorithm, and make connections with some existing algorithms. Finally, the sampler is adapted for application under the challenging "likelihood free," approximate Bayesian computation modelling framework, where we demonstrate superior performance over existing likelihood-free samplers.
△ Less
Submitted 11 November, 2009; v1 submitted 26 August, 2008;
originally announced August 2008.