-
A D-vine copula based model for repeated measurements extending linear mixed models with homogeneous correlation structure
Authors:
Matthias Killiches,
Claudia Czado
Abstract:
We propose a model for unbalanced longitudinal data, where the univariate margins can be selected arbitrarily and the dependence structure is described with the help of a D-vine copula. We show that our approach is an extremely flexible extension of the widely used linear mixed model if the correlation is homogeneous over the considered individuals. As an alternative to joint maximum-likelihood a…
▽ More
We propose a model for unbalanced longitudinal data, where the univariate margins can be selected arbitrarily and the dependence structure is described with the help of a D-vine copula. We show that our approach is an extremely flexible extension of the widely used linear mixed model if the correlation is homogeneous over the considered individuals. As an alternative to joint maximum-likelihood a sequential estimation approach for the D-vine copula is provided and validated in a simulation study. The model can handle missing values without being forced to discard data. Since conditional distributions are known analytically, we easily make predictions for future events. For model selection we adjust the Bayesian information criterion to our situation. In an application to heart surgery data our model performs clearly better than competing linear mixed models.
△ Less
Submitted 17 May, 2017;
originally announced May 2017.
-
Using model distances to investigate the simplifying assumption, model selection and truncation levels for vine copulas
Authors:
Matthias Killiches,
Daniel Kraus,
Claudia Czado
Abstract:
Vine copulas are a useful statistical tool to describe the dependence structure between several random variables, especially when the number of variables is very large. When modeling data with vine copulas, one often is confronted with a set of candidate models out of which the best one is supposed to be selected. For example, this may arise in the context of non-simplified vine copulas, truncatio…
▽ More
Vine copulas are a useful statistical tool to describe the dependence structure between several random variables, especially when the number of variables is very large. When modeling data with vine copulas, one often is confronted with a set of candidate models out of which the best one is supposed to be selected. For example, this may arise in the context of non-simplified vine copulas, truncations of vines and other simplifications regarding pair-copula families or the vine structure. With the help of distance measures we develop a parametric bootstrap based testing procedure to decide between copulas from nested model classes. In addition we use distance measures to select among different candidate models. All commonly used distance measures, e.g. the Kullback-Leibler distance, suffer from the curse of dimensionality due to high-dimensional integrals. As a remedy for this problem, Killiches, Kraus and Czado (2017) propose several modifications of the Kullback-Leibler distance. We apply these distance measures to the above mentioned model selection problems and substantiate their usefulness.
△ Less
Submitted 9 May, 2017; v1 submitted 27 October, 2016;
originally announced October 2016.
-
Vine copula based likelihood estimation of dependence patterns in multivariate event time data
Authors:
Nicole Barthel,
Candida Geerdens,
Matthias Killiches,
Paul Janssen,
Claudia Czado
Abstract:
In many studies multivariate event time data are generated from clusters having a possibly complex association pattern. Flexible models are needed to capture this dependence. Vine copulas serve this purpose. Inference methods for vine copulas are available for complete data. Event time data, however, are often subject to right-censoring. As a consequence, the existing inferential tools, e.g. likel…
▽ More
In many studies multivariate event time data are generated from clusters having a possibly complex association pattern. Flexible models are needed to capture this dependence. Vine copulas serve this purpose. Inference methods for vine copulas are available for complete data. Event time data, however, are often subject to right-censoring. As a consequence, the existing inferential tools, e.g. likelihood estimation, need to be adapted. A two-stage estimation approach is proposed. First, the marginal distributions are modeled. Second, the dependence structure modeled by a vine copula is estimated via likelihood maximization. Due to the right-censoring single and double integrals show up in the copula likelihood expression such that numerical integration is needed for its evaluation. For the dependence modeling a sequential estimation approach that facilitates the computational challenges of the likelihood optimization is provided. A three-dimensional simulation study provides evidence for the good finite sample performance of the proposed method. Using four-dimensional mastitis data, it is shown how an appropriate vine copula model can be selected for data at hand.
△ Less
Submitted 22 July, 2017; v1 submitted 4 March, 2016;
originally announced March 2016.
-
Examination and visualisation of the simplifying assumption for vine copulas in three dimensions
Authors:
Matthias Killiches,
Daniel Kraus,
Claudia Czado
Abstract:
Vine copulas are a highly flexible class of dependence models, which are based on the decomposition of the density into bivariate building blocks. For applications one usually makes the simplifying assumption that copulas of conditional distributions are independent of the variables on which they are conditioned. However this assumption has been criticised for being too restrictive. We examine bot…
▽ More
Vine copulas are a highly flexible class of dependence models, which are based on the decomposition of the density into bivariate building blocks. For applications one usually makes the simplifying assumption that copulas of conditional distributions are independent of the variables on which they are conditioned. However this assumption has been criticised for being too restrictive. We examine both simplified and non-simplified vine copulas in three dimensions and investigate conceptual differences. We show and compare contour surfaces of three-dimensional vine copula models, which prove to be much more informative than the contour lines of the bivariate marginals. Our investigation shows that non-simplified vine copulas can exhibit arbitrarily irregular shapes, whereas simplified vine copulas appear to be smooth extrapolations of their bivariate margins to three dimensions. In addition to a variety of constructed examples, we also investigate a three-dimensional subset of the well-known uranium data set and visually detect that a non-simplified vine copula is necessary to capture its complex dependence structure.
△ Less
Submitted 28 October, 2016; v1 submitted 18 February, 2016;
originally announced February 2016.
-
Model distances for vine copulas in high dimensions
Authors:
Matthias Killiches,
Daniel Kraus,
Claudia Czado
Abstract:
Vine copulas are a flexible class of dependence models consisting of bivariate building blocks and have proven to be particularly useful in high dimensions. Classical model distance measures require multivariate integration and thus suffer from the curse of dimensionality. In this paper we provide numerically tractable methods to measure the distance between two vine copulas even in high dimension…
▽ More
Vine copulas are a flexible class of dependence models consisting of bivariate building blocks and have proven to be particularly useful in high dimensions. Classical model distance measures require multivariate integration and thus suffer from the curse of dimensionality. In this paper we provide numerically tractable methods to measure the distance between two vine copulas even in high dimensions. For this purpose, we consecutively develop three new distance measures based on the Kullback-Leibler distance, using the result that it can be expressed as the sum over expectations of KL distances between univariate conditional densities, which can be easily obtained for vine copulas. To reduce numerical calculations we approximate these expectations on adequately designed grids, outperforming Monte Carlo-integration with respect to computational time. In numerous examples and applications we illustrate the strengths and weaknesses of the developed distance measures.
△ Less
Submitted 21 April, 2016; v1 submitted 13 October, 2015;
originally announced October 2015.
-
Block-Maxima of Vines
Authors:
Matthias Killiches,
Claudia Czado
Abstract:
We examine the dependence structure of finite block-maxima of multivariate distributions. We provide a closed form expression for the copula density of the vector of the block-maxima. Further, we show how partial derivatives of three-dimensional vine copulas can be obtained by only one-dimensional integration. Combining these results allows the numerical treatment of the block-maxima of any three-…
▽ More
We examine the dependence structure of finite block-maxima of multivariate distributions. We provide a closed form expression for the copula density of the vector of the block-maxima. Further, we show how partial derivatives of three-dimensional vine copulas can be obtained by only one-dimensional integration. Combining these results allows the numerical treatment of the block-maxima of any three-dimensional vine copula for finite block-sizes. We look at certain vine copula specifications and examine how the density of the block-maxima behaves for different block-sizes. Additionally, a real data example from hydrology is considered. In extreme-value theory for multivariate normal distributions, a certain scaling of each variable and the correlation matrix is necessary to obtain a non-trivial limiting distribution when the block-size goes to infinity. This scaling is applied to different three-dimensional vine copula specifications.
△ Less
Submitted 11 April, 2015;
originally announced April 2015.