-
One-sample location tests based on center-outward signs and ranks
Authors:
Daniel Hlubinka,
Šárka Hudecová
Abstract:
A multivariate one-sample location test based on the center-outward ranks and signs is considered, and two different testing procedures are proposed for centrally symmetric distributions. The first test is based on a random division of the data into two samples, while the second one uses a symmetrized sample. The asymptotic distributions of the proposed tests are provided. For univariate data, two…
▽ More
A multivariate one-sample location test based on the center-outward ranks and signs is considered, and two different testing procedures are proposed for centrally symmetric distributions. The first test is based on a random division of the data into two samples, while the second one uses a symmetrized sample. The asymptotic distributions of the proposed tests are provided. For univariate data, two variants of the symmetrized test statistic are shown to be equivalent to the standard sign and Wilcoxon test respectively. The small sample behavior of the proposed techniques is illustrated by a simulation study that also provides a power comparison for various transportation grids.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
Functional $K$ Sample Problem via Multivariate Optimal Measure Transport-Based Permutation Test
Authors:
Šárka Hudecová,
Daniel Hlubinka,
Zdeněk Hlávka
Abstract:
The null hypothesis of equality of distributions of functional data coming from $K$ samples is considered. The proposed test statistic is multivariate and its components are based on pairwise Cramér von Mises comparisons of empirical characteristic functionals. The significance of the test statistic is evaluated via the novel multivariate permutation test, where the final single $p$-value is compu…
▽ More
The null hypothesis of equality of distributions of functional data coming from $K$ samples is considered. The proposed test statistic is multivariate and its components are based on pairwise Cramér von Mises comparisons of empirical characteristic functionals. The significance of the test statistic is evaluated via the novel multivariate permutation test, where the final single $p$-value is computed using the discrete optimal measure transport. The methodology is illustrated by real data on cumulative intraday returns of Bitcoin.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Fully distribution-free center-outward rank tests for multiple-output regression and MANOVA
Authors:
Marc Hallin,
Daniel Hlubinka,
Šárka Hudecová
Abstract:
Extending rank-based inference to a multivariate setting such as multiple-output regression or MANOVA with unspecified d-dimensional error density has remained an open problem for more than half a century. None of the many solutions proposed so far is enjoying the combination of distribution-freeness and efficiency that makes rank-based inference a successful tool in the univariate setting. A conc…
▽ More
Extending rank-based inference to a multivariate setting such as multiple-output regression or MANOVA with unspecified d-dimensional error density has remained an open problem for more than half a century. None of the many solutions proposed so far is enjoying the combination of distribution-freeness and efficiency that makes rank-based inference a successful tool in the univariate setting. A concept of center-outward multivariate ranks and signs based on measure transportation ideas has been introduced recently. Center-outward ranks and signs are not only distribution-free but achieve in dimension d > 1 the (essential) maximal ancillarity property of traditional univariate ranks, hence carry all the "distribution-free information" available in the sample. We derive here the Hájek representation and asymptotic normality results required in the construction of center-outward rank tests for multiple-output regression and MANOVA. When based on appropriate spherical scores, these fully distribution-free tests achieve parametric efficiency in the corresponding models.
△ Less
Submitted 20 December, 2021; v1 submitted 30 July, 2020;
originally announced July 2020.
-
Maximum pseudo-likelihood estimation based on estimated residuals in copula semiparametric models
Authors:
Marek Omelka,
Šárka Hudecová,
Natalie Neumeyer
Abstract:
This paper deals with a situation when one is interested in the dependence structure of a multidimensional response variable in the presence of a multivariate covariate. It is assumed that the covariate affects only the marginal distributions through regression models while the dependence structure, which is described by a copula, is unaffected. A parametric estimation of the copula function is co…
▽ More
This paper deals with a situation when one is interested in the dependence structure of a multidimensional response variable in the presence of a multivariate covariate. It is assumed that the covariate affects only the marginal distributions through regression models while the dependence structure, which is described by a copula, is unaffected. A parametric estimation of the copula function is considered with focus on the maximum pseudo-likelihood method. It is proved that under some appropriate regularity assumptions the estimator calculated from the residuals is asymptotically equivalent to the estimator based on the unobserved errors. In such case one can ignore the fact that the response is first adjusted for the effect of the covariate. A Monte Carlo simulation study explores (among others) situations where the regularity assumptions are not satisfied and the claimed result does not hold. It shows that in such situations the maximum pseudo-likelihood estimator may behave poorly and the moment estimation of the copula parameter is of interest. Our results complement the results available for nonparametric estimation of the copula function.
△ Less
Submitted 11 March, 2019;
originally announced March 2019.
-
A copula approach for dependence modeling in multivariate nonparametric time series
Authors:
Natalie Neumeyer,
Marek Omelka,
Sarka Hudecova
Abstract:
This paper is concerned with modeling the dependence structure of two (or more) time-series in the presence of a (possible multivariate) covariate which may include past values of the time series. We assume that the covariate influences only the conditional mean and the conditional variance of each of the time series but the distribution of the standardized innovations is not influenced by the cov…
▽ More
This paper is concerned with modeling the dependence structure of two (or more) time-series in the presence of a (possible multivariate) covariate which may include past values of the time series. We assume that the covariate influences only the conditional mean and the conditional variance of each of the time series but the distribution of the standardized innovations is not influenced by the covariate and is stable in time. The joint distribution of the time series is then determined by the conditional means, the conditional variances and the marginal distributions of the innovations, which we estimate nonparametrically, and the copula of the innovations, which represents the dependency structure. We consider a nonparametric as well as a semiparametric estimator based on the estimated residuals. We show that under suitable assumptions these copula estimators are asymptotically equivalent to estimators that would be based on the unobserved innovations. The theoretical results are illustrated by simulations and a real data example.
△ Less
Submitted 10 December, 2018; v1 submitted 22 May, 2017;
originally announced May 2017.
-
Tests for Time Series of Counts Based on the Probability Generating Function
Authors:
Šárka Hudecová,
Marie Hušková,
Simos G. Meintanis
Abstract:
We propose testing procedures for the hypothesis that a given set of discrete observations may be formulated as a particular time series of counts with a specific conditional law. The new test statistics incorporate the empirical probability generating function computed from the observations. Special emphasis is given to the popular models of integer autoregression and Poisson autoregression. The…
▽ More
We propose testing procedures for the hypothesis that a given set of discrete observations may be formulated as a particular time series of counts with a specific conditional law. The new test statistics incorporate the empirical probability generating function computed from the observations. Special emphasis is given to the popular models of integer autoregression and Poisson autoregression. The asymptotic properties of the proposed test statistics are studied under the null hypothesis as well as under alternatives. A Monte Carlo power study on bootstrap versions of the new methods is included as well as real-data examples.
△ Less
Submitted 22 October, 2014;
originally announced October 2014.
-
Modeling Dependencies in Claims Reserving with GEE
Authors:
Šárka Hudecová,
Michal Pešta
Abstract:
A common approach to the claims reserving problem is based on generalized linear models (GLM). Within this framework, the claims in different origin and development years are assumed to be independent variables. If this assumption is violated, the classical techniques may provide incorrect predictions of the claims reserves or even misleading estimates of the prediction error.
In this article, t…
▽ More
A common approach to the claims reserving problem is based on generalized linear models (GLM). Within this framework, the claims in different origin and development years are assumed to be independent variables. If this assumption is violated, the classical techniques may provide incorrect predictions of the claims reserves or even misleading estimates of the prediction error.
In this article, the application of generalized estimating equations (GEE) for estimation of the claims reserves is shown. Claim triangles are handled as panel data, where claim amounts within the same accident year are dependent. Since the GEE allow to incorporate dependencies, various correlation structures are introduced and some practical recommendations are given.
Model selection criteria within the GEE reserving method are proposed. Moreover, an estimate for the mean square error of prediction for the claims reserves is derived in a nonstandard way and its advantages are discussed. Real data examples are provided as an illustration of the potential benefits of the presented approach.
△ Less
Submitted 17 June, 2013;
originally announced June 2013.