-
General linear hypothesis testing in ill-conditioned functional response model
Authors:
Łukasz Smaga,
Natalia Stefańska
Abstract:
The paper concerns inference in the ill-conditioned functional response model, which is a part of functional data analysis. In this regression model, the functional response is modeled using several independent scalar variables. To verify linear hypotheses, we develop new test statistics by aggregating pointwise statistics using either integral or supremum. The new tests are scale-invariant, in co…
▽ More
The paper concerns inference in the ill-conditioned functional response model, which is a part of functional data analysis. In this regression model, the functional response is modeled using several independent scalar variables. To verify linear hypotheses, we develop new test statistics by aggregating pointwise statistics using either integral or supremum. The new tests are scale-invariant, in contrast to the existing ones. To construct tests, we use different bootstrap methods. The performance of the new tests is compared with the performance of known tests through a simulation study and an application to a real data example.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
Forensically useful mid-term and short-term temperature reconstruction for quasi-indoor death scenes
Authors:
Jędrzej Wydra,
Łukasz Smaga,
Szymon Matuszewski
Abstract:
Accurate reconstruction of ambient temperature at death scenes is crucial for estimating the postmortem interval (PMI) in forensic science. Typically, this is done by correcting weather station temperatures using measurements from the scene, often through linear regression. While recent attempts to use alternative algorithms like GAM have improved accuracy, they usually require additional variable…
▽ More
Accurate reconstruction of ambient temperature at death scenes is crucial for estimating the postmortem interval (PMI) in forensic science. Typically, this is done by correcting weather station temperatures using measurements from the scene, often through linear regression. While recent attempts to use alternative algorithms like GAM have improved accuracy, they usually require additional variables such as humidity, making them impractical. This study presents two methods for accurate temperature reconstruction using only temperature data. The first, a concurrent regression model, is known in mathematics and is applied here for mid-term reconstructions (several days of measurements). The second, a new method based on Fourier expansion, is designed for short-term reconstructions (only a few hours of measurements). Both models were tested in quasi-indoor conditions, using data from six different environments. The concurrent regression model provided nearly perfect reconstructions for periods longer than six days, while the short-term model achieved similar accuracy after just 4-5 hours of measurements. These findings demonstrate that reliable temperature corrections for PMI estimation can be made with significantly reduced measurement periods, enhancing the practicality of the method in forensic applications.
△ Less
Submitted 14 September, 2024;
originally announced September 2024.
-
Multiple Comparison Procedures for Simultaneous Inference in Functional MANOVA
Authors:
Merle Munko,
Marc Ditzhaus,
Markus Pauly,
Łukasz Smaga
Abstract:
Functional data analysis is becoming increasingly popular to study data from real-valued random functions. Nevertheless, there is a lack of multiple testing procedures for such data. These are particularly important in factorial designs to compare different groups or to infer factor effects. We propose a new class of testing procedures for arbitrary linear hypotheses in general factorial designs w…
▽ More
Functional data analysis is becoming increasingly popular to study data from real-valued random functions. Nevertheless, there is a lack of multiple testing procedures for such data. These are particularly important in factorial designs to compare different groups or to infer factor effects. We propose a new class of testing procedures for arbitrary linear hypotheses in general factorial designs with functional data. Our methods allow global as well as multiple inference of both, univariate and multivariate mean functions without assuming particular error distributions nor homoscedasticity. That is, we allow for different structures of the covariance functions between groups. To this end, we use point-wise quadratic-form-type test functions that take potential heteroscedasticity into account. Taking the supremum over each test function, we define a class of local test statistics. We analyse their (joint) asymptotic behaviour and propose a resampling approach to approximate the limit distributions. The resulting global and multiple testing procedures are asymptotic valid under weak conditions and applicable in general functional MANOVA settings. We evaluate their small-sample performance in extensive simulations and finally illustrate their applicability by analysing a multivariate functional air pollution data set.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
General multiple tests for functional data
Authors:
Merle Munko,
Marc Ditzhaus,
Markus Pauly,
Łukasz Smaga,
Jin-Ting Zhang
Abstract:
While there exists several inferential methods for analyzing functional data in factorial designs, there is a lack of statistical tests that are valid (i) in general designs, (ii) under non-restrictive assumptions on the data generating process and (iii) allow for coherent post-hoc analyses. In particular, most existing methods assume Gaussianity or equal covariance functions across groups (homosc…
▽ More
While there exists several inferential methods for analyzing functional data in factorial designs, there is a lack of statistical tests that are valid (i) in general designs, (ii) under non-restrictive assumptions on the data generating process and (iii) allow for coherent post-hoc analyses. In particular, most existing methods assume Gaussianity or equal covariance functions across groups (homoscedasticity) and are only applicable for specific study designs that do not allow for evaluation of interactions. Moreover, all available strategies are only designed for testing global hypotheses and do not directly allow a more in-depth analysis of multiple local hypotheses. To address the first two problems (i)-(ii), we propose flexible integral-type test statistics that are applicable in general factorial designs under minimal assumptions on the data generating process. In particular, we neither postulate homoscedasticity nor Gaussianity. To approximate the statistics' null distribution, we adopt a resampling approach and validate it methodologically. Finally, we use our flexible testing framework to (iii) infer several local null hypotheses simultaneously. To allow for powerful data analysis, we thereby take the complex dependencies of the different local test statistics into account. In extensive simulations we confirm that the new methods are flexibly applicable. Two illustrate data analyses complete our study. The new testing procedures are implemented in the R package multiFANOVA, which will be available on CRAN soon.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Functional repeated measures analysis of variance and its application
Authors:
Katarzyna Kuryło,
Łukasz Smaga
Abstract:
This paper is motivated by medical studies in which the same patients with multiple sclerosis are examined at several successive visits and described by fractional anisotropy tract profiles, which can be represented as functions. Since the observations for each patient are dependent random processes, they follow a repeated measures design for functional data. To compare the results for different v…
▽ More
This paper is motivated by medical studies in which the same patients with multiple sclerosis are examined at several successive visits and described by fractional anisotropy tract profiles, which can be represented as functions. Since the observations for each patient are dependent random processes, they follow a repeated measures design for functional data. To compare the results for different visits, we thus consider functional repeated measures analysis of variance. For this purpose, a pointwise test statistic is constructed by adapting the classical test statistic for one-way repeated measures analysis of variance to the functional data framework. By integrating and taking the supremum of the pointwise test statistic, we create two global test statistics. Apart from verifying the general null hypothesis on the equality of mean functions corresponding to different objects, we also propose a simple method for post hoc analysis. We illustrate the finite sample properties of permutation and bootstrap testing procedures in an extensive simulation study. Finally, we analyze a motivating real data example in detail.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Inference for all variants of the multivariate coefficient of variation in factorial designs
Authors:
Marc Ditzhaus,
Łukasz Smaga
Abstract:
The multivariate coefficient of variation (MCV) is an attractive and easy-to-interpret effect size for the dispersion in multivariate data. Recently, the first inference methods for the MCV were proposed by Ditzhaus and Smaga (2022) for general factorial designs covering k-sample settings but also complex higher-way layouts. However, two questions are still pending: (1) The theory on inference met…
▽ More
The multivariate coefficient of variation (MCV) is an attractive and easy-to-interpret effect size for the dispersion in multivariate data. Recently, the first inference methods for the MCV were proposed by Ditzhaus and Smaga (2022) for general factorial designs covering k-sample settings but also complex higher-way layouts. However, two questions are still pending: (1) The theory on inference methods for MCV is primarily derived for one special MCV variant while there are several reasonable proposals. (2) When rejecting a global null hypothesis in factorial designs, a more in-depth analysis is typically of high interest to find the specific contrasts of MCV leading to the aforementioned rejection. In this paper, we tackle both by, first, extending the aforementioned nonparametric permutation procedure to the other MCV variants and, second, by proposing a max-type test for post hoc analysis. To improve the small sample performance of the latter, we suggest a novel studentized bootstrap strategy and prove its asymptotic validity. The actual performance of all proposed tests and post hoc procedures are compared in an extensive simulation study and illustrated by a real data analysis.
△ Less
Submitted 27 January, 2023;
originally announced January 2023.
-
Permutation test for the multivariate coefficient of variation in factorial designs
Authors:
Marc Ditzhaus,
Łukas Smaga
Abstract:
New inference methods for the multivariate coefficient of variation and its reciprocal, the standardized mean, are presented. While there are various testing procedures for both parameters in the univariate case, it is less known how to do inference in the multivariate setting appropriately. There are some existing procedures but they rely on restrictive assumptions on the underlying distributions…
▽ More
New inference methods for the multivariate coefficient of variation and its reciprocal, the standardized mean, are presented. While there are various testing procedures for both parameters in the univariate case, it is less known how to do inference in the multivariate setting appropriately. There are some existing procedures but they rely on restrictive assumptions on the underlying distributions. We tackle this problem by applying Wald-type statistics in the context of general, potentially heteroscedastic factorial designs. In addition to the $k$-sample case, higher-way layouts can be incorporated into this framework allowing the discussion of main and interaction effects. The resulting procedures are shown to be asymptotically valid under the null hypothesis and consistent under general alternatives. To improve the finite sample performance, we suggest permutation versions of the tests and shown that the tests' asymptotic properties can be transferred to them. An exhaustive simulation study compares the new tests, their permutation counterparts and existing methods. To further analyse the differences between the tests, we conduct two illustrative real data examples.
△ Less
Submitted 30 March, 2020;
originally announced March 2020.