-
Data-adaptive structural change-point detection via isolation
Authors:
Andreas Anastasiou,
Sophia Loizidou
Abstract:
In this paper, a new data-adaptive method, called DAIS (Data Adaptive ISolation), is introduced for the estimation of the number and the location of change-points in a given data sequence. The proposed method can detect changes in various different signal structures; we focus on the examples of piecewise-constant and continuous, piecewise-linear signals. The novelty of the proposed algorithm comes…
▽ More
In this paper, a new data-adaptive method, called DAIS (Data Adaptive ISolation), is introduced for the estimation of the number and the location of change-points in a given data sequence. The proposed method can detect changes in various different signal structures; we focus on the examples of piecewise-constant and continuous, piecewise-linear signals. The novelty of the proposed algorithm comes from the data-adaptive nature of the methodology. At each step, and for the data under consideration, we search for the most prominent change-point in a targeted neighborhood of the data sequence that contains this change-point with high probability. Using a suitably chosen contrast function, the change-point will then get detected after being isolated in an interval. The isolation feature enhances estimation accuracy, while the data-adaptive nature of DAIS is advantageous regarding, mainly, computational complexity. The methodology can be applied to both univariate and multivariate signals. The simulation results presented indicate that DAIS is at least as accurate as state-of-the-art competitors and in many cases significantly less computationally expensive.
△ Less
Submitted 23 September, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
Generalized multiple change-point detection in the structure of multivariate, possibly high-dimensional, data sequences
Authors:
Andreas Anastasiou,
Angelos Papanastasiou
Abstract:
The extensive emergence of big data techniques has led to an increasing interest in the development of change-point detection algorithms that can perform well in a multivariate, possibly high-dimensional setting. In the current paper, we propose a new method for the consistent estimation of the number and location of multiple generalized change-points in multivariate, possibly high-dimensional, no…
▽ More
The extensive emergence of big data techniques has led to an increasing interest in the development of change-point detection algorithms that can perform well in a multivariate, possibly high-dimensional setting. In the current paper, we propose a new method for the consistent estimation of the number and location of multiple generalized change-points in multivariate, possibly high-dimensional, noisy data sequences. The number of change-points is allowed to increase with the sample size and the dimensionality of the given data sequence. Having a number of univariate signals, which constitute the unknown multivariate signal, our algorithm can deal with general structural changes; we focus on changes in the mean vector of a multivariate piecewise-constant signal, as well as changes in the linear trend of any of the univariate component signals. Our proposed algorithm, labeled Multivariate Isolate-Detect (MID), allows for consistent change-point detection in the presence of frequent changes of possibly small magnitudes in a computationally fast way.
△ Less
Submitted 13 November, 2022;
originally announced November 2022.
-
Detecting change-points in noisy GPS time series with continuous piecewise structures
Authors:
Yiming Ma,
Andreas Anastasiou,
Ting Wang,
Fabien Montiel
Abstract:
Detecting change-points in noisy data sequences with an underlying continuous piecewise structure is a challenging problem, especially when prior knowledge of the exact nature of the structural changes is unknown. One important application is the automatic detection of slow slip events (SSEs), a type of slow earthquakes, in GPS measurements of ground deformation. We propose a new method based on S…
▽ More
Detecting change-points in noisy data sequences with an underlying continuous piecewise structure is a challenging problem, especially when prior knowledge of the exact nature of the structural changes is unknown. One important application is the automatic detection of slow slip events (SSEs), a type of slow earthquakes, in GPS measurements of ground deformation. We propose a new method based on Singular Spectrum Analysis to obscure the deviation from the piecewise-linear structure, allowing us to apply Isolate-Detect to detect change-points in SSE data with piecewise-non-linear structures. We demonstrate its effectiveness in both simulated and real SSE data.
△ Less
Submitted 24 February, 2022;
originally announced February 2022.
-
Stein's Method Meets Computational Statistics: A Review of Some Recent Developments
Authors:
Andreas Anastasiou,
Alessandro Barp,
François-Xavier Briol,
Bruno Ebner,
Robert E. Gaunt,
Fatemeh Ghaderinezhad,
Jackson Gorham,
Arthur Gretton,
Christophe Ley,
Qiang Liu,
Lester Mackey,
Chris. J. Oates,
Gesine Reinert,
Yvik Swan
Abstract:
Stein's method compares probability distributions through the study of a class of linear operators called Stein operators. While mainly studied in probability and used to underpin theoretical statistics, Stein's method has led to significant advances in computational statistics in recent years. The goal of this survey is to bring together some of these recent developments and, in doing so, to stim…
▽ More
Stein's method compares probability distributions through the study of a class of linear operators called Stein operators. While mainly studied in probability and used to underpin theoretical statistics, Stein's method has led to significant advances in computational statistics in recent years. The goal of this survey is to bring together some of these recent developments and, in doing so, to stimulate further research into the successful field of Stein's method and statistics. The topics we discuss include tools to benchmark and compare sampling methods such as approximate Markov chain Monte Carlo, deterministic alternatives to sampling methods, control variate techniques, parameter estimation and goodness-of-fit testing.
△ Less
Submitted 22 June, 2022; v1 submitted 7 May, 2021;
originally announced May 2021.
-
Modeling of Covid-19 Pandemic in Cyprus
Authors:
Sergios Agapiou,
Andreas Anastasiou,
Anastassia Baxevani,
Tasos Christofides,
Elisavet Constantinou,
Georgios Hadjigeorgiou,
Christos Nicolaides,
Georgios Nikolopoulos,
Konstantinos Fokianos
Abstract:
The Republic of Cyprus is a small island in the southeast of Europe and member of the European Union. The first wave of COVID-19 in Cyprus started in early March, 2020 (imported cases) and peaked in late March-early April. The health authorities responded rapidly and rigorously to the COVID-19 pandemic by scaling-up testing, increasing efforts to trace and isolate contacts of cases, and implementi…
▽ More
The Republic of Cyprus is a small island in the southeast of Europe and member of the European Union. The first wave of COVID-19 in Cyprus started in early March, 2020 (imported cases) and peaked in late March-early April. The health authorities responded rapidly and rigorously to the COVID-19 pandemic by scaling-up testing, increasing efforts to trace and isolate contacts of cases, and implementing measures such as closures of educational institutions, and travel and movement restrictions. The pandemic was also a unique opportunity that brought together experts from various disciplines including epidemiologists, clinicians, mathematicians, and statisticians. The aim of this paper is to present the efforts of this new, multidisciplinary research team in modelling the COVID-19 pandemic in the Republic of Cyprus.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Normal Approximation for Stochastic Gradient Descent via Non-Asymptotic Rates of Martingale CLT
Authors:
Andreas Anastasiou,
Krishnakumar Balasubramanian,
Murat A. Erdogdu
Abstract:
We provide non-asymptotic convergence rates of the Polyak-Ruppert averaged stochastic gradient descent (SGD) to a normal random vector for a class of twice-differentiable test functions. A crucial intermediate step is proving a non-asymptotic martingale central limit theorem (CLT), i.e., establishing the rates of convergence of a multivariate martingale difference sequence to a normal random vecto…
▽ More
We provide non-asymptotic convergence rates of the Polyak-Ruppert averaged stochastic gradient descent (SGD) to a normal random vector for a class of twice-differentiable test functions. A crucial intermediate step is proving a non-asymptotic martingale central limit theorem (CLT), i.e., establishing the rates of convergence of a multivariate martingale difference sequence to a normal random vector, which might be of independent interest. We obtain the explicit rates for the multivariate martingale CLT using a combination of Stein's method and Lindeberg's argument, which is then used in conjunction with a non-asymptotic analysis of averaged SGD proposed in [PJ92]. Our results have potentially interesting consequences for computing confidence intervals for parameter estimation with SGD and constructing hypothesis tests with SGD that are valid in a non-asymptotic sense.
△ Less
Submitted 3 April, 2019;
originally announced April 2019.
-
Detecting multiple generalized change-points by isolating single ones
Authors:
Andreas Anastasiou,
Piotr Fryzlewicz
Abstract:
We introduce a new approach, called Isolate-Detect (ID), for the consistent estimation of the number and location of multiple generalized change-points in noisy data sequences. Examples of signal changes that ID can deal with are changes in the mean of a piecewise-constant signal and changes, continuous or not, in the linear trend. The number of change-points can increase with the sample size. Our…
▽ More
We introduce a new approach, called Isolate-Detect (ID), for the consistent estimation of the number and location of multiple generalized change-points in noisy data sequences. Examples of signal changes that ID can deal with are changes in the mean of a piecewise-constant signal and changes, continuous or not, in the linear trend. The number of change-points can increase with the sample size. Our method is based on an isolation technique, which prevents the consideration of intervals that contain more than one change-point. This isolation enhances ID's accuracy as it allows for detection in the presence of frequent changes of possibly small magnitudes. In ID, model selection is carried out via thresholding, or an information criterion, or SDLL, or a hybrid involving the former two. The hybrid model selection leads to a general method with very good practical performance and minimal parameter choice. In the scenarios tested, ID is at least as accurate as the state-of-the-art methods; most of the times it outperforms them. ID is implemented in the R packages IDetect and breakfast, available from CRAN.
△ Less
Submitted 1 June, 2021; v1 submitted 30 January, 2019;
originally announced January 2019.