-
Cluster-based dual evolution for multivariate time series: analyzing COVID-19
Authors:
Nick James,
Max Menzies
Abstract:
This paper proposes a cluster-based method to analyze the evolution of multivariate time series and applies this to the COVID-19 pandemic. On each day, we partition countries into clusters according to both their case and death counts. The total number of clusters and individual countries' cluster memberships are algorithmically determined. We study the change in both quantities over time, demonst…
▽ More
This paper proposes a cluster-based method to analyze the evolution of multivariate time series and applies this to the COVID-19 pandemic. On each day, we partition countries into clusters according to both their case and death counts. The total number of clusters and individual countries' cluster memberships are algorithmically determined. We study the change in both quantities over time, demonstrating a close similarity in the evolution of cases and deaths. The changing number of clusters of the case counts precedes that of the death counts by 32 days. On the other hand, there is an optimal offset of 16 days with respect to the greatest consistency between cluster groupings, determined by a new method of comparing affinity matrices. With this offset in mind, we identify anomalous countries in the progression from COVID-19 cases to deaths. This analysis can aid in highlighting the most and least significant public policies in minimizing a country's COVID-19 mortality rate.
△ Less
Submitted 14 June, 2020; v1 submitted 5 May, 2020;
originally announced May 2020.
-
Equivalence relations and $L^p$ distances between time series with application to the Black Summer Australian bushfires
Authors:
Nick James,
Max Menzies
Abstract:
This paper introduces a new framework of algebraic equivalence relations between time series and new distance metrics between them, then applies these to investigate the Australian ``Black Summer'' bushfire season of 2019-2020. First, we introduce a general framework for defining equivalence between time series, heuristically intended to be equivalent if they differ only up to noise. Our first spe…
▽ More
This paper introduces a new framework of algebraic equivalence relations between time series and new distance metrics between them, then applies these to investigate the Australian ``Black Summer'' bushfire season of 2019-2020. First, we introduce a general framework for defining equivalence between time series, heuristically intended to be equivalent if they differ only up to noise. Our first specific implementation is based on using change point algorithms and comparing statistical quantities such as mean or variance in stationary segments. We thus derive the existence of such equivalence relations on the space of time series, such that the quotient spaces can be equipped with a metrizable topology. Next, we illustrate specifically how to define and compute such distances among a collection of time series and perform clustering and additional analysis thereon. Then, we apply these insights to analyze air quality data across New South Wales, Australia, during the 2019-2020 bushfires. There, we investigate structural similarity with respect to this data and identify locations that were impacted anonymously by the fires relative to their location. This may have implications regarding the appropriate management of resources to avoid gaps in the defense against future fires.
△ Less
Submitted 28 February, 2023; v1 submitted 6 February, 2020;
originally announced February 2020.
-
Changes to the extreme and erratic behaviour of cryptocurrencies during COVID-19
Authors:
Nick James,
Max Menzies,
Jennifer Chan
Abstract:
This paper introduces new methods for analysing the extreme and erratic behaviour of time series to evaluate the impact of COVID-19 on cryptocurrency market dynamics. Across 51 cryptocurrencies, we examine extreme behaviour through a study of distribution extremities, and erratic behaviour through structural breaks. First, we analyse the structure of the market as a whole and observe a reduction i…
▽ More
This paper introduces new methods for analysing the extreme and erratic behaviour of time series to evaluate the impact of COVID-19 on cryptocurrency market dynamics. Across 51 cryptocurrencies, we examine extreme behaviour through a study of distribution extremities, and erratic behaviour through structural breaks. First, we analyse the structure of the market as a whole and observe a reduction in self-similarity as a result of COVID-19, particularly with respect to structural breaks in variance. Second, we compare and contrast these two behaviours, and identify individual anomalous cryptocurrencies. Tether (USDT) and TrueUSD (TUSD) are consistent outliers with respect to their returns, while Holo (HOT), NEXO (NEXO), Maker (MKR) and NEM (XEM) are frequently observed as anomalous with respect to both behaviours and time. Even among a market known as consistently volatile, this identifies individual cryptocurrencies that behave most irregularly in their extreme and erratic behaviour and shows these were more affected during the COVID-19 market crisis.
△ Less
Submitted 29 November, 2020; v1 submitted 12 December, 2019;
originally announced December 2019.
-
Novel semi-metrics for multivariate change point analysis and anomaly detection
Authors:
Nick James,
Max Menzies,
Lamiae Azizi,
Jennifer Chan
Abstract:
This paper proposes a new method for determining similarity and anomalies between time series, most practically effective in large collections of (likely related) time series, by measuring distances between structural breaks within such a collection. We introduce a class of \emph{semi-metric} distance measures, which we term \emph{MJ distances}. These semi-metrics provide an advantage over existin…
▽ More
This paper proposes a new method for determining similarity and anomalies between time series, most practically effective in large collections of (likely related) time series, by measuring distances between structural breaks within such a collection. We introduce a class of \emph{semi-metric} distance measures, which we term \emph{MJ distances}. These semi-metrics provide an advantage over existing options such as the Hausdorff and Wasserstein metrics. We prove they have desirable properties, including better sensitivity to outliers, while experiments on simulated data demonstrate that they uncover similarity within collections of time series more effectively. Semi-metrics carry a potential disadvantage: without the triangle inequality, they may not satisfy a "transitivity property of closeness." We analyse this failure with proof and introduce an computational method to investigate, in which we demonstrate that our semi-metrics violate transitivity infrequently and mildly. Finally, we apply our methods to cryptocurrency and measles data, introducing a judicious application of eigenvalue analysis.
△ Less
Submitted 3 July, 2020; v1 submitted 3 November, 2019;
originally announced November 2019.
-
An immersed method based on cut-cells for the simulation of 2D incompressible fluid flows past solid structures
Authors:
François Bouchon,
Thierry Dubois,
Nicolas James
Abstract:
We present a cut-cell method for the simulation of 2D incompressible flows past obstacles. It consists in using the MAC scheme on cartesian grids and imposing Dirchlet boundary conditions for the velocity field on the boundary of solid structures following the Shortley-Weller formulation. In order to ensure local conservation properties, viscous and convecting terms are discretized in a finite vol…
▽ More
We present a cut-cell method for the simulation of 2D incompressible flows past obstacles. It consists in using the MAC scheme on cartesian grids and imposing Dirchlet boundary conditions for the velocity field on the boundary of solid structures following the Shortley-Weller formulation. In order to ensure local conservation properties, viscous and convecting terms are discretized in a finite volume way. The scheme is second order implicit in time for the linear part, the linear systems are solved by the use of the capacitance matrix method for non-moving obstacles. Numerical results of flows around an impulsively started circular cylinder are presented which confirm the efficiency of the method, for Reynolds numbers 1000 and 3000. An example of flows around a moving rigid body at Reynolds number 800 is also shown, a solver using the PETSc-Library has been prefered in this context to solve the linear systems.
△ Less
Submitted 24 January, 2019;
originally announced January 2019.
-
A transient Markov chain with finitely many cutpoints
Authors:
Nicholas James,
Russell Lyons,
Yuval Peres
Abstract:
We give an example of a transient reversible Markov chain that almost surely has only a finite number of cutpoints. We explain how this is relevant to a conjecture of Diaconis and Freedman and a question of Kaimanovich. We also answer Kaimanovich's question when the Markov chain is a nearest-neighbor random walk on a tree.
We give an example of a transient reversible Markov chain that almost surely has only a finite number of cutpoints. We explain how this is relevant to a conjecture of Diaconis and Freedman and a question of Kaimanovich. We also answer Kaimanovich's question when the Markov chain is a nearest-neighbor random walk on a tree.
△ Less
Submitted 19 May, 2008; v1 submitted 13 June, 2007;
originally announced June 2007.