-
Stylized Facts of High-Frequency Bitcoin Time Series
Authors:
Yaoyue Tang,
Karina Arias-Calluari,
M. N. Najafi,
Michael S. Harré,
Fernando Alonso-Marroquin
Abstract:
This paper analyses the high-frequency intraday Bitcoin dataset from 2019 to 2022. During this time frame, the Bitcoin market index exhibited two distinct periods, 2019-20 and 2021-22, characterized by an abrupt change in volatility. The Bitcoin price returns for both periods can be described by an anomalous diffusion process, transitioning from subdiffusion for short intervals to weak superdiffus…
▽ More
This paper analyses the high-frequency intraday Bitcoin dataset from 2019 to 2022. During this time frame, the Bitcoin market index exhibited two distinct periods, 2019-20 and 2021-22, characterized by an abrupt change in volatility. The Bitcoin price returns for both periods can be described by an anomalous diffusion process, transitioning from subdiffusion for short intervals to weak superdiffusion over longer time intervals. The characteristic features related to this anomalous behavior studied in the present paper include heavy tails, which can be described using a $q$-Gaussian distribution and correlations. When we sample the autocorrelation of absolute returns, we observe a power-law relationship, indicating time dependence in both periods initially. The ensemble autocorrelation of the returns decays rapidly. We fitted the autocorrelation with a power law to capture the decay and found that the second period experienced a slightly higher decay rate. The further study involves the analysis of endogenous effects within the Bitcoin time series, which are examined through detrending analysis. We found that both periods are multifractal and present self-similarity in the detrended probability density function (PDF). The Hurst exponent over short time intervals shifts from less than 0.5 ($\sim$ 0.42) in Period 1 to closer to 0.5 in Period 2 ($\sim$ 0.49), indicating that the market has gained efficiency over time.
△ Less
Submitted 22 June, 2025; v1 submitted 19 February, 2024;
originally announced February 2024.
-
Error-Robust Multi-View Clustering: Progress, Challenges and Opportunities
Authors:
Mehrnaz Najafi,
Lifang He,
Philip S. Yu
Abstract:
With recent advances in data collection from multiple sources, multi-view data has received significant attention. In multi-view data, each view represents a different perspective of data. Since label information is often expensive to acquire, multi-view clustering has gained growing interest, which aims to obtain better clustering solution by exploiting complementary and consistent information ac…
▽ More
With recent advances in data collection from multiple sources, multi-view data has received significant attention. In multi-view data, each view represents a different perspective of data. Since label information is often expensive to acquire, multi-view clustering has gained growing interest, which aims to obtain better clustering solution by exploiting complementary and consistent information across all views rather than only using an individual view. Due to inevitable sensor failures, data in each view may contain error. Error often exhibits as noise or feature-specific corruptions or outliers. Multi-view data may contain any or combination of these error types. Blindly clustering multi-view data i.e., without considering possible error in view(s) could significantly degrade the performance. The goal of error-robust multi-view clustering is to obtain useful outcome even if the multi-view data is corrupted. Existing error-robust multi-view clustering approaches with explicit error removal formulation can be structured into five broad research categories - sparsity norm based approaches, graph based methods, subspace based learning approaches, deep learning based methods and hybrid approaches, this survey summarizes and reviews recent advances in error-robust clustering for multi-view data. Finally, we highlight the challenges and provide future research opportunities.
△ Less
Submitted 7 May, 2021;
originally announced May 2021.
-
Persistent Homology of Fractional Gaussian Noise
Authors:
H. Masoomy,
B. Askari,
M. N. Najafi,
S. M. S. Movahed
Abstract:
In this paper, we employ the persistent homology (PH) technique to examine the topological properties of fractional Gaussian noise (fGn). We develop the weighted natural visibility graph algorithm, and the associated simplicial complexes through the filtration process are quantified by PH. The evolution of the homology group dimension represented by Betti numbers demonstrates a strong dependency o…
▽ More
In this paper, we employ the persistent homology (PH) technique to examine the topological properties of fractional Gaussian noise (fGn). We develop the weighted natural visibility graph algorithm, and the associated simplicial complexes through the filtration process are quantified by PH. The evolution of the homology group dimension represented by Betti numbers demonstrates a strong dependency on the Hurst exponent ($H$). The coefficients of the birth and death curve of the $k$-dimensional topological holes ($k$-holes) at a given threshold depend on $H$ which is almost not affected by finite sample size. We show that the distribution function of a lifetime for $k$-holes decays exponentially and the corresponding slope is an increasing function versus $H$, and more interestingly, the sample size effect completely disappears in this quantity. The persistence entropy logarithmically grows with the size of the visibility graph of a system with almost $H$-dependent prefactors. On the contrary, the local statistical features are not able to determine the corresponding Hurst exponent of fGn data, while the moments of eigenvalue distribution ($M_{n}$) for $n\ge1$ reveal a dependency on $H$, containing the sample size effect. Finally, the PH shows the correlated behavior of electroencephalography for both healthy and schizophrenic samples.
△ Less
Submitted 16 June, 2022; v1 submitted 9 January, 2021;
originally announced January 2021.
-
Prediction of Sewer Pipe Deterioration Using Random Forest Classification
Authors:
Razieh Tavakoli,
Ali Sharifara,
Mohammad Najafi
Abstract:
Wastewater infrastructure systems deteriorate over time due to a combination of physical and chemical factors. Failure of this significant infrastructure could affect important social, environmental, and economic impacts. Furthermore, recognizing the optimized timeline for inspection of sewer pipelines are challenging tasks for the utility managers and other authorities. Regular examination of sew…
▽ More
Wastewater infrastructure systems deteriorate over time due to a combination of physical and chemical factors. Failure of this significant infrastructure could affect important social, environmental, and economic impacts. Furthermore, recognizing the optimized timeline for inspection of sewer pipelines are challenging tasks for the utility managers and other authorities. Regular examination of sewer networks is not cost-effective due to limited time and high cost of assessment technologies and a large inventory of pipes. To avoid such obstacles, various researchers endeavored to improve infrastructure condition assessment methodologies to maintain sewer pipe systems at the desired condition. Sewer condition prediction models are developed to provide a framework to forecast the future condition of pipes to schedule inspection frequencies. The main goal of this study is to develop a predictive model for wastewater pipes using random forest classification. Predictive models can effectively predict sewer pipe condition and can increase the certainty level of the predictive results and decrease uncertainty in the current condition of wastewater pipes. The developed random forest classification model has achieved a stratified test set false negative rate, the false positive rate, and an excellent area under the ROC curve of 0.81 in a case study application for the City of LA, California. An area under the ROC curve > 0.80 indicates the developed model is an "excellent" choice for predicting the condition of individual pipes in a sewer network. The deterioration models can be used in the industry to improve the inspection timeline and maintenance planning.
△ Less
Submitted 9 December, 2019;
originally announced December 2019.
-
Artificial Neural Networks and Adaptive Neuro-fuzzy Models for Prediction of Remaining Useful Life
Authors:
Razieh Tavakoli,
Mohammad Najafi,
Ali Sharifara
Abstract:
The U.S. water distribution system contains thousands of miles of pipes constructed from different materials, and of various sizes, and age. These pipes suffer from physical, environmental, structural and operational stresses, causing deterioration which eventually leads to their failure. Pipe deterioration results in increased break rates, reduced hydraulic capacity, and detrimental impacts on wa…
▽ More
The U.S. water distribution system contains thousands of miles of pipes constructed from different materials, and of various sizes, and age. These pipes suffer from physical, environmental, structural and operational stresses, causing deterioration which eventually leads to their failure. Pipe deterioration results in increased break rates, reduced hydraulic capacity, and detrimental impacts on water quality. Therefore, it is crucial to use accurate models to forecast deterioration rates along with estimating the remaining useful life of the pipes to implement essential interference plans in order to prevent catastrophic failures. This paper discusses a computational model that forecasts the RUL of water pipes by applying Artificial Neural Networks (ANNs) as well as Adaptive Neural Fuzzy Inference System (ANFIS). These models are trained and tested acquired field data to identify the significant parameters that impact the prediction of RUL. It is concluded that, on average, with approximately 10\% of wall thickness loss in existing cast iron, ductile iron, asbestos-cement, and steel water pipes, the reduction of the remaining useful life is approximately 50%
△ Less
Submitted 27 August, 2019;
originally announced September 2019.