-
Efficient Decomposition of Forman-Ricci Curvature on Vietoris-Rips Complexes and Data Applications
Authors:
Danillo Barros de Souza,
Jonatas Teodomiro,
Fernando A. N. Santos,
Mengjun Ding,
Weiqiang Sun,
Mathieu Desroches,
Jürgen Jost,
Serafim Rodrigues
Abstract:
Discrete Forman-Ricci curvature (FRC) is an efficient tool that characterizes essential geometrical features and associated transitions of real-world networks, extending seamlessly to higher-dimensional computations in simplicial complexes. In this article, we provide two major advancements: First, we give a decomposition for FRC, enabling local computations of FRC. Second, we construct a set-theo…
▽ More
Discrete Forman-Ricci curvature (FRC) is an efficient tool that characterizes essential geometrical features and associated transitions of real-world networks, extending seamlessly to higher-dimensional computations in simplicial complexes. In this article, we provide two major advancements: First, we give a decomposition for FRC, enabling local computations of FRC. Second, we construct a set-theoretical proof enabling an efficient algorithm for the local computation of FRC in Vietoris-Rips (VR) complexes.Strikingly, this approach reveals critical information and geometric insights often overlooked by conventional classification techniques. Our findings open new avenues for geometric computations in VR complexes and highlight an essential yet under-explored aspect of data classification: the geometry underpinning statistical patterns.
△ Less
Submitted 5 May, 2025; v1 submitted 30 April, 2025;
originally announced April 2025.
-
Alternative set-theoretical algorithms for efficient computations of cliques in Vietoris-Rips complexes
Authors:
Danillo Barros de Souza,
Jonatas Teodomiro,
Fernando A. N. Santos,
Mathieu Desroches,
Serafim Rodrigues
Abstract:
Identifying cliques in dense networks remains a formidable challenge, even with significant advances in computational power and methodologies. To tackle this, numerous algorithms have been developed to optimize time and memory usage, implemented across diverse programming languages. Yet, the inherent NP-completeness of the problem continues to hinder performance on large-scale networks, often resu…
▽ More
Identifying cliques in dense networks remains a formidable challenge, even with significant advances in computational power and methodologies. To tackle this, numerous algorithms have been developed to optimize time and memory usage, implemented across diverse programming languages. Yet, the inherent NP-completeness of the problem continues to hinder performance on large-scale networks, often resulting in memory leaks and slow computations. In the present study, we critically evaluate classic algorithms to pinpoint computational bottlenecks and introduce novel set-theoretical approaches tailored for network clique computation. Our proposed algorithms are rigorously implemented and benchmarked against existing Python-based solutions, demonstrating superior performance. These findings underscore the potential of set-theoretical techniques to drive substantial performance gains in network analysis.
△ Less
Submitted 26 February, 2025; v1 submitted 20 February, 2025;
originally announced February 2025.
-
Data Augmentation of Multivariate Sensor Time Series using Autoregressive Models and Application to Failure Prognostics
Authors:
Douglas Baptista de Souza,
Bruno Paes Leao
Abstract:
This work presents a novel data augmentation solution for non-stationary multivariate time series and its application to failure prognostics. The method extends previous work from the authors which is based on time-varying autoregressive processes. It can be employed to extract key information from a limited number of samples and generate new synthetic samples in a way that potentially improves th…
▽ More
This work presents a novel data augmentation solution for non-stationary multivariate time series and its application to failure prognostics. The method extends previous work from the authors which is based on time-varying autoregressive processes. It can be employed to extract key information from a limited number of samples and generate new synthetic samples in a way that potentially improves the performance of PHM solutions. This is especially valuable in situations of data scarcity which are very usual in PHM, especially for failure prognostics. The proposed approach is tested based on the CMAPSS dataset, commonly employed for prognostics experiments and benchmarks. An AutoML approach from PHM literature is employed for automating the design of the prognostics solution. The empirical evaluation provides evidence that the proposed method can substantially improve the performance of PHM solutions.
△ Less
Submitted 24 October, 2024; v1 submitted 21 October, 2024;
originally announced October 2024.
-
Efficient set-theoretic algorithms for computing high-order Forman-Ricci curvature on abstract simplicial complexes
Authors:
Danillo Barros de Souza,
Jonatas T. S. da Cunha,
Fernando A. N. Santos,
Jürgen Jost,
Serafim Rodrigues
Abstract:
Forman-Ricci curvature (FRC) is a potent and powerful tool for analysing empirical networks, as the distribution of the curvature values can identify structural information that is not readily detected by other geometrical methods. Crucially, FRC captures higher-order structural information of clique complexes of a graph or Vietoris-Rips complexes, which is not readily accessible to alternative me…
▽ More
Forman-Ricci curvature (FRC) is a potent and powerful tool for analysing empirical networks, as the distribution of the curvature values can identify structural information that is not readily detected by other geometrical methods. Crucially, FRC captures higher-order structural information of clique complexes of a graph or Vietoris-Rips complexes, which is not readily accessible to alternative methods. However, existing FRC platforms are prohibitively computationally expensive. Therefore, herein we develop an efficient set-theoretic formulation for computing such high-order FRC in simplicial complexes. Significantly, our set theory representation reveals previous computational bottlenecks and also accelerates the computation of FRC. Finally, We provide a pseudo-code, a software implementation coined FastForman, as well as a benchmark comparison with alternative implementations. We envisage that FastForman will be used in Topological and Geometrical Data analysis for high-dimensional complex data sets. Moreover, our development paves the way for future generalisations towards efficient computations of FRC on cell complexes.
△ Less
Submitted 9 May, 2024; v1 submitted 22 August, 2023;
originally announced August 2023.