-
PH-STAT
Authors:
Moo K. Chung
Abstract:
We introduce PH-STAT, a comprehensive MATLAB toolbox designed for performing a wide range of statistical inferences and machine learning tasks on persistent homology, primarily for network and graph data, with an emphasis on brain network analysis. Persistent homology is a prominent tool in topological data analysis (TDA) that captures the underlying topological features of complex data sets. The…
▽ More
We introduce PH-STAT, a comprehensive MATLAB toolbox designed for performing a wide range of statistical inferences and machine learning tasks on persistent homology, primarily for network and graph data, with an emphasis on brain network analysis. Persistent homology is a prominent tool in topological data analysis (TDA) that captures the underlying topological features of complex data sets. The toolbox aims to provide users with an accessible and user-friendly interface for analyzing and interpreting topological data. The Matlab package is distributed in https://github.com/laplcebeltrami/PH-STAT.
△ Less
Submitted 18 February, 2025; v1 submitted 12 April, 2023;
originally announced April 2023.
-
Persistent Homological State-Space Estimation of Functional Human Brain Networks at Rest
Authors:
Moo K. Chung,
Shih-Gu Huang,
Ian C. Carroll,
Vince D. Calhoun,
H. Hill Goldsmith
Abstract:
We introduce an innovative, data-driven topological data analysis (TDA) technique for estimating the state spaces of dynamically changing functional human brain networks at rest. Our method utilizes the Wasserstein distance to measure topological differences, enabling the clustering of brain networks into distinct topological states. This technique outperforms the commonly used k-means clustering…
▽ More
We introduce an innovative, data-driven topological data analysis (TDA) technique for estimating the state spaces of dynamically changing functional human brain networks at rest. Our method utilizes the Wasserstein distance to measure topological differences, enabling the clustering of brain networks into distinct topological states. This technique outperforms the commonly used k-means clustering in identifying brain network state spaces by effectively incorporating the temporal dynamics of the data without the need for explicit model specification. We further investigate the genetic underpinnings of these topological features using a twin study design, examining the heritability of such state changes. Our findings suggest that the topology of brain networks, particularly in their dynamic state changes, may hold significant hidden genetic information. MATLAB code for the method is available at https://github.com/laplcebeltrami/PH-STAT.
△ Less
Submitted 16 April, 2024; v1 submitted 31 December, 2021;
originally announced January 2022.
-
Reviews: Topological Distances and Losses for Brain Networks
Authors:
Moo K. Chung,
Alexander Smith,
Gary Shiu
Abstract:
Almost all statistical and machine learning methods in analyzing brain networks rely on distances and loss functions, which are mostly Euclidean or matrix norms. The Euclidean or matrix distances may fail to capture underlying subtle topological differences in brain networks. Further, Euclidean distances are sensitive to outliers. A few extreme edge weights may severely affect the distance. Thus i…
▽ More
Almost all statistical and machine learning methods in analyzing brain networks rely on distances and loss functions, which are mostly Euclidean or matrix norms. The Euclidean or matrix distances may fail to capture underlying subtle topological differences in brain networks. Further, Euclidean distances are sensitive to outliers. A few extreme edge weights may severely affect the distance. Thus it is necessary to use distances and loss functions that recognize topology of data. In this review paper, we survey various topological distance and loss functions from topological data analysis (TDA) and persistent homology that can be used in brain network analysis more effectively. Although there are many recent brain imaging studies that are based on TDA methods, possibly due to the lack of method awareness, TDA has not taken as the mainstream tool in brain imaging field yet. The main purpose of this paper is provide the relevant technical survey of these powerful tools that are immediately applicable to brain network data.
△ Less
Submitted 17 February, 2021;
originally announced February 2021.
-
Diffusion Equations for Medical Images
Authors:
Moo K. Chung
Abstract:
In brain imaging, the image acquisition and processing processes themselves are likely to introduce noise to the images. It is therefore imperative to reduce the noise while preserving the geometric details of the anatomical structures for various applications. Traditionally Gaussian kernel smoothing has been often used in brain image processing and analysis. However, the direct application of Gau…
▽ More
In brain imaging, the image acquisition and processing processes themselves are likely to introduce noise to the images. It is therefore imperative to reduce the noise while preserving the geometric details of the anatomical structures for various applications. Traditionally Gaussian kernel smoothing has been often used in brain image processing and analysis. However, the direct application of Gaussian kernel smoothing tend to cause various numerical issues in irregular domains with boundaries. For example, if one uses large bandwidth in kernel smoothing in a cortical bounded region, the smoothing will blur signals across boundaries. So in kernel smoothing and regression literature, various ad-hoc procedures were introduce to remedy the boundary effect. Diffusion equations have been widely used in brain imaging as a form of noise reduction. The most natural straightforward way to smooth images in irregular domains with boundaries is to formulate the problem as boundary value problems using partial differential equations. Numerous diffusion-based techniques have been developed in image processing. In this paper, we will overview the basics of isotropic diffusion equations and explain how to solve them on regular grids and irregular grids such as graphs.
△ Less
Submitted 2 January, 2022; v1 submitted 27 July, 2020;
originally announced July 2020.
-
Introduction to Random Fields
Authors:
Moo K. Chung
Abstract:
General linear models (GLM) are often constructed and used in statistical inference at the voxel level in brain imaging. In this paper, we explore the basics of random fields and the multiple comparisons on the random fields, which are necessary to properly threshold statistical maps for the whole image at specific statistical significance level. The multiple comparisons are crucial in determining…
▽ More
General linear models (GLM) are often constructed and used in statistical inference at the voxel level in brain imaging. In this paper, we explore the basics of random fields and the multiple comparisons on the random fields, which are necessary to properly threshold statistical maps for the whole image at specific statistical significance level. The multiple comparisons are crucial in determining overall statistical significance in correlated test statistics over the whole brain. In practice, t- or F-statistics in adjacent voxels are correlated. So there is the problem of multiple comparisons, which we have simply neglected up to now. For multiple comparisons that account for spatially correlated test statistics, various methods were proposed: Bonferroni correction, random field theory, false discovery rates and permutation tests. Among them, we will explore the random field approach.
△ Less
Submitted 19 July, 2020;
originally announced July 2020.