-
Scientific Applications Leveraging Randomized Linear Algebra
Authors:
Vivak Patel,
D. Adrian Maldonado,
Maksim Melnichenko,
Nathaniel Pritchard,
Vishwas Rao,
Elizaveta Rebrova,
Sriram Sankararaman
Abstract:
This report showcases the role of, and future directions for, the field of Randomized Numerical Linear Algebra (RNLA) in a selection of scientific applications. These applications span the domains of imaging, genomics and time-varying systems, and are thematically connected by needing to perform linear algebra routines on large-scale matrices (with up to quantillions of entries). At such scales, t…
▽ More
This report showcases the role of, and future directions for, the field of Randomized Numerical Linear Algebra (RNLA) in a selection of scientific applications. These applications span the domains of imaging, genomics and time-varying systems, and are thematically connected by needing to perform linear algebra routines on large-scale matrices (with up to quantillions of entries). At such scales, the linear algebra routines face typical bottlenecks: memory constraints, data access latencies, and substantial floating-point operation costs. RNLA routines are discussed at a high level to demonstrate how RNLA is able to solve the challenges faced by traditional linear algebra routines, and, consequently, address the computational problem posed in the underlying application. For each application, RNLA's open challenges and possible future directions are also presented, which broadly fall into the categories: creating structure-aware RNLA algorithms; co-designing RNLA algorithms with hardware and mixed-precision considerations; and advancing modular, composable software infrastructure. Ultimately, this report serves two purposes: it invites domain scientists to engage with RNLA; and it offers a guide for future RNLA research grounded in real applications.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
Coarse embeddability of Wasserstein space and the space of persistence diagrams
Authors:
Neil Pritchard,
Thomas Weighill
Abstract:
We prove an equivalence between open questions about the embeddability of the space of persistence diagrams and the space of probability distributions (i.e.~Wasserstein space). It is known that for many natural metrics, no coarse embedding of either of these two spaces into Hilbert space exists. Some cases remain open, however. In particular, whether coarse embeddings exist with respect to the…
▽ More
We prove an equivalence between open questions about the embeddability of the space of persistence diagrams and the space of probability distributions (i.e.~Wasserstein space). It is known that for many natural metrics, no coarse embedding of either of these two spaces into Hilbert space exists. Some cases remain open, however. In particular, whether coarse embeddings exist with respect to the $p$-Wasserstein distance for $1\leq p\leq 2$ remains an open question for the space of persistence diagrams and for Wasserstein space on the plane. In this paper, we show that embeddability for persistence diagrams \redd{is equivalent to} embeddability for Wasserstein space on $\mathbb{R}^2$. \redd{When $p > 1$, Wasserstein space on $\mathbb{R}^2$ is snowflake universal (an obstruction to embeddability into any Banach space of non-trivial type) if and only if the space of persistence diagrams is snowflake universal.
△ Less
Submitted 6 November, 2023; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Gaussian Persistence Curves
Authors:
Yu-Min Chung,
Michael Hull,
Austin Lawson,
Neil Pritchard
Abstract:
Topological data analysis (TDA) is a rising field in the intersection of mathematics, statistics, and computer science/data science. The cornerstone of TDA is persistent homology, which produces a summary of topological information called a persistence diagram. To utilize machine and deep learning methods on persistence diagrams, These diagrams are further summarized by transforming them into func…
▽ More
Topological data analysis (TDA) is a rising field in the intersection of mathematics, statistics, and computer science/data science. The cornerstone of TDA is persistent homology, which produces a summary of topological information called a persistence diagram. To utilize machine and deep learning methods on persistence diagrams, These diagrams are further summarized by transforming them into functions. In this paper we investigate the stability and injectivity of a class of smooth, one-dimensional functional summaries called Gaussian persistence curves.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Solving, Tracking and Stopping Streaming Linear Inverse Problems
Authors:
Nathaniel Pritchard,
Vivak Patel
Abstract:
In large-scale applications including medical imaging, collocation differential equation solvers, and estimation with differential privacy, the underlying linear inverse problem can be reformulated as a streaming problem. In theory, the streaming problem can be effectively solved using memory-efficient, exponentially-converging streaming solvers. In practice, a streaming solver's effectiveness is…
▽ More
In large-scale applications including medical imaging, collocation differential equation solvers, and estimation with differential privacy, the underlying linear inverse problem can be reformulated as a streaming problem. In theory, the streaming problem can be effectively solved using memory-efficient, exponentially-converging streaming solvers. In practice, a streaming solver's effectiveness is undermined if it is stopped before, or well-after, the desired accuracy is achieved. In special cases when the underlying linear inverse problem is finite-dimensional, streaming solvers can periodically evaluate the residual norm at a substantial computational cost. When the underlying system is infinite dimensional, streaming solver can only access noisy estimates of the residual. While such noisy estimates are computationally efficient, they are useful only when their accuracy is known. In this work, we rigorously develop a general family of computationally-practical residual estimators and their uncertainty sets for streaming solvers, and we demonstrate the accuracy of our methods on a number of large-scale linear problems. Thus, we further enable the practical use of streaming solvers for important classes of linear inverse problems.
△ Less
Submitted 30 January, 2024; v1 submitted 14 January, 2022;
originally announced January 2022.
-
The space of persistence diagrams fails to have Yu's property A
Authors:
Greg Bell,
Austin Lawson,
C. Neil Pritchard,
Dan Yasaki
Abstract:
We define a simple obstruction to Yu's property A that we call $k$-prisms. This structure allows for a straightforward proof that the space of persistence diagrams fails to have property A in a Wasserstein metric.
We define a simple obstruction to Yu's property A that we call $k$-prisms. This structure allows for a straightforward proof that the space of persistence diagrams fails to have property A in a Wasserstein metric.
△ Less
Submitted 20 January, 2021; v1 submitted 6 February, 2019;
originally announced February 2019.
-
An exploration of Nathanson's $g$-adic representations of integers
Authors:
Greg Bell,
Austin Lawson,
Neil Pritchard,
Dan Yasaki
Abstract:
We use Nathanson's $g$-adic representation of integers to relate metric properties of Cayley graphs of the integers with respect to various infinite generating sets $S$ to problems in additive number theory. If $S$ consists of all powers of a fixed integer $g$, we find explicit formulas for the smallest positive integer of a given length. This is related to finding the smallest positive integer ex…
▽ More
We use Nathanson's $g$-adic representation of integers to relate metric properties of Cayley graphs of the integers with respect to various infinite generating sets $S$ to problems in additive number theory. If $S$ consists of all powers of a fixed integer $g$, we find explicit formulas for the smallest positive integer of a given length. This is related to finding the smallest positive integer expressible as a fixed number of sums and differences of powers of $g$. We also consider $S$ to be the set of all powers of all primes and bound the diameter of Cayley graph by relating it to Goldbach's conjecture.
△ Less
Submitted 17 January, 2019; v1 submitted 2 November, 2017;
originally announced November 2017.