Search | arXiv e-print repository

The Pseudoinverse of $A=CR$ is $A^+=R^+C^+$ (?)

Authors: Michał P. Karpowicz, Gilbert Strang

Abstract: This paper gives three formulas for the pseudoinverse of a matrix product $A = CR$. The first is sometimes correct, the second is always correct, and the third is almost never correct. But that third randomized pseudoinverse $A^+_r$ may be very useful when $A$ is a very large matrix. 1. $A^+ = R^+C^+$ when $A = CR$ and $C$ has independent columns and $R$ has independent rows. 2.… ▽ More This paper gives three formulas for the pseudoinverse of a matrix product $A = CR$. The first is sometimes correct, the second is always correct, and the third is almost never correct. But that third randomized pseudoinverse $A^+_r$ may be very useful when $A$ is a very large matrix. 1. $A^+ = R^+C^+$ when $A = CR$ and $C$ has independent columns and $R$ has independent rows. 2. $A^+ = (C^+CR)^+(CRR^+)^+$ is always correct. 3. $A^+_r = (P^TCR)^+P^TCRQ(CRQ)^+ = A^+$ only when $\mathrm{rank}(P^TA) = \mathrm{rank}(AQ) = \mathrm{rank}(A)$ with $A = CR$. △ Less

Submitted 26 March, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

Comments: 10 pages, 5 figures, matlab code, new paragraphs introduce general formulas for the pseudoinverse of CR, new Figures and the randomized pseudoinverse algorithm

MSC Class: 15A09; 15A23; 15A24; 65F45; 65F55; 68W20

arXiv:2304.02659 [pdf, ps, other]

Elimination and Factorization

Authors: Gilbert Strang

Abstract: If a matrix $A$ has rank $r$, then its row echelon form (from elimination) contains the identity matrix in its first $r$ independent columns. How do we \emph{interpret the matrix} $F$ that appears in the remaining columns of that echelon form\,? $F$ multiplies those first $r$ independent columns of $A$ to give its $n-r$ dependent columns. Then $F$ reveals bases for the row space and the nullspace… ▽ More If a matrix $A$ has rank $r$, then its row echelon form (from elimination) contains the identity matrix in its first $r$ independent columns. How do we \emph{interpret the matrix} $F$ that appears in the remaining columns of that echelon form\,? $F$ multiplies those first $r$ independent columns of $A$ to give its $n-r$ dependent columns. Then $F$ reveals bases for the row space and the nullspace of the original matrix $A$. And $F$ is the key to the column-row factorization $\boldsymbol{A}=\boldsymbol{CR}$. △ Less

Submitted 5 April, 2023; originally announced April 2023.

Comments: 5 pages, no figures, 4 references

arXiv:1610.01230 [pdf, other]

A Local Inverse Formula and a Factorization

Authors: Gilbert Strang, Shev MacNamara

Abstract: When a matrix has a banded inverse there is a remarkable formula that quickly computes that inverse, using only local information in the original matrix. This local inverse formula holds more generally, for matrices with sparsity patterns that are examples of chordal graphs or perfect eliminators. The formula has a long history going back at least as far as the completion problem for covariance ma… ▽ More When a matrix has a banded inverse there is a remarkable formula that quickly computes that inverse, using only local information in the original matrix. This local inverse formula holds more generally, for matrices with sparsity patterns that are examples of chordal graphs or perfect eliminators. The formula has a long history going back at least as far as the completion problem for covariance matrices with missing data. Maximum entropy estimates, log-determinants, rank conditions, the Nullity Theorem and wavelets are all closely related, and the formula has found wide applications in machine learning and graphical models. We describe that local inverse and explain how it can be understood as a matrix factorization. △ Less

Submitted 4 October, 2016; originally announced October 2016.

arXiv:1501.03053 [pdf, other]

Random Triangle Theory with Geometry and Applications

Authors: Alan Edelman, Gilbert Strang

Abstract: What is the probability that a random triangle is acute? We explore this old question from a modern viewpoint, taking into account linear algebra, shape theory, numerical analysis, random matrix theory, the Hopf fibration, and much much more. One of the best distributions of random triangles takes all six vertex coordinates as independent standard Gaussians. Six can be reduced to four by translati… ▽ More What is the probability that a random triangle is acute? We explore this old question from a modern viewpoint, taking into account linear algebra, shape theory, numerical analysis, random matrix theory, the Hopf fibration, and much much more. One of the best distributions of random triangles takes all six vertex coordinates as independent standard Gaussians. Six can be reduced to four by translation of the center to $(0,0)$ or reformulation as a 2x2 matrix problem. In this note, we develop shape theory in its historical context for a wide audience. We hope to encourage other to look again (and differently) at triangles. We provide a new constructive proof, using the geometry of parallelians, of a central result of shape theory: Triangle shapes naturally fall on a hemisphere. We give several proofs of the key random result: that triangles are uniformly distributed when the normal distribution is transferred to the hemisphere. A new proof connects to the distribution of random condition numbers. Generalizing to higher dimensions, we obtain the "square root ellipticity statistic" of random matrix theory. Another proof connects the Hopf map to the SVD of 2 by 2 matrices. A new theorem describes three similar triangles hidden in the hemisphere. Many triangle properties are reformulated as matrix theorems, providing insight to both. This paper argues for a shift of viewpoint to the modern approaches of random matrix theory. As one example, we propose that the smallest singular value is an effective test for uniformity. New software is developed and applications are proposed. △ Less

Submitted 9 January, 2015; originally announced January 2015.

arXiv:1112.0582 [pdf, ps, other]

The Main Diagonal of a Permutation Matrix

Authors: Marko Lindner, Gilbert Strang

Abstract: By counting 1's in the "right half" of $2w$ consecutive rows, we locate the main diagonal of any doubly infinite permutation matrix with bandwidth $w$. Then the matrix can be correctly centered and factored into block-diagonal permutation matrices. Part II of the paper discusses the same questions for the much larger class of band-dominated matrices. The main diagonal is determined by the Fredholm… ▽ More By counting 1's in the "right half" of $2w$ consecutive rows, we locate the main diagonal of any doubly infinite permutation matrix with bandwidth $w$. Then the matrix can be correctly centered and factored into block-diagonal permutation matrices. Part II of the paper discusses the same questions for the much larger class of band-dominated matrices. The main diagonal is determined by the Fredholm index of a singly infinite submatrix. Thus the main diagonal is determined "at infinity" in general, but from only $2w$ rows for banded permutations. △ Less

Submitted 15 December, 2011; v1 submitted 2 December, 2011; originally announced December 2011.

MSC Class: 15A23; 47A53; 47B36

arXiv:math/9304214 [pdf, ps, other]

Wavelet transforms versus Fourier transforms

Authors: Gilbert Strang

Abstract: This note is a very basic introduction to wavelets. It starts with an orthogonal basis of piecewise constant functions, constructed by dilation and translation. The ``wavelet transform'' maps each $f(x)$ to its coefficients with respect to this basis. The mathematics is simple and the transform is fast (faster than the Fast Fourier Transform, which we briefly explain), but approximation by piece… ▽ More This note is a very basic introduction to wavelets. It starts with an orthogonal basis of piecewise constant functions, constructed by dilation and translation. The ``wavelet transform'' maps each $f(x)$ to its coefficients with respect to this basis. The mathematics is simple and the transform is fast (faster than the Fast Fourier Transform, which we briefly explain), but approximation by piecewise constants is poor. To improve this first wavelet, we are led to dilation equations and their unusual solutions. Higher-order wavelets are constructed, and it is surprisingly quick to compute with them --- always indirectly and recursively. We comment informally on the contest between these transforms in signal processing, especially for video and image compression (including high-definition television). So far the Fourier Transform --- or its 8 by 8 windowed version, the Discrete Cosine Transform --- is often chosen. But wavelets are already competitive, and they are ahead for fingerprints. We present a sample of this developing theory. △ Less

Submitted 31 March, 1993; originally announced April 1993.

Comments: 18 pages

Report number: Bulletin migration 11/99

Journal ref: Bull. Amer. Math. Soc. (N.S.) 28 (1993) 288-305

Showing 1–6 of 6 results for author: Strang, G