-
Hermitian Quaternion Toeplitz Matrices by Quaternion-valued Generating Functions
Authors:
Xue-lei Lin,
Michael K. Ng,
Junjun Pan
Abstract:
In this paper, we study Hermitian quaternion Toeplitz matrices generated by quaternion-valued functions. We show that such generating function must be the sum of a real-valued function and an odd function with imaginary component. This setting is different from the case of Hermitian complex Toeplitz matrices generated by real-valued functions only. By using of 2-by-2 block complex representation o…
▽ More
In this paper, we study Hermitian quaternion Toeplitz matrices generated by quaternion-valued functions. We show that such generating function must be the sum of a real-valued function and an odd function with imaginary component. This setting is different from the case of Hermitian complex Toeplitz matrices generated by real-valued functions only. By using of 2-by-2 block complex representation of quaternion matrices, we give a quaternion version of Grenander-Szegö theorem stating the distribution of eigenvalues of Hermitian quaternion Toeplitz matrices in terms of its generating function. As an application, we investigate Strang's circulant preconditioners for Hermitian quaternion Toeplitz linear systems arising from quaternion signal processing. We show that Strang's circulant preconditioners can be diagionalized by discrete quaternion Fourier transform matrices whereas general quaternion circulant matrices cannot be diagonalized by them. Also we verify the theoretical and numerical convergence results of Strang's circulant preconditioned conjugate gradient method for solving Hermitian quaternion Toeplitz systems.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
Truncated Huber Penalty for Sparse Signal Recovery with Convergence Analysis
Authors:
Li Yang,
Serena Morigi,
Michael K. Ng,
You-wei Wen
Abstract:
Sparse signal recovery from under-determined systems presents significant challenges when using conventional L_0 and L_1 penalties, primarily due to computational complexity and estimation bias. This paper introduces a truncated Huber penalty, a non-convex metric that effectively bridges the gap between unbiased sparse recovery and differentiable optimization. The proposed penalty applies quadrati…
▽ More
Sparse signal recovery from under-determined systems presents significant challenges when using conventional L_0 and L_1 penalties, primarily due to computational complexity and estimation bias. This paper introduces a truncated Huber penalty, a non-convex metric that effectively bridges the gap between unbiased sparse recovery and differentiable optimization. The proposed penalty applies quadratic regularization to small entries while truncating large magnitudes, avoiding non-differentiable points at optima. Theoretical analysis demonstrates that, for an appropriately chosen threshold, any s-sparse solution recoverable via conventional penalties remains a local optimum under the truncated Huber function. This property allows the exact and robust recovery theories developed for other penalty regularization functions to be directly extended to the truncated Huber function. To solve the optimization problem, we develop a block coordinate descent (BCD) algorithm with finite-step convergence guarantees under spark conditions. Numerical experiments are conducted to validate the effectiveness and robustness of the proposed approach. Furthermore, we extend the truncated Huber-penalized model to the gradient domain, illustrating its applicability in signal denoising and image smoothing.
△ Less
Submitted 6 April, 2025;
originally announced April 2025.
-
A Graph-Partitioning Based Continuous Optimization Approach to Semi-supervised Clustering Problems
Authors:
Wei Liu,
Xin Liu,
Michael K. Ng,
Zaikun Zhang
Abstract:
Semi-supervised clustering is a basic problem in various applications. Most existing methods require knowledge of the ideal cluster number, which is often difficult to obtain in practice. Besides, satisfying the must-link constraints is another major challenge for these methods. In this work, we view the semi-supervised clustering task as a partitioning problem on a graph associated with the given…
▽ More
Semi-supervised clustering is a basic problem in various applications. Most existing methods require knowledge of the ideal cluster number, which is often difficult to obtain in practice. Besides, satisfying the must-link constraints is another major challenge for these methods. In this work, we view the semi-supervised clustering task as a partitioning problem on a graph associated with the given dataset, where the similarity matrix includes a scaling parameter to reflect the must-link constraints. Utilizing a relaxation technique, we formulate the graph partitioning problem into a continuous optimization model that does not require the exact cluster number, but only an overestimate of it. We then propose a block coordinate descent algorithm to efficiently solve this model, and establish its convergence result. Based on the obtained solution, we can construct the clusters that theoretically meet the must-link constraints under mild assumptions. Furthermore, we verify the effectiveness and efficiency of our proposed method through comprehensive numerical experiments.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Non-Negative Reduced Biquaternion Matrix Factorization with Applications in Color Face Recognition
Authors:
Jifei Miao,
Junjun Pan,
Michael K. Ng
Abstract:
Reduced biquaternion (RB), as a four-dimensional algebra highly suitable for representing color pixels, has recently garnered significant attention from numerous scholars. In this paper, for color image processing problems, we introduce a concept of the non-negative RB matrix and then use the multiplication properties of RB to propose a non-negative RB matrix factorization (NRBMF) model. The NRBMF…
▽ More
Reduced biquaternion (RB), as a four-dimensional algebra highly suitable for representing color pixels, has recently garnered significant attention from numerous scholars. In this paper, for color image processing problems, we introduce a concept of the non-negative RB matrix and then use the multiplication properties of RB to propose a non-negative RB matrix factorization (NRBMF) model. The NRBMF model is introduced to address the challenge of reasonably establishing a non-negative quaternion matrix factorization model, which is primarily hindered by the multiplication properties of traditional quaternions. Furthermore, this paper transforms the problem of solving the NRBMF model into an RB alternating non-negative least squares (RB-ANNLS) problem. Then, by introducing a method to compute the gradient of the real function with RB matrix variables, we solve the RB-ANNLS optimization problem using the RB projected gradient algorithm and conduct a convergence analysis of the algorithm. Finally, we validate the effectiveness and superiority of the proposed NRBMF model in color face recognition.
△ Less
Submitted 9 July, 2025; v1 submitted 10 August, 2024;
originally announced August 2024.
-
A New Cross-Space Total Variation Regularization Model for Color Image Restoration with Quaternion Blur Operator
Authors:
Zhigang Jia,
Yuelian Xiang,
Meixiang Zhao,
Tingting Wu,
Michael K. Ng
Abstract:
The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color artifacts in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by intro…
▽ More
The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color artifacts in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by introducing a quaternion blur operator and a cross-color space regularization functional. The existence and uniqueness of the solution are proved and a new L-curve method is proposed to find a balance of regularization terms on different color spaces. The Euler-Lagrange equation is derived to show that CSTV has taken into account the coupling of all color channels and the local smoothing within each color channel. A quaternion operator splitting method is firstly proposed to enhance the ability of color artifacts reduction of the CSTV regularization model. This strategy also applies to the well-known color deblurring models. Numerical experiments on color image databases illustrate the efficiency and effectiveness of the new model and algorithms. The color images restored by them successfully maintain the color and spatial information and are of higher quality in terms of PSNR, SSIM, MSE and CIEde2000 than the restorations of the-state-of-the-art methods.
△ Less
Submitted 26 January, 2025; v1 submitted 20 May, 2024;
originally announced May 2024.
-
A $τ$-preconditioner for space fractional diffusion equation with non-separable variable coefficients
Authors:
Xue-Lei Lin,
Michael K. Ng
Abstract:
In this paper, we study a $τ$-matrix approximation based preconditioner for the linear systems arising from discretization of unsteady state Riesz space fractional diffusion equation with non-separable variable coefficients. The structure of coefficient matrices of the linear systems is identity plus summation of diagonal-times-multilevel-Toeplitz matrices. In our preconditioning technique, the di…
▽ More
In this paper, we study a $τ$-matrix approximation based preconditioner for the linear systems arising from discretization of unsteady state Riesz space fractional diffusion equation with non-separable variable coefficients. The structure of coefficient matrices of the linear systems is identity plus summation of diagonal-times-multilevel-Toeplitz matrices. In our preconditioning technique, the diagonal matrices are approximated by scalar identity matrices and the Toeplitz matrices are approximated by τ-matrices (a type of matrices diagonalizable by discrete sine transforms). The proposed preconditioner is fast invertible through the fast sine transform (FST) algorithm. Theoretically, we show that the GMRES solver for the preconditioned systems has an optimal convergence rate (a convergence rate independent of discretization stepsizes). To the best of our knowledge, this is the first preconditioning method with the optimal convergence rate for the variable-coefficients space fractional diffusion equation. Numerical results are reported to demonstrate the efficiency of the proposed method.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Multispectral Image Restoration by Generalized Opponent Transformation Total Variation
Authors:
Zhantao Ma,
Michael K. Ng
Abstract:
Multispectral images (MSI) contain light information in different wavelengths of objects, which convey spectral-spatial information and help improve the performance of various image processing tasks. Numerous techniques have been created to extend the application of total variation regularization in restoring multispectral images, for example, based on channel coupling and adaptive total variation…
▽ More
Multispectral images (MSI) contain light information in different wavelengths of objects, which convey spectral-spatial information and help improve the performance of various image processing tasks. Numerous techniques have been created to extend the application of total variation regularization in restoring multispectral images, for example, based on channel coupling and adaptive total variation regularization. The primary contribution of this paper is to propose and develop a new multispectral total variation regularization in a generalized opponent transformation domain instead of the original multispectral image domain. Here opponent transformations for multispectral images are generalized from a well-known opponent transformation for color images. We will explore the properties of generalized opponent transformation total variation (GOTTV) regularization and the corresponding optimization formula for multispectral image restoration. To evaluate the effectiveness of the new GOTTV method, we provide numerical examples that showcase its superior performance compared to existing multispectral image total variation methods, using criteria such as MPSNR and MSSIM.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
A One-step Image Retargeing Algorithm Based on Conformal Energy
Authors:
Chengyang Liu,
Michael K. Ng
Abstract:
The image retargeting problem is to find a proper mapping to resize an image to one with a prescribed aspect ratio, which is quite popular these days. In this paper, we propose an efficient and orientation-preserving one-step image retargeting algorithm based on minimizing the harmonic energy, which can well preserve the regions of interest (ROIs) and line structures in the image. We also give som…
▽ More
The image retargeting problem is to find a proper mapping to resize an image to one with a prescribed aspect ratio, which is quite popular these days. In this paper, we propose an efficient and orientation-preserving one-step image retargeting algorithm based on minimizing the harmonic energy, which can well preserve the regions of interest (ROIs) and line structures in the image. We also give some mathematical proofs in the paper to ensure the well-posedness and accuracy of our algorithm.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
AlgoFormer: An Efficient Transformer Framework with Algorithmic Structures
Authors:
Yihang Gao,
Chuanyang Zheng,
Enze Xie,
Han Shi,
Tianyang Hu,
Yu Li,
Michael K. Ng,
Zhenguo Li,
Zhaoqiang Liu
Abstract:
Besides natural language processing, transformers exhibit extraordinary performance in solving broader applications, including scientific computing and computer vision. Previous works try to explain this from the expressive power and capability perspectives that standard transformers are capable of performing some algorithms. To empower transformers with algorithmic capabilities and motivated by t…
▽ More
Besides natural language processing, transformers exhibit extraordinary performance in solving broader applications, including scientific computing and computer vision. Previous works try to explain this from the expressive power and capability perspectives that standard transformers are capable of performing some algorithms. To empower transformers with algorithmic capabilities and motivated by the recently proposed looped transformer, we design a novel transformer framework, dubbed Algorithm Transformer (abbreviated as AlgoFormer). We provide an insight that efficient transformer architectures can be designed by leveraging prior knowledge of tasks and the underlying structure of potential algorithms. Compared with the standard transformer and vanilla looped transformer, the proposed AlgoFormer can perform efficiently in algorithm representation in some specific tasks. In particular, inspired by the structure of human-designed learning algorithms, our transformer framework consists of a pre-transformer that is responsible for task preprocessing, a looped transformer for iterative optimization algorithms, and a post-transformer for producing the desired results after post-processing. We provide theoretical evidence of the expressive power of the AlgoFormer in solving some challenging problems, mirroring human-designed algorithms. Furthermore, some theoretical and empirical results are presented to show that the designed transformer has the potential to perform algorithm representation and learning. Experimental results demonstrate the empirical superiority of the proposed transformer in that it outperforms the standard transformer and vanilla looped transformer in some specific tasks. An extensive experiment on real language tasks (e.g., neural machine translation of German and English, and text classification) further validates the expressiveness and effectiveness of AlgoFormer.
△ Less
Submitted 10 January, 2025; v1 submitted 21 February, 2024;
originally announced February 2024.
-
Block Diagonalization of Quaternion Circulant Matrices with Applications
Authors:
Junjun Pan,
Michael K. Ng
Abstract:
It is well-known that a complex circulant matrix can be diagonalized by a discrete Fourier matrix with imaginary unit $\mathtt{i}$. The main aim of this paper is to demonstrate that a quaternion circulant matrix cannot be diagonalized by a discrete quaternion Fourier matrix with three imaginary units $\mathtt{i}$, $\mathtt{j}$ and $\mathtt{k}$. Instead, a quaternion circulant matrix can be block-d…
▽ More
It is well-known that a complex circulant matrix can be diagonalized by a discrete Fourier matrix with imaginary unit $\mathtt{i}$. The main aim of this paper is to demonstrate that a quaternion circulant matrix cannot be diagonalized by a discrete quaternion Fourier matrix with three imaginary units $\mathtt{i}$, $\mathtt{j}$ and $\mathtt{k}$. Instead, a quaternion circulant matrix can be block-diagonalized into 1-by-1 block and 2-by-2 block matrices by permuted discrete quaternion Fourier transform matrix. With such a block-diagonalized form, the inverse of a quaternion circulant matrix can be determined efficiently similar to the inverse of a complex circulant matrix. We make use of this block-diagonalized form to study quaternion tensor singular value decomposition of quaternion tensors where the entries are quaternion numbers. The applications including computing the inverse of a quaternion circulant matrix, and solving quaternion Toeplitz system arising from linear prediction of quaternion signals are employed to validate the efficiency of our proposed block diagonalized results. A numerical example of color video as third-order quaternion tensor is employed to validate the effectiveness of quaternion tensor singular value decomposition.
△ Less
Submitted 8 February, 2024; v1 submitted 8 February, 2023;
originally announced February 2023.
-
Quantizing Heavy-tailed Data in Statistical Estimation: (Near) Minimax Rates, Covariate Quantization, and Uniform Recovery
Authors:
Junren Chen,
Michael K. Ng,
Di Wang
Abstract:
This paper studies the quantization of heavy-tailed data in some fundamental statistical estimation problems, where the underlying distributions have bounded moments of some order. We propose to truncate and properly dither the data prior to a uniform quantization. Our major standpoint is that (near) minimax rates of estimation error are achievable merely from the quantized data produced by the pr…
▽ More
This paper studies the quantization of heavy-tailed data in some fundamental statistical estimation problems, where the underlying distributions have bounded moments of some order. We propose to truncate and properly dither the data prior to a uniform quantization. Our major standpoint is that (near) minimax rates of estimation error are achievable merely from the quantized data produced by the proposed scheme. In particular, concrete results are worked out for covariance estimation, compressed sensing, and matrix completion, all agreeing that the quantization only slightly worsens the multiplicative factor. Besides, we study compressed sensing where both covariate (i.e., sensing vector) and response are quantized. Under covariate quantization, although our recovery program is non-convex because the covariance matrix estimator lacks positive semi-definiteness, all local minimizers are proved to enjoy near optimal error bound. Moreover, by the concentration inequality of product process and covering argument, we establish near minimax uniform recovery guarantee for quantized compressed sensing with heavy-tailed noise.
△ Less
Submitted 26 July, 2023; v1 submitted 30 December, 2022;
originally announced December 2022.
-
SVD-PINNs: Transfer Learning of Physics-Informed Neural Networks via Singular Value Decomposition
Authors:
Yihang Gao,
Ka Chun Cheung,
Michael K. Ng
Abstract:
Physics-informed neural networks (PINNs) have attracted significant attention for solving partial differential equations (PDEs) in recent years because they alleviate the curse of dimensionality that appears in traditional methods. However, the most disadvantage of PINNs is that one neural network corresponds to one PDE. In practice, we usually need to solve a class of PDEs, not just one. With the…
▽ More
Physics-informed neural networks (PINNs) have attracted significant attention for solving partial differential equations (PDEs) in recent years because they alleviate the curse of dimensionality that appears in traditional methods. However, the most disadvantage of PINNs is that one neural network corresponds to one PDE. In practice, we usually need to solve a class of PDEs, not just one. With the explosive growth of deep learning, many useful techniques in general deep learning tasks are also suitable for PINNs. Transfer learning methods may reduce the cost for PINNs in solving a class of PDEs. In this paper, we proposed a transfer learning method of PINNs via keeping singular vectors and optimizing singular values (namely SVD-PINNs). Numerical experiments on high dimensional PDEs (10-d linear parabolic equations and 10-d Allen-Cahn equations) show that SVD-PINNs work for solving a class of PDEs with different but close right-hand-side functions.
△ Less
Submitted 14 March, 2024; v1 submitted 16 November, 2022;
originally announced November 2022.
-
Stochastic Variance Reduced Gradient for affine rank minimization problem
Authors:
Ningning Han,
Juan Nie,
Jian Lu,
Michael K. Ng
Abstract:
We develop an efficient stochastic variance reduced gradient descent algorithm to solve the affine rank minimization problem consists of finding a matrix of minimum rank from linear measurements. The proposed algorithm as a stochastic gradient descent strategy enjoys a more favorable complexity than full gradients. It also reduces the variance of the stochastic gradient at each iteration and accel…
▽ More
We develop an efficient stochastic variance reduced gradient descent algorithm to solve the affine rank minimization problem consists of finding a matrix of minimum rank from linear measurements. The proposed algorithm as a stochastic gradient descent strategy enjoys a more favorable complexity than full gradients. It also reduces the variance of the stochastic gradient at each iteration and accelerate the rate of convergence. We prove that the proposed algorithm converges linearly in expectation to the solution under a restricted isometry condition. The numerical experiments show that the proposed algorithm has a clearly advantageous balance of efficiency, adaptivity, and accuracy compared with other state-of-the-art greedy algorithms.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
A Momentum Accelerated Adaptive Cubic Regularization Method for Nonconvex Optimization
Authors:
Yihang Gao,
Michael K. Ng
Abstract:
The cubic regularization method (CR) and its adaptive version (ARC) are popular Newton-type methods in solving unconstrained non-convex optimization problems, due to its global convergence to local minima under mild conditions. The main aim of this paper is to develop a momentum-accelerated adaptive cubic regularization method (ARCm) to improve the convergent performance. With the proper choice of…
▽ More
The cubic regularization method (CR) and its adaptive version (ARC) are popular Newton-type methods in solving unconstrained non-convex optimization problems, due to its global convergence to local minima under mild conditions. The main aim of this paper is to develop a momentum-accelerated adaptive cubic regularization method (ARCm) to improve the convergent performance. With the proper choice of momentum step size, we show the global convergence of ARCm and the local convergence can also be guaranteed under the \KL property. Such global and local convergence can also be established when inexact solvers with low computational costs are employed in the iteration procedure. Numerical results for non-convex logistic regression and robust linear regression models are reported to demonstrate that the proposed ARCm significantly outperforms state-of-the-art cubic regularization methods (e.g., CR, momentum-based CR, ARC) and the trust region method. In particular, the number of iterations required by ARCm is less than 10\% to 50\% required by the most competitive method (ARC) in the experiments.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
Approximate Secular Equations for the Cubic Regularization Subproblem
Authors:
Yihang Gao,
Man-Chung Yue,
Michael K. Ng
Abstract:
The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper…
▽ More
The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper, we propose and analyze a novel CRS solver based on an approximate secular equation, which requires only some of the Hessian eigenvalues and is therefore much more efficient. Two approximate secular equations (ASEs) are developed. For both ASEs, we first study the existence and uniqueness of their roots and then establish an upper bound on the gap between the root and that of the standard secular equation. Such an upper bound can in turn be used to bound the distance from the approximate CRS solution based ASEs to the true CRS solution, thus offering a theoretical guarantee for our CRS solver. A desirable feature of our CRS solver is that it requires only matrix-vector multiplication but not matrix inversion, which makes it particularly suitable for high-dimensional applications of unconstrained non-convex optimization, such as low-rank recovery and deep learning. Numerical experiments with synthetic and real data-sets are conducted to investigate the practical performance of the proposed CRS solver. Experimental results show that the proposed solver outperforms two state-of-the-art methods.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
Expressing Multivariate Time Series as Graphs with Time Series Attention Transformer
Authors:
William T. Ng,
K. Siu,
Albert C. Cheung,
Michael K. Ng
Abstract:
A reliable and efficient representation of multivariate time series is crucial in various downstream machine learning tasks. In multivariate time series forecasting, each variable depends on its historical values and there are inter-dependencies among variables as well. Models have to be designed to capture both intra- and inter-relationships among the time series. To move towards this goal, we pr…
▽ More
A reliable and efficient representation of multivariate time series is crucial in various downstream machine learning tasks. In multivariate time series forecasting, each variable depends on its historical values and there are inter-dependencies among variables as well. Models have to be designed to capture both intra- and inter-relationships among the time series. To move towards this goal, we propose the Time Series Attention Transformer (TSAT) for multivariate time series representation learning. Using TSAT, we represent both temporal information and inter-dependencies of multivariate time series in terms of edge-enhanced dynamic graphs. The intra-series correlations are represented by nodes in a dynamic graph; a self-attention mechanism is modified to capture the inter-series correlations by using the super-empirical mode decomposition (SMD) module. We applied the embedded dynamic graphs to times series forecasting problems, including two real-world datasets and two benchmark datasets. Extensive experiments show that TSAT clearly outerperforms six state-of-the-art baseline methods in various forecasting horizons. We further visualize the embedded dynamic graphs to illustrate the graph representation power of TSAT. We share our code at https://github.com/RadiantResearch/TSAT.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
Separable Quaternion Matrix Factorization for Polarization Images
Authors:
Junjun Pan,
Michael K. Ng
Abstract:
Polarization is a unique characteristic of transverse wave and is represented by Stokes parameters. Analysis of polarization states can reveal valuable information about the sources. In this paper, we propose a separable low-rank quaternion linear mixing model to polarized signals: we assume each column of the source factor matrix equals a column of polarized data matrix and refer to the correspon…
▽ More
Polarization is a unique characteristic of transverse wave and is represented by Stokes parameters. Analysis of polarization states can reveal valuable information about the sources. In this paper, we propose a separable low-rank quaternion linear mixing model to polarized signals: we assume each column of the source factor matrix equals a column of polarized data matrix and refer to the corresponding problem as separable quaternion matrix factorization (SQMF). We discuss some properties of the matrix that can be decomposed by SQMF. To determine the source factor matrix in quaternion space, we propose a heuristic algorithm called quaternion successive projection algorithm (QSPA) inspired by the successive projection algorithm. To guarantee the effectiveness of QSPA, a new normalization operator is proposed for the quaternion matrix. We use a block coordinate descent algorithm to compute nonnegative factor activation matrix in real number space. We test our method on the applications of polarization image representation and spectro-polarimetric imaging unmixing to verify its effectiveness.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
HessianFR: An Efficient Hessian-based Follow-the-Ridge Algorithm for Minimax Optimization
Authors:
Yihang Gao,
Huafeng Liu,
Michael K. Ng,
Mingjie Zhou
Abstract:
Wide applications of differentiable two-player sequential games (e.g., image generation by GANs) have raised much interest and attention of researchers to study efficient and fast algorithms. Most of the existing algorithms are developed based on nice properties of simultaneous games, i.e., convex-concave payoff functions, but are not applicable in solving sequential games with different settings.…
▽ More
Wide applications of differentiable two-player sequential games (e.g., image generation by GANs) have raised much interest and attention of researchers to study efficient and fast algorithms. Most of the existing algorithms are developed based on nice properties of simultaneous games, i.e., convex-concave payoff functions, but are not applicable in solving sequential games with different settings. Some conventional gradient descent ascent algorithms theoretically and numerically fail to find the local Nash equilibrium of the simultaneous game or the local minimax (i.e., local Stackelberg equilibrium) of the sequential game. In this paper, we propose the HessianFR, an efficient Hessian-based Follow-the-Ridge algorithm with theoretical guarantees. Furthermore, the convergence of the stochastic algorithm and the approximation of Hessian inverse are exploited to improve algorithm efficiency. A series of experiments of training generative adversarial networks (GANs) have been conducted on both synthetic and real-world large-scale image datasets (e.g. MNIST, CIFAR-10 and CelebA). The experimental results demonstrate that the proposed HessianFR outperforms baselines in terms of convergence and image generation quality.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Deep neural networks for solving large linear systems arising from high-dimensional problems
Authors:
Yiqi Gu,
Michael K. Ng
Abstract:
This paper studies deep neural networks for solving extremely large linear systems arising from highdimensional problems. Because of the curse of dimensionality, it is expensive to store both the solution and right-hand side vector in such extremely large linear systems. Our idea is to employ a neural network to characterize the solution with much fewer parameters than the size of the solution und…
▽ More
This paper studies deep neural networks for solving extremely large linear systems arising from highdimensional problems. Because of the curse of dimensionality, it is expensive to store both the solution and right-hand side vector in such extremely large linear systems. Our idea is to employ a neural network to characterize the solution with much fewer parameters than the size of the solution under a matrix-free setting. We present an error analysis of the proposed method, indicating that the solution error is bounded by the condition number of the matrix and the neural network approximation error. Several numerical examples from partial differential equations, queueing problems, and probabilistic Boolean networks are presented to demonstrate that the solutions of linear systems can be learned quite accurately.
△ Less
Submitted 4 March, 2023; v1 submitted 1 April, 2022;
originally announced April 2022.
-
Color Image Inpainting via Robust Pure Quaternion Matrix Completion: Error Bound and Weighted Loss
Authors:
Junren Chen,
Michael K. Ng
Abstract:
In this paper, we study color image inpainting as a pure quaternion matrix completion problem. In the literature, the theoretical guarantee for quaternion matrix completion is not well-established. Our main aim is to propose a new minimization problem with an objective combining nuclear norm and a quadratic loss weighted among three channels. To fill the theoretical vacancy, we obtain the error bo…
▽ More
In this paper, we study color image inpainting as a pure quaternion matrix completion problem. In the literature, the theoretical guarantee for quaternion matrix completion is not well-established. Our main aim is to propose a new minimization problem with an objective combining nuclear norm and a quadratic loss weighted among three channels. To fill the theoretical vacancy, we obtain the error bound in both clean and corrupted regimes, which relies on some new results of quaternion matrices. A general Gaussian noise is considered in robust completion where all observations are corrupted. Motivated by the error bound, we propose to handle unbalanced or correlated noise via a cross-channel weight in the quadratic loss, with the main purpose of rebalancing noise level, or removing noise correlation. Extensive experimental results on synthetic and color image data are presented to confirm and demonstrate our theoretical findings.
△ Less
Submitted 26 October, 2022; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Deep adaptive basis Galerkin method for high-dimensional evolution equations with oscillatory solutions
Authors:
Yiqi Gu,
Micheal K. Ng
Abstract:
In this paper, we study deep neural networks (DNNs) for solving high-dimensional evolution equations with oscillatory solutions. Different from deep least-squares methods that deal with time and space variables simultaneously, we propose a deep adaptive basis Galerkin (DABG) method, which employs the spectral-Galerkin method for the time variable of oscillatory solutions and the deep neural networ…
▽ More
In this paper, we study deep neural networks (DNNs) for solving high-dimensional evolution equations with oscillatory solutions. Different from deep least-squares methods that deal with time and space variables simultaneously, we propose a deep adaptive basis Galerkin (DABG) method, which employs the spectral-Galerkin method for the time variable of oscillatory solutions and the deep neural network method for high-dimensional space variables. The proposed method can lead to a linear system of differential equations having unknown DNNs that can be trained via the loss function. We establish a posterior estimates of the solution error, which is bounded by the minimal loss function and the term $O(N^{-m})$, where $N$ is the number of basis functions and $m$ characterizes the regularity of the e'quation. We also show that if the true solution is a Barron-type function, the error bound converges to zero as $M=O(N^p)$ approaches to infinity, where $M$ is the width of the used networks, and $p$ is a positive constant. Numerical examples, including high-dimensional linear evolution equations and the nonlinear Allen-Cahn equation, are presented to demonstrate the performance of the proposed DABG method is better than that of existing DNNs.
△ Less
Submitted 31 May, 2022; v1 submitted 29 December, 2021;
originally announced December 2021.
-
Wasserstein Generative Adversarial Uncertainty Quantification in Physics-Informed Neural Networks
Authors:
Yihang Gao,
Michael K. Ng
Abstract:
In this paper, we study a physics-informed algorithm for Wasserstein Generative Adversarial Networks (WGANs) for uncertainty quantification in solutions of partial differential equations. By using groupsort activation functions in adversarial network discriminators, network generators are utilized to learn the uncertainty in solutions of partial differential equations observed from the initial/bou…
▽ More
In this paper, we study a physics-informed algorithm for Wasserstein Generative Adversarial Networks (WGANs) for uncertainty quantification in solutions of partial differential equations. By using groupsort activation functions in adversarial network discriminators, network generators are utilized to learn the uncertainty in solutions of partial differential equations observed from the initial/boundary data. Under mild assumptions, we show that the generalization error of the computed generator converges to the approximation error of the network with high probability, when the number of samples are sufficiently taken. According to our established error bound, we also find that our physics-informed WGANs have higher requirement for the capacity of discriminators than that of generators. Numerical results on synthetic examples of partial differential equations are reported to validate our theoretical results and demonstrate how uncertainty quantification can be obtained for solutions of partial differential equations and the distributions of initial/boundary data. However, the quality or the accuracy of the uncertainty quantification theory in all the points in the interior is still the theoretical vacancy, and required for further research.
△ Less
Submitted 9 August, 2022; v1 submitted 30 August, 2021;
originally announced August 2021.
-
Deep Ritz method for the spectral fractional Laplacian equation using the Caffarelli-Silvestre extension
Authors:
Yiqi Gu,
Micheal K. Ng
Abstract:
In this paper, we propose a novel method for solving high-dimensional spectral fractional Laplacian equations. Using the Caffarelli-Silvestre extension, the $d$-dimensional spectral fractional equation is reformulated as a regular partial differential equation of dimension $d+1$. We transform the extended equation as a minimal Ritz energy functional problem and search for its minimizer in a specia…
▽ More
In this paper, we propose a novel method for solving high-dimensional spectral fractional Laplacian equations. Using the Caffarelli-Silvestre extension, the $d$-dimensional spectral fractional equation is reformulated as a regular partial differential equation of dimension $d+1$. We transform the extended equation as a minimal Ritz energy functional problem and search for its minimizer in a special class of deep neural networks. Moreover, based on the approximation property of networks, we establish estimates on the error made by the deep Ritz method. Numerical results are reported to demonstrate the effectiveness of the proposed method for solving fractional Laplacian equations up to ten dimensions. Technically, in this method, we design a special network-based structure to adapt to the singularity and exponential decaying of the true solution. Also, A hybrid integration technique combining Monte Carlo method and sinc quadrature is developed to compute the loss function with higher accuracy.
△ Less
Submitted 29 December, 2021; v1 submitted 26 August, 2021;
originally announced August 2021.
-
Spectral analysis for preconditioning of multi-dimensional Riesz fractional diffusion equations
Authors:
Xin Huang,
Xue-Lei Lin,
Michael K. Ng,
Hai-Wei Sun
Abstract:
In this paper, we analyze the spectra of the preconditioned matrices arising from discretized multi-dimensional Riesz spatial fractional diffusion equations. The finite difference method is employed to approximate the multi-dimensional Riesz fractional derivatives, which will generate symmetric positive definite ill-conditioned multi-level Toeplitz matrices. The preconditioned conjugate gradient m…
▽ More
In this paper, we analyze the spectra of the preconditioned matrices arising from discretized multi-dimensional Riesz spatial fractional diffusion equations. The finite difference method is employed to approximate the multi-dimensional Riesz fractional derivatives, which will generate symmetric positive definite ill-conditioned multi-level Toeplitz matrices. The preconditioned conjugate gradient method with a preconditioner based on the sine transform is employed to solve the resulting linear system. Theoretically, we prove that the spectra of the preconditioned matrices are uniformly bounded in the open interval (1/2,3/2) and thus the preconditioned conjugate gradient method converges linearly. The proposed method can be extended to multi-level Toeplitz matrices generated by functions with zeros of fractional order. Our theoretical results fill in a vacancy in the literature. Numerical examples are presented to demonstrate our new theoretical results in the literature and show the convergence performance of the proposed preconditioner that is better than other existing preconditioners.
△ Less
Submitted 2 February, 2021;
originally announced February 2021.
-
A parallel-in-time two-sided preconditioning for all-at-once system from a non-local evolutionary equation with weakly singular kernel
Authors:
Xue-lei Lin,
Michael K. Ng,
Yajing Zhi
Abstract:
In this paper, we study a parallel-in-time (PinT) algorithm for all-at-once system from a non-local evolutionary equation with weakly singular kernel where the temporal term involves a non-local convolution with a weakly singular kernel and the spatial term is the usual Laplacian operator with variable coefficients. We propose to use a two-sided preconditioning technique for the all-at-once discre…
▽ More
In this paper, we study a parallel-in-time (PinT) algorithm for all-at-once system from a non-local evolutionary equation with weakly singular kernel where the temporal term involves a non-local convolution with a weakly singular kernel and the spatial term is the usual Laplacian operator with variable coefficients. We propose to use a two-sided preconditioning technique for the all-at-once discretization of the equation. Our preconditioner is constructed by replacing the variable diffusion coefficients with a constant coefficient to obtain a constant-coefficient all-at-once matrix. We split a square root of the constant Laplacian operator out of the constant-coefficient all-at-once matrix as a right preconditioner and take the remaining part as a left preconditioner, which constitutes our two-sided preconditioning. Exploiting the diagonalizability of the constant-Laplacian matrix and the triangular Toeplitz structure of the temporal discretization matrix, we obtain efficient representations of inverses of the right and the left preconditioners, because of which the iterative solution can be fast updated in a PinT manner. Theoretically, the condition number of the two-sided preconditioned matrix is proven to be uniformly bounded by a constant independent of the matrix size. To the best of our knowledge, for the non-local evolutionary equation with variable coefficients, this is the first attempt to develop a PinT preconditioning technique that has fast and exact implementation and that the corresponding preconditioned system has a uniformly bounded condition number. Numerical results are reported to confirm the efficiency of the proposed two-sided preconditioning technique.
△ Less
Submitted 30 January, 2021;
originally announced February 2021.
-
Low Rank Pure Quaternion Approximation for Pure Quaternion Matrices
Authors:
Guangjing Song,
Weiyang Ding,
Michael K. Ng
Abstract:
Quaternion matrices are employed successfully in many color image processing applications. In particular, a pure quaternion matrix can be used to represent red, green and blue channels of color images. A low-rank approximation for a pure quaternion matrix can be obtained by using the quaternion singular value decomposition. However, this approximation is not optimal in the sense that the resulting…
▽ More
Quaternion matrices are employed successfully in many color image processing applications. In particular, a pure quaternion matrix can be used to represent red, green and blue channels of color images. A low-rank approximation for a pure quaternion matrix can be obtained by using the quaternion singular value decomposition. However, this approximation is not optimal in the sense that the resulting low-rank approximation matrix may not be pure quaternion, i.e., the low-rank matrix contains real component which is not useful for the representation of a color image. The main contribution of this paper is to find an optimal rank-$r$ pure quaternion matrix approximation for a pure quaternion matrix (a color image). Our idea is to use a projection on a low-rank quaternion matrix manifold and a projection on a quaternion matrix with zero real component, and develop an alternating projections algorithm to find such optimal low-rank pure quaternion matrix approximation. The convergence of the projection algorithm can be established by showing that the low-rank quaternion matrix manifold and the zero real component quaternion matrix manifold has a non-trivial intersection point. Numerical examples on synthetic pure quaternion matrices and color images are presented to illustrate the projection algorithm can find optimal low-rank pure quaternion approximation for pure quaternion matrices or color images.
△ Less
Submitted 30 December, 2020;
originally announced December 2020.
-
Riemannian Conjugate Gradient Descent Method for Third-Order Tensor Completion
Authors:
Guang-Jing Song,
Xue-Zhong Wang,
Michael K. Ng
Abstract:
The goal of tensor completion is to fill in missing entries of a partially known tensor under a low-rank constraint. In this paper, we mainly study low rank third-order tensor completion problems by using Riemannian optimization methods on the smooth manifold. Here the tensor rank is defined to be a set of matrix ranks where the matrices are the slices of the transformed tensor obtained by applyin…
▽ More
The goal of tensor completion is to fill in missing entries of a partially known tensor under a low-rank constraint. In this paper, we mainly study low rank third-order tensor completion problems by using Riemannian optimization methods on the smooth manifold. Here the tensor rank is defined to be a set of matrix ranks where the matrices are the slices of the transformed tensor obtained by applying the Fourier-related transformation onto the tubes of the original tensor. We show that with suitable incoherence conditions on the underlying low rank tensor, the proposed Riemannian optimization method is guaranteed to converge and find such low rank tensor with a high probability. In addition, numbers of sample entries required for solving low rank tensor completion problem under different initialized methods are studied and derived. Numerical examples for both synthetic and image data sets are reported to demonstrate the proposed method is able to recover low rank tensors.
△ Less
Submitted 20 November, 2020;
originally announced November 2020.
-
Non-Local Robust Quaternion Matrix Completion for Color Images and Videos Inpainting
Authors:
Zhigang Jia,
Qiyu Jin,
Michael K. Ng,
Xile Zhao
Abstract:
The image nonlocal self-similarity (NSS) prior refers to the fact that a local patch often has many nonlocal similar patches to it across the image and has been widely applied in many recently proposed machining learning algorithms for image processing. However, there is no theoretical analysis on its working principle in the literature. In this paper, we discover a potential causality between NSS…
▽ More
The image nonlocal self-similarity (NSS) prior refers to the fact that a local patch often has many nonlocal similar patches to it across the image and has been widely applied in many recently proposed machining learning algorithms for image processing. However, there is no theoretical analysis on its working principle in the literature. In this paper, we discover a potential causality between NSS and low-rank property of color images, which is also available to grey images. A new patch group based NSS prior scheme is proposed to learn explicit NSS models of natural color images. The numerical low-rank property of patched matrices is also rigorously proved. The NSS-based QMC algorithm computes an optimal low-rank approximation to the high-rank color image, resulting in high PSNR and SSIM measures and particularly the better visual quality. A new tensor NSS-based QMC method is also presented to solve the color video inpainting problem based on quaternion tensor representation. The numerical experiments on color images and videos indicate the advantages of NSS-based QMC over the state-of-the-art methods.
△ Less
Submitted 13 May, 2022; v1 submitted 17 November, 2020;
originally announced November 2020.
-
Nonnegative Low Rank Tensor Approximation and its Application to Multi-dimensional Images
Authors:
Tai-Xiang Jiang,
Michael K. Ng,
Junjun Pan,
Guangjing Song
Abstract:
The main aim of this paper is to develop a new algorithm for computing nonnegative low rank tensor approximation for nonnegative tensors that arise in many multi-dimensional imaging applications. Nonnegativity is one of the important property as each pixel value refers to nonzero light intensity in image data acquisition. Our approach is different from classical nonnegative tensor factorization (N…
▽ More
The main aim of this paper is to develop a new algorithm for computing nonnegative low rank tensor approximation for nonnegative tensors that arise in many multi-dimensional imaging applications. Nonnegativity is one of the important property as each pixel value refers to nonzero light intensity in image data acquisition. Our approach is different from classical nonnegative tensor factorization (NTF) which requires each factorized matrix and/or tensor to be nonnegative. In this paper, we determine a nonnegative low Tucker rank tensor to approximate a given nonnegative tensor. We propose an alternating projections algorithm for computing such nonnegative low rank tensor approximation, which is referred to as NLRT. The convergence of the proposed manifold projection method is established. Experimental results for synthetic data and multi-dimensional images are presented to demonstrate the performance of NLRT is better than state-of-the-art NTF methods.
△ Less
Submitted 26 September, 2021; v1 submitted 28 July, 2020;
originally announced July 2020.
-
Sparse Nonnegative Tensor Factorization and Completion with Noisy Observations
Authors:
Xiongjun Zhang,
Michael K. Ng
Abstract:
In this paper, we study the sparse nonnegative tensor factorization and completion problem from partial and noisy observations for third-order tensors. Because of sparsity and nonnegativity, the underlying tensor is decomposed into the tensor-tensor product of one sparse nonnegative tensor and one nonnegative tensor. We propose to minimize the sum of the maximum likelihood estimation for the obser…
▽ More
In this paper, we study the sparse nonnegative tensor factorization and completion problem from partial and noisy observations for third-order tensors. Because of sparsity and nonnegativity, the underlying tensor is decomposed into the tensor-tensor product of one sparse nonnegative tensor and one nonnegative tensor. We propose to minimize the sum of the maximum likelihood estimation for the observations with nonnegativity constraints and the tensor $\ell_0$ norm for the sparse factor. We show that the error bounds of the estimator of the proposed model can be established under general noise observations. The detailed error bounds under specific noise distributions including additive Gaussian noise, additive Laplace noise, and Poisson observations can be derived. Moreover, the minimax lower bounds are shown to be matched with the established upper bounds up to a logarithmic factor of the sizes of the underlying tensor. These theoretical results for tensors are better than those obtained for matrices, and this illustrates the advantage of the use of nonnegative sparse tensor models for completion and denoising. Numerical experiments are provided to validate the superiority of the proposed tensor-based method compared with the matrix-based approach.
△ Less
Submitted 20 October, 2021; v1 submitted 21 July, 2020;
originally announced July 2020.
-
New Formulation and Computation for Generalized Singular Values of Grassman Matrix Pair
Authors:
Wei-Wei Xu,
Michael K. Ng,
Zheng-Jian Bai
Abstract:
In this paper, we derive new model formulations for computing generalized singular values of a Grassman matrix pair. These new formulations make use of truncated filter matrices to locate the $i$-th generalized singular value of a Grassman matrix pair. The resulting matrix optimization problems can be solved by using numerical methods involving Newton's method on Grassmann manifold. Numerical exam…
▽ More
In this paper, we derive new model formulations for computing generalized singular values of a Grassman matrix pair. These new formulations make use of truncated filter matrices to locate the $i$-th generalized singular value of a Grassman matrix pair. The resulting matrix optimization problems can be solved by using numerical methods involving Newton's method on Grassmann manifold. Numerical examples on synthetic data sets and gene expression data sets are reported to demonstrate the high accuracy and the fast computation of the proposed new ormulations for computing arbitrary generalized singular value of Grassman matrix pair.
△ Less
Submitted 5 April, 2020;
originally announced April 2020.
-
Fast Alternating Projections on Manifolds Based on Tangent Spaces
Authors:
Guangjing Song,
Michael K. Ng
Abstract:
In this paper, we study alternating projections on nontangential manifolds based on the tangent spaces. The main motivation is that the projection of a point onto a manifold can be computational expensive. We propose to use the tangent space of the point in the manifold to approximate the projection onto the manifold in order to reduce the computational cost. We show that the sequence generated by…
▽ More
In this paper, we study alternating projections on nontangential manifolds based on the tangent spaces. The main motivation is that the projection of a point onto a manifold can be computational expensive. We propose to use the tangent space of the point in the manifold to approximate the projection onto the manifold in order to reduce the computational cost. We show that the sequence generated by alternating projections on two nontangential manifolds based on tangent spaces, converges linearly to a point in the intersection of the two manifolds where the convergent point is close to the optimal solution. Numerical examples for nonnegative low rank matrix approximation and low rank image quaternion matrix (color image) approximation, are given to demonstrate that the performance of the proposed method is better than that of the classical alternating projection method in terms of computational time.
△ Less
Submitted 23 March, 2020;
originally announced March 2020.
-
Fast and High-order Accuracy Numerical Methods for Time-Dependent Nonlocal Problems in $\mathbb{R}^2
Authors:
Rongjun Cao,
Minghua Chen,
Michael K. Ng,
Yu-Jiang Wu
Abstract:
In this paper, we study the Crank-Nicolson method for temporal dimension and the piecewise quadratic polynomial collocation method for spatial dimensions of time-dependent nonlocal problems. The new theoretical results of such discretization are that the proposed numerical method is unconditionally stable and its global truncation error is of $\mathcal{O}\left(τ^2+h^{4-γ}\right)$ with $0<γ<1$, whe…
▽ More
In this paper, we study the Crank-Nicolson method for temporal dimension and the piecewise quadratic polynomial collocation method for spatial dimensions of time-dependent nonlocal problems. The new theoretical results of such discretization are that the proposed numerical method is unconditionally stable and its global truncation error is of $\mathcal{O}\left(τ^2+h^{4-γ}\right)$ with $0<γ<1$, where $τ$ and $h$ are the discretization sizes in the temporal and spatial dimensions respectively. Also we develop the conjugate gradient squared method to solving the resulting discretized nonsymmetric and indefinite systems arising from time-dependent nonlocal problems including two-dimensional cases. By using additive and multiplicative Cauchy kernels in non-local problems, structured coefficient matrix-vector multiplication can be performed efficiently in the conjugate gradient squared iteration. Numerical examples are given to illustrate our theoretical results and demonstrate that the computational cost of the proposed method is of $O(M \log M)$ operations where $M$ is the number of collocation points.
△ Less
Submitted 10 February, 2020;
originally announced February 2020.
-
Singular Vectors From Singular Values
Authors:
Weiwei Xu,
Michael K. Ng
Abstract:
In the recent paper \cite{1}, Denton et al. provided the eigenvector-eigenvalue identity for Hermitian matrices, and a survey was also given for such identity in the literature. The main aim of this paper is to present the identity related to singular vectors and singular values of a general matrix.
In the recent paper \cite{1}, Denton et al. provided the eigenvector-eigenvalue identity for Hermitian matrices, and a survey was also given for such identity in the literature. The main aim of this paper is to present the identity related to singular vectors and singular values of a general matrix.
△ Less
Submitted 2 February, 2020;
originally announced February 2020.
-
Nonnegative Low Rank Matrix Approximation for Nonnegative Matrices
Authors:
Guang-Jing Song,
Michael Kwok-Po Ng
Abstract:
This paper describes a new algorithm for computing Nonnegative Low Rank Matrix (NLRM) approximation for nonnegative matrices. Our approach is completely different from classical nonnegative matrix factorization (NMF) which has been studied for more than twenty five years. For a given nonnegative matrix, the usual NMF approach is to determine two nonnegative low rank matrices such that the distance…
▽ More
This paper describes a new algorithm for computing Nonnegative Low Rank Matrix (NLRM) approximation for nonnegative matrices. Our approach is completely different from classical nonnegative matrix factorization (NMF) which has been studied for more than twenty five years. For a given nonnegative matrix, the usual NMF approach is to determine two nonnegative low rank matrices such that the distance between their product and the given nonnegative matrix is as small as possible. However, the proposed NLRM approach is to determine a nonnegative low rank matrix such that the distance between such matrix and the given nonnegative matrix is as small as possible. There are two advantages. (i) The minimized distance by the proposed NLRM method can be smaller than that by the NMF method, and it implies that the proposed NLRM method can obtain a better low rank matrix approximation. (ii) Our low rank matrix admits a matrix singular value decomposition automatically which provides a significant index based on singular values that can be used to identify important singular basis vectors, while this information cannot be obtained in the classical NMF. The proposed NLRM approximation algorithm was derived using the alternating projection on the low rank matrix manifold and the non-negativity property. Experimental results are presented to demonstrate the above mentioned advantages of the proposed NLRM method compared the NMF method.
△ Less
Submitted 16 June, 2020; v1 submitted 14 December, 2019;
originally announced December 2019.
-
Bilinear Constraint based ADMM for Mixed Poisson-Gaussian Noise Removal
Authors:
Jie Zhang,
Yuping Duan,
Yue Lu,
Michael K. Ng,
Huibin Chang
Abstract:
In this paper, we propose new operator-splitting algorithms for the total variation regularized infimal convolution (TV-IC) model [4] in order to remove mixed Poisson-Gaussian(MPG) noise. In the existing splitting algorithm for TV-IC, an inner loop by Newton method had to be adopted for one nonlinear optimization subproblem, which increased the computation cost per outer loop. By introducing a new…
▽ More
In this paper, we propose new operator-splitting algorithms for the total variation regularized infimal convolution (TV-IC) model [4] in order to remove mixed Poisson-Gaussian(MPG) noise. In the existing splitting algorithm for TV-IC, an inner loop by Newton method had to be adopted for one nonlinear optimization subproblem, which increased the computation cost per outer loop. By introducing a new bilinear constraint and applying the alternating direction method of multipliers (ADMM), all subproblems of the proposed algorithms named as BCA (short for Bilinear Constraint based ADMM algorithm) and BCAf(short for a variant of BCA with fully splitting form) can be very efficiently solved; especially for the proposed BCAf, they can be calculated without any inner iterations. Under mild conditions, the convergence of the proposed BCA is investigated. Numerically, compared to existing primal-dual algorithms for the TV-IC model, the proposed algorithms, with fewer tunable parameters, converge much faster and produce comparable results meanwhile.
△ Less
Submitted 27 January, 2020; v1 submitted 17 October, 2019;
originally announced October 2019.
-
An efficient second-order convergent scheme for one-side space fractional diffusion equations with variable coefficients
Authors:
Xue-lei Lin,
Pin Lyu,
Michael K. Ng,
Hai-Wei Sun,
Seakweng Vong
Abstract:
In this paper, a second order finite difference scheme is investigated for time-dependent one-side space fractional diffusion equations with variable coefficients. The existing schemes for the equation with variable coefficients have temporal convergence rate no better than second order and spatial convergence rate no better than first order, theoretically. In the presented scheme, the Crank-Nicol…
▽ More
In this paper, a second order finite difference scheme is investigated for time-dependent one-side space fractional diffusion equations with variable coefficients. The existing schemes for the equation with variable coefficients have temporal convergence rate no better than second order and spatial convergence rate no better than first order, theoretically. In the presented scheme, the Crank-Nicolson temporal discretization and a second-order weighted-and-shifted Grünwald-Letnikov spatial discretization are employed. Theoretically, the unconditional stability and the second-order convergence in time and space of the proposed scheme are established under some conditions on the diffusion coefficients. Moreover, a Toeplitz preconditioner is proposed for linear systems arising from the proposed scheme. The condition number of the preconditioned matrix is proven to be bounded by a constant independent of the discretization step-sizes so that the Krylov subspace solver for the preconditioned linear systems converges linearly. Numerical results are reported to show the convergence rate and the efficiency of the proposed scheme.
△ Less
Submitted 22 February, 2019;
originally announced February 2019.
-
Parallel Active Subspace Decomposition for Scalable and Efficient Tensor Robust Principal Component Analysis
Authors:
Jonathan Q. Jiang,
Michael K. Ng
Abstract:
Tensor robust principal component analysis (TRPCA) has received a substantial amount of attention in various fields. Most existing methods, normally relying on tensor nuclear norm minimization, need to pay an expensive computational cost due to multiple singular value decompositions (SVDs) at each iteration. To overcome the drawback, we propose a scalable and efficient method, named Parallel Activ…
▽ More
Tensor robust principal component analysis (TRPCA) has received a substantial amount of attention in various fields. Most existing methods, normally relying on tensor nuclear norm minimization, need to pay an expensive computational cost due to multiple singular value decompositions (SVDs) at each iteration. To overcome the drawback, we propose a scalable and efficient method, named Parallel Active Subspace Decomposition (PASD), which divides the unfolding along each mode of the tensor into a columnwise orthonormal matrix (active subspace) and another small-size matrix in parallel. Such a transformation leads to a nonconvex optimization problem in which the scale of nulcear norm minimization is generally much smaller than that in the original problem. Furthermore, we introduce an alternating direction method of multipliers (ADMM) method to solve the reformulated problem and provide rigorous analyses for its convergence and suboptimality. Experimental results on synthetic and real-world data show that our algorithm is more accurate than the state-of-the-art approaches, and is orders of magnitude faster.
△ Less
Submitted 28 December, 2017;
originally announced December 2017.
-
Local dimensions of measures of finite type II - Measures without full support and with non-regular probabilities
Authors:
Kathryn E. Hare,
Kevin G. Hare,
Michael Ka Shing Ng
Abstract:
Consider a sequence of linear contractions $S_{j}(x)=\varrho x+d_{j}$ and probabilities $p_{j}>0$ with $\sum p_{j}=1$. We are interested in the self-similar measure $μ=\sum p_{j}μ\circ S_{j}^{-1}$, of finite type. In this paper we study the multi-fractal analysis of such measures, extending the theory to measures arising from non-regular probabilities and whose support is not necessarily an interv…
▽ More
Consider a sequence of linear contractions $S_{j}(x)=\varrho x+d_{j}$ and probabilities $p_{j}>0$ with $\sum p_{j}=1$. We are interested in the self-similar measure $μ=\sum p_{j}μ\circ S_{j}^{-1}$, of finite type. In this paper we study the multi-fractal analysis of such measures, extending the theory to measures arising from non-regular probabilities and whose support is not necessarily an interval.
Under some mild technical assumptions, we prove that there exists a subset of supp$μ$ of full $μ$ and Hausdorff measure, called the truly essential class, for which the set of (upper or lower) local dimensions is a closed interval. Within the truly essential class we show that there exists a point with local dimension exactly equal to the dimension of the support. We give an example where the set of local dimensions is a two element set, with all the elements of the truly essential class giving the same local dimension. We give general criteria for these measures to be absolutely continuous with respect to the associated Hausdorff measure of their support and we show that the dimension of the support can be computed using only information about the essential class.
To conclude, we present a detailed study of three examples. First, we show that the set of local dimensions of the biased Bernoulli convolution with contraction ratio the inverse of a simple Pisot number always admits an isolated point. We give a precise description of the essential class of a generalized Cantor set of finite type. Lastly, we study a maximal loop class that is not truly essential.
△ Less
Submitted 7 March, 2016;
originally announced March 2016.