Search | arXiv e-print repository

Open Problems in Computability Theory and Descriptive Set Theory

Authors: George Barmpalias, Nikolay Bazhenov, Chi Tat Chong, Wei Dai, Su Gao, Jun Le Goh, Jialiang He, Keng Meng Selwyn Ng, Andre Nies, Theodore Slaman, Riley Thornton, Wei Wang, Jing Yu, Liang Yu

Abstract: These open problems were presented in the Problem Sessions held during the Tianyuan Workshop on Computability Theory and Descriptive Set Theory, June 16-20, 2025. The problems are organized into sections named after their contributors, in the order of their presentations during the workshop. Notes were taken and compiled by Wei Dai, Feng Li, Ruiwen Li, Ming Xiao, Xu Wang, Víctor Hugo Yañez Salazar… ▽ More These open problems were presented in the Problem Sessions held during the Tianyuan Workshop on Computability Theory and Descriptive Set Theory, June 16-20, 2025. The problems are organized into sections named after their contributors, in the order of their presentations during the workshop. Notes were taken and compiled by Wei Dai, Feng Li, Ruiwen Li, Ming Xiao, Xu Wang, Víctor Hugo Yañez Salazar, and Yang Zheng. △ Less

Submitted 5 July, 2025; originally announced July 2025.

MSC Class: 03E15; 03D30

arXiv:2504.15073 [pdf, ps, other]

Hermitian Quaternion Toeplitz Matrices by Quaternion-valued Generating Functions

Authors: Xue-lei Lin, Michael K. Ng, Junjun Pan

Abstract: In this paper, we study Hermitian quaternion Toeplitz matrices generated by quaternion-valued functions. We show that such generating function must be the sum of a real-valued function and an odd function with imaginary component. This setting is different from the case of Hermitian complex Toeplitz matrices generated by real-valued functions only. By using of 2-by-2 block complex representation o… ▽ More In this paper, we study Hermitian quaternion Toeplitz matrices generated by quaternion-valued functions. We show that such generating function must be the sum of a real-valued function and an odd function with imaginary component. This setting is different from the case of Hermitian complex Toeplitz matrices generated by real-valued functions only. By using of 2-by-2 block complex representation of quaternion matrices, we give a quaternion version of Grenander-Szegö theorem stating the distribution of eigenvalues of Hermitian quaternion Toeplitz matrices in terms of its generating function. As an application, we investigate Strang's circulant preconditioners for Hermitian quaternion Toeplitz linear systems arising from quaternion signal processing. We show that Strang's circulant preconditioners can be diagionalized by discrete quaternion Fourier transform matrices whereas general quaternion circulant matrices cannot be diagonalized by them. Also we verify the theoretical and numerical convergence results of Strang's circulant preconditioned conjugate gradient method for solving Hermitian quaternion Toeplitz systems. △ Less

Submitted 21 April, 2025; originally announced April 2025.

arXiv:2504.04509 [pdf, other]

Truncated Huber Penalty for Sparse Signal Recovery with Convergence Analysis

Authors: Li Yang, Serena Morigi, Michael K. Ng, You-wei Wen

Abstract: Sparse signal recovery from under-determined systems presents significant challenges when using conventional L_0 and L_1 penalties, primarily due to computational complexity and estimation bias. This paper introduces a truncated Huber penalty, a non-convex metric that effectively bridges the gap between unbiased sparse recovery and differentiable optimization. The proposed penalty applies quadrati… ▽ More Sparse signal recovery from under-determined systems presents significant challenges when using conventional L_0 and L_1 penalties, primarily due to computational complexity and estimation bias. This paper introduces a truncated Huber penalty, a non-convex metric that effectively bridges the gap between unbiased sparse recovery and differentiable optimization. The proposed penalty applies quadratic regularization to small entries while truncating large magnitudes, avoiding non-differentiable points at optima. Theoretical analysis demonstrates that, for an appropriately chosen threshold, any s-sparse solution recoverable via conventional penalties remains a local optimum under the truncated Huber function. This property allows the exact and robust recovery theories developed for other penalty regularization functions to be directly extended to the truncated Huber function. To solve the optimization problem, we develop a block coordinate descent (BCD) algorithm with finite-step convergence guarantees under spark conditions. Numerical experiments are conducted to validate the effectiveness and robustness of the proposed approach. Furthermore, we extend the truncated Huber-penalized model to the gradient domain, illustrating its applicability in signal denoising and image smoothing. △ Less

Submitted 6 April, 2025; originally announced April 2025.

MSC Class: 90C26; 90C90; 65K10; 49N45;

arXiv:2503.04447 [pdf, other]

A Graph-Partitioning Based Continuous Optimization Approach to Semi-supervised Clustering Problems

Authors: Wei Liu, Xin Liu, Michael K. Ng, Zaikun Zhang

Abstract: Semi-supervised clustering is a basic problem in various applications. Most existing methods require knowledge of the ideal cluster number, which is often difficult to obtain in practice. Besides, satisfying the must-link constraints is another major challenge for these methods. In this work, we view the semi-supervised clustering task as a partitioning problem on a graph associated with the given… ▽ More Semi-supervised clustering is a basic problem in various applications. Most existing methods require knowledge of the ideal cluster number, which is often difficult to obtain in practice. Besides, satisfying the must-link constraints is another major challenge for these methods. In this work, we view the semi-supervised clustering task as a partitioning problem on a graph associated with the given dataset, where the similarity matrix includes a scaling parameter to reflect the must-link constraints. Utilizing a relaxation technique, we formulate the graph partitioning problem into a continuous optimization model that does not require the exact cluster number, but only an overestimate of it. We then propose a block coordinate descent algorithm to efficiently solve this model, and establish its convergence result. Based on the obtained solution, we can construct the clusters that theoretically meet the must-link constraints under mild assumptions. Furthermore, we verify the effectiveness and efficiency of our proposed method through comprehensive numerical experiments. △ Less

Submitted 6 March, 2025; originally announced March 2025.

arXiv:2501.06928 [pdf, other]

Scissors congruence K-theory for equivariant manifolds

Authors: Mona Merling, Ming Ng, Julia Semikina, Alba Sendón Blanco, Lucas Williams

Abstract: We introduce a scissors congruence $K$-theory spectrum which lifts the equivariant scissors congruence groups for compact $G$-manifolds with boundary, and we show that on $π_0$ this is the source of a spectrum level lift of the Burnside ring valued equivariant Euler characteristic of a compact $G$-manifold. We also show that the equivariant scissors congruence groups for varying subgroups assemble… ▽ More We introduce a scissors congruence $K$-theory spectrum which lifts the equivariant scissors congruence groups for compact $G$-manifolds with boundary, and we show that on $π_0$ this is the source of a spectrum level lift of the Burnside ring valued equivariant Euler characteristic of a compact $G$-manifold. We also show that the equivariant scissors congruence groups for varying subgroups assemble into a Mackey functor, which is a shadow of a conjectural higher genuine equivariant structure. △ Less

Submitted 12 January, 2025; originally announced January 2025.

Comments: 20 pages, 1 figure

MSC Class: Primary: 19D55; 19D99; 57R91; Secondary: 19D10; 19A49; 55P91; 55S91

arXiv:2501.01615

Equity Impacts of Public Transit Network Redesign with Shared Autonomous Mobility Services

Authors: Max T. M. Ng, Meredith Raymer, Hani S. Mahmassani, Omer Verbas, Taner Cokyasar

Abstract: This study examines the equity impacts of integrating shared autonomous mobility services (SAMS) into transit system redesign. Using the Greater Chicago area as a case study, we compare two optimization objectives in multimodal transit network redesign: minimizing total generalized costs (equity-agnostic) versus prioritizing service in low-income areas (equity-focused). We evaluate the achieved ac… ▽ More This study examines the equity impacts of integrating shared autonomous mobility services (SAMS) into transit system redesign. Using the Greater Chicago area as a case study, we compare two optimization objectives in multimodal transit network redesign: minimizing total generalized costs (equity-agnostic) versus prioritizing service in low-income areas (equity-focused). We evaluate the achieved accessibility of clustered zones with redesigned transit networks under two objectives, compared to driving and the existing transit network. The transit access gaps across zones and between transit and driving are found to be generally reduced with the introduction of SAMS, but less so with the subsequent improved infrastructure under budget. Differential improvement in equity is seen across suburbs and areas of the city, reflecting the disparity in current transit access and improvement potential. In particular, SAMS bridges the transit access gaps in suburban and city areas currently underserved by transit. The City of Chicago, which is also disproportionately home to vulnerable populations, offers an avenue to improve vertical equity. These findings demonstrate that SAMS can enhance both horizontal and vertical equity in transit systems, particularly when equity is explicitly incorporated into the design objective. △ Less

Submitted 8 January, 2025; v1 submitted 2 January, 2025; originally announced January 2025.

Comments: Restructuring the paper for more precise research direction

arXiv:2501.01614 [pdf]

doi 10.1177/03611981231170182

Evaluation of Rail Decarbonization Alternatives: Framework and Application

Authors: Adrian Hernandez, Max TM Ng, Nazib Siddique, Pablo L. Durango-Cohen, Amgad Elgowainy, Hani S. Mahmassani, Michael Wang, Yan Zhou

Abstract: The Northwestern University Freight Rail Infrastructure and Energy Network Decarbonization (NUFRIEND) framework is a comprehensive industry-oriented tool for simulating the deployment of new energy technologies including biofuels, e-fuels, battery-electric, and hydrogen locomotives. By classifying fuel types into two categories based on deployment requirements, the associated optimal charging/fuel… ▽ More The Northwestern University Freight Rail Infrastructure and Energy Network Decarbonization (NUFRIEND) framework is a comprehensive industry-oriented tool for simulating the deployment of new energy technologies including biofuels, e-fuels, battery-electric, and hydrogen locomotives. By classifying fuel types into two categories based on deployment requirements, the associated optimal charging/fueling facility location and sizing problem are solved with a five-step framework. Life cycle analyses (LCA) and techno-economic analyses (TEA) are used to estimate carbon reduction, capital investments, cost of carbon reduction, and operational impacts, enabling sensitivity analysis with operational and technological parameters. The framework is illustrated on lower-carbon drop-in fuels as well as battery-electric technology deployments for US Eastern and Western Class I railroad networks. Drop-in fuel deployments are modeled as admixtures with diesel in existing locomotives, while battery-electric deployments are shown for varying technology penetration levels and locomotive ranges. When mixed in a 50 percent ratio with diesel, results show biodiesel's capacity to reduce emissions at 36 percent with a cost of 0.13 USD per kilogram of CO2 reduced, while e-fuels offer a 50 percent emissions reduction potential at a cost of 0.22 USD per kilogram of CO2 reduced. Battery-electric results for 50 percent deployment over all ton-miles highlight the value of future innovations in battery energy densities as scenarios assuming 800-mile range locomotives show an estimated emissions reduction of 46 percent with a cost of 0.06 USD per kilogram of CO2 reduced, compared to 16 percent emissions reduction at a cost of 0.11 USD per kilogram of CO2 reduced for 400-mile range locomotives. △ Less

Submitted 2 January, 2025; originally announced January 2025.

Comments: 29 pages, 17 figures. This is the accepted version of a work that was published in Transportation Research Record

Journal ref: Transportation Research Record 2678.1 (2024): 102-121

arXiv:2501.00219 [pdf]

doi 10.1177/03611981221098660

Autonomous Minibus Service with Semi-on-demand Routes in Grid Networks

Authors: Max T. M. Ng, Hani S. Mahmassani

Abstract: This paper investigates the potential of autonomous minibuses which take on-demand directional routes for pick-up and drop-off in a grid network of wider area with low density, followed by fixed routes in areas with demand. Mathematical formulation for generalized costs demonstrates its benefits, with indicators proposed to select existing bus routes for conversion with the options of zonal expres… ▽ More This paper investigates the potential of autonomous minibuses which take on-demand directional routes for pick-up and drop-off in a grid network of wider area with low density, followed by fixed routes in areas with demand. Mathematical formulation for generalized costs demonstrates its benefits, with indicators proposed to select existing bus routes for conversion with the options of zonal express and parallel routes. Simulations on modeled scenarios and case studies with bus routes in Chicago show reductions in both passenger costs and generalized costs over existing fixed-route bus service between suburban areas and CBD. △ Less

Submitted 30 December, 2024; originally announced January 2025.

Comments: 38 pages, 35 figures. This is the accepted version of a work that was published in Transportation Research Record

Journal ref: Transportation Research Record 2677.1 (2023): 178-200

arXiv:2412.20667 [pdf]

doi 10.1177/03611981231185145

Highway Managed Lane Usage and Tolling for Mixed Traffic Flows with Connected Automated Vehicles (CAVs) and High-Occupancy Vehicles (HOVs)

Authors: Max T. M. Ng, Hani S. Mahmassani

Abstract: This paper investigates managed lane (ML) toll setting and its effect under mixed traffic of connected automated vehicles (CAVs), high-occupancy vehicles (HOVs), and human-driven vehicles (HDVs), with a goal to avoid flow breakdown and minimize total social cost. A mesoscopic finite-difference traffic simulation model considers the flow-density relationship at different CAV market penetration rate… ▽ More This paper investigates managed lane (ML) toll setting and its effect under mixed traffic of connected automated vehicles (CAVs), high-occupancy vehicles (HOVs), and human-driven vehicles (HDVs), with a goal to avoid flow breakdown and minimize total social cost. A mesoscopic finite-difference traffic simulation model considers the flow-density relationship at different CAV market penetration rates, lane-changing behavior, and multiple entries/exits, interacting with a reactive toll setting mechanism. The results of the Monte Carlo simulation suggest an optimal policy of untolled HOV/CAV use with HDV tolls in particular scenarios of limited CAV market penetration. Small and targeted tolling avoids flow breakdown in ML while prioritizing HOVs and other vehicles with high values of time. Extensions of the formulation and sensitivity analysis quantify the benefits of converting high-occupancy HDVs to CAVs. The optimal tolling regime combines traffic science notions of flow stability and the economics of resource allocation. △ Less

Submitted 29 December, 2024; originally announced December 2024.

Comments: 38 pages, 23 figures. This is the accepted version of a work that was published in Transportation Research Record

Journal ref: Transportation Research Record 2678.4 (2024): 505-526

arXiv:2412.19719 [pdf]

doi 10.1016/j.tre.2024.103601

Trading Off Energy Storage and Payload -- An Analytical Model for Freight Train Configuration

Authors: Max T. M. Ng, Adrian Hernandez, Pablo L. Durango-Cohen, Hani S. Mahmassani

Abstract: To support planning of alternative fuel technology (e.g., battery-electric locomotives) deployment for decarbonizing non-electrified freight rail, we develop a convex optimization formulation with a closed-form solution to determine the optimal number of energy storage tender cars in a train. The formulation shares a similar structure to an Economic Order Quantity (EOQ) model. For given market cha… ▽ More To support planning of alternative fuel technology (e.g., battery-electric locomotives) deployment for decarbonizing non-electrified freight rail, we develop a convex optimization formulation with a closed-form solution to determine the optimal number of energy storage tender cars in a train. The formulation shares a similar structure to an Economic Order Quantity (EOQ) model. For given market characteristics, cost forecasts, and technology parameters, our model captures the trade-offs between inventory carrying costs associated with trip times (including delays due to charging/refueling) and ordering costs associated with train dispatch and operation (energy, amortized equipment, and labor costs). To illustrate the framework, we find the optimal number of battery-electric energy tender cars in 22,501 freight markets (origin-destination pairs and commodities) for U.S. Class I railroads. The results display heterogeneity in optimal configurations with lighter, yet more time-sensitive shipments (e.g., intermodal) utilizing more battery tender cars. For heavier commodities (e.g., coal) with lower holding costs, single battery tender car configurations are generally optimal. The results also show that the optimal train configurations are sensitive to delays associated with recharging or swapping tender cars. △ Less

Submitted 27 December, 2024; originally announced December 2024.

Comments: 42 pages, 19 figures. This is the accepted version of a work that was published in Transportation Research Part E: Logistics and Transportation Review

Journal ref: Transportation Research Part E: Logistics and Transportation Review Volume 187, July 2024, 103601

arXiv:2412.19401 [pdf]

Joint Optimization of Multimodal Transit Frequency and Shared Autonomous Vehicle Fleet Size with Hybrid Metaheuristic and Nonlinear Programming

Authors: Max T. M. Ng, Hani S. Mahmassani, Draco Tong, Omer Verbas, Taner Cokyasar

Abstract: Shared autonomous vehicles (SAVs) bring competition to traditional transit services but redesigning multimodal transit network can utilize SAVs as feeders to enhance service efficiency and coverage. This paper presents an optimization framework for the joint multimodal transit frequency and SAV fleet size problem, a variant of the transit network frequency setting problem. The objective is to maxi… ▽ More Shared autonomous vehicles (SAVs) bring competition to traditional transit services but redesigning multimodal transit network can utilize SAVs as feeders to enhance service efficiency and coverage. This paper presents an optimization framework for the joint multimodal transit frequency and SAV fleet size problem, a variant of the transit network frequency setting problem. The objective is to maximize total transit ridership (including SAV-fed trips and subtracting boarding rejections) across multiple time periods under budget constraints, considering endogenous mode choice (transit, point-to-point SAVs, driving) and route selection, while allowing for strategic route removal by setting frequencies to zero. Due to the problem's non-linear, non-convex nature and the computational challenges of large-scale networks, we develop a hybrid solution approach that combines a metaheuristic approach (particle swarm optimization) with nonlinear programming for local solution refinement. To ensure computational tractability, the framework integrates analytical approximation models for SAV waiting times based on fleet utilization, multimodal network assignment for route choice, and multinomial logit mode choice behavior, bypassing the need for computationally intensive simulations within the main optimization loop. Applied to the Chicago metropolitan area's multimodal network, our method illustrates a 33.3% increase in transit ridership through optimized transit route frequencies and SAV integration, particularly enhancing off-peak service accessibility and strategically reallocating resources. △ Less

Submitted 22 April, 2025; v1 submitted 26 December, 2024; originally announced December 2024.

Comments: 23 pages, 5 figures, a previous version is accepted for presentation at the Conference on Advanced Systems in Public Transport and TransitData 2025 in Kyoto, Japan on 1 - 4 July 2025

arXiv:2412.18991 [pdf, ps, other]

The singleton degrees of the $Σ^0_2$ sets are not dense

Authors: Thomas F. Kent, Keng Meng Ng, Andrea Sorbi

Abstract: Answering an open question raised by Cooper, we show that there exist $Δ^0_2$ sets $D$ and $E$ such that the singleton degree of $E$ is a minimal cover of the singleton degree of $D$. This shows that the $Σ^{0}_{2}$ singleton degrees, and the $Δ^{0}_{2}$ singleton degrees, are not dense (and consequently the $Π^0_2$ $Q$-degrees, and the $Δ^{0}_{2}$ $Q$-degrees, are not dense). Moreover $D$ and… ▽ More Answering an open question raised by Cooper, we show that there exist $Δ^0_2$ sets $D$ and $E$ such that the singleton degree of $E$ is a minimal cover of the singleton degree of $D$. This shows that the $Σ^{0}_{2}$ singleton degrees, and the $Δ^{0}_{2}$ singleton degrees, are not dense (and consequently the $Π^0_2$ $Q$-degrees, and the $Δ^{0}_{2}$ $Q$-degrees, are not dense). Moreover $D$ and $E$ can be built to lie in the same enumeration degree. △ Less

Submitted 25 December, 2024; originally announced December 2024.

MSC Class: 03D25; 03D30

arXiv:2411.06043 [pdf, ps, other]

The subTuring degrees

Authors: Takayuki Kihara, Keng Meng Ng

Abstract: In this article, we introduce a notion of reducibility for partial functions on the natural numbers, which we call subTuring reducibility. One important aspect is that the subTuring degrees correspond to the structure of the realizability subtoposes of the effective topos. We show that the subTuring degrees (that is, the realizability subtoposes of the effective topos) form a dense non-modular (th… ▽ More In this article, we introduce a notion of reducibility for partial functions on the natural numbers, which we call subTuring reducibility. One important aspect is that the subTuring degrees correspond to the structure of the realizability subtoposes of the effective topos. We show that the subTuring degrees (that is, the realizability subtoposes of the effective topos) form a dense non-modular (thus, non-distributive) lattice. We also show that there is a nonzero join-irreducible subTuring degree (which implies that there is a realizability subtopos of the effective topos that cannot be decomposed into two smaller realizability subtoposes). △ Less

Submitted 20 November, 2024; v1 submitted 8 November, 2024; originally announced November 2024.

arXiv:2409.19068 [pdf]

Joint Optimization of Pattern, Headway, and Fleet Size of Multiple Urban Transit Lines with Perceived Headway Consideration and Passenger Flow Allocation

Authors: Max T. M. Ng, Draco Tong, Hani S. Mahmassani, Omer Verbas, Taner Cokyasar

Abstract: This study addresses the urban transit pattern design problem, optimizing stop sequences, headways, and fleet sizes across multiple routes and periods simultaneously to minimize user costs (composed of riding, waiting, and transfer times) under operational constraints (e.g., vehicle capacity and fleet size). A destination-labeled multi-commodity network flow (MCNF) formulation is developed to solv… ▽ More This study addresses the urban transit pattern design problem, optimizing stop sequences, headways, and fleet sizes across multiple routes and periods simultaneously to minimize user costs (composed of riding, waiting, and transfer times) under operational constraints (e.g., vehicle capacity and fleet size). A destination-labeled multi-commodity network flow (MCNF) formulation is developed to solve the problem at a large scale more efficiently compared to the previous literature. The model allows for flexible pattern options without relying on pre-defined candidate sets and simultaneously considers multiple operational strategies such as express/local services, short-turning, and deadheading. It evaluates perceived headways of joint patterns for passengers, assigns passenger flows to each pattern accordingly, and allows transfers across patterns in different directions. The mixed-integer linear programming (MILP) model is demonstrated with a city-sized network of metro lines in Chicago, USA, achieving near-optimal solutions in hours. The total weighted journey times are reduced by 0.61% and 5.76% under single-route and multi-period multi-route scenarios respectively. The model provides transit agencies with an efficient tool for comprehensive service design and resource allocation, improving service quality and resource utilization without additional operational costs. △ Less

Submitted 26 December, 2024; v1 submitted 27 September, 2024; originally announced September 2024.

Comments: 25 pages, 4 figures, a previous version accepted for presentation in the 104th Transportation Research Board Annual Meeting in Washington, D.C. in January 2025

arXiv:2408.10547 [pdf, other]

Semi-on-Demand Off-Peak Transit Services with Shared Autonomous Vehicles -- Service Planning, Simulation, and Analysis in Munich, Germany

Authors: Max T. M. Ng, Roman Engelhardt, Florian Dandl, Vasileios Volakakis, Hani S. Mahmassani, Klaus Bogenberger

Abstract: This study investigates the implementation of semi-on-demand (SoD) hybrid-route services using Shared Autonomous Vehicles (SAVs) on existing transit lines. SoD services combine the cost efficiency of fixed-route buses with the flexibility of on-demand services. SAVs first serve all scheduled fixed-route stops, then drop off and pick up passengers in the pre-determined flexible-route portion, and r… ▽ More This study investigates the implementation of semi-on-demand (SoD) hybrid-route services using Shared Autonomous Vehicles (SAVs) on existing transit lines. SoD services combine the cost efficiency of fixed-route buses with the flexibility of on-demand services. SAVs first serve all scheduled fixed-route stops, then drop off and pick up passengers in the pre-determined flexible-route portion, and return to the fixed route. This study addresses four key questions: optimal fleet and vehicle sizes for peak-hour fixed-route services with SAVs and during transition (from drivers to autonomous vehicles), optimal off-peak SoD service planning, and suitable use cases. The methodology combines analytical modeling for service planning with agent-based simulation for operational analysis. We examine ten bus routes in Munich, Germany, considering full SAV and transition scenarios with varying proportions of drivers. Our findings demonstrate that the lower operating costs of SAVs improve service quality through increased frequency and smaller vehicles, even in transition scenarios. The reduced headway lowers waiting time and also favors more flexible-route operation in SoD services. The optimal SoD settings range from fully flexible to hybrid routes, where higher occupancy from the terminus favors shorter flexible routes. During the transition phase, limited fleet size and higher headways constrain the benefits of flexible-route operations. The simulation results corroborate the SoD benefits of door-to-door convenience, attracting more passengers without excessive detours and operator costs at moderate flexible-route lengths, and validate the analytical model. △ Less

Submitted 18 December, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

Comments: 38 pages, 10 figures, previous version accepted for presentation at the 104th Transportation Research Board Annual Meeting, Washington, D.C

arXiv:2408.05582 [pdf, ps, other]

Non-Negative Reduced Biquaternion Matrix Factorization with Applications in Color Face Recognition

Authors: Jifei Miao, Junjun Pan, Michael K. Ng

Abstract: Reduced biquaternion (RB), as a four-dimensional algebra highly suitable for representing color pixels, has recently garnered significant attention from numerous scholars. In this paper, for color image processing problems, we introduce a concept of the non-negative RB matrix and then use the multiplication properties of RB to propose a non-negative RB matrix factorization (NRBMF) model. The NRBMF… ▽ More Reduced biquaternion (RB), as a four-dimensional algebra highly suitable for representing color pixels, has recently garnered significant attention from numerous scholars. In this paper, for color image processing problems, we introduce a concept of the non-negative RB matrix and then use the multiplication properties of RB to propose a non-negative RB matrix factorization (NRBMF) model. The NRBMF model is introduced to address the challenge of reasonably establishing a non-negative quaternion matrix factorization model, which is primarily hindered by the multiplication properties of traditional quaternions. Furthermore, this paper transforms the problem of solving the NRBMF model into an RB alternating non-negative least squares (RB-ANNLS) problem. Then, by introducing a method to compute the gradient of the real function with RB matrix variables, we solve the RB-ANNLS optimization problem using the RB projected gradient algorithm and conduct a convergence analysis of the algorithm. Finally, we validate the effectiveness and superiority of the proposed NRBMF model in color face recognition. △ Less

Submitted 9 July, 2025; v1 submitted 10 August, 2024; originally announced August 2024.

arXiv:2405.12114 [pdf, other]

A New Cross-Space Total Variation Regularization Model for Color Image Restoration with Quaternion Blur Operator

Authors: Zhigang Jia, Yuelian Xiang, Meixiang Zhao, Tingting Wu, Michael K. Ng

Abstract: The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color artifacts in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by intro… ▽ More The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color artifacts in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by introducing a quaternion blur operator and a cross-color space regularization functional. The existence and uniqueness of the solution are proved and a new L-curve method is proposed to find a balance of regularization terms on different color spaces. The Euler-Lagrange equation is derived to show that CSTV has taken into account the coupling of all color channels and the local smoothing within each color channel. A quaternion operator splitting method is firstly proposed to enhance the ability of color artifacts reduction of the CSTV regularization model. This strategy also applies to the well-known color deblurring models. Numerical experiments on color image databases illustrate the efficiency and effectiveness of the new model and algorithms. The color images restored by them successfully maintain the color and spatial information and are of higher quality in terms of PSNR, SSIM, MSE and CIEde2000 than the restorations of the-state-of-the-art methods. △ Less

Submitted 26 January, 2025; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: 15pages,14figures

arXiv:2405.04338 [pdf, ps, other]

The computational content of multidimensional discontinuity

Authors: Rupert Hölzl, Keng Meng Ng

Abstract: The Weihrauch degrees are a tool to gauge the computational difficulty of mathematical problems. Often, what makes these problems hard is their discontinuity. We look at discontinuity in its purest form, that is, at otherwise constant functions that make a single discontinuous step along each dimension of their underlying space. This is an extension of previous work of Kihara, Pauly, Westrick from… ▽ More The Weihrauch degrees are a tool to gauge the computational difficulty of mathematical problems. Often, what makes these problems hard is their discontinuity. We look at discontinuity in its purest form, that is, at otherwise constant functions that make a single discontinuous step along each dimension of their underlying space. This is an extension of previous work of Kihara, Pauly, Westrick from a single dimension to multiple dimensions. Among other results, we obtain strict hierarchies in the Weihrauch degrees, one of which orders mathematical problems by the richness of the truth-tables determining how discontinuous steps influence the output. △ Less

Submitted 18 July, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

MSC Class: 03D78; 03D30; 03F60

arXiv:2404.11390 [pdf, ps, other]

A $τ$-preconditioner for space fractional diffusion equation with non-separable variable coefficients

Authors: Xue-Lei Lin, Michael K. Ng

Abstract: In this paper, we study a $τ$-matrix approximation based preconditioner for the linear systems arising from discretization of unsteady state Riesz space fractional diffusion equation with non-separable variable coefficients. The structure of coefficient matrices of the linear systems is identity plus summation of diagonal-times-multilevel-Toeplitz matrices. In our preconditioning technique, the di… ▽ More In this paper, we study a $τ$-matrix approximation based preconditioner for the linear systems arising from discretization of unsteady state Riesz space fractional diffusion equation with non-separable variable coefficients. The structure of coefficient matrices of the linear systems is identity plus summation of diagonal-times-multilevel-Toeplitz matrices. In our preconditioning technique, the diagonal matrices are approximated by scalar identity matrices and the Toeplitz matrices are approximated by τ-matrices (a type of matrices diagonalizable by discrete sine transforms). The proposed preconditioner is fast invertible through the fast sine transform (FST) algorithm. Theoretically, we show that the GMRES solver for the preconditioned systems has an optimal convergence rate (a convergence rate independent of discretization stepsizes). To the best of our knowledge, this is the first preconditioning method with the optimal convergence rate for the variable-coefficients space fractional diffusion equation. Numerical results are reported to demonstrate the efficiency of the proposed method. △ Less

Submitted 17 April, 2024; originally announced April 2024.

MSC Class: 65B99; 65M22; 65F08; 65F10

arXiv:2403.15804 [pdf, other]

Semi-on-Demand Hybrid Transit Route Design with Shared Autonomous Mobility Services

Authors: Max T. M. Ng, Florian Dandl, Hani S. Mahmassani, Klaus Bogenberger

Abstract: This study examines the route design of a semi-on-demand hybrid route directional service in the public transit network, offering on-demand flexible route service in low-density areas and fixed route service in higher-density areas with Shared Autonomous Mobility Service (SAMS). The study develops analytically tractable cost expressions that capture access, waiting, and riding costs for users, and… ▽ More This study examines the route design of a semi-on-demand hybrid route directional service in the public transit network, offering on-demand flexible route service in low-density areas and fixed route service in higher-density areas with Shared Autonomous Mobility Service (SAMS). The study develops analytically tractable cost expressions that capture access, waiting, and riding costs for users, and distance-based operating and time-based vehicle costs for operators. Two formulations are presented for strategic and tactical decisions in flexible route portion, fleet size, headway, and vehicle size optimization, enabling the determination of route types between fixed, hybrid, and flexible routes based on demand, cost, and operational parameters. The practical applications and benefits of semi-on-demand feeders are demonstrated with numerical examples and a large-scale case study in the Chicago metropolitan area. Findings reveal scenarios in which flexible route portions serving passengers located further away reduce total costs, particularly user costs. Lower operating costs in lower-demand areas favor more flexible routes, whereas higher demand densities favor more traditional line-based operations. On two studied lines, a current cost forecast favors smaller vehicles with flexible routes, but operating constraints and higher operating costs would favor bigger vehicles with hybrid routes. The study provides an analytical tool to design SAMS as directional services and transit feeders, and tractable continuous approximation formulations for future research in transit network design. △ Less

Submitted 7 August, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

Comments: 24 pages, 12 figures, previous version presented at the 103rd Transportation Research Board Annual Meeting, Washington, D.C

arXiv:2403.12770 [pdf, other]

Multispectral Image Restoration by Generalized Opponent Transformation Total Variation

Authors: Zhantao Ma, Michael K. Ng

Abstract: Multispectral images (MSI) contain light information in different wavelengths of objects, which convey spectral-spatial information and help improve the performance of various image processing tasks. Numerous techniques have been created to extend the application of total variation regularization in restoring multispectral images, for example, based on channel coupling and adaptive total variation… ▽ More Multispectral images (MSI) contain light information in different wavelengths of objects, which convey spectral-spatial information and help improve the performance of various image processing tasks. Numerous techniques have been created to extend the application of total variation regularization in restoring multispectral images, for example, based on channel coupling and adaptive total variation regularization. The primary contribution of this paper is to propose and develop a new multispectral total variation regularization in a generalized opponent transformation domain instead of the original multispectral image domain. Here opponent transformations for multispectral images are generalized from a well-known opponent transformation for color images. We will explore the properties of generalized opponent transformation total variation (GOTTV) regularization and the corresponding optimization formula for multispectral image restoration. To evaluate the effectiveness of the new GOTTV method, we provide numerical examples that showcase its superior performance compared to existing multispectral image total variation methods, using criteria such as MPSNR and MSSIM. △ Less

Submitted 19 March, 2024; originally announced March 2024.

MSC Class: 65F22; 68U10; 35A15; 65K10; 52A41

arXiv:2403.04254 [pdf, other]

Finite final segments of the d.c.e. Turing degrees

Authors: Steffen Lempp, Yiqun Liu, Yong Liu, Keng Meng Ng, Cheng Peng, Guohua Wu

Abstract: We prove that every finite distributive lattice is isomorphic to a final segment of the d.c.e. Turing degrees (i.e., the degrees of differences of computably enumerable sets). As a corollary, we are able to infer the undecidability of the EAE-theory of the d.c.e. degrees in the language of partial ordering. We prove that every finite distributive lattice is isomorphic to a final segment of the d.c.e. Turing degrees (i.e., the degrees of differences of computably enumerable sets). As a corollary, we are able to infer the undecidability of the EAE-theory of the d.c.e. degrees in the language of partial ordering. △ Less

Submitted 21 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

MSC Class: 03D28

arXiv:2402.18074 [pdf, other]

A One-step Image Retargeing Algorithm Based on Conformal Energy

Authors: Chengyang Liu, Michael K. Ng

Abstract: The image retargeting problem is to find a proper mapping to resize an image to one with a prescribed aspect ratio, which is quite popular these days. In this paper, we propose an efficient and orientation-preserving one-step image retargeting algorithm based on minimizing the harmonic energy, which can well preserve the regions of interest (ROIs) and line structures in the image. We also give som… ▽ More The image retargeting problem is to find a proper mapping to resize an image to one with a prescribed aspect ratio, which is quite popular these days. In this paper, we propose an efficient and orientation-preserving one-step image retargeting algorithm based on minimizing the harmonic energy, which can well preserve the regions of interest (ROIs) and line structures in the image. We also give some mathematical proofs in the paper to ensure the well-posedness and accuracy of our algorithm. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: 24 pages, 10 figures

arXiv:2402.13572 [pdf, other]

AlgoFormer: An Efficient Transformer Framework with Algorithmic Structures

Authors: Yihang Gao, Chuanyang Zheng, Enze Xie, Han Shi, Tianyang Hu, Yu Li, Michael K. Ng, Zhenguo Li, Zhaoqiang Liu

Abstract: Besides natural language processing, transformers exhibit extraordinary performance in solving broader applications, including scientific computing and computer vision. Previous works try to explain this from the expressive power and capability perspectives that standard transformers are capable of performing some algorithms. To empower transformers with algorithmic capabilities and motivated by t… ▽ More Besides natural language processing, transformers exhibit extraordinary performance in solving broader applications, including scientific computing and computer vision. Previous works try to explain this from the expressive power and capability perspectives that standard transformers are capable of performing some algorithms. To empower transformers with algorithmic capabilities and motivated by the recently proposed looped transformer, we design a novel transformer framework, dubbed Algorithm Transformer (abbreviated as AlgoFormer). We provide an insight that efficient transformer architectures can be designed by leveraging prior knowledge of tasks and the underlying structure of potential algorithms. Compared with the standard transformer and vanilla looped transformer, the proposed AlgoFormer can perform efficiently in algorithm representation in some specific tasks. In particular, inspired by the structure of human-designed learning algorithms, our transformer framework consists of a pre-transformer that is responsible for task preprocessing, a looped transformer for iterative optimization algorithms, and a post-transformer for producing the desired results after post-processing. We provide theoretical evidence of the expressive power of the AlgoFormer in solving some challenging problems, mirroring human-designed algorithms. Furthermore, some theoretical and empirical results are presented to show that the designed transformer has the potential to perform algorithm representation and learning. Experimental results demonstrate the empirical superiority of the proposed transformer in that it outperforms the standard transformer and vanilla looped transformer in some specific tasks. An extensive experiment on real language tasks (e.g., neural machine translation of German and English, and text classification) further validates the expressiveness and effectiveness of AlgoFormer. △ Less

Submitted 10 January, 2025; v1 submitted 21 February, 2024; originally announced February 2024.

Comments: Published at Transactions on Machine Learning Research (TMLR). The paper provides insight that the Transformer architectures can mimic the algorithm structures in (in-context) algorithm learning and representation. The incorporated algorithmic structure in Algoformer shows its potential in (deep learning for) scientific computing, besides the real language tasks

arXiv:2308.16472 [pdf, other]

Logical Berkovich Geometry: A Point-free Perspective

Authors: Ming Ng

Abstract: Extending our insights from \cite{NVOstrowski}, we apply point-free techniques to sharpen a foundational result in Berkovich geometry. In our language, given the ring $\mathcal{A}:=K\{R^{-1}T\}$ of convergent power series over a suitable non-Archimedean field $K$, the points of its Berkovich Spectrum $\mathcal{M}(\mathcal{A})$ correspond to $R$-good filters. The surprise is that, unlike the origin… ▽ More Extending our insights from \cite{NVOstrowski}, we apply point-free techniques to sharpen a foundational result in Berkovich geometry. In our language, given the ring $\mathcal{A}:=K\{R^{-1}T\}$ of convergent power series over a suitable non-Archimedean field $K$, the points of its Berkovich Spectrum $\mathcal{M}(\mathcal{A})$ correspond to $R$-good filters. The surprise is that, unlike the original result by Berkovich, we do not require the field $K$ to be non-trivially valued. Our investigations into non-Archimedean geometry can be understood as being framed by the question: what is the relationship between topology and logic? △ Less

Submitted 31 August, 2023; originally announced August 2023.

MSC Class: 03G30; 06D22; 14G22

arXiv:2308.14758 [pdf, ps, other]

A Point-Free Look at Ostrowski's Theorem and Absolute Values

Authors: Ming Ng, Steven Vickers

Abstract: This paper investigates the absolute values on $\mathbb{Z}$ valued in the upper reals (i.e. reals for which only a right Dedekind section is given). These necessarily include multiplicative seminorms corresponding to the finite prime fields $\mathbb{F}_p$. As an Ostrowski-type Theorem, the space of such absolute values is homeomorphic to a space of prime ideals (with co-Zariski topology) suitably… ▽ More This paper investigates the absolute values on $\mathbb{Z}$ valued in the upper reals (i.e. reals for which only a right Dedekind section is given). These necessarily include multiplicative seminorms corresponding to the finite prime fields $\mathbb{F}_p$. As an Ostrowski-type Theorem, the space of such absolute values is homeomorphic to a space of prime ideals (with co-Zariski topology) suitably paired with upper reals in the range $[-\infty, 1]$, and from this is recovered the standard Ostrowski's Theorem for absolute values on $\mathbb{Q}$. Our approach is fully constructive, using, in the topos-theoretic sense, geometric reasoning with point-free spaces, and that calls for a careful distinction between Dedekinds vs. upper reals. This forces attention on topological subtleties that are obscured in the classical treatment. In particular, the admission of multiplicative seminorms points to connections with Berkovich and adic spectra. The results are also intended to contribute to characterising a (point-free) space of places of $\mathbb{Q}$. △ Less

Submitted 20 August, 2023; originally announced August 2023.

MSC Class: 18F10; 18F70; 03G30; 06D22

arXiv:2307.16075 [pdf]

doi 10.1016/j.trc.2024.104575

Redesigning Large-Scale Multimodal Transit Networks with Shared Autonomous Mobility Services

Authors: Max T. M. Ng, Hani S. Mahmassani, Ömer Verbas, Taner Cokyasar, Roman Engelhardt

Abstract: This study addresses a large-scale multimodal transit network design problem, with Shared Autonomous Mobility Services (SAMS) as both transit feeders and an origin-to-destination mode. The framework captures spatial demand and modal characteristics, considers intermodal transfers and express services, determines transit infrastructure investment and path flows, and generates transit routes. A syst… ▽ More This study addresses a large-scale multimodal transit network design problem, with Shared Autonomous Mobility Services (SAMS) as both transit feeders and an origin-to-destination mode. The framework captures spatial demand and modal characteristics, considers intermodal transfers and express services, determines transit infrastructure investment and path flows, and generates transit routes. A system-optimal multimodal transit network is designed with minimum total door-to-door generalized costs of users and operators, satisfying transit origin-destination demand within a pre-set infrastructure budget. Firstly, the geography, demand, and modes in each zone are characterized with continuous approximation. The decisions of network link investment and multimodal path flows in zonal connection optimization are formulated as a minimum-cost multi-commodity network flow (MCNF) problem and solved efficiently with a mixed-integer linear programming (MILP) solver. Subsequently, the route generation problem is solved by expanding the MCNF formulation to minimize intramodal transfers. The model is illustrated through a set of experiments with the Chicago network comprised of 50 zones and seven modes, under three scenarios. The computational results present savings in traveler journey time and operator cost demonstrating the potential benefits of collaboration between multimodal transit systems and SAMS. △ Less

Submitted 27 March, 2024; v1 submitted 29 July, 2023; originally announced July 2023.

Comments: 48 pages, 18 figures, accepted for publication in Transportation Research Part C: Emerging Technologies, and presentation in the 25th International Symposium on Transportation and Traffic Theory (ISTTT25)

arXiv:2302.04086 [pdf, other]

Block Diagonalization of Quaternion Circulant Matrices with Applications

Authors: Junjun Pan, Michael K. Ng

Abstract: It is well-known that a complex circulant matrix can be diagonalized by a discrete Fourier matrix with imaginary unit $\mathtt{i}$. The main aim of this paper is to demonstrate that a quaternion circulant matrix cannot be diagonalized by a discrete quaternion Fourier matrix with three imaginary units $\mathtt{i}$, $\mathtt{j}$ and $\mathtt{k}$. Instead, a quaternion circulant matrix can be block-d… ▽ More It is well-known that a complex circulant matrix can be diagonalized by a discrete Fourier matrix with imaginary unit $\mathtt{i}$. The main aim of this paper is to demonstrate that a quaternion circulant matrix cannot be diagonalized by a discrete quaternion Fourier matrix with three imaginary units $\mathtt{i}$, $\mathtt{j}$ and $\mathtt{k}$. Instead, a quaternion circulant matrix can be block-diagonalized into 1-by-1 block and 2-by-2 block matrices by permuted discrete quaternion Fourier transform matrix. With such a block-diagonalized form, the inverse of a quaternion circulant matrix can be determined efficiently similar to the inverse of a complex circulant matrix. We make use of this block-diagonalized form to study quaternion tensor singular value decomposition of quaternion tensors where the entries are quaternion numbers. The applications including computing the inverse of a quaternion circulant matrix, and solving quaternion Toeplitz system arising from linear prediction of quaternion signals are employed to validate the efficiency of our proposed block diagonalized results. A numerical example of color video as third-order quaternion tensor is employed to validate the effectiveness of quaternion tensor singular value decomposition. △ Less

Submitted 8 February, 2024; v1 submitted 8 February, 2023; originally announced February 2023.

arXiv:2212.14562 [pdf, ps, other]

Quantizing Heavy-tailed Data in Statistical Estimation: (Near) Minimax Rates, Covariate Quantization, and Uniform Recovery

Authors: Junren Chen, Michael K. Ng, Di Wang

Abstract: This paper studies the quantization of heavy-tailed data in some fundamental statistical estimation problems, where the underlying distributions have bounded moments of some order. We propose to truncate and properly dither the data prior to a uniform quantization. Our major standpoint is that (near) minimax rates of estimation error are achievable merely from the quantized data produced by the pr… ▽ More This paper studies the quantization of heavy-tailed data in some fundamental statistical estimation problems, where the underlying distributions have bounded moments of some order. We propose to truncate and properly dither the data prior to a uniform quantization. Our major standpoint is that (near) minimax rates of estimation error are achievable merely from the quantized data produced by the proposed scheme. In particular, concrete results are worked out for covariance estimation, compressed sensing, and matrix completion, all agreeing that the quantization only slightly worsens the multiplicative factor. Besides, we study compressed sensing where both covariate (i.e., sensing vector) and response are quantized. Under covariate quantization, although our recovery program is non-convex because the covariance matrix estimator lacks positive semi-definiteness, all local minimizers are proved to enjoy near optimal error bound. Moreover, by the concentration inequality of product process and covering argument, we establish near minimax uniform recovery guarantee for quantized compressed sensing with heavy-tailed noise. △ Less

Submitted 26 July, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

Comments: Major changes

arXiv:2211.08760 [pdf, other]

doi 10.1109/SSCI51031.2022.10022281

SVD-PINNs: Transfer Learning of Physics-Informed Neural Networks via Singular Value Decomposition

Authors: Yihang Gao, Ka Chun Cheung, Michael K. Ng

Abstract: Physics-informed neural networks (PINNs) have attracted significant attention for solving partial differential equations (PDEs) in recent years because they alleviate the curse of dimensionality that appears in traditional methods. However, the most disadvantage of PINNs is that one neural network corresponds to one PDE. In practice, we usually need to solve a class of PDEs, not just one. With the… ▽ More Physics-informed neural networks (PINNs) have attracted significant attention for solving partial differential equations (PDEs) in recent years because they alleviate the curse of dimensionality that appears in traditional methods. However, the most disadvantage of PINNs is that one neural network corresponds to one PDE. In practice, we usually need to solve a class of PDEs, not just one. With the explosive growth of deep learning, many useful techniques in general deep learning tasks are also suitable for PINNs. Transfer learning methods may reduce the cost for PINNs in solving a class of PDEs. In this paper, we proposed a transfer learning method of PINNs via keeping singular vectors and optimizing singular values (namely SVD-PINNs). Numerical experiments on high dimensional PDEs (10-d linear parabolic equations and 10-d Allen-Cahn equations) show that SVD-PINNs work for solving a class of PDEs with different but close right-hand-side functions. △ Less

Submitted 14 March, 2024; v1 submitted 16 November, 2022; originally announced November 2022.

Comments: Accepted to The 2022 IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2022)

arXiv:2211.02802 [pdf, other]

Stochastic Variance Reduced Gradient for affine rank minimization problem

Authors: Ningning Han, Juan Nie, Jian Lu, Michael K. Ng

Abstract: We develop an efficient stochastic variance reduced gradient descent algorithm to solve the affine rank minimization problem consists of finding a matrix of minimum rank from linear measurements. The proposed algorithm as a stochastic gradient descent strategy enjoys a more favorable complexity than full gradients. It also reduces the variance of the stochastic gradient at each iteration and accel… ▽ More We develop an efficient stochastic variance reduced gradient descent algorithm to solve the affine rank minimization problem consists of finding a matrix of minimum rank from linear measurements. The proposed algorithm as a stochastic gradient descent strategy enjoys a more favorable complexity than full gradients. It also reduces the variance of the stochastic gradient at each iteration and accelerate the rate of convergence. We prove that the proposed algorithm converges linearly in expectation to the solution under a restricted isometry condition. The numerical experiments show that the proposed algorithm has a clearly advantageous balance of efficiency, adaptivity, and accuracy compared with other state-of-the-art greedy algorithms. △ Less

Submitted 4 November, 2022; originally announced November 2022.

arXiv:2210.05987 [pdf, other]

A Momentum Accelerated Adaptive Cubic Regularization Method for Nonconvex Optimization

Authors: Yihang Gao, Michael K. Ng

Abstract: The cubic regularization method (CR) and its adaptive version (ARC) are popular Newton-type methods in solving unconstrained non-convex optimization problems, due to its global convergence to local minima under mild conditions. The main aim of this paper is to develop a momentum-accelerated adaptive cubic regularization method (ARCm) to improve the convergent performance. With the proper choice of… ▽ More The cubic regularization method (CR) and its adaptive version (ARC) are popular Newton-type methods in solving unconstrained non-convex optimization problems, due to its global convergence to local minima under mild conditions. The main aim of this paper is to develop a momentum-accelerated adaptive cubic regularization method (ARCm) to improve the convergent performance. With the proper choice of momentum step size, we show the global convergence of ARCm and the local convergence can also be guaranteed under the \KL property. Such global and local convergence can also be established when inexact solvers with low computational costs are employed in the iteration procedure. Numerical results for non-convex logistic regression and robust linear regression models are reported to demonstrate that the proposed ARCm significantly outperforms state-of-the-art cubic regularization methods (e.g., CR, momentum-based CR, ARC) and the trust region method. In particular, the number of iterations required by ARCm is less than 10\% to 50\% required by the most competitive method (ARC) in the experiments. △ Less

Submitted 12 October, 2022; originally announced October 2022.

arXiv:2209.13268 [pdf, other]

Approximate Secular Equations for the Cubic Regularization Subproblem

Authors: Yihang Gao, Man-Chung Yue, Michael K. Ng

Abstract: The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper… ▽ More The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper, we propose and analyze a novel CRS solver based on an approximate secular equation, which requires only some of the Hessian eigenvalues and is therefore much more efficient. Two approximate secular equations (ASEs) are developed. For both ASEs, we first study the existence and uniqueness of their roots and then establish an upper bound on the gap between the root and that of the standard secular equation. Such an upper bound can in turn be used to bound the distance from the approximate CRS solution based ASEs to the true CRS solution, thus offering a theoretical guarantee for our CRS solver. A desirable feature of our CRS solver is that it requires only matrix-vector multiplication but not matrix inversion, which makes it particularly suitable for high-dimensional applications of unconstrained non-convex optimization, such as low-rank recovery and deep learning. Numerical experiments with synthetic and real data-sets are conducted to investigate the practical performance of the proposed CRS solver. Experimental results show that the proposed solver outperforms two state-of-the-art methods. △ Less

Submitted 27 September, 2022; originally announced September 2022.

Comments: Accepted to NeurIPS 2022

arXiv:2209.04617 [pdf, ps, other]

doi 10.1017/jsl.2023.67

Computable topological groups

Authors: Heer Tern Koh, Alexander Melnikov, Keng Meng Ng

Abstract: We investigate what it means for a (Hausdorff, second-countable) topological group to be computable. We compare several potential definitions in the literature. We relate these notions with the well-established definitions of effective presentability for discrete and profinite groups, and compare these results with similar results in computable topology. Most of these definitions can be separated… ▽ More We investigate what it means for a (Hausdorff, second-countable) topological group to be computable. We compare several potential definitions in the literature. We relate these notions with the well-established definitions of effective presentability for discrete and profinite groups, and compare these results with similar results in computable topology. Most of these definitions can be separated by counter-examples. Remarkably, we prove that two such definitions are equivalent for locally compact Polish and abelian Polish groups. More specifically, we prove that in these broad classes of groups, every computable topological group admits a right-c.e.~(upper semi-computable) presentation with a left-invariant metric, and a computable dense sequence of points. In the locally compact case, we also show that if the group is additionally effectively locally compact, then we can produce an effectively proper left-invariant metric. △ Less

Submitted 10 September, 2022; originally announced September 2022.

MSC Class: 03D78 (Primary)

Journal ref: J. symb. log. 90 (2025) 188-220

arXiv:2208.09300 [pdf]

Expressing Multivariate Time Series as Graphs with Time Series Attention Transformer

Authors: William T. Ng, K. Siu, Albert C. Cheung, Michael K. Ng

Abstract: A reliable and efficient representation of multivariate time series is crucial in various downstream machine learning tasks. In multivariate time series forecasting, each variable depends on its historical values and there are inter-dependencies among variables as well. Models have to be designed to capture both intra- and inter-relationships among the time series. To move towards this goal, we pr… ▽ More A reliable and efficient representation of multivariate time series is crucial in various downstream machine learning tasks. In multivariate time series forecasting, each variable depends on its historical values and there are inter-dependencies among variables as well. Models have to be designed to capture both intra- and inter-relationships among the time series. To move towards this goal, we propose the Time Series Attention Transformer (TSAT) for multivariate time series representation learning. Using TSAT, we represent both temporal information and inter-dependencies of multivariate time series in terms of edge-enhanced dynamic graphs. The intra-series correlations are represented by nodes in a dynamic graph; a self-attention mechanism is modified to capture the inter-series correlations by using the super-empirical mode decomposition (SMD) module. We applied the embedded dynamic graphs to times series forecasting problems, including two real-world datasets and two benchmark datasets. Extensive experiments show that TSAT clearly outerperforms six state-of-the-art baseline methods in various forecasting horizons. We further visualize the embedded dynamic graphs to illustrate the graph representation power of TSAT. We share our code at https://github.com/RadiantResearch/TSAT. △ Less

Submitted 19 August, 2022; originally announced August 2022.

Comments: IJCAI'22 WORKSHOP AI4TS: AI FOR TIME SERIES ANALYSIS

arXiv:2208.02982 [pdf, ps, other]

Limit Complexities, Minimal Descriptions, and $n$-Randomness

Authors: Rodney Downey, Lu Liu, Keng Meng Ng, Daniel Turetsky

Abstract: Let $K$ denote prefix-free Kolmogorov Complexity, and $K^A$ denote it relative to an oracle $A$. We show that for any $n$, $K^{\emptyset^{(n)}}$ is definable purely in terms of the unrelativized notion $K$. It was already known that 2-randomness is definable in terms of $K$ (and plain complexity $C$) as those reals which infinitely often have maximal complexity. We can use our characterization to… ▽ More Let $K$ denote prefix-free Kolmogorov Complexity, and $K^A$ denote it relative to an oracle $A$. We show that for any $n$, $K^{\emptyset^{(n)}}$ is definable purely in terms of the unrelativized notion $K$. It was already known that 2-randomness is definable in terms of $K$ (and plain complexity $C$) as those reals which infinitely often have maximal complexity. We can use our characterization to show that $n$-randomness is definable purely in terms of $K$. To do this we extend a certain ``limsup'' formula from the literature, and apply Symmetry of Information. This extension entails a novel use of semilow sets, and a more precise analysis of the complexity of $Δ_2^0$ sets of mimimal descriptions. △ Less

Submitted 5 August, 2022; originally announced August 2022.

arXiv:2207.14039 [pdf, other]

Separable Quaternion Matrix Factorization for Polarization Images

Authors: Junjun Pan, Michael K. Ng

Abstract: Polarization is a unique characteristic of transverse wave and is represented by Stokes parameters. Analysis of polarization states can reveal valuable information about the sources. In this paper, we propose a separable low-rank quaternion linear mixing model to polarized signals: we assume each column of the source factor matrix equals a column of polarized data matrix and refer to the correspon… ▽ More Polarization is a unique characteristic of transverse wave and is represented by Stokes parameters. Analysis of polarization states can reveal valuable information about the sources. In this paper, we propose a separable low-rank quaternion linear mixing model to polarized signals: we assume each column of the source factor matrix equals a column of polarized data matrix and refer to the corresponding problem as separable quaternion matrix factorization (SQMF). We discuss some properties of the matrix that can be decomposed by SQMF. To determine the source factor matrix in quaternion space, we propose a heuristic algorithm called quaternion successive projection algorithm (QSPA) inspired by the successive projection algorithm. To guarantee the effectiveness of QSPA, a new normalization operator is proposed for the quaternion matrix. We use a block coordinate descent algorithm to compute nonnegative factor activation matrix in real number space. We test our method on the applications of polarization image representation and spectro-polarimetric imaging unmixing to verify its effectiveness. △ Less

Submitted 28 July, 2022; originally announced July 2022.

arXiv:2205.11030 [pdf, other]

HessianFR: An Efficient Hessian-based Follow-the-Ridge Algorithm for Minimax Optimization

Authors: Yihang Gao, Huafeng Liu, Michael K. Ng, Mingjie Zhou

Abstract: Wide applications of differentiable two-player sequential games (e.g., image generation by GANs) have raised much interest and attention of researchers to study efficient and fast algorithms. Most of the existing algorithms are developed based on nice properties of simultaneous games, i.e., convex-concave payoff functions, but are not applicable in solving sequential games with different settings.… ▽ More Wide applications of differentiable two-player sequential games (e.g., image generation by GANs) have raised much interest and attention of researchers to study efficient and fast algorithms. Most of the existing algorithms are developed based on nice properties of simultaneous games, i.e., convex-concave payoff functions, but are not applicable in solving sequential games with different settings. Some conventional gradient descent ascent algorithms theoretically and numerically fail to find the local Nash equilibrium of the simultaneous game or the local minimax (i.e., local Stackelberg equilibrium) of the sequential game. In this paper, we propose the HessianFR, an efficient Hessian-based Follow-the-Ridge algorithm with theoretical guarantees. Furthermore, the convergence of the stochastic algorithm and the approximation of Hessian inverse are exploited to improve algorithm efficiency. A series of experiments of training generative adversarial networks (GANs) have been conducted on both synthetic and real-world large-scale image datasets (e.g. MNIST, CIFAR-10 and CelebA). The experimental results demonstrate that the proposed HessianFR outperforms baselines in terms of convergence and image generation quality. △ Less

Submitted 23 May, 2022; originally announced May 2022.

MSC Class: 68U10; 68W40; 90C47

arXiv:2204.00313 [pdf, ps, other]

Deep neural networks for solving large linear systems arising from high-dimensional problems

Authors: Yiqi Gu, Michael K. Ng

Abstract: This paper studies deep neural networks for solving extremely large linear systems arising from highdimensional problems. Because of the curse of dimensionality, it is expensive to store both the solution and right-hand side vector in such extremely large linear systems. Our idea is to employ a neural network to characterize the solution with much fewer parameters than the size of the solution und… ▽ More This paper studies deep neural networks for solving extremely large linear systems arising from highdimensional problems. Because of the curse of dimensionality, it is expensive to store both the solution and right-hand side vector in such extremely large linear systems. Our idea is to employ a neural network to characterize the solution with much fewer parameters than the size of the solution under a matrix-free setting. We present an error analysis of the proposed method, indicating that the solution error is bounded by the condition number of the matrix and the neural network approximation error. Several numerical examples from partial differential equations, queueing problems, and probabilistic Boolean networks are presented to demonstrate that the solutions of linear systems can be learned quite accurately. △ Less

Submitted 4 March, 2023; v1 submitted 1 April, 2022; originally announced April 2022.

arXiv:2202.02063 [pdf, ps, other]

doi 10.1137/22M1476897

Color Image Inpainting via Robust Pure Quaternion Matrix Completion: Error Bound and Weighted Loss

Authors: Junren Chen, Michael K. Ng

Abstract: In this paper, we study color image inpainting as a pure quaternion matrix completion problem. In the literature, the theoretical guarantee for quaternion matrix completion is not well-established. Our main aim is to propose a new minimization problem with an objective combining nuclear norm and a quadratic loss weighted among three channels. To fill the theoretical vacancy, we obtain the error bo… ▽ More In this paper, we study color image inpainting as a pure quaternion matrix completion problem. In the literature, the theoretical guarantee for quaternion matrix completion is not well-established. Our main aim is to propose a new minimization problem with an objective combining nuclear norm and a quadratic loss weighted among three channels. To fill the theoretical vacancy, we obtain the error bound in both clean and corrupted regimes, which relies on some new results of quaternion matrices. A general Gaussian noise is considered in robust completion where all observations are corrupted. Motivated by the error bound, we propose to handle unbalanced or correlated noise via a cross-channel weight in the quadratic loss, with the main purpose of rebalancing noise level, or removing noise correlation. Extensive experimental results on synthetic and color image data are presented to confirm and demonstrate our theoretical findings. △ Less

Submitted 26 October, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

Journal ref: SIAM Journal on Imaging Sciences, 15(2022), pp. 1469-1498

arXiv:2112.14418 [pdf, ps, other]

Deep adaptive basis Galerkin method for high-dimensional evolution equations with oscillatory solutions

Authors: Yiqi Gu, Micheal K. Ng

Abstract: In this paper, we study deep neural networks (DNNs) for solving high-dimensional evolution equations with oscillatory solutions. Different from deep least-squares methods that deal with time and space variables simultaneously, we propose a deep adaptive basis Galerkin (DABG) method, which employs the spectral-Galerkin method for the time variable of oscillatory solutions and the deep neural networ… ▽ More In this paper, we study deep neural networks (DNNs) for solving high-dimensional evolution equations with oscillatory solutions. Different from deep least-squares methods that deal with time and space variables simultaneously, we propose a deep adaptive basis Galerkin (DABG) method, which employs the spectral-Galerkin method for the time variable of oscillatory solutions and the deep neural network method for high-dimensional space variables. The proposed method can lead to a linear system of differential equations having unknown DNNs that can be trained via the loss function. We establish a posterior estimates of the solution error, which is bounded by the minimal loss function and the term $O(N^{-m})$, where $N$ is the number of basis functions and $m$ characterizes the regularity of the e'quation. We also show that if the true solution is a Barron-type function, the error bound converges to zero as $M=O(N^p)$ approaches to infinity, where $M$ is the width of the used networks, and $p$ is a positive constant. Numerical examples, including high-dimensional linear evolution equations and the nonlinear Allen-Cahn equation, are presented to demonstrate the performance of the proposed DABG method is better than that of existing DNNs. △ Less

Submitted 31 May, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

arXiv:2109.04055 [pdf, ps, other]

doi 10.3233/COM-210375

Punctual equivalence relations and their (punctual) complexity

Authors: Nikolay Bazhenov, Keng Meng Ng, Luca San Mauro, Andrea Sorbi

Abstract: The complexity of equivalence relations has received much attention in the recent literature. The main tool for such endeavour is the following reducibility: given equivalence relations $R$ and $S$ on natural numbers, $R$ is computably reducible to $S$ if there is a computable function $f \colon ω\to ω$ that induces an injective map from $R$-equivalence classes to $S$-equivalence classes. In order… ▽ More The complexity of equivalence relations has received much attention in the recent literature. The main tool for such endeavour is the following reducibility: given equivalence relations $R$ and $S$ on natural numbers, $R$ is computably reducible to $S$ if there is a computable function $f \colon ω\to ω$ that induces an injective map from $R$-equivalence classes to $S$-equivalence classes. In order to compare the complexity of equivalence relations which are computable, researchers considered also feasible variants of computable reducibility, such as the polynomial-time reducibility. In this work, we explore $\mathbf{Peq}$, the degree structure generated by primitive recursive reducibility on punctual equivalence relations (i.e., primitive recursive equivalence relations with domain $ω$). In contrast with all other known degree structures on equivalence relations, we show that $\mathbf{Peq}$ has much more structure: e.g., we show that it is a dense distributive lattice. On the other hand, we also offer evidence of the intricacy of $\mathbf{Peq}$, proving, e.g., that the structure is neither rigid nor homogeneous. △ Less

Submitted 9 September, 2021; originally announced September 2021.

Comments: 37 pages

MSC Class: 03D25; 03D30

Journal ref: Computability, vol. 11 (2022), no. 3-4, pp. 187-221

arXiv:2108.13054 [pdf, other]

doi 10.1016/j.jcp.2022.111270

Wasserstein Generative Adversarial Uncertainty Quantification in Physics-Informed Neural Networks

Authors: Yihang Gao, Michael K. Ng

Abstract: In this paper, we study a physics-informed algorithm for Wasserstein Generative Adversarial Networks (WGANs) for uncertainty quantification in solutions of partial differential equations. By using groupsort activation functions in adversarial network discriminators, network generators are utilized to learn the uncertainty in solutions of partial differential equations observed from the initial/bou… ▽ More In this paper, we study a physics-informed algorithm for Wasserstein Generative Adversarial Networks (WGANs) for uncertainty quantification in solutions of partial differential equations. By using groupsort activation functions in adversarial network discriminators, network generators are utilized to learn the uncertainty in solutions of partial differential equations observed from the initial/boundary data. Under mild assumptions, we show that the generalization error of the computed generator converges to the approximation error of the network with high probability, when the number of samples are sufficiently taken. According to our established error bound, we also find that our physics-informed WGANs have higher requirement for the capacity of discriminators than that of generators. Numerical results on synthetic examples of partial differential equations are reported to validate our theoretical results and demonstrate how uncertainty quantification can be obtained for solutions of partial differential equations and the distributions of initial/boundary data. However, the quality or the accuracy of the uncertainty quantification theory in all the points in the interior is still the theoretical vacancy, and required for further research. △ Less

Submitted 9 August, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

Journal ref: Journal of Computational Physics 2022

arXiv:2108.11592 [pdf, ps, other]

Deep Ritz method for the spectral fractional Laplacian equation using the Caffarelli-Silvestre extension

Authors: Yiqi Gu, Micheal K. Ng

Abstract: In this paper, we propose a novel method for solving high-dimensional spectral fractional Laplacian equations. Using the Caffarelli-Silvestre extension, the $d$-dimensional spectral fractional equation is reformulated as a regular partial differential equation of dimension $d+1$. We transform the extended equation as a minimal Ritz energy functional problem and search for its minimizer in a specia… ▽ More In this paper, we propose a novel method for solving high-dimensional spectral fractional Laplacian equations. Using the Caffarelli-Silvestre extension, the $d$-dimensional spectral fractional equation is reformulated as a regular partial differential equation of dimension $d+1$. We transform the extended equation as a minimal Ritz energy functional problem and search for its minimizer in a special class of deep neural networks. Moreover, based on the approximation property of networks, we establish estimates on the error made by the deep Ritz method. Numerical results are reported to demonstrate the effectiveness of the proposed method for solving fractional Laplacian equations up to ten dimensions. Technically, in this method, we design a special network-based structure to adapt to the singularity and exponential decaying of the true solution. Also, A hybrid integration technique combining Monte Carlo method and sinc quadrature is developed to compute the loss function with higher accuracy. △ Less

Submitted 29 December, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

MSC Class: 65N15; 65N30; 68T07; 41A25

arXiv:2104.00162 [pdf, other]

doi 10.46298/lmcs-18(3:15)2022

Point-free Construction of Real Exponentiation

Authors: Ming Ng, Steven Vickers

Abstract: We define a point-free construction of real exponentiation and logarithms, i.e.\ we construct the maps $\exp\colon (0, \infty)\times \mathbb{R} \rightarrow \!(0,\infty),\, (x, ζ) \mapsto x^ζ$ and $\log\colon (1,\infty)\times (0, \infty) \rightarrow\mathbb{R},\, (b, y) \mapsto \log_b(y)$, and we develop familiar algebraic rules for them. The point-free approach is constructive, and defines the poin… ▽ More We define a point-free construction of real exponentiation and logarithms, i.e.\ we construct the maps $\exp\colon (0, \infty)\times \mathbb{R} \rightarrow \!(0,\infty),\, (x, ζ) \mapsto x^ζ$ and $\log\colon (1,\infty)\times (0, \infty) \rightarrow\mathbb{R},\, (b, y) \mapsto \log_b(y)$, and we develop familiar algebraic rules for them. The point-free approach is constructive, and defines the points of a space as models of a geometric theory, rather than as elements of a set - in particular, this allows geometric constructions to be applied to points living in toposes other than Set. Our geometric development includes new lifting and gluing techniques in point-free topology, which highlight how properties of $\mathbb{Q}$ determine properties of real exponentiation. This work is motivated by our broader research programme of developing a version of adelic geometry via topos theory. In particular, we wish to construct the classifying topos of places of $\mathbb{Q}$, which will provide a geometric perspective into the subtle relationship between $\mathbb{R}$ and $\mathbb{Q}_p$, a question of longstanding number-theoretic interest. △ Less

Submitted 1 August, 2022; v1 submitted 31 March, 2021; originally announced April 2021.

MSC Class: 26E40; 18F10; 18F40

Journal ref: Logical Methods in Computer Science, Volume 18, Issue 3 (August 2, 2022) lmcs:7325

arXiv:2102.01371 [pdf, other]

Spectral analysis for preconditioning of multi-dimensional Riesz fractional diffusion equations

Authors: Xin Huang, Xue-Lei Lin, Michael K. Ng, Hai-Wei Sun

Abstract: In this paper, we analyze the spectra of the preconditioned matrices arising from discretized multi-dimensional Riesz spatial fractional diffusion equations. The finite difference method is employed to approximate the multi-dimensional Riesz fractional derivatives, which will generate symmetric positive definite ill-conditioned multi-level Toeplitz matrices. The preconditioned conjugate gradient m… ▽ More In this paper, we analyze the spectra of the preconditioned matrices arising from discretized multi-dimensional Riesz spatial fractional diffusion equations. The finite difference method is employed to approximate the multi-dimensional Riesz fractional derivatives, which will generate symmetric positive definite ill-conditioned multi-level Toeplitz matrices. The preconditioned conjugate gradient method with a preconditioner based on the sine transform is employed to solve the resulting linear system. Theoretically, we prove that the spectra of the preconditioned matrices are uniformly bounded in the open interval (1/2,3/2) and thus the preconditioned conjugate gradient method converges linearly. The proposed method can be extended to multi-level Toeplitz matrices generated by functions with zeros of fractional order. Our theoretical results fill in a vacancy in the literature. Numerical examples are presented to demonstrate our new theoretical results in the literature and show the convergence performance of the proposed preconditioner that is better than other existing preconditioners. △ Less

Submitted 2 February, 2021; originally announced February 2021.

Journal ref: NUMERICAL MATHEMATICS: Theory, Methods and Applications, 2022

arXiv:2102.00363 [pdf, ps, other]

doi 10.1016/j.jcp.2021.110221

A parallel-in-time two-sided preconditioning for all-at-once system from a non-local evolutionary equation with weakly singular kernel

Authors: Xue-lei Lin, Michael K. Ng, Yajing Zhi

Abstract: In this paper, we study a parallel-in-time (PinT) algorithm for all-at-once system from a non-local evolutionary equation with weakly singular kernel where the temporal term involves a non-local convolution with a weakly singular kernel and the spatial term is the usual Laplacian operator with variable coefficients. We propose to use a two-sided preconditioning technique for the all-at-once discre… ▽ More In this paper, we study a parallel-in-time (PinT) algorithm for all-at-once system from a non-local evolutionary equation with weakly singular kernel where the temporal term involves a non-local convolution with a weakly singular kernel and the spatial term is the usual Laplacian operator with variable coefficients. We propose to use a two-sided preconditioning technique for the all-at-once discretization of the equation. Our preconditioner is constructed by replacing the variable diffusion coefficients with a constant coefficient to obtain a constant-coefficient all-at-once matrix. We split a square root of the constant Laplacian operator out of the constant-coefficient all-at-once matrix as a right preconditioner and take the remaining part as a left preconditioner, which constitutes our two-sided preconditioning. Exploiting the diagonalizability of the constant-Laplacian matrix and the triangular Toeplitz structure of the temporal discretization matrix, we obtain efficient representations of inverses of the right and the left preconditioners, because of which the iterative solution can be fast updated in a PinT manner. Theoretically, the condition number of the two-sided preconditioned matrix is proven to be uniformly bounded by a constant independent of the matrix size. To the best of our knowledge, for the non-local evolutionary equation with variable coefficients, this is the first attempt to develop a PinT preconditioning technique that has fast and exact implementation and that the corresponding preconditioned system has a uniformly bounded condition number. Numerical results are reported to confirm the efficiency of the proposed two-sided preconditioning technique. △ Less

Submitted 30 January, 2021; originally announced February 2021.

MSC Class: 65F10; 65F08; 15A12; 15A60;

arXiv:2012.15138 [pdf, other]

Low Rank Pure Quaternion Approximation for Pure Quaternion Matrices

Authors: Guangjing Song, Weiyang Ding, Michael K. Ng

Abstract: Quaternion matrices are employed successfully in many color image processing applications. In particular, a pure quaternion matrix can be used to represent red, green and blue channels of color images. A low-rank approximation for a pure quaternion matrix can be obtained by using the quaternion singular value decomposition. However, this approximation is not optimal in the sense that the resulting… ▽ More Quaternion matrices are employed successfully in many color image processing applications. In particular, a pure quaternion matrix can be used to represent red, green and blue channels of color images. A low-rank approximation for a pure quaternion matrix can be obtained by using the quaternion singular value decomposition. However, this approximation is not optimal in the sense that the resulting low-rank approximation matrix may not be pure quaternion, i.e., the low-rank matrix contains real component which is not useful for the representation of a color image. The main contribution of this paper is to find an optimal rank-$r$ pure quaternion matrix approximation for a pure quaternion matrix (a color image). Our idea is to use a projection on a low-rank quaternion matrix manifold and a projection on a quaternion matrix with zero real component, and develop an alternating projections algorithm to find such optimal low-rank pure quaternion matrix approximation. The convergence of the projection algorithm can be established by showing that the low-rank quaternion matrix manifold and the zero real component quaternion matrix manifold has a non-trivial intersection point. Numerical examples on synthetic pure quaternion matrices and color images are presented to illustrate the projection algorithm can find optimal low-rank pure quaternion approximation for pure quaternion matrices or color images. △ Less

Submitted 30 December, 2020; originally announced December 2020.

arXiv:2011.11417 [pdf, other]

Riemannian Conjugate Gradient Descent Method for Third-Order Tensor Completion

Authors: Guang-Jing Song, Xue-Zhong Wang, Michael K. Ng

Abstract: The goal of tensor completion is to fill in missing entries of a partially known tensor under a low-rank constraint. In this paper, we mainly study low rank third-order tensor completion problems by using Riemannian optimization methods on the smooth manifold. Here the tensor rank is defined to be a set of matrix ranks where the matrices are the slices of the transformed tensor obtained by applyin… ▽ More The goal of tensor completion is to fill in missing entries of a partially known tensor under a low-rank constraint. In this paper, we mainly study low rank third-order tensor completion problems by using Riemannian optimization methods on the smooth manifold. Here the tensor rank is defined to be a set of matrix ranks where the matrices are the slices of the transformed tensor obtained by applying the Fourier-related transformation onto the tubes of the original tensor. We show that with suitable incoherence conditions on the underlying low rank tensor, the proposed Riemannian optimization method is guaranteed to converge and find such low rank tensor with a high probability. In addition, numbers of sample entries required for solving low rank tensor completion problem under different initialized methods are studied and derived. Numerical examples for both synthetic and image data sets are reported to demonstrate the proposed method is able to recover low rank tensors. △ Less

Submitted 20 November, 2020; originally announced November 2020.

Comments: arXiv admin note: substantial text overlap with arXiv:1603.06610 by other authors

arXiv:2011.08675 [pdf, other]

Non-Local Robust Quaternion Matrix Completion for Color Images and Videos Inpainting

Authors: Zhigang Jia, Qiyu Jin, Michael K. Ng, Xile Zhao

Abstract: The image nonlocal self-similarity (NSS) prior refers to the fact that a local patch often has many nonlocal similar patches to it across the image and has been widely applied in many recently proposed machining learning algorithms for image processing. However, there is no theoretical analysis on its working principle in the literature. In this paper, we discover a potential causality between NSS… ▽ More The image nonlocal self-similarity (NSS) prior refers to the fact that a local patch often has many nonlocal similar patches to it across the image and has been widely applied in many recently proposed machining learning algorithms for image processing. However, there is no theoretical analysis on its working principle in the literature. In this paper, we discover a potential causality between NSS and low-rank property of color images, which is also available to grey images. A new patch group based NSS prior scheme is proposed to learn explicit NSS models of natural color images. The numerical low-rank property of patched matrices is also rigorously proved. The NSS-based QMC algorithm computes an optimal low-rank approximation to the high-rank color image, resulting in high PSNR and SSIM measures and particularly the better visual quality. A new tensor NSS-based QMC method is also presented to solve the color video inpainting problem based on quaternion tensor representation. The numerical experiments on color images and videos indicate the advantages of NSS-based QMC over the state-of-the-art methods. △ Less

Submitted 13 May, 2022; v1 submitted 17 November, 2020; originally announced November 2020.

Comments: 34 pages, 17 figures

MSC Class: 65F55 ACM Class: G.1.3

Journal ref: IEEE Transactions on Image Processing, 2022

Showing 1–50 of 73 results for author: Ng, M