-
Open Problems in Computability Theory and Descriptive Set Theory
Authors:
George Barmpalias,
Nikolay Bazhenov,
Chi Tat Chong,
Wei Dai,
Su Gao,
Jun Le Goh,
Jialiang He,
Keng Meng Selwyn Ng,
Andre Nies,
Theodore Slaman,
Riley Thornton,
Wei Wang,
Jing Yu,
Liang Yu
Abstract:
These open problems were presented in the Problem Sessions held during the Tianyuan Workshop on Computability Theory and Descriptive Set Theory, June 16-20, 2025. The problems are organized into sections named after their contributors, in the order of their presentations during the workshop. Notes were taken and compiled by Wei Dai, Feng Li, Ruiwen Li, Ming Xiao, Xu Wang, Víctor Hugo Yañez Salazar…
▽ More
These open problems were presented in the Problem Sessions held during the Tianyuan Workshop on Computability Theory and Descriptive Set Theory, June 16-20, 2025. The problems are organized into sections named after their contributors, in the order of their presentations during the workshop. Notes were taken and compiled by Wei Dai, Feng Li, Ruiwen Li, Ming Xiao, Xu Wang, Víctor Hugo Yañez Salazar, and Yang Zheng.
△ Less
Submitted 5 July, 2025;
originally announced July 2025.
-
Hermitian Quaternion Toeplitz Matrices by Quaternion-valued Generating Functions
Authors:
Xue-lei Lin,
Michael K. Ng,
Junjun Pan
Abstract:
In this paper, we study Hermitian quaternion Toeplitz matrices generated by quaternion-valued functions. We show that such generating function must be the sum of a real-valued function and an odd function with imaginary component. This setting is different from the case of Hermitian complex Toeplitz matrices generated by real-valued functions only. By using of 2-by-2 block complex representation o…
▽ More
In this paper, we study Hermitian quaternion Toeplitz matrices generated by quaternion-valued functions. We show that such generating function must be the sum of a real-valued function and an odd function with imaginary component. This setting is different from the case of Hermitian complex Toeplitz matrices generated by real-valued functions only. By using of 2-by-2 block complex representation of quaternion matrices, we give a quaternion version of Grenander-Szegö theorem stating the distribution of eigenvalues of Hermitian quaternion Toeplitz matrices in terms of its generating function. As an application, we investigate Strang's circulant preconditioners for Hermitian quaternion Toeplitz linear systems arising from quaternion signal processing. We show that Strang's circulant preconditioners can be diagionalized by discrete quaternion Fourier transform matrices whereas general quaternion circulant matrices cannot be diagonalized by them. Also we verify the theoretical and numerical convergence results of Strang's circulant preconditioned conjugate gradient method for solving Hermitian quaternion Toeplitz systems.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
Truncated Huber Penalty for Sparse Signal Recovery with Convergence Analysis
Authors:
Li Yang,
Serena Morigi,
Michael K. Ng,
You-wei Wen
Abstract:
Sparse signal recovery from under-determined systems presents significant challenges when using conventional L_0 and L_1 penalties, primarily due to computational complexity and estimation bias. This paper introduces a truncated Huber penalty, a non-convex metric that effectively bridges the gap between unbiased sparse recovery and differentiable optimization. The proposed penalty applies quadrati…
▽ More
Sparse signal recovery from under-determined systems presents significant challenges when using conventional L_0 and L_1 penalties, primarily due to computational complexity and estimation bias. This paper introduces a truncated Huber penalty, a non-convex metric that effectively bridges the gap between unbiased sparse recovery and differentiable optimization. The proposed penalty applies quadratic regularization to small entries while truncating large magnitudes, avoiding non-differentiable points at optima. Theoretical analysis demonstrates that, for an appropriately chosen threshold, any s-sparse solution recoverable via conventional penalties remains a local optimum under the truncated Huber function. This property allows the exact and robust recovery theories developed for other penalty regularization functions to be directly extended to the truncated Huber function. To solve the optimization problem, we develop a block coordinate descent (BCD) algorithm with finite-step convergence guarantees under spark conditions. Numerical experiments are conducted to validate the effectiveness and robustness of the proposed approach. Furthermore, we extend the truncated Huber-penalized model to the gradient domain, illustrating its applicability in signal denoising and image smoothing.
△ Less
Submitted 6 April, 2025;
originally announced April 2025.
-
A Graph-Partitioning Based Continuous Optimization Approach to Semi-supervised Clustering Problems
Authors:
Wei Liu,
Xin Liu,
Michael K. Ng,
Zaikun Zhang
Abstract:
Semi-supervised clustering is a basic problem in various applications. Most existing methods require knowledge of the ideal cluster number, which is often difficult to obtain in practice. Besides, satisfying the must-link constraints is another major challenge for these methods. In this work, we view the semi-supervised clustering task as a partitioning problem on a graph associated with the given…
▽ More
Semi-supervised clustering is a basic problem in various applications. Most existing methods require knowledge of the ideal cluster number, which is often difficult to obtain in practice. Besides, satisfying the must-link constraints is another major challenge for these methods. In this work, we view the semi-supervised clustering task as a partitioning problem on a graph associated with the given dataset, where the similarity matrix includes a scaling parameter to reflect the must-link constraints. Utilizing a relaxation technique, we formulate the graph partitioning problem into a continuous optimization model that does not require the exact cluster number, but only an overestimate of it. We then propose a block coordinate descent algorithm to efficiently solve this model, and establish its convergence result. Based on the obtained solution, we can construct the clusters that theoretically meet the must-link constraints under mild assumptions. Furthermore, we verify the effectiveness and efficiency of our proposed method through comprehensive numerical experiments.
△ Less
Submitted 6 March, 2025;
originally announced March 2025.
-
Scissors congruence K-theory for equivariant manifolds
Authors:
Mona Merling,
Ming Ng,
Julia Semikina,
Alba Sendón Blanco,
Lucas Williams
Abstract:
We introduce a scissors congruence $K$-theory spectrum which lifts the equivariant scissors congruence groups for compact $G$-manifolds with boundary, and we show that on $π_0$ this is the source of a spectrum level lift of the Burnside ring valued equivariant Euler characteristic of a compact $G$-manifold. We also show that the equivariant scissors congruence groups for varying subgroups assemble…
▽ More
We introduce a scissors congruence $K$-theory spectrum which lifts the equivariant scissors congruence groups for compact $G$-manifolds with boundary, and we show that on $π_0$ this is the source of a spectrum level lift of the Burnside ring valued equivariant Euler characteristic of a compact $G$-manifold. We also show that the equivariant scissors congruence groups for varying subgroups assemble into a Mackey functor, which is a shadow of a conjectural higher genuine equivariant structure.
△ Less
Submitted 12 January, 2025;
originally announced January 2025.
-
Equity Impacts of Public Transit Network Redesign with Shared Autonomous Mobility Services
Authors:
Max T. M. Ng,
Meredith Raymer,
Hani S. Mahmassani,
Omer Verbas,
Taner Cokyasar
Abstract:
This study examines the equity impacts of integrating shared autonomous mobility services (SAMS) into transit system redesign. Using the Greater Chicago area as a case study, we compare two optimization objectives in multimodal transit network redesign: minimizing total generalized costs (equity-agnostic) versus prioritizing service in low-income areas (equity-focused). We evaluate the achieved ac…
▽ More
This study examines the equity impacts of integrating shared autonomous mobility services (SAMS) into transit system redesign. Using the Greater Chicago area as a case study, we compare two optimization objectives in multimodal transit network redesign: minimizing total generalized costs (equity-agnostic) versus prioritizing service in low-income areas (equity-focused). We evaluate the achieved accessibility of clustered zones with redesigned transit networks under two objectives, compared to driving and the existing transit network. The transit access gaps across zones and between transit and driving are found to be generally reduced with the introduction of SAMS, but less so with the subsequent improved infrastructure under budget. Differential improvement in equity is seen across suburbs and areas of the city, reflecting the disparity in current transit access and improvement potential. In particular, SAMS bridges the transit access gaps in suburban and city areas currently underserved by transit. The City of Chicago, which is also disproportionately home to vulnerable populations, offers an avenue to improve vertical equity. These findings demonstrate that SAMS can enhance both horizontal and vertical equity in transit systems, particularly when equity is explicitly incorporated into the design objective.
△ Less
Submitted 8 January, 2025; v1 submitted 2 January, 2025;
originally announced January 2025.
-
Evaluation of Rail Decarbonization Alternatives: Framework and Application
Authors:
Adrian Hernandez,
Max TM Ng,
Nazib Siddique,
Pablo L. Durango-Cohen,
Amgad Elgowainy,
Hani S. Mahmassani,
Michael Wang,
Yan Zhou
Abstract:
The Northwestern University Freight Rail Infrastructure and Energy Network Decarbonization (NUFRIEND) framework is a comprehensive industry-oriented tool for simulating the deployment of new energy technologies including biofuels, e-fuels, battery-electric, and hydrogen locomotives. By classifying fuel types into two categories based on deployment requirements, the associated optimal charging/fuel…
▽ More
The Northwestern University Freight Rail Infrastructure and Energy Network Decarbonization (NUFRIEND) framework is a comprehensive industry-oriented tool for simulating the deployment of new energy technologies including biofuels, e-fuels, battery-electric, and hydrogen locomotives. By classifying fuel types into two categories based on deployment requirements, the associated optimal charging/fueling facility location and sizing problem are solved with a five-step framework. Life cycle analyses (LCA) and techno-economic analyses (TEA) are used to estimate carbon reduction, capital investments, cost of carbon reduction, and operational impacts, enabling sensitivity analysis with operational and technological parameters. The framework is illustrated on lower-carbon drop-in fuels as well as battery-electric technology deployments for US Eastern and Western Class I railroad networks. Drop-in fuel deployments are modeled as admixtures with diesel in existing locomotives, while battery-electric deployments are shown for varying technology penetration levels and locomotive ranges. When mixed in a 50 percent ratio with diesel, results show biodiesel's capacity to reduce emissions at 36 percent with a cost of 0.13 USD per kilogram of CO2 reduced, while e-fuels offer a 50 percent emissions reduction potential at a cost of 0.22 USD per kilogram of CO2 reduced. Battery-electric results for 50 percent deployment over all ton-miles highlight the value of future innovations in battery energy densities as scenarios assuming 800-mile range locomotives show an estimated emissions reduction of 46 percent with a cost of 0.06 USD per kilogram of CO2 reduced, compared to 16 percent emissions reduction at a cost of 0.11 USD per kilogram of CO2 reduced for 400-mile range locomotives.
△ Less
Submitted 2 January, 2025;
originally announced January 2025.
-
Autonomous Minibus Service with Semi-on-demand Routes in Grid Networks
Authors:
Max T. M. Ng,
Hani S. Mahmassani
Abstract:
This paper investigates the potential of autonomous minibuses which take on-demand directional routes for pick-up and drop-off in a grid network of wider area with low density, followed by fixed routes in areas with demand. Mathematical formulation for generalized costs demonstrates its benefits, with indicators proposed to select existing bus routes for conversion with the options of zonal expres…
▽ More
This paper investigates the potential of autonomous minibuses which take on-demand directional routes for pick-up and drop-off in a grid network of wider area with low density, followed by fixed routes in areas with demand. Mathematical formulation for generalized costs demonstrates its benefits, with indicators proposed to select existing bus routes for conversion with the options of zonal express and parallel routes. Simulations on modeled scenarios and case studies with bus routes in Chicago show reductions in both passenger costs and generalized costs over existing fixed-route bus service between suburban areas and CBD.
△ Less
Submitted 30 December, 2024;
originally announced January 2025.
-
Highway Managed Lane Usage and Tolling for Mixed Traffic Flows with Connected Automated Vehicles (CAVs) and High-Occupancy Vehicles (HOVs)
Authors:
Max T. M. Ng,
Hani S. Mahmassani
Abstract:
This paper investigates managed lane (ML) toll setting and its effect under mixed traffic of connected automated vehicles (CAVs), high-occupancy vehicles (HOVs), and human-driven vehicles (HDVs), with a goal to avoid flow breakdown and minimize total social cost. A mesoscopic finite-difference traffic simulation model considers the flow-density relationship at different CAV market penetration rate…
▽ More
This paper investigates managed lane (ML) toll setting and its effect under mixed traffic of connected automated vehicles (CAVs), high-occupancy vehicles (HOVs), and human-driven vehicles (HDVs), with a goal to avoid flow breakdown and minimize total social cost. A mesoscopic finite-difference traffic simulation model considers the flow-density relationship at different CAV market penetration rates, lane-changing behavior, and multiple entries/exits, interacting with a reactive toll setting mechanism. The results of the Monte Carlo simulation suggest an optimal policy of untolled HOV/CAV use with HDV tolls in particular scenarios of limited CAV market penetration. Small and targeted tolling avoids flow breakdown in ML while prioritizing HOVs and other vehicles with high values of time. Extensions of the formulation and sensitivity analysis quantify the benefits of converting high-occupancy HDVs to CAVs. The optimal tolling regime combines traffic science notions of flow stability and the economics of resource allocation.
△ Less
Submitted 29 December, 2024;
originally announced December 2024.
-
Trading Off Energy Storage and Payload -- An Analytical Model for Freight Train Configuration
Authors:
Max T. M. Ng,
Adrian Hernandez,
Pablo L. Durango-Cohen,
Hani S. Mahmassani
Abstract:
To support planning of alternative fuel technology (e.g., battery-electric locomotives) deployment for decarbonizing non-electrified freight rail, we develop a convex optimization formulation with a closed-form solution to determine the optimal number of energy storage tender cars in a train. The formulation shares a similar structure to an Economic Order Quantity (EOQ) model. For given market cha…
▽ More
To support planning of alternative fuel technology (e.g., battery-electric locomotives) deployment for decarbonizing non-electrified freight rail, we develop a convex optimization formulation with a closed-form solution to determine the optimal number of energy storage tender cars in a train. The formulation shares a similar structure to an Economic Order Quantity (EOQ) model. For given market characteristics, cost forecasts, and technology parameters, our model captures the trade-offs between inventory carrying costs associated with trip times (including delays due to charging/refueling) and ordering costs associated with train dispatch and operation (energy, amortized equipment, and labor costs). To illustrate the framework, we find the optimal number of battery-electric energy tender cars in 22,501 freight markets (origin-destination pairs and commodities) for U.S. Class I railroads. The results display heterogeneity in optimal configurations with lighter, yet more time-sensitive shipments (e.g., intermodal) utilizing more battery tender cars. For heavier commodities (e.g., coal) with lower holding costs, single battery tender car configurations are generally optimal. The results also show that the optimal train configurations are sensitive to delays associated with recharging or swapping tender cars.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
Joint Optimization of Multimodal Transit Frequency and Shared Autonomous Vehicle Fleet Size with Hybrid Metaheuristic and Nonlinear Programming
Authors:
Max T. M. Ng,
Hani S. Mahmassani,
Draco Tong,
Omer Verbas,
Taner Cokyasar
Abstract:
Shared autonomous vehicles (SAVs) bring competition to traditional transit services but redesigning multimodal transit network can utilize SAVs as feeders to enhance service efficiency and coverage. This paper presents an optimization framework for the joint multimodal transit frequency and SAV fleet size problem, a variant of the transit network frequency setting problem. The objective is to maxi…
▽ More
Shared autonomous vehicles (SAVs) bring competition to traditional transit services but redesigning multimodal transit network can utilize SAVs as feeders to enhance service efficiency and coverage. This paper presents an optimization framework for the joint multimodal transit frequency and SAV fleet size problem, a variant of the transit network frequency setting problem. The objective is to maximize total transit ridership (including SAV-fed trips and subtracting boarding rejections) across multiple time periods under budget constraints, considering endogenous mode choice (transit, point-to-point SAVs, driving) and route selection, while allowing for strategic route removal by setting frequencies to zero. Due to the problem's non-linear, non-convex nature and the computational challenges of large-scale networks, we develop a hybrid solution approach that combines a metaheuristic approach (particle swarm optimization) with nonlinear programming for local solution refinement. To ensure computational tractability, the framework integrates analytical approximation models for SAV waiting times based on fleet utilization, multimodal network assignment for route choice, and multinomial logit mode choice behavior, bypassing the need for computationally intensive simulations within the main optimization loop. Applied to the Chicago metropolitan area's multimodal network, our method illustrates a 33.3% increase in transit ridership through optimized transit route frequencies and SAV integration, particularly enhancing off-peak service accessibility and strategically reallocating resources.
△ Less
Submitted 22 April, 2025; v1 submitted 26 December, 2024;
originally announced December 2024.
-
The singleton degrees of the $Σ^0_2$ sets are not dense
Authors:
Thomas F. Kent,
Keng Meng Ng,
Andrea Sorbi
Abstract:
Answering an open question raised by Cooper, we show that there exist $Δ^0_2$ sets $D$ and $E$ such that the singleton degree of $E$ is a minimal cover of the singleton degree of $D$. This shows that the $Σ^{0}_{2}$ singleton degrees, and the $Δ^{0}_{2}$ singleton degrees, are not dense (and consequently the $Π^0_2$ $Q$-degrees, and the $Δ^{0}_{2}$ $Q$-degrees, are not dense). Moreover $D$ and…
▽ More
Answering an open question raised by Cooper, we show that there exist $Δ^0_2$ sets $D$ and $E$ such that the singleton degree of $E$ is a minimal cover of the singleton degree of $D$. This shows that the $Σ^{0}_{2}$ singleton degrees, and the $Δ^{0}_{2}$ singleton degrees, are not dense (and consequently the $Π^0_2$ $Q$-degrees, and the $Δ^{0}_{2}$ $Q$-degrees, are not dense). Moreover $D$ and $E$ can be built to lie in the same enumeration degree.
△ Less
Submitted 25 December, 2024;
originally announced December 2024.
-
The subTuring degrees
Authors:
Takayuki Kihara,
Keng Meng Ng
Abstract:
In this article, we introduce a notion of reducibility for partial functions on the natural numbers, which we call subTuring reducibility. One important aspect is that the subTuring degrees correspond to the structure of the realizability subtoposes of the effective topos. We show that the subTuring degrees (that is, the realizability subtoposes of the effective topos) form a dense non-modular (th…
▽ More
In this article, we introduce a notion of reducibility for partial functions on the natural numbers, which we call subTuring reducibility. One important aspect is that the subTuring degrees correspond to the structure of the realizability subtoposes of the effective topos. We show that the subTuring degrees (that is, the realizability subtoposes of the effective topos) form a dense non-modular (thus, non-distributive) lattice. We also show that there is a nonzero join-irreducible subTuring degree (which implies that there is a realizability subtopos of the effective topos that cannot be decomposed into two smaller realizability subtoposes).
△ Less
Submitted 20 November, 2024; v1 submitted 8 November, 2024;
originally announced November 2024.
-
Joint Optimization of Pattern, Headway, and Fleet Size of Multiple Urban Transit Lines with Perceived Headway Consideration and Passenger Flow Allocation
Authors:
Max T. M. Ng,
Draco Tong,
Hani S. Mahmassani,
Omer Verbas,
Taner Cokyasar
Abstract:
This study addresses the urban transit pattern design problem, optimizing stop sequences, headways, and fleet sizes across multiple routes and periods simultaneously to minimize user costs (composed of riding, waiting, and transfer times) under operational constraints (e.g., vehicle capacity and fleet size). A destination-labeled multi-commodity network flow (MCNF) formulation is developed to solv…
▽ More
This study addresses the urban transit pattern design problem, optimizing stop sequences, headways, and fleet sizes across multiple routes and periods simultaneously to minimize user costs (composed of riding, waiting, and transfer times) under operational constraints (e.g., vehicle capacity and fleet size). A destination-labeled multi-commodity network flow (MCNF) formulation is developed to solve the problem at a large scale more efficiently compared to the previous literature. The model allows for flexible pattern options without relying on pre-defined candidate sets and simultaneously considers multiple operational strategies such as express/local services, short-turning, and deadheading. It evaluates perceived headways of joint patterns for passengers, assigns passenger flows to each pattern accordingly, and allows transfers across patterns in different directions. The mixed-integer linear programming (MILP) model is demonstrated with a city-sized network of metro lines in Chicago, USA, achieving near-optimal solutions in hours. The total weighted journey times are reduced by 0.61% and 5.76% under single-route and multi-period multi-route scenarios respectively. The model provides transit agencies with an efficient tool for comprehensive service design and resource allocation, improving service quality and resource utilization without additional operational costs.
△ Less
Submitted 26 December, 2024; v1 submitted 27 September, 2024;
originally announced September 2024.
-
Semi-on-Demand Off-Peak Transit Services with Shared Autonomous Vehicles -- Service Planning, Simulation, and Analysis in Munich, Germany
Authors:
Max T. M. Ng,
Roman Engelhardt,
Florian Dandl,
Vasileios Volakakis,
Hani S. Mahmassani,
Klaus Bogenberger
Abstract:
This study investigates the implementation of semi-on-demand (SoD) hybrid-route services using Shared Autonomous Vehicles (SAVs) on existing transit lines. SoD services combine the cost efficiency of fixed-route buses with the flexibility of on-demand services. SAVs first serve all scheduled fixed-route stops, then drop off and pick up passengers in the pre-determined flexible-route portion, and r…
▽ More
This study investigates the implementation of semi-on-demand (SoD) hybrid-route services using Shared Autonomous Vehicles (SAVs) on existing transit lines. SoD services combine the cost efficiency of fixed-route buses with the flexibility of on-demand services. SAVs first serve all scheduled fixed-route stops, then drop off and pick up passengers in the pre-determined flexible-route portion, and return to the fixed route. This study addresses four key questions: optimal fleet and vehicle sizes for peak-hour fixed-route services with SAVs and during transition (from drivers to autonomous vehicles), optimal off-peak SoD service planning, and suitable use cases. The methodology combines analytical modeling for service planning with agent-based simulation for operational analysis. We examine ten bus routes in Munich, Germany, considering full SAV and transition scenarios with varying proportions of drivers. Our findings demonstrate that the lower operating costs of SAVs improve service quality through increased frequency and smaller vehicles, even in transition scenarios. The reduced headway lowers waiting time and also favors more flexible-route operation in SoD services. The optimal SoD settings range from fully flexible to hybrid routes, where higher occupancy from the terminus favors shorter flexible routes. During the transition phase, limited fleet size and higher headways constrain the benefits of flexible-route operations. The simulation results corroborate the SoD benefits of door-to-door convenience, attracting more passengers without excessive detours and operator costs at moderate flexible-route lengths, and validate the analytical model.
△ Less
Submitted 18 December, 2024; v1 submitted 20 August, 2024;
originally announced August 2024.
-
Non-Negative Reduced Biquaternion Matrix Factorization with Applications in Color Face Recognition
Authors:
Jifei Miao,
Junjun Pan,
Michael K. Ng
Abstract:
Reduced biquaternion (RB), as a four-dimensional algebra highly suitable for representing color pixels, has recently garnered significant attention from numerous scholars. In this paper, for color image processing problems, we introduce a concept of the non-negative RB matrix and then use the multiplication properties of RB to propose a non-negative RB matrix factorization (NRBMF) model. The NRBMF…
▽ More
Reduced biquaternion (RB), as a four-dimensional algebra highly suitable for representing color pixels, has recently garnered significant attention from numerous scholars. In this paper, for color image processing problems, we introduce a concept of the non-negative RB matrix and then use the multiplication properties of RB to propose a non-negative RB matrix factorization (NRBMF) model. The NRBMF model is introduced to address the challenge of reasonably establishing a non-negative quaternion matrix factorization model, which is primarily hindered by the multiplication properties of traditional quaternions. Furthermore, this paper transforms the problem of solving the NRBMF model into an RB alternating non-negative least squares (RB-ANNLS) problem. Then, by introducing a method to compute the gradient of the real function with RB matrix variables, we solve the RB-ANNLS optimization problem using the RB projected gradient algorithm and conduct a convergence analysis of the algorithm. Finally, we validate the effectiveness and superiority of the proposed NRBMF model in color face recognition.
△ Less
Submitted 9 July, 2025; v1 submitted 10 August, 2024;
originally announced August 2024.
-
A New Cross-Space Total Variation Regularization Model for Color Image Restoration with Quaternion Blur Operator
Authors:
Zhigang Jia,
Yuelian Xiang,
Meixiang Zhao,
Tingting Wu,
Michael K. Ng
Abstract:
The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color artifacts in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by intro…
▽ More
The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color artifacts in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by introducing a quaternion blur operator and a cross-color space regularization functional. The existence and uniqueness of the solution are proved and a new L-curve method is proposed to find a balance of regularization terms on different color spaces. The Euler-Lagrange equation is derived to show that CSTV has taken into account the coupling of all color channels and the local smoothing within each color channel. A quaternion operator splitting method is firstly proposed to enhance the ability of color artifacts reduction of the CSTV regularization model. This strategy also applies to the well-known color deblurring models. Numerical experiments on color image databases illustrate the efficiency and effectiveness of the new model and algorithms. The color images restored by them successfully maintain the color and spatial information and are of higher quality in terms of PSNR, SSIM, MSE and CIEde2000 than the restorations of the-state-of-the-art methods.
△ Less
Submitted 26 January, 2025; v1 submitted 20 May, 2024;
originally announced May 2024.
-
The computational content of multidimensional discontinuity
Authors:
Rupert Hölzl,
Keng Meng Ng
Abstract:
The Weihrauch degrees are a tool to gauge the computational difficulty of mathematical problems. Often, what makes these problems hard is their discontinuity. We look at discontinuity in its purest form, that is, at otherwise constant functions that make a single discontinuous step along each dimension of their underlying space. This is an extension of previous work of Kihara, Pauly, Westrick from…
▽ More
The Weihrauch degrees are a tool to gauge the computational difficulty of mathematical problems. Often, what makes these problems hard is their discontinuity. We look at discontinuity in its purest form, that is, at otherwise constant functions that make a single discontinuous step along each dimension of their underlying space. This is an extension of previous work of Kihara, Pauly, Westrick from a single dimension to multiple dimensions. Among other results, we obtain strict hierarchies in the Weihrauch degrees, one of which orders mathematical problems by the richness of the truth-tables determining how discontinuous steps influence the output.
△ Less
Submitted 18 July, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.
-
A $τ$-preconditioner for space fractional diffusion equation with non-separable variable coefficients
Authors:
Xue-Lei Lin,
Michael K. Ng
Abstract:
In this paper, we study a $τ$-matrix approximation based preconditioner for the linear systems arising from discretization of unsteady state Riesz space fractional diffusion equation with non-separable variable coefficients. The structure of coefficient matrices of the linear systems is identity plus summation of diagonal-times-multilevel-Toeplitz matrices. In our preconditioning technique, the di…
▽ More
In this paper, we study a $τ$-matrix approximation based preconditioner for the linear systems arising from discretization of unsteady state Riesz space fractional diffusion equation with non-separable variable coefficients. The structure of coefficient matrices of the linear systems is identity plus summation of diagonal-times-multilevel-Toeplitz matrices. In our preconditioning technique, the diagonal matrices are approximated by scalar identity matrices and the Toeplitz matrices are approximated by τ-matrices (a type of matrices diagonalizable by discrete sine transforms). The proposed preconditioner is fast invertible through the fast sine transform (FST) algorithm. Theoretically, we show that the GMRES solver for the preconditioned systems has an optimal convergence rate (a convergence rate independent of discretization stepsizes). To the best of our knowledge, this is the first preconditioning method with the optimal convergence rate for the variable-coefficients space fractional diffusion equation. Numerical results are reported to demonstrate the efficiency of the proposed method.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Semi-on-Demand Hybrid Transit Route Design with Shared Autonomous Mobility Services
Authors:
Max T. M. Ng,
Florian Dandl,
Hani S. Mahmassani,
Klaus Bogenberger
Abstract:
This study examines the route design of a semi-on-demand hybrid route directional service in the public transit network, offering on-demand flexible route service in low-density areas and fixed route service in higher-density areas with Shared Autonomous Mobility Service (SAMS). The study develops analytically tractable cost expressions that capture access, waiting, and riding costs for users, and…
▽ More
This study examines the route design of a semi-on-demand hybrid route directional service in the public transit network, offering on-demand flexible route service in low-density areas and fixed route service in higher-density areas with Shared Autonomous Mobility Service (SAMS). The study develops analytically tractable cost expressions that capture access, waiting, and riding costs for users, and distance-based operating and time-based vehicle costs for operators. Two formulations are presented for strategic and tactical decisions in flexible route portion, fleet size, headway, and vehicle size optimization, enabling the determination of route types between fixed, hybrid, and flexible routes based on demand, cost, and operational parameters. The practical applications and benefits of semi-on-demand feeders are demonstrated with numerical examples and a large-scale case study in the Chicago metropolitan area. Findings reveal scenarios in which flexible route portions serving passengers located further away reduce total costs, particularly user costs. Lower operating costs in lower-demand areas favor more flexible routes, whereas higher demand densities favor more traditional line-based operations. On two studied lines, a current cost forecast favors smaller vehicles with flexible routes, but operating constraints and higher operating costs would favor bigger vehicles with hybrid routes. The study provides an analytical tool to design SAMS as directional services and transit feeders, and tractable continuous approximation formulations for future research in transit network design.
△ Less
Submitted 7 August, 2024; v1 submitted 23 March, 2024;
originally announced March 2024.
-
Multispectral Image Restoration by Generalized Opponent Transformation Total Variation
Authors:
Zhantao Ma,
Michael K. Ng
Abstract:
Multispectral images (MSI) contain light information in different wavelengths of objects, which convey spectral-spatial information and help improve the performance of various image processing tasks. Numerous techniques have been created to extend the application of total variation regularization in restoring multispectral images, for example, based on channel coupling and adaptive total variation…
▽ More
Multispectral images (MSI) contain light information in different wavelengths of objects, which convey spectral-spatial information and help improve the performance of various image processing tasks. Numerous techniques have been created to extend the application of total variation regularization in restoring multispectral images, for example, based on channel coupling and adaptive total variation regularization. The primary contribution of this paper is to propose and develop a new multispectral total variation regularization in a generalized opponent transformation domain instead of the original multispectral image domain. Here opponent transformations for multispectral images are generalized from a well-known opponent transformation for color images. We will explore the properties of generalized opponent transformation total variation (GOTTV) regularization and the corresponding optimization formula for multispectral image restoration. To evaluate the effectiveness of the new GOTTV method, we provide numerical examples that showcase its superior performance compared to existing multispectral image total variation methods, using criteria such as MPSNR and MSSIM.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Finite final segments of the d.c.e. Turing degrees
Authors:
Steffen Lempp,
Yiqun Liu,
Yong Liu,
Keng Meng Ng,
Cheng Peng,
Guohua Wu
Abstract:
We prove that every finite distributive lattice is isomorphic to a final segment of the d.c.e. Turing degrees (i.e., the degrees of differences of computably enumerable sets). As a corollary, we are able to infer the undecidability of the EAE-theory of the d.c.e. degrees in the language of partial ordering.
We prove that every finite distributive lattice is isomorphic to a final segment of the d.c.e. Turing degrees (i.e., the degrees of differences of computably enumerable sets). As a corollary, we are able to infer the undecidability of the EAE-theory of the d.c.e. degrees in the language of partial ordering.
△ Less
Submitted 21 March, 2024; v1 submitted 7 March, 2024;
originally announced March 2024.
-
A One-step Image Retargeing Algorithm Based on Conformal Energy
Authors:
Chengyang Liu,
Michael K. Ng
Abstract:
The image retargeting problem is to find a proper mapping to resize an image to one with a prescribed aspect ratio, which is quite popular these days. In this paper, we propose an efficient and orientation-preserving one-step image retargeting algorithm based on minimizing the harmonic energy, which can well preserve the regions of interest (ROIs) and line structures in the image. We also give som…
▽ More
The image retargeting problem is to find a proper mapping to resize an image to one with a prescribed aspect ratio, which is quite popular these days. In this paper, we propose an efficient and orientation-preserving one-step image retargeting algorithm based on minimizing the harmonic energy, which can well preserve the regions of interest (ROIs) and line structures in the image. We also give some mathematical proofs in the paper to ensure the well-posedness and accuracy of our algorithm.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
AlgoFormer: An Efficient Transformer Framework with Algorithmic Structures
Authors:
Yihang Gao,
Chuanyang Zheng,
Enze Xie,
Han Shi,
Tianyang Hu,
Yu Li,
Michael K. Ng,
Zhenguo Li,
Zhaoqiang Liu
Abstract:
Besides natural language processing, transformers exhibit extraordinary performance in solving broader applications, including scientific computing and computer vision. Previous works try to explain this from the expressive power and capability perspectives that standard transformers are capable of performing some algorithms. To empower transformers with algorithmic capabilities and motivated by t…
▽ More
Besides natural language processing, transformers exhibit extraordinary performance in solving broader applications, including scientific computing and computer vision. Previous works try to explain this from the expressive power and capability perspectives that standard transformers are capable of performing some algorithms. To empower transformers with algorithmic capabilities and motivated by the recently proposed looped transformer, we design a novel transformer framework, dubbed Algorithm Transformer (abbreviated as AlgoFormer). We provide an insight that efficient transformer architectures can be designed by leveraging prior knowledge of tasks and the underlying structure of potential algorithms. Compared with the standard transformer and vanilla looped transformer, the proposed AlgoFormer can perform efficiently in algorithm representation in some specific tasks. In particular, inspired by the structure of human-designed learning algorithms, our transformer framework consists of a pre-transformer that is responsible for task preprocessing, a looped transformer for iterative optimization algorithms, and a post-transformer for producing the desired results after post-processing. We provide theoretical evidence of the expressive power of the AlgoFormer in solving some challenging problems, mirroring human-designed algorithms. Furthermore, some theoretical and empirical results are presented to show that the designed transformer has the potential to perform algorithm representation and learning. Experimental results demonstrate the empirical superiority of the proposed transformer in that it outperforms the standard transformer and vanilla looped transformer in some specific tasks. An extensive experiment on real language tasks (e.g., neural machine translation of German and English, and text classification) further validates the expressiveness and effectiveness of AlgoFormer.
△ Less
Submitted 10 January, 2025; v1 submitted 21 February, 2024;
originally announced February 2024.
-
Logical Berkovich Geometry: A Point-free Perspective
Authors:
Ming Ng
Abstract:
Extending our insights from \cite{NVOstrowski}, we apply point-free techniques to sharpen a foundational result in Berkovich geometry. In our language, given the ring $\mathcal{A}:=K\{R^{-1}T\}$ of convergent power series over a suitable non-Archimedean field $K$, the points of its Berkovich Spectrum $\mathcal{M}(\mathcal{A})$ correspond to $R$-good filters. The surprise is that, unlike the origin…
▽ More
Extending our insights from \cite{NVOstrowski}, we apply point-free techniques to sharpen a foundational result in Berkovich geometry. In our language, given the ring $\mathcal{A}:=K\{R^{-1}T\}$ of convergent power series over a suitable non-Archimedean field $K$, the points of its Berkovich Spectrum $\mathcal{M}(\mathcal{A})$ correspond to $R$-good filters. The surprise is that, unlike the original result by Berkovich, we do not require the field $K$ to be non-trivially valued. Our investigations into non-Archimedean geometry can be understood as being framed by the question: what is the relationship between topology and logic?
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
A Point-Free Look at Ostrowski's Theorem and Absolute Values
Authors:
Ming Ng,
Steven Vickers
Abstract:
This paper investigates the absolute values on $\mathbb{Z}$ valued in the upper reals (i.e. reals for which only a right Dedekind section is given). These necessarily include multiplicative seminorms corresponding to the finite prime fields $\mathbb{F}_p$. As an Ostrowski-type Theorem, the space of such absolute values is homeomorphic to a space of prime ideals (with co-Zariski topology) suitably…
▽ More
This paper investigates the absolute values on $\mathbb{Z}$ valued in the upper reals (i.e. reals for which only a right Dedekind section is given). These necessarily include multiplicative seminorms corresponding to the finite prime fields $\mathbb{F}_p$. As an Ostrowski-type Theorem, the space of such absolute values is homeomorphic to a space of prime ideals (with co-Zariski topology) suitably paired with upper reals in the range $[-\infty, 1]$, and from this is recovered the standard Ostrowski's Theorem for absolute values on $\mathbb{Q}$.
Our approach is fully constructive, using, in the topos-theoretic sense, geometric reasoning with point-free spaces, and that calls for a careful distinction between Dedekinds vs. upper reals. This forces attention on topological subtleties that are obscured in the classical treatment. In particular, the admission of multiplicative seminorms points to connections with Berkovich and adic spectra. The results are also intended to contribute to characterising a (point-free) space of places of $\mathbb{Q}$.
△ Less
Submitted 20 August, 2023;
originally announced August 2023.
-
Redesigning Large-Scale Multimodal Transit Networks with Shared Autonomous Mobility Services
Authors:
Max T. M. Ng,
Hani S. Mahmassani,
Ömer Verbas,
Taner Cokyasar,
Roman Engelhardt
Abstract:
This study addresses a large-scale multimodal transit network design problem, with Shared Autonomous Mobility Services (SAMS) as both transit feeders and an origin-to-destination mode. The framework captures spatial demand and modal characteristics, considers intermodal transfers and express services, determines transit infrastructure investment and path flows, and generates transit routes. A syst…
▽ More
This study addresses a large-scale multimodal transit network design problem, with Shared Autonomous Mobility Services (SAMS) as both transit feeders and an origin-to-destination mode. The framework captures spatial demand and modal characteristics, considers intermodal transfers and express services, determines transit infrastructure investment and path flows, and generates transit routes. A system-optimal multimodal transit network is designed with minimum total door-to-door generalized costs of users and operators, satisfying transit origin-destination demand within a pre-set infrastructure budget. Firstly, the geography, demand, and modes in each zone are characterized with continuous approximation. The decisions of network link investment and multimodal path flows in zonal connection optimization are formulated as a minimum-cost multi-commodity network flow (MCNF) problem and solved efficiently with a mixed-integer linear programming (MILP) solver. Subsequently, the route generation problem is solved by expanding the MCNF formulation to minimize intramodal transfers. The model is illustrated through a set of experiments with the Chicago network comprised of 50 zones and seven modes, under three scenarios. The computational results present savings in traveler journey time and operator cost demonstrating the potential benefits of collaboration between multimodal transit systems and SAMS.
△ Less
Submitted 27 March, 2024; v1 submitted 29 July, 2023;
originally announced July 2023.
-
Block Diagonalization of Quaternion Circulant Matrices with Applications
Authors:
Junjun Pan,
Michael K. Ng
Abstract:
It is well-known that a complex circulant matrix can be diagonalized by a discrete Fourier matrix with imaginary unit $\mathtt{i}$. The main aim of this paper is to demonstrate that a quaternion circulant matrix cannot be diagonalized by a discrete quaternion Fourier matrix with three imaginary units $\mathtt{i}$, $\mathtt{j}$ and $\mathtt{k}$. Instead, a quaternion circulant matrix can be block-d…
▽ More
It is well-known that a complex circulant matrix can be diagonalized by a discrete Fourier matrix with imaginary unit $\mathtt{i}$. The main aim of this paper is to demonstrate that a quaternion circulant matrix cannot be diagonalized by a discrete quaternion Fourier matrix with three imaginary units $\mathtt{i}$, $\mathtt{j}$ and $\mathtt{k}$. Instead, a quaternion circulant matrix can be block-diagonalized into 1-by-1 block and 2-by-2 block matrices by permuted discrete quaternion Fourier transform matrix. With such a block-diagonalized form, the inverse of a quaternion circulant matrix can be determined efficiently similar to the inverse of a complex circulant matrix. We make use of this block-diagonalized form to study quaternion tensor singular value decomposition of quaternion tensors where the entries are quaternion numbers. The applications including computing the inverse of a quaternion circulant matrix, and solving quaternion Toeplitz system arising from linear prediction of quaternion signals are employed to validate the efficiency of our proposed block diagonalized results. A numerical example of color video as third-order quaternion tensor is employed to validate the effectiveness of quaternion tensor singular value decomposition.
△ Less
Submitted 8 February, 2024; v1 submitted 8 February, 2023;
originally announced February 2023.
-
Quantizing Heavy-tailed Data in Statistical Estimation: (Near) Minimax Rates, Covariate Quantization, and Uniform Recovery
Authors:
Junren Chen,
Michael K. Ng,
Di Wang
Abstract:
This paper studies the quantization of heavy-tailed data in some fundamental statistical estimation problems, where the underlying distributions have bounded moments of some order. We propose to truncate and properly dither the data prior to a uniform quantization. Our major standpoint is that (near) minimax rates of estimation error are achievable merely from the quantized data produced by the pr…
▽ More
This paper studies the quantization of heavy-tailed data in some fundamental statistical estimation problems, where the underlying distributions have bounded moments of some order. We propose to truncate and properly dither the data prior to a uniform quantization. Our major standpoint is that (near) minimax rates of estimation error are achievable merely from the quantized data produced by the proposed scheme. In particular, concrete results are worked out for covariance estimation, compressed sensing, and matrix completion, all agreeing that the quantization only slightly worsens the multiplicative factor. Besides, we study compressed sensing where both covariate (i.e., sensing vector) and response are quantized. Under covariate quantization, although our recovery program is non-convex because the covariance matrix estimator lacks positive semi-definiteness, all local minimizers are proved to enjoy near optimal error bound. Moreover, by the concentration inequality of product process and covering argument, we establish near minimax uniform recovery guarantee for quantized compressed sensing with heavy-tailed noise.
△ Less
Submitted 26 July, 2023; v1 submitted 30 December, 2022;
originally announced December 2022.
-
SVD-PINNs: Transfer Learning of Physics-Informed Neural Networks via Singular Value Decomposition
Authors:
Yihang Gao,
Ka Chun Cheung,
Michael K. Ng
Abstract:
Physics-informed neural networks (PINNs) have attracted significant attention for solving partial differential equations (PDEs) in recent years because they alleviate the curse of dimensionality that appears in traditional methods. However, the most disadvantage of PINNs is that one neural network corresponds to one PDE. In practice, we usually need to solve a class of PDEs, not just one. With the…
▽ More
Physics-informed neural networks (PINNs) have attracted significant attention for solving partial differential equations (PDEs) in recent years because they alleviate the curse of dimensionality that appears in traditional methods. However, the most disadvantage of PINNs is that one neural network corresponds to one PDE. In practice, we usually need to solve a class of PDEs, not just one. With the explosive growth of deep learning, many useful techniques in general deep learning tasks are also suitable for PINNs. Transfer learning methods may reduce the cost for PINNs in solving a class of PDEs. In this paper, we proposed a transfer learning method of PINNs via keeping singular vectors and optimizing singular values (namely SVD-PINNs). Numerical experiments on high dimensional PDEs (10-d linear parabolic equations and 10-d Allen-Cahn equations) show that SVD-PINNs work for solving a class of PDEs with different but close right-hand-side functions.
△ Less
Submitted 14 March, 2024; v1 submitted 16 November, 2022;
originally announced November 2022.
-
Stochastic Variance Reduced Gradient for affine rank minimization problem
Authors:
Ningning Han,
Juan Nie,
Jian Lu,
Michael K. Ng
Abstract:
We develop an efficient stochastic variance reduced gradient descent algorithm to solve the affine rank minimization problem consists of finding a matrix of minimum rank from linear measurements. The proposed algorithm as a stochastic gradient descent strategy enjoys a more favorable complexity than full gradients. It also reduces the variance of the stochastic gradient at each iteration and accel…
▽ More
We develop an efficient stochastic variance reduced gradient descent algorithm to solve the affine rank minimization problem consists of finding a matrix of minimum rank from linear measurements. The proposed algorithm as a stochastic gradient descent strategy enjoys a more favorable complexity than full gradients. It also reduces the variance of the stochastic gradient at each iteration and accelerate the rate of convergence. We prove that the proposed algorithm converges linearly in expectation to the solution under a restricted isometry condition. The numerical experiments show that the proposed algorithm has a clearly advantageous balance of efficiency, adaptivity, and accuracy compared with other state-of-the-art greedy algorithms.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
A Momentum Accelerated Adaptive Cubic Regularization Method for Nonconvex Optimization
Authors:
Yihang Gao,
Michael K. Ng
Abstract:
The cubic regularization method (CR) and its adaptive version (ARC) are popular Newton-type methods in solving unconstrained non-convex optimization problems, due to its global convergence to local minima under mild conditions. The main aim of this paper is to develop a momentum-accelerated adaptive cubic regularization method (ARCm) to improve the convergent performance. With the proper choice of…
▽ More
The cubic regularization method (CR) and its adaptive version (ARC) are popular Newton-type methods in solving unconstrained non-convex optimization problems, due to its global convergence to local minima under mild conditions. The main aim of this paper is to develop a momentum-accelerated adaptive cubic regularization method (ARCm) to improve the convergent performance. With the proper choice of momentum step size, we show the global convergence of ARCm and the local convergence can also be guaranteed under the \KL property. Such global and local convergence can also be established when inexact solvers with low computational costs are employed in the iteration procedure. Numerical results for non-convex logistic regression and robust linear regression models are reported to demonstrate that the proposed ARCm significantly outperforms state-of-the-art cubic regularization methods (e.g., CR, momentum-based CR, ARC) and the trust region method. In particular, the number of iterations required by ARCm is less than 10\% to 50\% required by the most competitive method (ARC) in the experiments.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
Approximate Secular Equations for the Cubic Regularization Subproblem
Authors:
Yihang Gao,
Man-Chung Yue,
Michael K. Ng
Abstract:
The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper…
▽ More
The cubic regularization method (CR) is a popular algorithm for unconstrained non-convex optimization. At each iteration, CR solves a cubically regularized quadratic problem, called the cubic regularization subproblem (CRS). One way to solve the CRS relies on solving the secular equation, whose computational bottleneck lies in the computation of all eigenvalues of the Hessian matrix. In this paper, we propose and analyze a novel CRS solver based on an approximate secular equation, which requires only some of the Hessian eigenvalues and is therefore much more efficient. Two approximate secular equations (ASEs) are developed. For both ASEs, we first study the existence and uniqueness of their roots and then establish an upper bound on the gap between the root and that of the standard secular equation. Such an upper bound can in turn be used to bound the distance from the approximate CRS solution based ASEs to the true CRS solution, thus offering a theoretical guarantee for our CRS solver. A desirable feature of our CRS solver is that it requires only matrix-vector multiplication but not matrix inversion, which makes it particularly suitable for high-dimensional applications of unconstrained non-convex optimization, such as low-rank recovery and deep learning. Numerical experiments with synthetic and real data-sets are conducted to investigate the practical performance of the proposed CRS solver. Experimental results show that the proposed solver outperforms two state-of-the-art methods.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
Computable topological groups
Authors:
Heer Tern Koh,
Alexander Melnikov,
Keng Meng Ng
Abstract:
We investigate what it means for a (Hausdorff, second-countable) topological group to be computable. We compare several potential definitions in the literature. We relate these notions with the well-established definitions of effective presentability for discrete and profinite groups, and compare these results with similar results in computable topology. Most of these definitions can be separated…
▽ More
We investigate what it means for a (Hausdorff, second-countable) topological group to be computable. We compare several potential definitions in the literature. We relate these notions with the well-established definitions of effective presentability for discrete and profinite groups, and compare these results with similar results in computable topology. Most of these definitions can be separated by counter-examples. Remarkably, we prove that two such definitions are equivalent for locally compact Polish and abelian Polish groups. More specifically, we prove that in these broad classes of groups, every computable topological group admits a right-c.e.~(upper semi-computable) presentation with a left-invariant metric, and a computable dense sequence of points. In the locally compact case, we also show that if the group is additionally effectively locally compact, then we can produce an effectively proper left-invariant metric.
△ Less
Submitted 10 September, 2022;
originally announced September 2022.
-
Expressing Multivariate Time Series as Graphs with Time Series Attention Transformer
Authors:
William T. Ng,
K. Siu,
Albert C. Cheung,
Michael K. Ng
Abstract:
A reliable and efficient representation of multivariate time series is crucial in various downstream machine learning tasks. In multivariate time series forecasting, each variable depends on its historical values and there are inter-dependencies among variables as well. Models have to be designed to capture both intra- and inter-relationships among the time series. To move towards this goal, we pr…
▽ More
A reliable and efficient representation of multivariate time series is crucial in various downstream machine learning tasks. In multivariate time series forecasting, each variable depends on its historical values and there are inter-dependencies among variables as well. Models have to be designed to capture both intra- and inter-relationships among the time series. To move towards this goal, we propose the Time Series Attention Transformer (TSAT) for multivariate time series representation learning. Using TSAT, we represent both temporal information and inter-dependencies of multivariate time series in terms of edge-enhanced dynamic graphs. The intra-series correlations are represented by nodes in a dynamic graph; a self-attention mechanism is modified to capture the inter-series correlations by using the super-empirical mode decomposition (SMD) module. We applied the embedded dynamic graphs to times series forecasting problems, including two real-world datasets and two benchmark datasets. Extensive experiments show that TSAT clearly outerperforms six state-of-the-art baseline methods in various forecasting horizons. We further visualize the embedded dynamic graphs to illustrate the graph representation power of TSAT. We share our code at https://github.com/RadiantResearch/TSAT.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
Limit Complexities, Minimal Descriptions, and $n$-Randomness
Authors:
Rodney Downey,
Lu Liu,
Keng Meng Ng,
Daniel Turetsky
Abstract:
Let $K$ denote prefix-free Kolmogorov Complexity, and $K^A$ denote it relative to an oracle $A$. We show that for any $n$, $K^{\emptyset^{(n)}}$ is definable purely in terms of the unrelativized notion $K$. It was already known that 2-randomness is definable in terms of $K$ (and plain complexity $C$) as those reals which infinitely often have maximal complexity. We can use our characterization to…
▽ More
Let $K$ denote prefix-free Kolmogorov Complexity, and $K^A$ denote it relative to an oracle $A$. We show that for any $n$, $K^{\emptyset^{(n)}}$ is definable purely in terms of the unrelativized notion $K$. It was already known that 2-randomness is definable in terms of $K$ (and plain complexity $C$) as those reals which infinitely often have maximal complexity. We can use our characterization to show that $n$-randomness is definable purely in terms of $K$. To do this we extend a certain ``limsup'' formula from the literature, and apply Symmetry of Information. This extension entails a novel use of semilow sets, and a more precise analysis of the complexity of $Δ_2^0$ sets of mimimal descriptions.
△ Less
Submitted 5 August, 2022;
originally announced August 2022.
-
Separable Quaternion Matrix Factorization for Polarization Images
Authors:
Junjun Pan,
Michael K. Ng
Abstract:
Polarization is a unique characteristic of transverse wave and is represented by Stokes parameters. Analysis of polarization states can reveal valuable information about the sources. In this paper, we propose a separable low-rank quaternion linear mixing model to polarized signals: we assume each column of the source factor matrix equals a column of polarized data matrix and refer to the correspon…
▽ More
Polarization is a unique characteristic of transverse wave and is represented by Stokes parameters. Analysis of polarization states can reveal valuable information about the sources. In this paper, we propose a separable low-rank quaternion linear mixing model to polarized signals: we assume each column of the source factor matrix equals a column of polarized data matrix and refer to the corresponding problem as separable quaternion matrix factorization (SQMF). We discuss some properties of the matrix that can be decomposed by SQMF. To determine the source factor matrix in quaternion space, we propose a heuristic algorithm called quaternion successive projection algorithm (QSPA) inspired by the successive projection algorithm. To guarantee the effectiveness of QSPA, a new normalization operator is proposed for the quaternion matrix. We use a block coordinate descent algorithm to compute nonnegative factor activation matrix in real number space. We test our method on the applications of polarization image representation and spectro-polarimetric imaging unmixing to verify its effectiveness.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
HessianFR: An Efficient Hessian-based Follow-the-Ridge Algorithm for Minimax Optimization
Authors:
Yihang Gao,
Huafeng Liu,
Michael K. Ng,
Mingjie Zhou
Abstract:
Wide applications of differentiable two-player sequential games (e.g., image generation by GANs) have raised much interest and attention of researchers to study efficient and fast algorithms. Most of the existing algorithms are developed based on nice properties of simultaneous games, i.e., convex-concave payoff functions, but are not applicable in solving sequential games with different settings.…
▽ More
Wide applications of differentiable two-player sequential games (e.g., image generation by GANs) have raised much interest and attention of researchers to study efficient and fast algorithms. Most of the existing algorithms are developed based on nice properties of simultaneous games, i.e., convex-concave payoff functions, but are not applicable in solving sequential games with different settings. Some conventional gradient descent ascent algorithms theoretically and numerically fail to find the local Nash equilibrium of the simultaneous game or the local minimax (i.e., local Stackelberg equilibrium) of the sequential game. In this paper, we propose the HessianFR, an efficient Hessian-based Follow-the-Ridge algorithm with theoretical guarantees. Furthermore, the convergence of the stochastic algorithm and the approximation of Hessian inverse are exploited to improve algorithm efficiency. A series of experiments of training generative adversarial networks (GANs) have been conducted on both synthetic and real-world large-scale image datasets (e.g. MNIST, CIFAR-10 and CelebA). The experimental results demonstrate that the proposed HessianFR outperforms baselines in terms of convergence and image generation quality.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Deep neural networks for solving large linear systems arising from high-dimensional problems
Authors:
Yiqi Gu,
Michael K. Ng
Abstract:
This paper studies deep neural networks for solving extremely large linear systems arising from highdimensional problems. Because of the curse of dimensionality, it is expensive to store both the solution and right-hand side vector in such extremely large linear systems. Our idea is to employ a neural network to characterize the solution with much fewer parameters than the size of the solution und…
▽ More
This paper studies deep neural networks for solving extremely large linear systems arising from highdimensional problems. Because of the curse of dimensionality, it is expensive to store both the solution and right-hand side vector in such extremely large linear systems. Our idea is to employ a neural network to characterize the solution with much fewer parameters than the size of the solution under a matrix-free setting. We present an error analysis of the proposed method, indicating that the solution error is bounded by the condition number of the matrix and the neural network approximation error. Several numerical examples from partial differential equations, queueing problems, and probabilistic Boolean networks are presented to demonstrate that the solutions of linear systems can be learned quite accurately.
△ Less
Submitted 4 March, 2023; v1 submitted 1 April, 2022;
originally announced April 2022.
-
Color Image Inpainting via Robust Pure Quaternion Matrix Completion: Error Bound and Weighted Loss
Authors:
Junren Chen,
Michael K. Ng
Abstract:
In this paper, we study color image inpainting as a pure quaternion matrix completion problem. In the literature, the theoretical guarantee for quaternion matrix completion is not well-established. Our main aim is to propose a new minimization problem with an objective combining nuclear norm and a quadratic loss weighted among three channels. To fill the theoretical vacancy, we obtain the error bo…
▽ More
In this paper, we study color image inpainting as a pure quaternion matrix completion problem. In the literature, the theoretical guarantee for quaternion matrix completion is not well-established. Our main aim is to propose a new minimization problem with an objective combining nuclear norm and a quadratic loss weighted among three channels. To fill the theoretical vacancy, we obtain the error bound in both clean and corrupted regimes, which relies on some new results of quaternion matrices. A general Gaussian noise is considered in robust completion where all observations are corrupted. Motivated by the error bound, we propose to handle unbalanced or correlated noise via a cross-channel weight in the quadratic loss, with the main purpose of rebalancing noise level, or removing noise correlation. Extensive experimental results on synthetic and color image data are presented to confirm and demonstrate our theoretical findings.
△ Less
Submitted 26 October, 2022; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Deep adaptive basis Galerkin method for high-dimensional evolution equations with oscillatory solutions
Authors:
Yiqi Gu,
Micheal K. Ng
Abstract:
In this paper, we study deep neural networks (DNNs) for solving high-dimensional evolution equations with oscillatory solutions. Different from deep least-squares methods that deal with time and space variables simultaneously, we propose a deep adaptive basis Galerkin (DABG) method, which employs the spectral-Galerkin method for the time variable of oscillatory solutions and the deep neural networ…
▽ More
In this paper, we study deep neural networks (DNNs) for solving high-dimensional evolution equations with oscillatory solutions. Different from deep least-squares methods that deal with time and space variables simultaneously, we propose a deep adaptive basis Galerkin (DABG) method, which employs the spectral-Galerkin method for the time variable of oscillatory solutions and the deep neural network method for high-dimensional space variables. The proposed method can lead to a linear system of differential equations having unknown DNNs that can be trained via the loss function. We establish a posterior estimates of the solution error, which is bounded by the minimal loss function and the term $O(N^{-m})$, where $N$ is the number of basis functions and $m$ characterizes the regularity of the e'quation. We also show that if the true solution is a Barron-type function, the error bound converges to zero as $M=O(N^p)$ approaches to infinity, where $M$ is the width of the used networks, and $p$ is a positive constant. Numerical examples, including high-dimensional linear evolution equations and the nonlinear Allen-Cahn equation, are presented to demonstrate the performance of the proposed DABG method is better than that of existing DNNs.
△ Less
Submitted 31 May, 2022; v1 submitted 29 December, 2021;
originally announced December 2021.
-
Punctual equivalence relations and their (punctual) complexity
Authors:
Nikolay Bazhenov,
Keng Meng Ng,
Luca San Mauro,
Andrea Sorbi
Abstract:
The complexity of equivalence relations has received much attention in the recent literature. The main tool for such endeavour is the following reducibility: given equivalence relations $R$ and $S$ on natural numbers, $R$ is computably reducible to $S$ if there is a computable function $f \colon ω\to ω$ that induces an injective map from $R$-equivalence classes to $S$-equivalence classes. In order…
▽ More
The complexity of equivalence relations has received much attention in the recent literature. The main tool for such endeavour is the following reducibility: given equivalence relations $R$ and $S$ on natural numbers, $R$ is computably reducible to $S$ if there is a computable function $f \colon ω\to ω$ that induces an injective map from $R$-equivalence classes to $S$-equivalence classes. In order to compare the complexity of equivalence relations which are computable, researchers considered also feasible variants of computable reducibility, such as the polynomial-time reducibility. In this work, we explore $\mathbf{Peq}$, the degree structure generated by primitive recursive reducibility on punctual equivalence relations (i.e., primitive recursive equivalence relations with domain $ω$). In contrast with all other known degree structures on equivalence relations, we show that $\mathbf{Peq}$ has much more structure: e.g., we show that it is a dense distributive lattice. On the other hand, we also offer evidence of the intricacy of $\mathbf{Peq}$, proving, e.g., that the structure is neither rigid nor homogeneous.
△ Less
Submitted 9 September, 2021;
originally announced September 2021.
-
Wasserstein Generative Adversarial Uncertainty Quantification in Physics-Informed Neural Networks
Authors:
Yihang Gao,
Michael K. Ng
Abstract:
In this paper, we study a physics-informed algorithm for Wasserstein Generative Adversarial Networks (WGANs) for uncertainty quantification in solutions of partial differential equations. By using groupsort activation functions in adversarial network discriminators, network generators are utilized to learn the uncertainty in solutions of partial differential equations observed from the initial/bou…
▽ More
In this paper, we study a physics-informed algorithm for Wasserstein Generative Adversarial Networks (WGANs) for uncertainty quantification in solutions of partial differential equations. By using groupsort activation functions in adversarial network discriminators, network generators are utilized to learn the uncertainty in solutions of partial differential equations observed from the initial/boundary data. Under mild assumptions, we show that the generalization error of the computed generator converges to the approximation error of the network with high probability, when the number of samples are sufficiently taken. According to our established error bound, we also find that our physics-informed WGANs have higher requirement for the capacity of discriminators than that of generators. Numerical results on synthetic examples of partial differential equations are reported to validate our theoretical results and demonstrate how uncertainty quantification can be obtained for solutions of partial differential equations and the distributions of initial/boundary data. However, the quality or the accuracy of the uncertainty quantification theory in all the points in the interior is still the theoretical vacancy, and required for further research.
△ Less
Submitted 9 August, 2022; v1 submitted 30 August, 2021;
originally announced August 2021.
-
Deep Ritz method for the spectral fractional Laplacian equation using the Caffarelli-Silvestre extension
Authors:
Yiqi Gu,
Micheal K. Ng
Abstract:
In this paper, we propose a novel method for solving high-dimensional spectral fractional Laplacian equations. Using the Caffarelli-Silvestre extension, the $d$-dimensional spectral fractional equation is reformulated as a regular partial differential equation of dimension $d+1$. We transform the extended equation as a minimal Ritz energy functional problem and search for its minimizer in a specia…
▽ More
In this paper, we propose a novel method for solving high-dimensional spectral fractional Laplacian equations. Using the Caffarelli-Silvestre extension, the $d$-dimensional spectral fractional equation is reformulated as a regular partial differential equation of dimension $d+1$. We transform the extended equation as a minimal Ritz energy functional problem and search for its minimizer in a special class of deep neural networks. Moreover, based on the approximation property of networks, we establish estimates on the error made by the deep Ritz method. Numerical results are reported to demonstrate the effectiveness of the proposed method for solving fractional Laplacian equations up to ten dimensions. Technically, in this method, we design a special network-based structure to adapt to the singularity and exponential decaying of the true solution. Also, A hybrid integration technique combining Monte Carlo method and sinc quadrature is developed to compute the loss function with higher accuracy.
△ Less
Submitted 29 December, 2021; v1 submitted 26 August, 2021;
originally announced August 2021.
-
Point-free Construction of Real Exponentiation
Authors:
Ming Ng,
Steven Vickers
Abstract:
We define a point-free construction of real exponentiation and logarithms, i.e.\ we construct the maps $\exp\colon (0, \infty)\times \mathbb{R} \rightarrow \!(0,\infty),\, (x, ζ) \mapsto x^ζ$ and $\log\colon (1,\infty)\times (0, \infty) \rightarrow\mathbb{R},\, (b, y) \mapsto \log_b(y)$, and we develop familiar algebraic rules for them. The point-free approach is constructive, and defines the poin…
▽ More
We define a point-free construction of real exponentiation and logarithms, i.e.\ we construct the maps $\exp\colon (0, \infty)\times \mathbb{R} \rightarrow \!(0,\infty),\, (x, ζ) \mapsto x^ζ$ and $\log\colon (1,\infty)\times (0, \infty) \rightarrow\mathbb{R},\, (b, y) \mapsto \log_b(y)$, and we develop familiar algebraic rules for them. The point-free approach is constructive, and defines the points of a space as models of a geometric theory, rather than as elements of a set - in particular, this allows geometric constructions to be applied to points living in toposes other than Set. Our geometric development includes new lifting and gluing techniques in point-free topology, which highlight how properties of $\mathbb{Q}$ determine properties of real exponentiation.
This work is motivated by our broader research programme of developing a version of adelic geometry via topos theory. In particular, we wish to construct the classifying topos of places of $\mathbb{Q}$, which will provide a geometric perspective into the subtle relationship between $\mathbb{R}$ and $\mathbb{Q}_p$, a question of longstanding number-theoretic interest.
△ Less
Submitted 1 August, 2022; v1 submitted 31 March, 2021;
originally announced April 2021.
-
Spectral analysis for preconditioning of multi-dimensional Riesz fractional diffusion equations
Authors:
Xin Huang,
Xue-Lei Lin,
Michael K. Ng,
Hai-Wei Sun
Abstract:
In this paper, we analyze the spectra of the preconditioned matrices arising from discretized multi-dimensional Riesz spatial fractional diffusion equations. The finite difference method is employed to approximate the multi-dimensional Riesz fractional derivatives, which will generate symmetric positive definite ill-conditioned multi-level Toeplitz matrices. The preconditioned conjugate gradient m…
▽ More
In this paper, we analyze the spectra of the preconditioned matrices arising from discretized multi-dimensional Riesz spatial fractional diffusion equations. The finite difference method is employed to approximate the multi-dimensional Riesz fractional derivatives, which will generate symmetric positive definite ill-conditioned multi-level Toeplitz matrices. The preconditioned conjugate gradient method with a preconditioner based on the sine transform is employed to solve the resulting linear system. Theoretically, we prove that the spectra of the preconditioned matrices are uniformly bounded in the open interval (1/2,3/2) and thus the preconditioned conjugate gradient method converges linearly. The proposed method can be extended to multi-level Toeplitz matrices generated by functions with zeros of fractional order. Our theoretical results fill in a vacancy in the literature. Numerical examples are presented to demonstrate our new theoretical results in the literature and show the convergence performance of the proposed preconditioner that is better than other existing preconditioners.
△ Less
Submitted 2 February, 2021;
originally announced February 2021.
-
A parallel-in-time two-sided preconditioning for all-at-once system from a non-local evolutionary equation with weakly singular kernel
Authors:
Xue-lei Lin,
Michael K. Ng,
Yajing Zhi
Abstract:
In this paper, we study a parallel-in-time (PinT) algorithm for all-at-once system from a non-local evolutionary equation with weakly singular kernel where the temporal term involves a non-local convolution with a weakly singular kernel and the spatial term is the usual Laplacian operator with variable coefficients. We propose to use a two-sided preconditioning technique for the all-at-once discre…
▽ More
In this paper, we study a parallel-in-time (PinT) algorithm for all-at-once system from a non-local evolutionary equation with weakly singular kernel where the temporal term involves a non-local convolution with a weakly singular kernel and the spatial term is the usual Laplacian operator with variable coefficients. We propose to use a two-sided preconditioning technique for the all-at-once discretization of the equation. Our preconditioner is constructed by replacing the variable diffusion coefficients with a constant coefficient to obtain a constant-coefficient all-at-once matrix. We split a square root of the constant Laplacian operator out of the constant-coefficient all-at-once matrix as a right preconditioner and take the remaining part as a left preconditioner, which constitutes our two-sided preconditioning. Exploiting the diagonalizability of the constant-Laplacian matrix and the triangular Toeplitz structure of the temporal discretization matrix, we obtain efficient representations of inverses of the right and the left preconditioners, because of which the iterative solution can be fast updated in a PinT manner. Theoretically, the condition number of the two-sided preconditioned matrix is proven to be uniformly bounded by a constant independent of the matrix size. To the best of our knowledge, for the non-local evolutionary equation with variable coefficients, this is the first attempt to develop a PinT preconditioning technique that has fast and exact implementation and that the corresponding preconditioned system has a uniformly bounded condition number. Numerical results are reported to confirm the efficiency of the proposed two-sided preconditioning technique.
△ Less
Submitted 30 January, 2021;
originally announced February 2021.
-
Low Rank Pure Quaternion Approximation for Pure Quaternion Matrices
Authors:
Guangjing Song,
Weiyang Ding,
Michael K. Ng
Abstract:
Quaternion matrices are employed successfully in many color image processing applications. In particular, a pure quaternion matrix can be used to represent red, green and blue channels of color images. A low-rank approximation for a pure quaternion matrix can be obtained by using the quaternion singular value decomposition. However, this approximation is not optimal in the sense that the resulting…
▽ More
Quaternion matrices are employed successfully in many color image processing applications. In particular, a pure quaternion matrix can be used to represent red, green and blue channels of color images. A low-rank approximation for a pure quaternion matrix can be obtained by using the quaternion singular value decomposition. However, this approximation is not optimal in the sense that the resulting low-rank approximation matrix may not be pure quaternion, i.e., the low-rank matrix contains real component which is not useful for the representation of a color image. The main contribution of this paper is to find an optimal rank-$r$ pure quaternion matrix approximation for a pure quaternion matrix (a color image). Our idea is to use a projection on a low-rank quaternion matrix manifold and a projection on a quaternion matrix with zero real component, and develop an alternating projections algorithm to find such optimal low-rank pure quaternion matrix approximation. The convergence of the projection algorithm can be established by showing that the low-rank quaternion matrix manifold and the zero real component quaternion matrix manifold has a non-trivial intersection point. Numerical examples on synthetic pure quaternion matrices and color images are presented to illustrate the projection algorithm can find optimal low-rank pure quaternion approximation for pure quaternion matrices or color images.
△ Less
Submitted 30 December, 2020;
originally announced December 2020.
-
Riemannian Conjugate Gradient Descent Method for Third-Order Tensor Completion
Authors:
Guang-Jing Song,
Xue-Zhong Wang,
Michael K. Ng
Abstract:
The goal of tensor completion is to fill in missing entries of a partially known tensor under a low-rank constraint. In this paper, we mainly study low rank third-order tensor completion problems by using Riemannian optimization methods on the smooth manifold. Here the tensor rank is defined to be a set of matrix ranks where the matrices are the slices of the transformed tensor obtained by applyin…
▽ More
The goal of tensor completion is to fill in missing entries of a partially known tensor under a low-rank constraint. In this paper, we mainly study low rank third-order tensor completion problems by using Riemannian optimization methods on the smooth manifold. Here the tensor rank is defined to be a set of matrix ranks where the matrices are the slices of the transformed tensor obtained by applying the Fourier-related transformation onto the tubes of the original tensor. We show that with suitable incoherence conditions on the underlying low rank tensor, the proposed Riemannian optimization method is guaranteed to converge and find such low rank tensor with a high probability. In addition, numbers of sample entries required for solving low rank tensor completion problem under different initialized methods are studied and derived. Numerical examples for both synthetic and image data sets are reported to demonstrate the proposed method is able to recover low rank tensors.
△ Less
Submitted 20 November, 2020;
originally announced November 2020.
-
Non-Local Robust Quaternion Matrix Completion for Color Images and Videos Inpainting
Authors:
Zhigang Jia,
Qiyu Jin,
Michael K. Ng,
Xile Zhao
Abstract:
The image nonlocal self-similarity (NSS) prior refers to the fact that a local patch often has many nonlocal similar patches to it across the image and has been widely applied in many recently proposed machining learning algorithms for image processing. However, there is no theoretical analysis on its working principle in the literature. In this paper, we discover a potential causality between NSS…
▽ More
The image nonlocal self-similarity (NSS) prior refers to the fact that a local patch often has many nonlocal similar patches to it across the image and has been widely applied in many recently proposed machining learning algorithms for image processing. However, there is no theoretical analysis on its working principle in the literature. In this paper, we discover a potential causality between NSS and low-rank property of color images, which is also available to grey images. A new patch group based NSS prior scheme is proposed to learn explicit NSS models of natural color images. The numerical low-rank property of patched matrices is also rigorously proved. The NSS-based QMC algorithm computes an optimal low-rank approximation to the high-rank color image, resulting in high PSNR and SSIM measures and particularly the better visual quality. A new tensor NSS-based QMC method is also presented to solve the color video inpainting problem based on quaternion tensor representation. The numerical experiments on color images and videos indicate the advantages of NSS-based QMC over the state-of-the-art methods.
△ Less
Submitted 13 May, 2022; v1 submitted 17 November, 2020;
originally announced November 2020.