-
Convex Submodular Minimization with Indicator Variables
Authors:
Andres Gomez,
Shaoning Han
Abstract:
We study a general class of convex submodular optimization problems with indicator variables. Many applications such as the problem of inferring Markov random fields (MRFs) with a sparsity or robustness prior can be naturally modeled in this form. We show that these problems can be reduced to binary submodular minimization problems, possibly after a suitable reformulation, and thus are strongly po…
▽ More
We study a general class of convex submodular optimization problems with indicator variables. Many applications such as the problem of inferring Markov random fields (MRFs) with a sparsity or robustness prior can be naturally modeled in this form. We show that these problems can be reduced to binary submodular minimization problems, possibly after a suitable reformulation, and thus are strongly polynomially solvable. %We also discuss the implication of our results in the case of quadratic objectives. Furthermore, we develop a parametric approach for computing the associated extreme bases under certain smoothness conditions. This leads to a fast solution method, whose efficiency is demonstrated through numerical experiments.
△ Less
Submitted 7 July, 2025; v1 submitted 1 July, 2025;
originally announced July 2025.
-
The Error in a Smooth Weighted Prime Number Formula and Zero-free Regions for the Riemann Zeta Function
Authors:
Songlin Han
Abstract:
We study the error bound for a smooth weighted prime number theorem, and its implication to the zero-free region for the Riemann zeta function using the method of Pintz. We also give an application to the average number of smooth weighted Goldbach representations and generalize the result to the case of smooth weighted average k-Goldbach representations.
We study the error bound for a smooth weighted prime number theorem, and its implication to the zero-free region for the Riemann zeta function using the method of Pintz. We also give an application to the average number of smooth weighted Goldbach representations and generalize the result to the case of smooth weighted average k-Goldbach representations.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
Rank-one convexification for quadratic optimization problems with step function penalties
Authors:
Soobin Choi,
Valentina Cepeda,
Andres Gomez,
Shaoning Han
Abstract:
We investigate convexification for convex quadratic optimization with step function penalties. Such problems can be cast as mixed-integer quadratic optimization problems, where binary variables are used to encode the non-convex step function. First, we derive the convex hull for the epigraph of a quadratic function defined by a rank-one matrix and step function penalties. Using this rank-one conve…
▽ More
We investigate convexification for convex quadratic optimization with step function penalties. Such problems can be cast as mixed-integer quadratic optimization problems, where binary variables are used to encode the non-convex step function. First, we derive the convex hull for the epigraph of a quadratic function defined by a rank-one matrix and step function penalties. Using this rank-one convexification, we develop copositive and semi-definite relaxations for general convex quadratic functions. Leveraging these findings, we construct convex formulations to the support vector machine problem with 0--1 loss and show that they yield robust estimators in settings with anomalies and outliers.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
On Finite Time Span Estimators of Parameters for Ornstein-Uhlenbeck Processes
Authors:
Jun S. Han,
Nino Kordzakhia
Abstract:
We study the bias and the mean-squared error of the maximum likelihood estimators (MLE) of parameters associated with a two-parameter mean-reverting process for a finite time $T$. Using the likelihood ratio process, we derive the expressions for MLEs, then compute the bias and the MSE via the change of measure and Ito's formula. We apply the derived expressions to the general Ornstein-Uhlenbeck pr…
▽ More
We study the bias and the mean-squared error of the maximum likelihood estimators (MLE) of parameters associated with a two-parameter mean-reverting process for a finite time $T$. Using the likelihood ratio process, we derive the expressions for MLEs, then compute the bias and the MSE via the change of measure and Ito's formula. We apply the derived expressions to the general Ornstein-Uhlenbeck process, where the bias and the MSE are numerically computed through a joint moment-generating function of key functionals of the O-U process. A numerical study is provided to illustrate the behaviour of bias and the MSE for the MLE of the mean-reverting speed parameter.
△ Less
Submitted 30 March, 2025;
originally announced March 2025.
-
Precoder Learning by Leveraging Unitary Equivariance Property
Authors:
Yilun Ge,
Shuyao Liao,
Shengqian Han,
Chenyang Yang
Abstract:
Incorporating mathematical properties of a wireless policy to be learned into the design of deep neural networks (DNNs) is effective for enhancing learning efficiency. Multi-user precoding policy in multi-antenna system, which is the mapping from channel matrix to precoding matrix, possesses a permutation equivariance property, which has been harnessed to design the parameter sharing structure of…
▽ More
Incorporating mathematical properties of a wireless policy to be learned into the design of deep neural networks (DNNs) is effective for enhancing learning efficiency. Multi-user precoding policy in multi-antenna system, which is the mapping from channel matrix to precoding matrix, possesses a permutation equivariance property, which has been harnessed to design the parameter sharing structure of the weight matrix of DNNs. In this paper, we study a stronger property than permutation equivariance, namely unitary equivariance, for precoder learning. We first show that a DNN with unitary equivariance designed by further introducing parameter sharing into a permutation equivariant DNN is unable to learn the optimal precoder. We proceed to develop a novel non-linear weighting process satisfying unitary equivariance and then construct a joint unitary and permutation equivariant DNN. Simulation results demonstrate that the proposed DNN not only outperforms existing learning methods in learning performance and generalizability but also reduces training complexity.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
On families of strongly divisible modules of rank 2
Authors:
Seongjae Han,
Chol Park
Abstract:
Let $p$ be an odd prime, and $\mathbf{Q}_{p^f}$ the unramified extension of $\mathbf{Q}_p$ of degree $f$. In this paper, we reduce the problem of constructing strongly divisible modules for $2$-dimensional semi-stable non-crystalline representations of $\mathrm{Gal}(\overline{\mathbf{Q}}_p/\mathbf{Q}_{p^f})$ with Hodge--Tate weights in the Fontaine--Laffaille range to solving systems of linear equ…
▽ More
Let $p$ be an odd prime, and $\mathbf{Q}_{p^f}$ the unramified extension of $\mathbf{Q}_p$ of degree $f$. In this paper, we reduce the problem of constructing strongly divisible modules for $2$-dimensional semi-stable non-crystalline representations of $\mathrm{Gal}(\overline{\mathbf{Q}}_p/\mathbf{Q}_{p^f})$ with Hodge--Tate weights in the Fontaine--Laffaille range to solving systems of linear equations and inequalities. We also determine the Breuil modules corresponding to the mod-$p$ reduction of the strongly divisible modules. We expect our method to produce at least one Galois-stable lattice in each such representation for general $f$. Moreover, when the mod-$p$ reduction is an extension of distinct characters, we further expect our method to provide the two non-homothetic lattices. As applications, we show that our approach recovers previously known results for $f=1$ and determine the mod-$p$ reduction of the semi-stable representations with some small Hodge--Tate weights when $f=2$.
△ Less
Submitted 24 May, 2025; v1 submitted 5 March, 2025;
originally announced March 2025.
-
Convergence to superposition of boundary layer, rarefaction and shock for the 1D Navier-Stokes equations
Authors:
Sungho Han,
Moon-Jin Kang,
Jeongho Kim,
Nayeon Kim,
HyeonSeop Oh
Abstract:
We establish the asymptotic stability of solutions to the inflow problem for the one-dimensional barotropic Navier-Stokes equations in half space. When the boundary value is located at the subsonic regime, all the possible thirteen asymptotic patterns are classified in \cite{M01}. We consider the most complicated pattern, the superposition of the boundary layer solution, the 1-rarefaction wave, an…
▽ More
We establish the asymptotic stability of solutions to the inflow problem for the one-dimensional barotropic Navier-Stokes equations in half space. When the boundary value is located at the subsonic regime, all the possible thirteen asymptotic patterns are classified in \cite{M01}. We consider the most complicated pattern, the superposition of the boundary layer solution, the 1-rarefaction wave, and the viscous 2-shock waves. In this superposition, the boundary layer is degenerate and large. We prove that, if the strengths of the rarefaction wave and shock wave are small, and if the initial data is a small perturbation of the superposition, then the solution asymptotically converges to the superposition up to a dynamical shift for the shock. As a corollary, our result implies the asymptotic stability for the simpler case where the superposition consists of the degenerate boundary layer solution and the viscous 2-shock. Therefore, we complete the study of the asymptotic stability of the inflow problem for the 1D barotropic Navier-Stokes equations for subsonic boundary values.
△ Less
Submitted 12 February, 2025;
originally announced February 2025.
-
Time-asymptotic stability of composite wave for the one-dimensional compressible fluid of Kortwewg type
Authors:
Sungho Han,
Jeongho Kim
Abstract:
We study the asymptotic stability of a composition of rarefaction and shock waves for the one-dimensional barotropic compressible fluid of Korteweg type, called the Navier-Stokes-Korteweg(NSK) system. Precisely, we show that the solution to the NSK system asymptotically converges to the composition of the rarefaction wave and shifted viscous-dispersive shock wave, under certain smallness assumptio…
▽ More
We study the asymptotic stability of a composition of rarefaction and shock waves for the one-dimensional barotropic compressible fluid of Korteweg type, called the Navier-Stokes-Korteweg(NSK) system. Precisely, we show that the solution to the NSK system asymptotically converges to the composition of the rarefaction wave and shifted viscous-dispersive shock wave, under certain smallness assumption on the initial perturbation and strength of the waves. Our method is based on the method of $a$-contraction with shift developed by Kang and Vasseur \cite{KV16}, successfully applied to obtain contraction or stability of nonlinear waves for hyperbolic systems.
△ Less
Submitted 3 February, 2025;
originally announced February 2025.
-
Growth Rate Gap for Stable Subgroups
Authors:
Suzhen Han,
Qing Liu
Abstract:
We prove that stable subgroups of Morse local-to-global groups exhibit a growth gap. That is, the growth rate of an infinite-index stable subgroup is strictly less than the growth rate of the ambient Morse local-to-global group. This generalizes a result of Cordes, Russell, Spriano, and Zalloum in the sense that we removed the additional torsion-free or residually finite assumptions. The Morse loc…
▽ More
We prove that stable subgroups of Morse local-to-global groups exhibit a growth gap. That is, the growth rate of an infinite-index stable subgroup is strictly less than the growth rate of the ambient Morse local-to-global group. This generalizes a result of Cordes, Russell, Spriano, and Zalloum in the sense that we removed the additional torsion-free or residually finite assumptions. The Morse local-to-global groups are a very broad class of groups, including mapping class groups, CAT(0) groups, closed $3$-manifold groups, certain relatively hyperbolic groups, virtually solvable groups, etc.
△ Less
Submitted 15 December, 2024;
originally announced December 2024.
-
Remarks on the digital-topological $k$-group structures and the development of the $AP_1$-$k$- and $AP_1^\ast$-$k$-group
Authors:
Sang-Eon Han
Abstract:
In the literature of a digital-topological ($DT$-, for brevity) group structure on a digital image $(X,k)$, roughly saying, two kinds of methods are shown. Given a digital image $(X,k)$, the first one, named by a $DT$-$k$-group, was established in 2022 \cite{H10} by using both the $G_{k^\ast}$- or $C_{k^\ast}$-adjacency \cite{H10} for the product $X^2:=X \times X$ and the $(G_{k^\ast},k)$- or…
▽ More
In the literature of a digital-topological ($DT$-, for brevity) group structure on a digital image $(X,k)$, roughly saying, two kinds of methods are shown. Given a digital image $(X,k)$, the first one, named by a $DT$-$k$-group, was established in 2022 \cite{H10} by using both the $G_{k^\ast}$- or $C_{k^\ast}$-adjacency \cite{H10} for the product $X^2:=X \times X$ and the $(G_{k^\ast},k)$- or $(C_{k^\ast},k)$-continuity for the multiplication $α:X^2 \to X$ \cite{H10}.
The second one with the name of $NP_i$-$DT$-groups, $i \in \{1,2\}$, was
discussed in 2023 \cite{LS1} by using the $NP_i(k,k)$-adjacency for $X^2$ in \cite{B1} and the $(NP_i(k,k), k)$-continuities of the multiplication $α:X^2 \to X$, $i\in \{1,2\}$. However, due to some defects of the $NP_u(k_1,k_2, \cdots, k_v)$-adjacency in \cite{B1,B2}, the $AP_u(k_1,k_2, \cdots, k_v)$-adjacency was recently developed as an alternative to the $NP_u(k_1,k_2, \cdots, k_v)$-adjacency (see Section 4). Besides, we also develop an $AP_u^\ast(k_1,k_2, \cdots, k_v)$-adjacency. For a digital image $(X, k)$, in case an $AP_1(k,k)$-($AP_1$-, for simplicity) adjacency on $X^2$ exists, we formulate both an $AP_1$-$k$- and an $AP_1^\ast$-$k$-group. Then we show that an $AP_1^\ast$-$k$-group is equivalent to a Han's $DT$-$k$-group based on both the $C_{k^\ast}$-adjacency on the product $X^2$ and the $(C_{k^\ast}, k)$-continuity for the multiplication $α_1^\prime:(X^2, C_{k^\ast}) \to (X,k)$.
△ Less
Submitted 30 October, 2024;
originally announced October 2024.
-
The method of $a$-contraction with shifts used for long-time behavior toward viscous shock
Authors:
Sungho Han,
Moon-Jin Kang,
Hobin Lee
Abstract:
We revisit the method of $a$-contraction with shifts used for long-time behavior of barotropic Navier-Stokes flows perturbed from a Riemann shock. For the usage of the method of $a$-contraction with shifts, we do not employ the effective velocity $h$ variable even for higher order estimates. This approach would be important when handling the barotropic Navier-Stokes system with other effects, for…
▽ More
We revisit the method of $a$-contraction with shifts used for long-time behavior of barotropic Navier-Stokes flows perturbed from a Riemann shock. For the usage of the method of $a$-contraction with shifts, we do not employ the effective velocity $h$ variable even for higher order estimates. This approach would be important when handling the barotropic Navier-Stokes system with other effects, for example, such as capillary effect and boundary effect.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
A compounded random walk for space-fractional diffusion on finite domains
Authors:
Christopher N. Angstmann,
Daniel S. Han,
Bruce I. Henry,
Boris Z. Huang,
Zhuang Xu
Abstract:
We formulate a compounded random walk that is physically well defined on both finite and infinite domains, and samples space-dependent forces throughout jumps. The governing evolution equation for the walk limits to a space-fractional Fokker-Planck equation valid on bounded domains, and recovers the well known superdiffusive space-fractional diffusion equation on infinite domains. We describe meth…
▽ More
We formulate a compounded random walk that is physically well defined on both finite and infinite domains, and samples space-dependent forces throughout jumps. The governing evolution equation for the walk limits to a space-fractional Fokker-Planck equation valid on bounded domains, and recovers the well known superdiffusive space-fractional diffusion equation on infinite domains. We describe methods for numerical approximation and Monte Carlo simulations and demonstrate excellent correspondence with analytical solutions. This compounded random walk, and its associated fractional Fokker-Planck equation, provides a major advance for modeling space-fractional diffusion through potential fields and on finite domains.
△ Less
Submitted 11 February, 2025; v1 submitted 13 October, 2024;
originally announced October 2024.
-
Improving the Solution of Indefinite Quadratic Programs and Linear Programs with Complementarity Constraints by a Progressive MIP Method
Authors:
Xinyao Zhang,
Shaoning Han,
Jong-Shi Pang
Abstract:
Indefinite quadratic programs (QPs) are known to be very difficult to be solved to global optimality, so are linear programs with linear complementarity constraints. Treating the former as a subclass of the latter, this paper presents a progressive mixed integer linear programming method for solving a general linear program with linear complementarity constraints (LPCC). Instead of solving the LPC…
▽ More
Indefinite quadratic programs (QPs) are known to be very difficult to be solved to global optimality, so are linear programs with linear complementarity constraints. Treating the former as a subclass of the latter, this paper presents a progressive mixed integer linear programming method for solving a general linear program with linear complementarity constraints (LPCC). Instead of solving the LPCC with a full set of integer variables expressing the complementarity conditions, the presented method solves a finite number of mixed integer subprograms by starting with a small fraction of integer variables and progressively increasing this fraction. After describing the PIP (for progressive integer programming) method and its various implementations, we demonstrate, via an extensive set of computational experiments, the superior performance of the progressive approach over the direct solution of the full-integer formulation of the LPCCs. It is also shown that the solution obtained at the termination of the PIP method is a local minimizer of the LPCC, a property that cannot be claimed by any known non-enumerative method for solving this nonconvex program. In all the experiments, the PIP method is initiated at a feasible solution of the LPCC obtained from a nonlinear programming solver, and with high likelihood, can successfully improve it. Thus, the PIP method can improve a stationary solution of an indefinite QP, something that is not likely to be achievable by a nonlinear programming method. Finally, some analysis is presented that provides a better understanding of the roles of the LPCC suboptimal solutions in the local optimality of the indefinite QP.
△ Less
Submitted 15 March, 2025; v1 submitted 15 September, 2024;
originally announced September 2024.
-
Injectivity of modules over trusses
Authors:
Yongduo Wang,
Shujuan Han,
Dengke Jia,
Jian He,
Dejun Wu
Abstract:
As the dual notion of projective modules over trusses, injective modules over trusses are introduced. The Schanuel Lemmas on projective and injective modules over trusses are exhibited in this paper.
As the dual notion of projective modules over trusses, injective modules over trusses are introduced. The Schanuel Lemmas on projective and injective modules over trusses are exhibited in this paper.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
Terminal Soft Landing Guidance Law Using Analytic Gravity Turn Trajectory
Authors:
Seungyeop Han,
Byeong-Un Jo,
Koki Ho
Abstract:
This paper presents an innovative terminal landing guidance law that utilizes an analytic solution derived from the gravity turn trajectory. The characteristics of the derived solution are thoroughly investigated, and the solution is employed to generate a reference velocity vector that satisfies terminal landing conditions. A nonlinear control law is applied to effectively track the reference vel…
▽ More
This paper presents an innovative terminal landing guidance law that utilizes an analytic solution derived from the gravity turn trajectory. The characteristics of the derived solution are thoroughly investigated, and the solution is employed to generate a reference velocity vector that satisfies terminal landing conditions. A nonlinear control law is applied to effectively track the reference velocity vector within a finite time, and its robustness against disturbances is studied. Furthermore, the guidance law is expanded to incorporate ground collision avoidance by considering the shape of the gravity turn trajectory. The proposed method's fuel efficiency, robustness, and practicality are demonstrated through comprehensive numerical simulations, and its performance is compared with existing methods.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Analysis and Design of Satellite Constellation Spare Strategy Using Markov Chain
Authors:
Seungyeop Han,
Takumi Noro,
Koki Ho
Abstract:
This paper introduces the analysis and design method of an optimal spare management policy using Markov chain for a large-scale satellite constellation. We propose an analysis methodology of spare strategy using a multi-echelon $(r,q)$ inventory control model with Markov chain, and review two different spare strategies: direct resupply, which inserts spares directly into the constellation orbit us…
▽ More
This paper introduces the analysis and design method of an optimal spare management policy using Markov chain for a large-scale satellite constellation. We propose an analysis methodology of spare strategy using a multi-echelon $(r,q)$ inventory control model with Markov chain, and review two different spare strategies: direct resupply, which inserts spares directly into the constellation orbit using launch vehicles; and indirect resupply, which places spares into parking orbits before transferring them to the constellation orbit. Furthermore, we propose an optimization formulation utilizing the results of the proposed analysis method, and an optimal solution is found using a genetic algorithm.
△ Less
Submitted 17 August, 2024;
originally announced August 2024.
-
Time Efficient Rate Feedback Tracking Controller with Slew Rate and Control Constraint
Authors:
Seungyeop Han,
Byeong-Un Jo,
Koki Ho
Abstract:
This paper proposes a time-efficient attitude-tracking controller considering the slew rate constraint and control constraint. The algorithm defines the sliding surface, which is the linear combination of command, body, and regulating angular velocity, and utilizes the sliding surface to derive the control command that guarantees finite time stability. The regulating rate, which is an angular velo…
▽ More
This paper proposes a time-efficient attitude-tracking controller considering the slew rate constraint and control constraint. The algorithm defines the sliding surface, which is the linear combination of command, body, and regulating angular velocity, and utilizes the sliding surface to derive the control command that guarantees finite time stability. The regulating rate, which is an angular velocity regulating the attitude error between the command and body frame, is defined along the instantaneous eigen-axis between the two frames to minimize the rotation angle. In addition, the regulating rate is shaped such that the slew rate constraint is satisfied while the time to regulation is minimized with consideration of the control constraint. Practical scenarios involving Earth observation satellites are used to validate the algorithm's performance.
△ Less
Submitted 17 August, 2024;
originally announced August 2024.
-
Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming
Authors:
Seungyeop Han,
Byeong-Un Jo,
Koki Ho
Abstract:
This paper addresses the optimal scan profile problem for strip imaging in an Earth observation satellite (EOS) equipped with a time-delay integration (TDI) camera. Modern TDI cameras can control image integration frequency during imaging operation, adding an additional degree of freedom (DOF) to the imaging operation. On the other hand, modern agile EOS is capable of imaging non-parallel ground t…
▽ More
This paper addresses the optimal scan profile problem for strip imaging in an Earth observation satellite (EOS) equipped with a time-delay integration (TDI) camera. Modern TDI cameras can control image integration frequency during imaging operation, adding an additional degree of freedom (DOF) to the imaging operation. On the other hand, modern agile EOS is capable of imaging non-parallel ground targets, which require a substantial amount of angular velocity and angular acceleration during operation. We leverage this DOF to minimize various factors impacting image quality, such as angular velocity. Initially, we derive analytic expressions for angular velocity based on kinematic equations. These expressions are then used to formulate a constrained optimal control problem (OCP), which we solve using differential dynamic programming (DDP). We validate our approach through testing and comparison with reference methods across various practical scenarios. Simulation results demonstrate that our proposed method efficiently achieves near-optimal solutions without encountering non-convergence issues.
△ Less
Submitted 17 August, 2024;
originally announced August 2024.
-
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Authors:
Seyeon Kim,
Joonhun Lee,
Namhoon Cho,
Sungjun Han,
Wooseop Hwang
Abstract:
Conventional uncertainty-aware temporal difference (TD) learning often assumes a zero-mean Gaussian distribution for TD errors, leading to inaccurate error representations and compromised uncertainty estimation. We introduce a novel framework for generalized Gaussian error modeling in deep reinforcement learning to enhance the flexibility of error distribution modeling by incorporating additional…
▽ More
Conventional uncertainty-aware temporal difference (TD) learning often assumes a zero-mean Gaussian distribution for TD errors, leading to inaccurate error representations and compromised uncertainty estimation. We introduce a novel framework for generalized Gaussian error modeling in deep reinforcement learning to enhance the flexibility of error distribution modeling by incorporating additional higher-order moment, particularly kurtosis, thereby improving the estimation and mitigation of data-dependent aleatoric uncertainty. We examine the influence of the shape parameter of the generalized Gaussian distribution (GGD) on aleatoric uncertainty and provide a closed-form expression that demonstrates an inverse relationship between uncertainty and the shape parameter. Additionally, we propose a theoretically grounded weighting scheme to address epistemic uncertainty by fully leveraging the GGD. We refine batch inverse variance weighting with bias reduction and kurtosis considerations, enhancing robustness. Experiments with policy gradient algorithms demonstrate significant performance gains.
△ Less
Submitted 3 February, 2025; v1 submitted 5 August, 2024;
originally announced August 2024.
-
Finite Time Blowup of Integer- and Fractional-Order Time-Delayed Diffusion Equations
Authors:
Christopher N. Angstmann,
Stuart-James M. Burney,
Daniel S. Han,
Bruce I. Henry,
Boris Z. Huang,
Zhuang Xu
Abstract:
In this work, exact solutions are derived for an integer- and fractional-order time-delayed diffusion equation with arbitrary initial conditions. The solutions are obtained using Fourier transform methods in conjunction with the known properties of delay functions. It is observed that the solutions do not exhibit infinite speed of propagation for smooth initial conditions that are bounded and posi…
▽ More
In this work, exact solutions are derived for an integer- and fractional-order time-delayed diffusion equation with arbitrary initial conditions. The solutions are obtained using Fourier transform methods in conjunction with the known properties of delay functions. It is observed that the solutions do not exhibit infinite speed of propagation for smooth initial conditions that are bounded and positive. Sufficient conditions on the initial condition are also established such that the finite time blowup of the solutions can be explicitly calculated. Examples are provided that highlight the contrasting behaviours of these exact solutions with the known dynamics of solutions to the standard diffusion equation.
△ Less
Submitted 3 August, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
Robust Reward Design for Markov Decision Processes
Authors:
Shuo Wu,
Haoxiang Ma,
Jie Fu,
Shuo Han
Abstract:
The problem of reward design examines the interaction between a leader and a follower, where the leader aims to shape the follower's behavior to maximize the leader's payoff by modifying the follower's reward function. Current approaches to reward design rely on an accurate model of how the follower responds to reward modifications, which can be sensitive to modeling inaccuracies. To address this…
▽ More
The problem of reward design examines the interaction between a leader and a follower, where the leader aims to shape the follower's behavior to maximize the leader's payoff by modifying the follower's reward function. Current approaches to reward design rely on an accurate model of how the follower responds to reward modifications, which can be sensitive to modeling inaccuracies. To address this issue of sensitivity, we present a solution that offers robustness against uncertainties in modeling the follower, including 1) how the follower breaks ties in the presence of nonunique best responses, 2) inexact knowledge of how the follower perceives reward modifications, and 3) bounded rationality of the follower. Our robust solution is guaranteed to exist under mild conditions and can be obtained numerically by solving a mixed-integer linear program. Numerical experiments on multiple test cases demonstrate that our solution improves robustness compared to the standard approach without incurring significant additional computing costs.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Exact Solutions of Time-Delay Integer- and Fractional-Order Advection Equations
Authors:
Christopher N. Angstmann,
Stuart-James M. Burney,
Daniel S. Han,
Bruce I. Henry,
Zhuang Xu
Abstract:
Transport phenomena play a vital role in various fields of science and engineering. In this work, exact solutions are derived for advection equations with integer- and fractional-order time derivatives and a constant time-delay in the spatial derivative. Solutions are obtained, for arbitrary separable initial conditions, by incorporating recently introduced delay functions in a separation of varia…
▽ More
Transport phenomena play a vital role in various fields of science and engineering. In this work, exact solutions are derived for advection equations with integer- and fractional-order time derivatives and a constant time-delay in the spatial derivative. Solutions are obtained, for arbitrary separable initial conditions, by incorporating recently introduced delay functions in a separation of variables approach. Examples are provided showing oscillatory and translatory behaviours that are fundamentally different to standard propagating wave solutions.
△ Less
Submitted 23 September, 2024; v1 submitted 2 June, 2024;
originally announced June 2024.
-
Perfect basis theory for quantum Borcherds-Bozec algebras
Authors:
Zhaobing Fan,
Shaolong Han,
Seok-Jin Kang,
Young Rock Kim
Abstract:
In this paper, we develop the perfect basis theory for quantum Borcherds-Bozec algebras $U_{q}(\mathfrak g)$ and their irreducible highest weight modules $V(λ)$. We show that the lower perfect graph (resp. upper perfect graph) of every lower perfect basis (resp. upper perfect basis) of $U_{q}^{-}(\mathfrak g)$ (resp. $V(λ)$) is isomorphic to the crystal $B(\infty)$ (resp. $B(λ)$).
In this paper, we develop the perfect basis theory for quantum Borcherds-Bozec algebras $U_{q}(\mathfrak g)$ and their irreducible highest weight modules $V(λ)$. We show that the lower perfect graph (resp. upper perfect graph) of every lower perfect basis (resp. upper perfect basis) of $U_{q}^{-}(\mathfrak g)$ (resp. $V(λ)$) is isomorphic to the crystal $B(\infty)$ (resp. $B(λ)$).
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Real-time solution of quadratic optimization problems with banded matrices and indicator variables
Authors:
Andres Gomez,
Shaoning Han,
Leonardo Lozano
Abstract:
We consider mixed-integer quadratic optimization problems with banded matrices and indicator variables. These problems arise pervasively in statistical inference problems with time-series data, where the banded matrix captures the temporal relationship of the underlying process. In particular, the problem studied arises in monitoring problems, where the decision-maker wants to detect changes or an…
▽ More
We consider mixed-integer quadratic optimization problems with banded matrices and indicator variables. These problems arise pervasively in statistical inference problems with time-series data, where the banded matrix captures the temporal relationship of the underlying process. In particular, the problem studied arises in monitoring problems, where the decision-maker wants to detect changes or anomalies. We propose to solve these problems using decision diagrams. In particular we show how to exploit the temporal dependencies to construct diagrams with size polynomial in the number of decision variables. We also describe how to construct the convex hull of the set under study from the decision diagrams, and how to deploy the method online to solve the problems in milliseconds via a shortest path algorithm.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
A new Young wall realization of $B(λ)$ and $B(\infty)$
Authors:
Zhaobing Fan,
Shaolong Han,
Seok-Jin Kang,
Young Rock Kim
Abstract:
Using new combinatorics of Young walls, we give a new construction of the arbitrary level highest weight crystal $B(λ)$ for the quantum affine algebras of types $A^{(2)}_{2n}$, $D^{(2)}_{n+1}$, $A^{(2)}_{2n-1}$, $D^{(1)}_n$, $B^{(1)}_n$ and $C^{(1)}_n$. We show that the crystal consisting of reduced Young walls is isomorphic to the crystal $B(λ)$. Moreover, we provide a new realization of the crys…
▽ More
Using new combinatorics of Young walls, we give a new construction of the arbitrary level highest weight crystal $B(λ)$ for the quantum affine algebras of types $A^{(2)}_{2n}$, $D^{(2)}_{n+1}$, $A^{(2)}_{2n-1}$, $D^{(1)}_n$, $B^{(1)}_n$ and $C^{(1)}_n$. We show that the crystal consisting of reduced Young walls is isomorphic to the crystal $B(λ)$. Moreover, we provide a new realization of the crystal $B(\infty)$ in terms of reduced virtual Young walls and reduced extended Young walls.
△ Less
Submitted 16 March, 2024;
originally announced March 2024.
-
Young wall models for the level 1 highest weight and Fock space crystals of $U_q(E_6^{(2)})$ and $U_q(F_4^{(1)})$
Authors:
Shaolong Han,
Yuanfeng Jin,
Seok-Jin Kang,
Duncan Laurie
Abstract:
In this paper we construct Young wall models for the level $1$ highest weight and Fock space crystals of quantum affine algebras in types $E_6^{(2)}$ and $F_4^{(1)}$. Our starting point in each case is a combinatorial realization for a certain level $1$ perfect crystal in terms of Young columns. Then using energy functions and affine energy functions we define the notions of reduced and proper You…
▽ More
In this paper we construct Young wall models for the level $1$ highest weight and Fock space crystals of quantum affine algebras in types $E_6^{(2)}$ and $F_4^{(1)}$. Our starting point in each case is a combinatorial realization for a certain level $1$ perfect crystal in terms of Young columns. Then using energy functions and affine energy functions we define the notions of reduced and proper Young walls, which model the highest weight and Fock space crystals respectively.
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
Robust SVD Made Easy: A fast and reliable algorithm for large-scale data analysis
Authors:
Sangil Han,
Kyoowon Kim,
Sungkyu Jung
Abstract:
The singular value decomposition (SVD) is a crucial tool in machine learning and statistical data analysis. However, it is highly susceptible to outliers in the data matrix. Existing robust SVD algorithms often sacrifice speed for robustness or fail in the presence of only a few outliers. This study introduces an efficient algorithm, called Spherically Normalized SVD, for robust SVD approximation…
▽ More
The singular value decomposition (SVD) is a crucial tool in machine learning and statistical data analysis. However, it is highly susceptible to outliers in the data matrix. Existing robust SVD algorithms often sacrifice speed for robustness or fail in the presence of only a few outliers. This study introduces an efficient algorithm, called Spherically Normalized SVD, for robust SVD approximation that is highly insensitive to outliers, computationally scalable, and provides accurate approximations of singular vectors. The proposed algorithm achieves remarkable speed by utilizing only two applications of a standard reduced-rank SVD algorithm to appropriately scaled data, significantly outperforming competing algorithms in computation times. To assess the robustness of the approximated singular vectors and their subspaces against data contamination, we introduce new notions of breakdown points for matrix-valued input, including row-wise, column-wise, and block-wise breakdown points. Theoretical and empirical analyses demonstrate that our algorithm exhibits higher breakdown points compared to standard SVD and its modifications. We empirically validate the effectiveness of our approach in applications such as robust low-rank approximation and robust principal component analysis of high-dimensional microarray datasets. Overall, our study presents a highly efficient and robust solution for SVD approximation that overcomes the limitations of existing algorithms in the presence of outliers.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Long-time behavior towards viscous-dispersive shock for Navier-Stokes equations of Korteweg type
Authors:
Sungho Han,
Moon-Jin Kang,
Jeongho Kim,
Hobin Lee
Abstract:
We consider the so-called Naiver-Stokes-Korteweg(NSK) equations for the dynamics of compressible barotropic viscous fluids with internal capillarity. We handle the time-asymptotic stability in 1D of the viscous-dispersive shock wave that is a traveling wave solution to NSK as a viscous-dispersive counterpart of a Riemann shock. More precisely, we prove that when the prescribed far-field states of…
▽ More
We consider the so-called Naiver-Stokes-Korteweg(NSK) equations for the dynamics of compressible barotropic viscous fluids with internal capillarity. We handle the time-asymptotic stability in 1D of the viscous-dispersive shock wave that is a traveling wave solution to NSK as a viscous-dispersive counterpart of a Riemann shock. More precisely, we prove that when the prescribed far-field states of NSK are connected by a single Hugoniot curve, then solutions of NSK tend to the viscous-dispersive shock wave as time goes to infinity. To obtain the convergence, we extend the theory of $a$-contraction with shifts, used for the Navier-Stokes equations, to the NSK system. The main difficulty in analysis for NSK is due to the third-order derivative terms of the specific volume in the momentum equation. To resolve the problem, we introduce an auxiliary variable that is equivalent to the derivative of the specific volume.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Robust support vector machines via conic optimization
Authors:
Valentina Cepeda,
Andrés Gómez,
Shaoning Han
Abstract:
We consider the problem of learning support vector machines robust to uncertainty. It has been established in the literature that typical loss functions, including the hinge loss, are sensible to data perturbations and outliers, thus performing poorly in the setting considered. In contrast, using the 0-1 loss or a suitable non-convex approximation results in robust estimators, at the expense of la…
▽ More
We consider the problem of learning support vector machines robust to uncertainty. It has been established in the literature that typical loss functions, including the hinge loss, are sensible to data perturbations and outliers, thus performing poorly in the setting considered. In contrast, using the 0-1 loss or a suitable non-convex approximation results in robust estimators, at the expense of large computational costs. In this paper we use mixed-integer optimization techniques to derive a new loss function that better approximates the 0-1 loss compared with existing alternatives, while preserving the convexity of the learning problem. In our computational results, we show that the proposed estimator is competitive with the standard SVMs with the hinge loss in outlier-free regimes and better in the presence of outliers.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Convergence analysis of Lawson's iteration for the polynomial and rational minimax approximations
Authors:
Lei-Hong Zhang,
Shanheng Han
Abstract:
Lawson's iteration is a classical and effective method for solving the linear (polynomial) minimax approximation in the complex plane. Extension of Lawson's iteration for the rational minimax approximation with both computationally high efficiency and theoretical guarantee is challenging. A recent work [L.-H. Zhang, L. Yang, W. H. Yang and Y.-N. Zhang, A convex dual programming for the rational mi…
▽ More
Lawson's iteration is a classical and effective method for solving the linear (polynomial) minimax approximation in the complex plane. Extension of Lawson's iteration for the rational minimax approximation with both computationally high efficiency and theoretical guarantee is challenging. A recent work [L.-H. Zhang, L. Yang, W. H. Yang and Y.-N. Zhang, A convex dual programming for the rational minimax approximation and Lawson's iteration, 2023, arXiv:2308.06991v1] reveals that Lawson's iteration can be viewed as a method for solving the dual problem of the original rational minimax approximation, and a new type of Lawson's iteration was proposed. Such a dual problem is guaranteed to obtain the original minimax solution under Ruttan's sufficient condition, and numerically, the proposed Lawson's iteration was observed to converge monotonically with respect to the dual objective function. In this paper, we perform theoretical convergence analysis for Lawson's iteration for both the linear and rational minimax approximations. In particular, we show that (i) for the linear minimax approximation, the near-optimal Lawson exponent $β$ in Lawson's iteration is $β=1$, and (ii) for the rational minimax approximation, the proposed Lawson's iteration converges monotonically with respect to the dual objective function for any sufficiently small $β>0$, and the limit approximant fulfills the complementary slackness: any node associated with positive weight either is an interpolation point or has a constant error.
△ Less
Submitted 14 April, 2024; v1 submitted 1 January, 2024;
originally announced January 2024.
-
A Stochastic Simulation Method for Fractional Order Compartment Models
Authors:
Christopher N. Angstmann,
Stuart-James M. Burney,
Bruce I. Henry,
Daniel S. Han,
Byron A. Jacobs,
Zhuang Xu
Abstract:
Our study focuses on fractional order compartment models derived from underlying physical stochastic processes, providing a more physically grounded approach compared to models that use the dynamical system approach by simply replacing integer-order derivatives with fractional order derivatives. In these models, inherent stochasticity becomes important, particularly when dealing with the dynamics…
▽ More
Our study focuses on fractional order compartment models derived from underlying physical stochastic processes, providing a more physically grounded approach compared to models that use the dynamical system approach by simply replacing integer-order derivatives with fractional order derivatives. In these models, inherent stochasticity becomes important, particularly when dealing with the dynamics of small populations far from the continuum limit of large particle numbers. The necessity for stochastic simulations arises from deviations of the mean states from those obtained from the governing equations in these scenarios. To address this, we introduce an exact stochastic simulation algorithm designed for fractional order compartment models, based on a semi-Markov process. We have considered a fractional order resusceptibility SIS model and a fractional order recovery SIR model as illustrative examples, highlighting significant disparities between deterministic and stochastic dynamics when the total population is small. Beyond its modeling applications, the algorithm presented serves as a versatile tool for solving fractional order differential equations via Monte Carlo simulations.
△ Less
Submitted 26 June, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Policy Learning with Distributional Welfare
Authors:
Yifan Cui,
Sukjin Han
Abstract:
In this paper, we explore optimal treatment allocation policies that target distributional welfare. Most literature on treatment choice has considered utilitarian welfare based on the conditional average treatment effect (ATE). While average welfare is intuitive, it may yield undesirable allocations especially when individuals are heterogeneous (e.g., with outliers) - the very reason individualize…
▽ More
In this paper, we explore optimal treatment allocation policies that target distributional welfare. Most literature on treatment choice has considered utilitarian welfare based on the conditional average treatment effect (ATE). While average welfare is intuitive, it may yield undesirable allocations especially when individuals are heterogeneous (e.g., with outliers) - the very reason individualized treatments were introduced in the first place. This observation motivates us to propose an optimal policy that allocates the treatment based on the conditional quantile of individual treatment effects (QoTE). Depending on the choice of the quantile probability, this criterion can accommodate a policymaker who is either prudent or negligent. The challenge of identifying the QoTE lies in its requirement for knowledge of the joint distribution of the counterfactual outcomes, which is not generally point-identified. We introduce minimax policies that are robust to this model uncertainty. A range of identifying assumptions can be used to yield more informative policies. For both stochastic and deterministic policies, we establish the asymptotic bound on the regret of implementing the proposed policies. The framework can be generalized to any setting where welfare is defined as a functional of the joint distribution of the potential outcomes.
△ Less
Submitted 29 April, 2025; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Essential concepts of digital topology (digital $k$-covering spaces and pseudo $k$-covering spaces)
Authors:
Sang-Eon Han
Abstract:
The present paper focuses on the notions of covering spaces, pseudo-covering spaces, and their equivalences.
We discuss something incorrectly mentioned in Boxer's papers and correct them. Indeed, Sections 4-6 (or 4-6) of \cite{B3} are redundant because they have some incorrect assertions on $(k_1,k_2)$-covering spaces or pseudo- $(k_1,k_2)$-covering spaces due to his misunderstanding on Han's pa…
▽ More
The present paper focuses on the notions of covering spaces, pseudo-covering spaces, and their equivalences.
We discuss something incorrectly mentioned in Boxer's papers and correct them. Indeed, Sections 4-6 (or 4-6) of \cite{B3} are redundant because they have some incorrect assertions on $(k_1,k_2)$-covering spaces or pseudo- $(k_1,k_2)$-covering spaces due to his misunderstanding on Han's papers \cite{H14,H16}.
In addition, many things in \cite{B3} are duplicated with some results in \cite{H18}.
In addition, since the papers \cite{P1,P2} also have some defects, we correct and improve them.
△ Less
Submitted 24 September, 2023;
originally announced September 2023.
-
Digital $k$-continuity, digital $k$-isomorphism, local $k$-isomorphism, radius $2$-local $k$-isomorphism, and digital $k$-homotopy
Authors:
Sang-Eon Han
Abstract:
The present paper refers to the notions of digital continuity, digital $k$-isomorphism, local $k$-isomorphism, radius $2$-local $k$-isomorphism, and digital $k$-homotopy motivated by the Khalimsky's version.
We discuss something incorrectly mentioned in Boxer's papers and suggest some accurate information.
The present paper refers to the notions of digital continuity, digital $k$-isomorphism, local $k$-isomorphism, radius $2$-local $k$-isomorphism, and digital $k$-homotopy motivated by the Khalimsky's version.
We discuss something incorrectly mentioned in Boxer's papers and suggest some accurate information.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Essential concepts of digital topology\\ (digital $k$-connectivity and $k$-adjacencis for digital products)
Authors:
Sang-Eon Han
Abstract:
The paper refers to several concepts which are essential to studying digital objects from the viewpoint of digital topology: digital $k$-connectivity or digital $k$-adjacency, $C$-compatible and normal $k$-adjacency for a digital product.
Since L. Boxer has often mentioned the origins of these concepts in an inaccurate way, we discuss something incorrectly cited or mentioned in Boxer's papers ac…
▽ More
The paper refers to several concepts which are essential to studying digital objects from the viewpoint of digital topology: digital $k$-connectivity or digital $k$-adjacency, $C$-compatible and normal $k$-adjacency for a digital product.
Since L. Boxer has often mentioned the origins of these concepts in an inaccurate way, we discuss something incorrectly cited or mentioned in Boxer's papers according to the facts.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Counting double cosets with application to generic 3-manifolds
Authors:
Suzhen Han,
Wenyuan Yang,
Yanqing Zou
Abstract:
We study the growth of double cosets in the class of groups with contracting elements, including relatively hyperbolic groups, CAT(0) groups and mapping class groups among others. Generalizing a recent work of Gitik and Rips about hyperbolic groups, we prove that the double coset growth of two Morse subgroups of infinite index is comparable with the orbital growth function. The same result is furt…
▽ More
We study the growth of double cosets in the class of groups with contracting elements, including relatively hyperbolic groups, CAT(0) groups and mapping class groups among others. Generalizing a recent work of Gitik and Rips about hyperbolic groups, we prove that the double coset growth of two Morse subgroups of infinite index is comparable with the orbital growth function. The same result is further obtained for a more general class of subgroups whose limit sets are proper subsets in the entire limit set of the ambient group.
As an application, we confirm a conjecture of Maher that hyperbolic 3-manifolds are exponentially generic in the set of 3-manifolds built from Heegaard splitting using complexity in Teichmüller metric.
△ Less
Submitted 30 December, 2023; v1 submitted 12 July, 2023;
originally announced July 2023.
-
Differential operator realization of braid group action on $\imath$quantum groups
Authors:
Zhaobing Fan,
Jicheng Geng,
Shaolong Han
Abstract:
We construct a unique braid group action on modified $q$-Weyl algebra $\mathbf A_q(S)$. Under this action, we give a realization of the braid group action on quasi-split $\imath$quantum groups $^{\imath}\mathbf U(S)$ of type $\mathrm{AIII}$. Furthermore, we directly construct a unique braid group action on polynomial ring $\mathbb P$ which is compatible with the braid group action on…
▽ More
We construct a unique braid group action on modified $q$-Weyl algebra $\mathbf A_q(S)$. Under this action, we give a realization of the braid group action on quasi-split $\imath$quantum groups $^{\imath}\mathbf U(S)$ of type $\mathrm{AIII}$. Furthermore, we directly construct a unique braid group action on polynomial ring $\mathbb P$ which is compatible with the braid group action on $\mathbf A_q(S)$ and $^{\imath}\mathbf U(S)$.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
Pseudocovering and digital covering spaces
Authors:
Sang-Eon Han
Abstract:
The notions of a local $(k_0,k_1)$-isomorphism and a weakly local $(k_0,k_1)$-isomorphism play crucial roles in developing a digital $(k_0,k_1)$-covering space and a pseudo-$(k_0,k_1)$-covering space, respectively. In relation to the study of pseudo-$(k_0,k_1)$-covering spaces, since there are some works to be refined and improved in the literature, the recent paper \cite{H10} improved and correct…
▽ More
The notions of a local $(k_0,k_1)$-isomorphism and a weakly local $(k_0,k_1)$-isomorphism play crucial roles in developing a digital $(k_0,k_1)$-covering space and a pseudo-$(k_0,k_1)$-covering space, respectively. In relation to the study of pseudo-$(k_0,k_1)$-covering spaces, since there are some works to be refined and improved in the literature, the recent paper \cite{H10} improved and corrected some mistakes occurred in the literature. One of the important things is that the notion of a pseudo-$(k_0,k_1)$-covering map in \cite{H6,H9} was revised to be more broadened in \cite{H10}. Thus this new version is proved to be equivalent to a weakly local $(k_0,k_1)$-isomorphic surjection \cite{H10}. The present paper contains some works in \cite{H10} and we only deals with $k$-connected digital images $(X, k)$.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Ideals and continuity for quantaloid-enriched categories
Authors:
Min Liu,
Shengwei Han,
Isar Stubbe
Abstract:
We study ideals in, and continuity of, quantaloid-enriched categories (Q-categories for short) as a 'many-valued and many-typed' generalization of domain theory. Abstractly, for any (saturated) class Phi of presheaves, we define and study the Phi-continuity of Q-categories. Concretely, we compute three examples of such saturated classes of presheaves - the class of flat ideals, the class of irredu…
▽ More
We study ideals in, and continuity of, quantaloid-enriched categories (Q-categories for short) as a 'many-valued and many-typed' generalization of domain theory. Abstractly, for any (saturated) class Phi of presheaves, we define and study the Phi-continuity of Q-categories. Concretely, we compute three examples of such saturated classes of presheaves - the class of flat ideals, the class of irreducible ideals and the class of conical ideals - which are proper generalizations of ideals in domain theory.
△ Less
Submitted 27 April, 2023;
originally announced April 2023.
-
Solving Strongly Convex and Smooth Stackelberg Games Without Modeling the Follower
Authors:
Yansong Li,
Shuo Han
Abstract:
Stackelberg games have been widely used to model interactive decision-making problems in a variety of domains such as energy systems, transportation, cybersecurity, and human-robot interaction. However, existing algorithms for solving Stackelberg games often require knowledge of the follower's cost function or learning dynamics and may also require the follower to provide an exact best response, w…
▽ More
Stackelberg games have been widely used to model interactive decision-making problems in a variety of domains such as energy systems, transportation, cybersecurity, and human-robot interaction. However, existing algorithms for solving Stackelberg games often require knowledge of the follower's cost function or learning dynamics and may also require the follower to provide an exact best response, which can be difficult to obtain in practice. To circumvent this difficulty, we develop an algorithm that does not require knowledge of the follower's cost function or an exact best response, making it more applicable to real-world scenarios. Specifically, our algorithm only requires the follower to provide an approximately optimal action in response to the leader's action. The inexact best response is used in computing an approximate gradient of the leader's objective function, with which zeroth-order bilevel optimization can be applied to obtain an optimal action for the leader. Our algorithm is proved to converge at a linear rate to a neighborhood of the optimal point when the leader's cost function under the follower's best response is strongly convex and smooth.
△ Less
Submitted 10 March, 2023;
originally announced March 2023.
-
Approaching epidemiological dynamics of COVID-19 with physics-informed neural networks
Authors:
Shuai Han,
Lukas Stelz,
Horst Stoecker,
Lingxiao Wang,
Kai Zhou
Abstract:
A physics-informed neural network (PINN) embedded with the susceptible-infected-removed (SIR) model is devised to understand the temporal evolution dynamics of infectious diseases. Firstly, the effectiveness of this approach is demonstrated on synthetic data as generated from the numerical solution of the susceptible-asymptomatic-infected-recovered-dead (SAIRD) model. Then, the method is applied t…
▽ More
A physics-informed neural network (PINN) embedded with the susceptible-infected-removed (SIR) model is devised to understand the temporal evolution dynamics of infectious diseases. Firstly, the effectiveness of this approach is demonstrated on synthetic data as generated from the numerical solution of the susceptible-asymptomatic-infected-recovered-dead (SAIRD) model. Then, the method is applied to COVID-19 data reported for Germany and shows that it can accurately identify and predict virus spread trends. The results indicate that an incomplete physics-informed model can approach more complicated dynamics efficiently. Thus, the present work demonstrates the high potential of using machine learning methods, e.g., PINNs, to study and predict epidemic dynamics in combination with compartmental models.
△ Less
Submitted 20 February, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Data-Driven Distributionally Robust Electric Vehicle Balancing for Autonomous Mobility-on-Demand Systems under Demand and Supply Uncertainties
Authors:
Sihong He,
Zhili Zhang,
Shuo Han,
Lynn Pepin,
Guang Wang,
Desheng Zhang,
John Stankovic,
Fei Miao
Abstract:
Electric vehicles (EVs) are being rapidly adopted due to their economic and societal benefits. Autonomous mobility-on-demand (AMoD) systems also embrace this trend. However, the long charging time and high recharging frequency of EVs pose challenges to efficiently managing EV AMoD systems. The complicated dynamic charging and mobility process of EV AMoD systems makes the demand and supply uncertai…
▽ More
Electric vehicles (EVs) are being rapidly adopted due to their economic and societal benefits. Autonomous mobility-on-demand (AMoD) systems also embrace this trend. However, the long charging time and high recharging frequency of EVs pose challenges to efficiently managing EV AMoD systems. The complicated dynamic charging and mobility process of EV AMoD systems makes the demand and supply uncertainties significant when designing vehicle balancing algorithms. In this work, we design a data-driven distributionally robust optimization (DRO) approach to balance EVs for both the mobility service and the charging process. The optimization goal is to minimize the worst-case expected cost under both passenger mobility demand uncertainties and EV supply uncertainties. We then propose a novel distributional uncertainty sets construction algorithm that guarantees the produced parameters are contained in desired confidence regions with a given probability. To solve the proposed DRO AMoD EV balancing problem, we derive an equivalent computationally tractable convex optimization problem. Based on real-world EV data of a taxi system, we show that with our solution the average total balancing cost is reduced by 14.49%, and the average mobility fairness and charging fairness are improved by 15.78% and 34.51%, respectively, compared to solutions that do not consider uncertainties.
△ Less
Submitted 24 November, 2022;
originally announced November 2022.
-
Young wall construction of level-1 highest weight crystals over $U_q(D_4^{(3)})$ and $U_q(G_2^{(1)})$
Authors:
Zhaobing Fan,
Shaolong Han,
Seok-Jin Kang,
Yong-Su Shin
Abstract:
With the help of path realization and affine energy function, we give a Young wall construction of level-1 highest weight crystals $B(λ)$ over $U_{q}(G_{2}^{(1)})$ and $U_{q}(D_{4}^{(3)})$. Our construction is based on four different shapes of colored blocks, $\mathbf O$-block, $\mathbf I$-block, $\mathbf L$-block and $\mathbf{LL}$-block, obtained by cutting the unit cube in three different ways.
With the help of path realization and affine energy function, we give a Young wall construction of level-1 highest weight crystals $B(λ)$ over $U_{q}(G_{2}^{(1)})$ and $U_{q}(D_{4}^{(3)})$. Our construction is based on four different shapes of colored blocks, $\mathbf O$-block, $\mathbf I$-block, $\mathbf L$-block and $\mathbf{LL}$-block, obtained by cutting the unit cube in three different ways.
△ Less
Submitted 25 February, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
High Order Schemes for Gradient Flow with Respect to a Metric
Authors:
Saem Han,
Selim Esedoglu,
Krishna Garikipati
Abstract:
New criteria for energy stability of multi-step, multi-stage, and mixed schemes are introduced in the context of evolution equations that arise as gradient flow with respect to a metric. These criteria are used to exhibit second and third order consistent, energy stable schemes, which are then demonstrated on several partial differential equations that arise as gradient flow with respect to the 2-…
▽ More
New criteria for energy stability of multi-step, multi-stage, and mixed schemes are introduced in the context of evolution equations that arise as gradient flow with respect to a metric. These criteria are used to exhibit second and third order consistent, energy stable schemes, which are then demonstrated on several partial differential equations that arise as gradient flow with respect to the 2-Wasserstein metric.
△ Less
Submitted 5 October, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
Crystal bases and canonical bases for quantum Borcherds-Bozec algebras
Authors:
Zhaobing Fan,
Shaolong Han,
Seok-Jin Kang,
Young Rock Kim
Abstract:
Let $U_{q}^{-}(\mathfrak g)$ be the negative half of a quantum Borcherds-Bozec algebra $U_{q}(\mathfrak g)$ and $V(λ)$ be the irreducible highest weight module with $λ\in P^{+}$. In this paper, we investigate the structures, properties and their close connections between crystal bases and canonical bases of $U_{q}^{-}(\mathfrak g)$ and $V(λ)$. We first re-construct crystal basis theory with modifi…
▽ More
Let $U_{q}^{-}(\mathfrak g)$ be the negative half of a quantum Borcherds-Bozec algebra $U_{q}(\mathfrak g)$ and $V(λ)$ be the irreducible highest weight module with $λ\in P^{+}$. In this paper, we investigate the structures, properties and their close connections between crystal bases and canonical bases of $U_{q}^{-}(\mathfrak g)$ and $V(λ)$. We first re-construct crystal basis theory with modified Kashiwara operators. While going through Kashiwara's grand-loop argument, we prove several important lemmas, which play crucial roles in the later developments of the paper. Next, based on the theory of canonical bases on quantum Bocherds-Bozec algebras, we introduce the notion of primitive canonical bases and prove that primitive canonical bases coincide with lower global bases.
△ Less
Submitted 31 March, 2024; v1 submitted 5 November, 2022;
originally announced November 2022.
-
Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems
Authors:
Fan Zhou,
Haoyu Dong,
Qian Liu,
Zhoujun Cheng,
Shi Han,
Dongmei Zhang
Abstract:
Numerical reasoning over natural language has been a long-standing goal for the research community. However, cutting-edge language models have proven difficult to reliably generalize to a broad range of numbers, although they have shown proficiency in reasoning over common and simple numbers. In this paper, we propose a novel method to elicit and exploit the numerical reasoning knowledge hidden in…
▽ More
Numerical reasoning over natural language has been a long-standing goal for the research community. However, cutting-edge language models have proven difficult to reliably generalize to a broad range of numbers, although they have shown proficiency in reasoning over common and simple numbers. In this paper, we propose a novel method to elicit and exploit the numerical reasoning knowledge hidden in pre-trained language models using simple anchor numbers. Concretely, we first leverage simple numbers as anchors to probe the implicitly inferred arithmetic expressions from language models, and then explicitly apply the expressions on complex numbers to get corresponding answers. To inversely elicit arithmetic expressions, we transform and formulate the task as an analytically solvable linear system. Experimental results on several numerical reasoning benchmarks demonstrate that our approach significantly improves numerical reasoning capabilities of existing LMs. More importantly, our approach is training-free and simply works in the inference phase, making it highly portable and achieving consistent performance benefits across a variety of language models (GPT-3, T5, BART, etc) in all zero-shot, few-shot, and fine-tuning scenarios.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
Convex Submodular Minimization with Indicator Variables
Authors:
Shaoning Han,
Andrés Gómez
Abstract:
We study a general class of convex submodular optimization problems with indicator variables. Many applications such as the problem of inferring Markov random fields (MRFs) with a sparsity or robustness prior can be naturally modeled in this form. We show that these problems can be reduced to binary submodular minimization problems, possibly after a suitable reformulation, and thus are strongly po…
▽ More
We study a general class of convex submodular optimization problems with indicator variables. Many applications such as the problem of inferring Markov random fields (MRFs) with a sparsity or robustness prior can be naturally modeled in this form. We show that these problems can be reduced to binary submodular minimization problems, possibly after a suitable reformulation, and thus are strongly polynomially solvable. Furthermore, we develop a parametric approach for computing the associated extreme bases under certain smoothness conditions. This leads to a fast solution method, whose efficiency is demonstrated through numerical experiments.
△ Less
Submitted 7 July, 2025; v1 submitted 27 September, 2022;
originally announced September 2022.
-
Large-time behavior of composite waves of viscous shocks for the barotropic Navier-Stokes equations
Authors:
Sungho Han,
Moon-Jin Kang,
Jeongho Kim
Abstract:
We study the large-time behavior of the 1D barotropic Navier-Stokes flow perturbed from Riemann data generating a composition of two shock waves with small amplitudes. We prove that the perturbed Navier-Stokes flow converges, uniformly in space, towards a composition of two viscous shock waves as time goes to infinity, up to dynamical shifts. Especially, the strengths of the two waves can be chose…
▽ More
We study the large-time behavior of the 1D barotropic Navier-Stokes flow perturbed from Riemann data generating a composition of two shock waves with small amplitudes. We prove that the perturbed Navier-Stokes flow converges, uniformly in space, towards a composition of two viscous shock waves as time goes to infinity, up to dynamical shifts. Especially, the strengths of the two waves can be chosen independently. This is the first result for the convergence to a composite wave of two viscous shocks with independently small amplitudes.
△ Less
Submitted 13 July, 2022;
originally announced July 2022.
-
Synthesizing Attack-Aware Control and Active Sensing Strategies under Reactive Sensor Attacks
Authors:
Sumukha Udupa,
Abhishek N. Kulkarni,
Shuo Han,
Nandi O. Leslie,
Charles A. Kamhoua,
Jie Fu
Abstract:
We consider the probabilistic planning problem for a defender (P1) who can jointly query the sensors and take control actions to reach a set of goal states while being aware of possible sensor attacks by an adversary (P2) who has perfect observations. To synthesize a provably-correct, attack-aware joint control and active sensing strategy for P1, we construct a stochastic game on graph with augmen…
▽ More
We consider the probabilistic planning problem for a defender (P1) who can jointly query the sensors and take control actions to reach a set of goal states while being aware of possible sensor attacks by an adversary (P2) who has perfect observations. To synthesize a provably-correct, attack-aware joint control and active sensing strategy for P1, we construct a stochastic game on graph with augmented states that include the actual game state (known only to the attacker), the belief of the defender about the game state (constructed by the attacker based on his knowledge of defender's observations). We present an algorithm to compute a belief-based, randomized strategy for P1 to ensure satisfying the reachability objective with probability one, under the worst-case sensor attack carried out by an informed P2. We prove the correctness of the algorithm and illustrate using an example.
△ Less
Submitted 29 November, 2022; v1 submitted 28 March, 2022;
originally announced April 2022.
-
Accelerating Model-Free Policy Optimization Using Model-Based Gradient: A Composite Optimization Perspective
Authors:
Yansong Li,
Shuo Han
Abstract:
We develop an algorithm that combines model-based and model-free methods for solving a nonlinear optimal control problem with a quadratic cost in which the system model is given by a linear state-space model with a small additive nonlinear perturbation. We decompose the cost into a sum of two functions, one having an explicit form obtained from the approximate linear model, the other being a black…
▽ More
We develop an algorithm that combines model-based and model-free methods for solving a nonlinear optimal control problem with a quadratic cost in which the system model is given by a linear state-space model with a small additive nonlinear perturbation. We decompose the cost into a sum of two functions, one having an explicit form obtained from the approximate linear model, the other being a black-box model representing the unknown modeling error. The decomposition allows us to formulate the problem as a composite optimization problem. To solve the optimization problem, our algorithm performs gradient descent using the gradient obtained from the approximate linear model until backtracking line search fails, upon which the model-based gradient is compared with the exact gradient obtained from a model-free algorithm. The difference between the model gradient and the exact gradient is then used for compensating future gradient-based updates. Our algorithm is shown to decrease the number of function evaluations compared with traditional model-free methods both in theory and in practice.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.