-
Blowup of solutions for compressible viscoelastic fluid
Authors:
Sébastien Boyaval,
Na Wang,
Yuxi Hu
Abstract:
We prove finite-time blowup of classical solutions for the compressible Upper Convective Maxwell (UCM) viscoelastic fluid system. By establishing a key energy identity and adapting Sideris' method for compressible flows, we derive a Riccati-type inequality for a momentum functional. For initial data with compactly supported perturbations satisfying a sufficiently large condition, all classical sol…
▽ More
We prove finite-time blowup of classical solutions for the compressible Upper Convective Maxwell (UCM) viscoelastic fluid system. By establishing a key energy identity and adapting Sideris' method for compressible flows, we derive a Riccati-type inequality for a momentum functional. For initial data with compactly supported perturbations satisfying a sufficiently large condition, all classical solutions lose regularity in finite time. This constitutes the first rigorous blowup result for multidimensional compressible viscoelastic fluids.
△ Less
Submitted 5 September, 2025;
originally announced September 2025.
-
Generalized Golub-Kahan bidiagonalization for generalized saddle point systems
Authors:
Na-Na Wang,
Ji-Cheng Li
Abstract:
We consider the iterative solution of generalized saddle point systems. When the right bottom block is zero, Arioli [SIAM J. Matrix Anal. Appl., 34 (2013), pp. 571--592] proposed a CRAIG algorithm based on generalized Golub-Kahan Bidiagonalization (GKB) for the augmented systems with the leading block being symmetric and positive definite (SPD), and then Dumitrasc et al. [SIAM J. Matrix Anal. Appl…
▽ More
We consider the iterative solution of generalized saddle point systems. When the right bottom block is zero, Arioli [SIAM J. Matrix Anal. Appl., 34 (2013), pp. 571--592] proposed a CRAIG algorithm based on generalized Golub-Kahan Bidiagonalization (GKB) for the augmented systems with the leading block being symmetric and positive definite (SPD), and then Dumitrasc et al. [SIAM J. Matrix Anal. Appl., 46 (2025), pp. 370--392] extended the GKB for the case where the symmetry condition of the leading block no longer holds and then proposed nonsymmetric version of the CRAIG (nsCRAIG) algorithm. The CRAIG and nsCRAIG algorithms are theoretically equivalent to the Schur complement reduction (SCR) methods where the Conjugate Gradient (CG) method and the Full Orthogonalization Method (FOM) are applied to the associated Schur-complement equation, respectively. We extend the GKB and its nonsymmetric counterpart used separately in CRAIG and nsCRAIG algorithms for the case where the right bottom block of saddle point system is nonzero. On this basis, we propose CRAIG and nsCRAIG algorithms for the solution of the generalized saddle point problems with the leading block being SPD and nonsymmetric positive definite (NSPD), respectively. They are also theoretically equivalent to the SCR methods with inner CG and FOM iterations for the associated Schur-complement equation, respectively. Moreover, we give algorithm steps of the two new solvers and propose appropriate stopping criteria based on an estimate of the energy norm for the error and the residual norm. Numerical comparison with MINRES or GMRES highlights the advantages of our proposed strategies regarding its high computational efficiency and/or low memory requirements and the associated implications.
△ Less
Submitted 2 September, 2025;
originally announced September 2025.
-
Compressed Decentralized Momentum Stochastic Gradient Methods for Nonconvex Optimization
Authors:
Wei Liu,
Anweshit Panda,
Ujwal Pandey,
Christopher Brissette,
Yikang Shen,
George M. Slota,
Naigang Wang,
Jie Chen,
Yangyang Xu
Abstract:
In this paper, we design two compressed decentralized algorithms for solving nonconvex stochastic optimization under two different scenarios. Both algorithms adopt a momentum technique to achieve fast convergence and a message-compression technique to save communication costs. Though momentum acceleration and compressed communication have been used in literature, it is highly nontrivial to theoret…
▽ More
In this paper, we design two compressed decentralized algorithms for solving nonconvex stochastic optimization under two different scenarios. Both algorithms adopt a momentum technique to achieve fast convergence and a message-compression technique to save communication costs. Though momentum acceleration and compressed communication have been used in literature, it is highly nontrivial to theoretically prove the effectiveness of their composition in a decentralized algorithm that can maintain the benefits of both sides, because of the need to simultaneously control the consensus error, the compression error, and the bias from the momentum gradient.
For the scenario where gradients are bounded, our proposal is a compressed decentralized adaptive method. To the best of our knowledge, this is the first decentralized adaptive stochastic gradient method with compressed communication. For the scenario of data heterogeneity without bounded gradients, our proposal is a compressed decentralized heavy-ball method, which applies a gradient tracking technique to address the challenge of data heterogeneity. Notably, both methods achieve an optimal convergence rate, and they can achieve linear speed up and adopt topology-independent algorithmic parameters within a certain regime of the user-specified error tolerance. Superior empirical performance is observed over state-of-the-art methods on training deep neural networks (DNNs) and Transformers.
△ Less
Submitted 6 August, 2025;
originally announced August 2025.
-
Complexity Analysis of Convex Majorization Schemes for Nonconvex Constrained Optimization
Authors:
Nuozhou Wang,
Junyu Zhang,
Shuzhong Zhang
Abstract:
We introduce and study various algorithms for solving nonconvex minimization with inequality constraints, based on the construction of convex surrogate envelopes that majorize the objective and the constraints. In the case where the objective and constraint functions are gradient Hölderian continuous, the surrogate functions can be readily constructed and the solution method can be efficiently imp…
▽ More
We introduce and study various algorithms for solving nonconvex minimization with inequality constraints, based on the construction of convex surrogate envelopes that majorize the objective and the constraints. In the case where the objective and constraint functions are gradient Hölderian continuous, the surrogate functions can be readily constructed and the solution method can be efficiently implemented. The surrogate envelopes are extended to the settings where the second-order information is available, and the convex subproblems are further represented by Dikin ellipsoids using the self-concordance of the convex surrogate constraints. Iteration complexities have been developed for both convex and nonconvex optimization models. The numerical results show promising potential of the proposed approaches.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
DiaBlo: Diagonal Blocks Are Sufficient For Finetuning
Authors:
Selcuk Gurses,
Aozhong Zhang,
Yanxia Deng,
Xun Dong,
Xin Li,
Naigang Wang,
Penghang Yin,
Zi Yang
Abstract:
Finetuning is a critical step for adapting large language models (LLMs) to domain-specific downstream tasks. To mitigate the substantial computational and memory costs of full-model fine-tuning, Parameter-Efficient Finetuning (PEFT) methods have been proposed to update only a small subset of model parameters. However, performance gaps between PEFT approaches and full-model fine-tuning still exist.…
▽ More
Finetuning is a critical step for adapting large language models (LLMs) to domain-specific downstream tasks. To mitigate the substantial computational and memory costs of full-model fine-tuning, Parameter-Efficient Finetuning (PEFT) methods have been proposed to update only a small subset of model parameters. However, performance gaps between PEFT approaches and full-model fine-tuning still exist. In this work, we present DiaBlo, a simple yet effective PEFT approach that updates only the diagonal blocks of selected model weight matrices. Unlike Low Rank Adaptation (LoRA) and its variants, DiaBlo eliminates the need for low rank matrix products, thereby avoiding the reliance on auxiliary initialization schemes or customized optimization strategies to improve convergence. This design leads to stable and robust convergence while maintaining comparable memory efficiency and training speed to LoRA. We conduct extensive experiments across a range of tasks, including commonsense reasoning, arithmetic reasoning, code generation, and safety alignment, to evaluate the effectiveness and efficiency of DiaBlo. Across these benchmarks, DiaBlo demonstrates strong and consistent performance while maintaining high memory efficiency and fast finetuning speed. Codes are available at https://github.com/ziyangjoy/DiaBlo.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Vintage-Based Formulations in Multi-Year Investment Modelling for Energy Systems
Authors:
Ni Wang,
Germán Morales-España
Abstract:
This paper reviews two established formulations for modelling multi-year energy investments: the simple method, which aggregates all capacity regardless of commissioning year, and the vintage method, which explicitly tracks investments by year to capture differences in technical parameters over time. While the vintage method improves modelling fidelity, it significantly increases model size. To ad…
▽ More
This paper reviews two established formulations for modelling multi-year energy investments: the simple method, which aggregates all capacity regardless of commissioning year, and the vintage method, which explicitly tracks investments by year to capture differences in technical parameters over time. While the vintage method improves modelling fidelity, it significantly increases model size. To address this, we propose a novel compact formulation that maintains the ability to represent year-specific characteristics while reducing the dimensionality of the model. The proposed compact formulation is implemented in the open-source model TulipaEnergyModel.jl and offers a tractable alternative for detailed long-term energy system planning.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
Discounting Approaches in Multi-Year Investment Modelling for Energy Systems
Authors:
Ni Wang,
Diego A. Tejada-Arango
Abstract:
This paper reviews discounting approaches for modeling multi-year energy investments, focusing on total versus annualised cost formulations. We discuss how time value of money is handled, and how salvage value and milestone-year weighting can address mismatches between asset lifetimes and model horizons. These methods are implemented in the open-source TulipaEnergyModel to support transparent and…
▽ More
This paper reviews discounting approaches for modeling multi-year energy investments, focusing on total versus annualised cost formulations. We discuss how time value of money is handled, and how salvage value and milestone-year weighting can address mismatches between asset lifetimes and model horizons. These methods are implemented in the open-source TulipaEnergyModel to support transparent and tractable long-term energy system planning.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
A sharp-interface approach for simulating solid-state dewetting of thin films with double-bubble structure
Authors:
Meng Li,
Nan Wang,
Ruofan Zhao,
Chunjie Zhou
Abstract:
We develop a sharp-interface model for solid-state dewetting of double-bubble thin films using an energy variational approach based on a newly proposed interfacial energy. This model characterizes the dynamic evolution of interfaces in double-bubble thin films, a process primarily governed by surface diffusion and junction/contact points migration, and fundamentally distinct from the behavior obse…
▽ More
We develop a sharp-interface model for solid-state dewetting of double-bubble thin films using an energy variational approach based on a newly proposed interfacial energy. This model characterizes the dynamic evolution of interfaces in double-bubble thin films, a process primarily governed by surface diffusion and junction/contact points migration, and fundamentally distinct from the behavior observed in a single thin film. Subsequently, a structure-preserving parametric finite element approximation is developed for the sharp-interface model, which can preserve both area conservation and energy stability. Extensive numerical experiments are presented to demonstrate the convergence, structure-preserving properties, and superior mesh quality of the proposed method. Additionally, we investigate several specific evolution processes, including the equilibrium shapes of double-bubble thin films and the pinch-off dynamics of long islands.
△ Less
Submitted 4 March, 2025; v1 submitted 2 March, 2025;
originally announced March 2025.
-
Novel Optimization Techniques for Parameter Estimation
Authors:
Chenyu Wu,
Nuozhou Wang,
Casey Garner,
Kevin Leder,
Shuzhong Zhang
Abstract:
In this paper, we introduce a new optimization algorithm that is well suited for solving parameter estimation problems. We call our new method cubic regularized Newton with affine scaling (CRNAS). In contrast to so-called first-order methods which rely solely on the gradient of the objective function, our method utilizes the Hessian of the objective. As a result it is able to focus on points satis…
▽ More
In this paper, we introduce a new optimization algorithm that is well suited for solving parameter estimation problems. We call our new method cubic regularized Newton with affine scaling (CRNAS). In contrast to so-called first-order methods which rely solely on the gradient of the objective function, our method utilizes the Hessian of the objective. As a result it is able to focus on points satisfying the second-order optimality conditions, as opposed to first-order methods that simply converge to critical points. This is an important feature in parameter estimation problems where the objective function is often non-convex and as a result there can be many critical points making it is near impossible to identify the global minimum. An important feature of parameter estimation in mathematical models of biological systems is that the parameters are constrained by either physical constraints or prior knowledge. We use an affine scaling approach to handle a wide class of constraints. We establish that CRNAS identifies a point satisfying $ε$-approximate second-order optimality conditions within $O(ε^{-3/2})$ iterations. Finally, we compare CRNAS with MATLAB's optimization solver fmincon on three different test problems. These test problems all feature mixtures of heterogeneous populations, a problem setting that CRNAS is particularly well-suited for. Our numerical simulations show CRNAS has favorable performance, performing comparable if not better than fmincon in accuracy and computational cost for most of our examples.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Field Knowledge as a Dual to Distributed Knowledge: A Characterization by Weighted Modal Logic
Authors:
Xiaolong Liang,
Yì N. Wáng
Abstract:
The study of group knowledge concepts such as mutual, common, and distributed knowledge is well established within the discipline of epistemic logic. In this work, we incorporate epistemic abilities of agents to refine the formal definition of distributed knowledge and introduce a formal characterization of field knowledge. We propose that field knowledge serves as a dual to distributed knowledge.…
▽ More
The study of group knowledge concepts such as mutual, common, and distributed knowledge is well established within the discipline of epistemic logic. In this work, we incorporate epistemic abilities of agents to refine the formal definition of distributed knowledge and introduce a formal characterization of field knowledge. We propose that field knowledge serves as a dual to distributed knowledge. Our approach utilizes epistemic logics with various group knowledge constructs, interpreted through weighted models. We delve into the eight logics that stem from these considerations, explore their relative expressivity and develop sound and complete axiomatic systems.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Global solutions and uniform convergence stability for compressible Navier-Stokes equations with oldroyd-type constitutive law
Authors:
Na Wang,
Sébastien Boyaval,
Yuxi Hu
Abstract:
We consider one dimensional isentropic compressible Navier-Stokes equations with Oldroyd-type constitutive law. By establishing uniform a priori estimates (with respect to relaxation time), we show global existence of smooth solutions with small initial data. Moreover, we get global-in-time convergence of the system towards the classical isentropic compressible Navier-Stokes equations.
We consider one dimensional isentropic compressible Navier-Stokes equations with Oldroyd-type constitutive law. By establishing uniform a priori estimates (with respect to relaxation time), we show global existence of smooth solutions with small initial data. Moreover, we get global-in-time convergence of the system towards the classical isentropic compressible Navier-Stokes equations.
△ Less
Submitted 17 September, 2025; v1 submitted 21 June, 2024;
originally announced June 2024.
-
Structure-preserving weighted BDF2 methods for Anisotropic Cahn-Hilliard model: uniform/variable-time-steps
Authors:
Meng Li,
Jingjiang Bi,
Nan Wang
Abstract:
In this paper, we innovatively develop uniform/variable-time-step weighted and shifted BDF2 (WSBDF2) methods for the anisotropic Cahn-Hilliard (CH) model, combining the scalar auxiliary variable (SAV) approach with two types of stabilized techniques. Using the concept of $G$-stability, the uniform-time-step WSBDF2 method is theoretically proved to be energy-stable. Due to the inapplicability of th…
▽ More
In this paper, we innovatively develop uniform/variable-time-step weighted and shifted BDF2 (WSBDF2) methods for the anisotropic Cahn-Hilliard (CH) model, combining the scalar auxiliary variable (SAV) approach with two types of stabilized techniques. Using the concept of $G$-stability, the uniform-time-step WSBDF2 method is theoretically proved to be energy-stable. Due to the inapplicability of the relevant G-stability properties, another technique is adopted in this work to demonstrate the energy stability of the variable-time-step WSBDF2 method. In addition, the two numerical schemes are all mass-conservative.Finally, numerous numerical simulations are presented to demonstrate the stability and accuracy of these schemes.
△ Less
Submitted 15 June, 2024; v1 submitted 20 April, 2024;
originally announced April 2024.
-
Structure-preserving, weighted implicit-explicit schemes for multi-phase incompressible Navier-Stokes/Darcy coupled nonlocal Allen-Cahn model
Authors:
Meng Li,
Ke Wang,
Nan Wang
Abstract:
A multitude of substances exist as mixtures comprising multiple chemical components in the natural world. These substances undergo morphological changes under external influences. the phase field model coupled with fluid flow, the dynamic movement and evolution of the phase interface intricately interact with the fluid motion. This article focuses on the N-component models that couple the conserva…
▽ More
A multitude of substances exist as mixtures comprising multiple chemical components in the natural world. These substances undergo morphological changes under external influences. the phase field model coupled with fluid flow, the dynamic movement and evolution of the phase interface intricately interact with the fluid motion. This article focuses on the N-component models that couple the conservative Allen-Cahn equation with two types of incompressible fluid flow systems: the Navier-Stokes equation and the Darcy equation. By utilizing the scalar auxiliary variable method and the projection method, we innovatively construct two types of structure-preserving weighted implicit-explicit schemes for the coupled models, resulting in fully decoupled linear systems and second-order accuracy in time. The schemes are proved to be mass-conservative. In addition, with the application of $G$-norm inspired by the idea of $G$-stability, we rigorously establish its unconditional energy stability. Finally, the performance of the proposed scheme is verified by some numerical simulations.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Epistemic Logic over Similarity Graphs: Common, Distributed and Mutual Knowledge
Authors:
Xiaolong Liang,
Yì N. Wáng
Abstract:
In this paper, we delve into the study of epistemic logics, interpreted through similarity models based on weighted graphs. We explore eight languages that extend the traditional epistemic language by incorporating modalities of common, distributed, and mutual knowledge. The concept of individual knowledge is redefined under these similarity models. It is no longer just a matter of personal knowle…
▽ More
In this paper, we delve into the study of epistemic logics, interpreted through similarity models based on weighted graphs. We explore eight languages that extend the traditional epistemic language by incorporating modalities of common, distributed, and mutual knowledge. The concept of individual knowledge is redefined under these similarity models. It is no longer just a matter of personal knowledge, but is now enriched and understood as knowledge under the individual's epistemic ability. Common knowledge is presented as higher-order knowledge that is universally known to any degree, a definition that aligns with existing literature. We reframe distributed knowledge as a form of knowledge acquired by collectively leveraging the abilities of a group of agents. In contrast, mutual knowledge is defined as the knowledge obtained through the shared abilities of a group. We then focus on the resulting logics, examining their relative expressivity, semantic correspondence to the classical epistemic logic, proof systems and the computational complexity associated with the model checking problem and the satisfiability/validity problem. This paper offers significant insights into the logical analysis and understanding of these enriched forms of knowledge, contributing to the broader discourse on epistemic logic.
△ Less
Submitted 30 September, 2023;
originally announced October 2023.
-
Tulipa Energy Model: Mathematical Formulation
Authors:
Diego A. Tejada-Arango,
Germán Morales-España,
Lauren Clisby,
Ni Wang,
Abel S. Siqueira,
Ali Subayu,
Laurent Soucasse,
Zhi Gao
Abstract:
Tulipa Energy Model aims to optimise the investment and operation of the electricity market, considering its coupling with other sectors, such as hydrogen and heat, that can also be electrified. The problem is analysed from the perspective of a central planner who determines the expansion plan that is most beneficial for the system as a whole, either by maximising social welfare or by minimising t…
▽ More
Tulipa Energy Model aims to optimise the investment and operation of the electricity market, considering its coupling with other sectors, such as hydrogen and heat, that can also be electrified. The problem is analysed from the perspective of a central planner who determines the expansion plan that is most beneficial for the system as a whole, either by maximising social welfare or by minimising total costs. The formulation provides a general description of the objective function and constraints in the optimisation model based on the concept of energy assets representing any element in the model. The model uses subsets and specific methods to determine the constraints that apply to a particular technology or network, allowing more flexibility in the code to consider new technologies and constraints with different levels of detail in the future.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model
Authors:
Ning Wang,
Xin Zhang,
Qing Mai
Abstract:
The expectation-maximization (EM) algorithm and its variants are widely used in statistics. In high-dimensional mixture linear regression, the model is assumed to be a finite mixture of linear regression and the number of predictors is much larger than the sample size. The standard EM algorithm, which attempts to find the maximum likelihood estimator, becomes infeasible for such model. We devise a…
▽ More
The expectation-maximization (EM) algorithm and its variants are widely used in statistics. In high-dimensional mixture linear regression, the model is assumed to be a finite mixture of linear regression and the number of predictors is much larger than the sample size. The standard EM algorithm, which attempts to find the maximum likelihood estimator, becomes infeasible for such model. We devise a group lasso penalized EM algorithm and study its statistical properties. Existing theoretical results of regularized EM algorithms often rely on dividing the sample into many independent batches and employing a fresh batch of sample in each iteration of the algorithm. Our algorithm and theoretical analysis do not require sample-splitting, and can be extended to multivariate response cases. The proposed methods also have encouraging performances in numerical studies.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Geometric Extended State Observer on SE(3) with Fast Finite-Time Stability: Theory and Validation on a Rotorcraft Aerial Vehicle
Authors:
Ningshan Wang,
Reza Hamrah,
Amit K. Sanyal,
Mark N. Glauser
Abstract:
This article presents an extended state observer for vehicle modeled as a rigid body in three-dimensional translational and rotational motions. The extended state observer is applicable to a rotorcraft aerial vehicle with a fixed plane of rotors, modeled as an under-actuated system on the tangent bundle of the six-dimensional Lie group of rigid body motions, SE(3). The extended state observer is d…
▽ More
This article presents an extended state observer for vehicle modeled as a rigid body in three-dimensional translational and rotational motions. The extended state observer is applicable to a rotorcraft aerial vehicle with a fixed plane of rotors, modeled as an under-actuated system on the tangent bundle of the six-dimensional Lie group of rigid body motions, SE(3). The extended state observer is designed to estimate the resultant external disturbance force and disturbance torque acting on the vehicle. It guarantees stable convergence of disturbance estimation errors in finite time when the disturbances are constant and finite time convergence to a bounded neighborhood of zero errors for time-varying disturbances. This extended state observer design is based on a Hölder-continuous fast finite time stable differentiator that is similar to the super-twisting algorithm, to obtain fast convergence. Numerical simulations are conducted to validate the proposed extended state observer. The proposed extended state observer is compared with other existing research to show its advantages. A set of experimental results implementing disturbance rejection control using feedback of disturbance estimates from the extended state observer is also presented.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Optimal Hypothesis Testing Based on Information Theory
Authors:
Dazhuan Xu,
Nan Wang
Abstract:
There has a major problem in the current theory of hypothesis testing in which no unified indicator to evaluate the goodness of various test methods since the cost function or utility function usually relies on the specific application scenario, resulting in no optimal hypothesis testing method. In this paper, the problem of optimal hypothesis testing is investigated based on information theory. W…
▽ More
There has a major problem in the current theory of hypothesis testing in which no unified indicator to evaluate the goodness of various test methods since the cost function or utility function usually relies on the specific application scenario, resulting in no optimal hypothesis testing method. In this paper, the problem of optimal hypothesis testing is investigated based on information theory. We propose an information-theoretic framework of hypothesis testing consisting of five parts: test information (TI) is proposed to evaluate the hypothesis testing, which depends on the a posteriori probability distribution function of hypotheses and independent of specific test methods; accuracy with the unit of bit is proposed to evaluate the degree of validity of specific test methods; the sampling a posteriori (SAP) probability test method is presented, which makes stochastic selections on the hypotheses according to the a posteriori probability distribution of the hypotheses; the probability of test failure is defined to reflect the probability of the failed decision is made; test theorem is proved that all accuracy lower than the TI is achievable. Specifically, for every accuracy lower than TI, there exists a test method with the probability of test failure tending to zero. Conversely, there is no test method whose accuracy is more than TI. Numerical simulations are performed to demonstrate that the SAP test is asymptotically optimal. In addition, the results show that the accuracy of the SAP test and the existing test methods, such as the maximum a posteriori probability, expected a posteriori probability, and median a posteriori probability tests, are not more than TI.
△ Less
Submitted 15 June, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Geometric Active Disturbance Rejection Control of Rotorcraft on $SE(3)$ with Fast Finite-Time Stability
Authors:
Ningshan Wang,
Reza Hamrah,
Amit K. Sanyal,
Mark N. Glauser
Abstract:
This article presents a tracking control framework enhanced by an extended state observer for a rotorcraft aerial vehicle modeled as a rigid body in three-dimensional translational and rotational motions. The system is considered as an underactuated system on the tangent bundle of the six-dimensional Lie group of rigid body motions, $SE(3)$. The extended state observer is designed to estimate the…
▽ More
This article presents a tracking control framework enhanced by an extended state observer for a rotorcraft aerial vehicle modeled as a rigid body in three-dimensional translational and rotational motions. The system is considered as an underactuated system on the tangent bundle of the six-dimensional Lie group of rigid body motions, $SE(3)$. The extended state observer is designed to estimate the resultant external disturbance force and disturbance torque acting on the vehicle. It guarantees stable convergence of disturbance estimation errors in finite time when the disturbances are constant and finite time convergence to a bounded neighborhood of zero errors for time-varying disturbances. This extended state observer design is based on a Hölder-continuous fast finite time stable differentiator that is similar to the super-twisting algorithm, to obtain fast convergence. A tracking control scheme that uses the estimated disturbances from extended state observer for disturbance rejection, is designed to achieve fast finite-time stable tracking control. Numerical simulations are conducted to validate the proposed extended state observer and tracking control scheme with disturbance rejection. The proposed extended state observer is compared with other existing research to show its supremacy.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Efficient First Order Method for Saddle Point Problems with Higher Order Smoothness
Authors:
Nuozhou Wang,
Junyu Zhang,
Shuzhong Zhang
Abstract:
This paper studies the complexity of finding approximate stationary points for the smooth nonconvex-strongly-concave (NC-SC) saddle point problem: $\min_x\max_yf(x,y)$. Under the standard first-order smoothness conditions where $f$ is $\ell$-smooth in both arguments and $μ_y$-strongly concave in $y$, existing literature shows that the optimal complexity for first-order methods to obtain an $ε$-sta…
▽ More
This paper studies the complexity of finding approximate stationary points for the smooth nonconvex-strongly-concave (NC-SC) saddle point problem: $\min_x\max_yf(x,y)$. Under the standard first-order smoothness conditions where $f$ is $\ell$-smooth in both arguments and $μ_y$-strongly concave in $y$, existing literature shows that the optimal complexity for first-order methods to obtain an $ε$-stationary point is $\tilde{O}\big(\sqrt{κ_y}\ellε^{-2}\big)$, where $κ_y=\ell/μ_y$ is the condition number. However, when $Φ(x):=\max_y f(x,y)$ has $L_2$-Lipschitz continuous Hessian in addition, we derive a first-order algorithm with an $\tilde{O}\big(\sqrt{κ_y}\ell^{1/2}L_2^{1/4}ε^{-7/4}\big)$ complexity by designing an accelerated proximal point algorithm enhanced with the "Convex Until Proven Guilty" technique. Moreover, an improved $Ω\big(\sqrt{κ_y}\ell^{3/7}L_2^{2/7}ε^{-12/7}\big)$ lower bound for first-order method is also derived for sufficiently small $ε$. As a result, given the second-order smoothness of the problem, the complexity of our method improves the state-of-the-art result by a factor of $\tilde{O}\big(\big(\frac{\ell^2}{L_2ε}\big)^{1/4}\big)$, while almost matching the lower bound except for a small $\tilde{O}\big(\big(\frac{\ell^2}{L_2ε}\big)^{1/28}\big)$ factor.
△ Less
Submitted 7 December, 2024; v1 submitted 24 April, 2023;
originally announced April 2023.
-
The asymptotic formulae of sums of two smooth squares for divisor function
Authors:
Nanxiang Wang,
Haobo Dai
Abstract:
A natural number $n$ is $y$-smooth if the greatest prime factor of $n$ does not exceed $y$. Let $s_{1}$ and $s_{2}$ are $y$-smooth numbers. We consider sums of smooth squares of the binary Titchmarsh divisor problem and give asymptotic formulae for $\sum_{s_{1}^{2}+s_{2}^{2}\le x}τ(s_{1}^{2}+s_{2}^{2}+1)$ for $(\log x)^{K}\le y<x^{\frac{1}{2}}$, where $K$ is large enough.
A natural number $n$ is $y$-smooth if the greatest prime factor of $n$ does not exceed $y$. Let $s_{1}$ and $s_{2}$ are $y$-smooth numbers. We consider sums of smooth squares of the binary Titchmarsh divisor problem and give asymptotic formulae for $\sum_{s_{1}^{2}+s_{2}^{2}\le x}τ(s_{1}^{2}+s_{2}^{2}+1)$ for $(\log x)^{K}\le y<x^{\frac{1}{2}}$, where $K$ is large enough.
△ Less
Submitted 12 June, 2023; v1 submitted 15 March, 2023;
originally announced March 2023.
-
3D Bosons, 3-Jack polynomials and affine Yangian of ${\mathfrak{gl}}(1)$
Authors:
Na Wang,
Ke Wu
Abstract:
3D (3 dimensional) Young diagrams are a generalization of 2D Young diagrams. In this paper, We consider 3D Bosons and 3-Jack polynomials. We associate three parameters $h_1,h_2,h_3$ to $y,x,z$-axis respectively. 3-Jack polynomials are polynomials of $P_{n,j},
n\geq j$ with coefficients in $\mathbb C(h_1,h_2,h_3)$, which are the generalization of Schur functions and Jack polynomials to 3D case. S…
▽ More
3D (3 dimensional) Young diagrams are a generalization of 2D Young diagrams. In this paper, We consider 3D Bosons and 3-Jack polynomials. We associate three parameters $h_1,h_2,h_3$ to $y,x,z$-axis respectively. 3-Jack polynomials are polynomials of $P_{n,j},
n\geq j$ with coefficients in $\mathbb C(h_1,h_2,h_3)$, which are the generalization of Schur functions and Jack polynomials to 3D case. Similar to Schur functions, 3-Jack polynomials can also be determined by the vertex operators and the Pieri formulas.
△ Less
Submitted 11 December, 2022;
originally announced December 2022.
-
An Accelerated Variance Reduced Extra-Point Approach to Finite-Sum VI and Optimization
Authors:
Kevin Huang,
Nuozhou Wang,
Shuzhong Zhang
Abstract:
In this paper, we develop stochastic variance reduced algorithms for solving a class of finite-sum hemivariational inequality (HVI) problem. In this HVI problem, the associated function is assumed to be differentiable, and both the vector mapping and the function are of finite-sum structure. We propose two algorithms to solve the cases when the vector mapping is either merely monotone or strongly…
▽ More
In this paper, we develop stochastic variance reduced algorithms for solving a class of finite-sum hemivariational inequality (HVI) problem. In this HVI problem, the associated function is assumed to be differentiable, and both the vector mapping and the function are of finite-sum structure. We propose two algorithms to solve the cases when the vector mapping is either merely monotone or strongly monotone, while the function is assumed to be convex. We show how to apply variance reduction in the proposed algorithms when such an HVI problem has a finite-sum structure, and the resulting accelerated gradient complexities can match the best bound established for finite-sum VI problem, as well as the bound given by the direct Katyusha for finite-sum optimization respectively, in terms of the corresponding parameters such as (gradient) Lipschitz constants and the sizes of the finite-sums. We demonstrate the application of our algorithms through solving a finite-sum constrained finite-sum optimization problem and provide preliminary numerical results.
△ Less
Submitted 11 September, 2025; v1 submitted 6 November, 2022;
originally announced November 2022.
-
Devaney Chaos on a Set-valued Map and Its Inverse Limit
Authors:
Yingcui Zhao,
Lidong Wang,
Nan Wang
Abstract:
We study relationships between a set-valued map and its inverse limits about the notion of periodic point set, transitivity, sensitivity and Devaney chaos. Density of periodic point set of a set-valued map and its inverse limits implies each other. Sensitivity of a set-valued map and its inverse limits does not imply each other. Transitivity and Devaney chaos of generalized inverse limits implies…
▽ More
We study relationships between a set-valued map and its inverse limits about the notion of periodic point set, transitivity, sensitivity and Devaney chaos. Density of periodic point set of a set-valued map and its inverse limits implies each other. Sensitivity of a set-valued map and its inverse limits does not imply each other. Transitivity and Devaney chaos of generalized inverse limits implies the corresponding property of a set-valued map.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
A Gradient Complexity Analysis for Minimizing the Sum of Strongly Convex Functions with Varying Condition Numbers
Authors:
Nuozhou Wang,
Shuzhong Zhang
Abstract:
A popular approach to minimize a finite-sum of convex functions is stochastic gradient descent (SGD) and its variants. Fundamental research questions associated with SGD include: (i) To find a lower bound on the number of times that the gradient oracle of each individual function must be assessed in order to find an $ε$-minimizer of the overall objective; (ii) To design algorithms which guarantee…
▽ More
A popular approach to minimize a finite-sum of convex functions is stochastic gradient descent (SGD) and its variants. Fundamental research questions associated with SGD include: (i) To find a lower bound on the number of times that the gradient oracle of each individual function must be assessed in order to find an $ε$-minimizer of the overall objective; (ii) To design algorithms which guarantee to find an $ε$-minimizer of the overall objective in expectation at no more than a certain number of times (in terms of $1/ε$) that the gradient oracle of each functions needs to be assessed (i.e., upper bound). If these two bounds are at the same order of magnitude, then the algorithms may be called optimal. Most existing results along this line of research typically assume that the functions in the objective share the same condition number. In this paper, the first model we study is the problem of minimizing the sum of finitely many strongly convex functions whose condition numbers are all different. We propose an SGD method for this model and show that it is optimal in gradient computations, up to a logarithmic factor. We then consider a constrained separate block optimization model, and present lower and upper bounds for its gradient computation complexity. Next, we propose to solve the Fenchel dual of the constrained block optimization model via the SGD we introduced earlier, and show that it yields a lower iteration complexity than solving the original model by the ADMM-type approach. Finally, we extend the analysis to the general composite convex optimization model, and obtain gradient-computation complexity results under certain conditions.
△ Less
Submitted 12 August, 2022;
originally announced August 2022.
-
P-adic incomplete gamma functions and Artin-Hasse-type series
Authors:
Xiaojian Li,
Jay Reiter,
Shiang Tang,
Napoleon Wang,
Jin Yi
Abstract:
We define and study a $p$-adic analogue of the incomplete gamma function related to Morita's $p$-adic gamma function. We also discuss a combinatorial identity related to the Artin-Hasse series, which is a special case of the exponential principle in combinatorics. From this we deduce a curious $p$-adic property of $|\mathrm{Hom} (G,S_n)|$ for a topologically finitely generated group $G$, using a c…
▽ More
We define and study a $p$-adic analogue of the incomplete gamma function related to Morita's $p$-adic gamma function. We also discuss a combinatorial identity related to the Artin-Hasse series, which is a special case of the exponential principle in combinatorics. From this we deduce a curious $p$-adic property of $|\mathrm{Hom} (G,S_n)|$ for a topologically finitely generated group $G$, using a characterization of $p$-adic continuity for certain functions $f \colon \mathbb Z_{>0} \to \mathbb Q_p$ due to O'Desky-Richman. In the end, we give an exposition of some standard properties of the Artin-Hasse series.
△ Less
Submitted 28 November, 2022; v1 submitted 24 July, 2022;
originally announced July 2022.
-
An energy system model for mixed bilateral and pool markets
Authors:
Ni Wang,
Remco A. Verzijlbergh,
Petra W. Heijnen,
Paulien M. Herder
Abstract:
Investments into renewable energy are increasing rapidly around the world. Energy system models are able to provide insights into optimal investment capacities and thus are widely used to aid the long-term investment decision-making under an electricity market environment. Existing energy system models, however, fail to consider bilateral electricity markets while in reality, these constitute a ma…
▽ More
Investments into renewable energy are increasing rapidly around the world. Energy system models are able to provide insights into optimal investment capacities and thus are widely used to aid the long-term investment decision-making under an electricity market environment. Existing energy system models, however, fail to consider bilateral electricity markets while in reality, these constitute a major part of all energy trades. In this paper, we propose an improved energy system model that endogenously considers mixed bilateral and pool markets. In this model, we also introduce three externality cost items that account for the social cost of technologies, carbon taxes/renewable energy subsidies, and the bilateral product differentiation in the bilateral market, respectively. We start with an equilibrium problem formulation for different market players and next, an equivalent optimization problem is presented. Then, a case study of the pan-European market to reach 95\% emission reduction in 2050 is conducted to demonstrate the model. Different scenarios are constructed to showcase two different usages of product differentiation in the bilateral market, i.e., willingness to pay and exogenous costs. Our main conclusion is that the inclusion of mixed bilateral and pool markets into our enriched energy system model significantly changed the optimal investment capacities, compared to benchmark results from the existing, conventional energy system model. This shows that the inclusion of the bilateral market is of key importance in future investment considerations. Our model is the first of its kind to include this important and realistic bilateral market in energy system models.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
The bipartite Turan number and spectral extremum for linear forests
Authors:
Ming-Zhu Chen,
Ning Wang,
Long-Tu Yuan,
Xiao-Dong Zhang
Abstract:
The bipartite Turán number of a graph $H$, denoted by $ex(m,n; H)$, is the maximum number of edges in any bipartite graph $G=(X,Y; E)$ with $|X|=m$ and $|Y|=n$ which does not contain $H$ as a subgraph. In this paper, we determined $ex(m,n; F_{\ell})$ for arbitrary $\ell$ and appropriately large $n$ with comparing to $m$ and $\ell$, where $F_\ell$ is a linear forest which consists of $\ell$ vertex…
▽ More
The bipartite Turán number of a graph $H$, denoted by $ex(m,n; H)$, is the maximum number of edges in any bipartite graph $G=(X,Y; E)$ with $|X|=m$ and $|Y|=n$ which does not contain $H$ as a subgraph. In this paper, we determined $ex(m,n; F_{\ell})$ for arbitrary $\ell$ and appropriately large $n$ with comparing to $m$ and $\ell$, where $F_\ell$ is a linear forest which consists of $\ell$ vertex disjoint paths. Moreover, the extremal graphs have been characterized. Furthermore, these results are used to obtain the maximum spectral radius of bipartite graphs which does not contain $F_{\ell}$ as a subgraph and characterize all extremal graphs which attain the maximum spectral radius.
△ Less
Submitted 2 January, 2022;
originally announced January 2022.
-
Privacy-Preserving Energy Storage Sharing with Blockchain and Secure Multi-Party Computation
Authors:
Nan Wang,
Sid Chi-Kin Chau,
Yue Zhou
Abstract:
Energy storage provides an effective way of shifting temporal energy demands and supplies, which enables significant cost reduction under time-of-use energy pricing plans. Despite its promising benefits, the cost of present energy storage remains expensive, presenting a major obstacle to practical deployment. A more viable solution to improve the cost-effectiveness is by sharing energy storage, su…
▽ More
Energy storage provides an effective way of shifting temporal energy demands and supplies, which enables significant cost reduction under time-of-use energy pricing plans. Despite its promising benefits, the cost of present energy storage remains expensive, presenting a major obstacle to practical deployment. A more viable solution to improve the cost-effectiveness is by sharing energy storage, such as community sharing, cloud energy storage and peer-to-peer sharing. However, revealing private energy demand data to an external energy storage operator may compromise user privacy, and is susceptible to data misuses and breaches. In this paper, we explore a novel approach to support energy storage sharing with privacy protection, based on privacy-preserving blockchain and secure multi-party computation. We present an integrated solution to enable privacy-preserving energy storage sharing, such that energy storage service scheduling and cost-sharing can be attained without the knowledge of individual users' demands. It also supports auditing and verification by the grid operator via blockchain. Furthermore, our privacy-preserving solution can safeguard against a dishonest majority of users, who may collude in cheating, without requiring a trusted third-party. We implemented our solution as a smart contract on real-world Ethereum blockchain platform, and provide empirical evaluation in this paper.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
An improved Liouville type theorem for Beltrami flows
Authors:
Na Wang,
Zhibing Zhang
Abstract:
In this note, we improved the Liouville type theorem for the Beltrami flows. Two different methods are used to prove it. One is the monotonicity method, and the other is proof by contradiction. The conditions that we proposed on Beltrami flows are significantly weaker than previously known conditions.
In this note, we improved the Liouville type theorem for the Beltrami flows. Two different methods are used to prove it. One is the monotonicity method, and the other is proof by contradiction. The conditions that we proposed on Beltrami flows are significantly weaker than previously known conditions.
△ Less
Submitted 21 April, 2022; v1 submitted 18 July, 2021;
originally announced July 2021.
-
The asymmetric particle population density method for simulation of coupled noisy oscillators
Authors:
Ningyuan Wang,
Daniel B Forger
Abstract:
A wide variety of biological phenomena can be modeled by the collective activity of a population of individual units. A common strategy for simulating such a system, the population density approach, is to take the macroscopic limit and update its population density function. However, in many cases, the coupling between the units and noise gives rise to complex behaviors challenging to existing pop…
▽ More
A wide variety of biological phenomena can be modeled by the collective activity of a population of individual units. A common strategy for simulating such a system, the population density approach, is to take the macroscopic limit and update its population density function. However, in many cases, the coupling between the units and noise gives rise to complex behaviors challenging to existing population density approach methods. To address these challenges, we develop the asymmetric particle population density (APPD) method that efficiently and accurately simulates such populations consist of coupled elements. The APPD is well-suited for a parallel implementation. We compare the performance of the method against direct Monte-Carlo simulation and verify its accuracy by applying it to the well-studied Hodgkin-Huxley model, with a range of challenging scenarios. We find that our method can accurately reproduce complex macroscopic behaviors such as inhibitory coupling-induced clustering and noise-induced firing while being faster than the direct simulation.
△ Less
Submitted 14 June, 2021;
originally announced June 2021.
-
A Prosumer-Centric Framework for Concurrent Generation and Transmission Planning -- Part II
Authors:
Ni Wang,
Remco Verzijlbergh,
Petra Heijnen,
Paulien Herder
Abstract:
We propose a framework where generation and transmission capacities are planned concurrently in market environments with a focus on the prosumers. This paper is a continuation of Part I and presents numerical results from three archetypal case studies. Following the proposed framework, optimal planning decisions are shown in all the cases. Furthermore, in case study I, no-regret planning decisions…
▽ More
We propose a framework where generation and transmission capacities are planned concurrently in market environments with a focus on the prosumers. This paper is a continuation of Part I and presents numerical results from three archetypal case studies. Following the proposed framework, optimal planning decisions are shown in all the cases. Furthermore, in case study I, no-regret planning decisions considering the uncertainties in future electricity market designs are discussed. In case study II, we look at a situation where the social resistance of wind energy prevails and the prosumers choose to not invest in wind energy. This preference results in an increased system cost, and thus may harm other prosumers. The framework is used as a negotiation simulator to deal with this problem. Case study III presents numerical results for a mixed bilateral/pool market. The case studies utilize realistic data from the Dutch power system and the European power system to provide policy-relevant results that aid their decarbonization in various market environments.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
A Prosumer-Centric Framework for Concurrent Generation and Transmission Planning -- Part I
Authors:
Ni Wang,
Remco Verzijlbergh,
Petra Heijnen,
Paulien Herder
Abstract:
The growing share of proactive actors in the electricity markets calls for more attention on prosumers and more support for their decision-making under decentralized electricity markets. In view of the changing paradigm, it is crucial to study the long-term planning under the decentralized and prosumer-centric markets to unravel the effects of such markets on the planning decisions. In the first p…
▽ More
The growing share of proactive actors in the electricity markets calls for more attention on prosumers and more support for their decision-making under decentralized electricity markets. In view of the changing paradigm, it is crucial to study the long-term planning under the decentralized and prosumer-centric markets to unravel the effects of such markets on the planning decisions. In the first part of the two-part paper, we propose a prosumer-centric framework for concurrent generation and transmission planning. Here, three planning models are presented where a peer-to-peer market with product differentiation, a pool market and a mixed bilateral/pool market and their associated trading costs are explicitly modeled, respectively. To fully reveal the individual costs and benefits, we start by formulating the optimization problems of various actors, i.e. prosumers, transmission system operator, energy market operator and carbon market operator. Moreover, to enable decentralized planning where the privacy of the prosumers is preserved, distributed optimization algorithms are presented based on the corresponding centralized optimization problems.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
Unconditional energy dissipation and error estimates of the SAV Fourier spectral method for nonlinear fractional generalized wave equation
Authors:
Nan Wang,
Meng Li,
Chengming Huang
Abstract:
In this paper, we consider a second-order scalar auxiliary variable (SAV) Fourier spectral method to solve the nonlinear fractional generalized wave equation. Unconditional energy conservation or dissipation properties of the fully discrete scheme are first established. Next, we utilize the temporal-spatial error splitting argument to obtain unconditional optimal error estimate of the fully discre…
▽ More
In this paper, we consider a second-order scalar auxiliary variable (SAV) Fourier spectral method to solve the nonlinear fractional generalized wave equation. Unconditional energy conservation or dissipation properties of the fully discrete scheme are first established. Next, we utilize the temporal-spatial error splitting argument to obtain unconditional optimal error estimate of the fully discrete scheme, which overcomes time-step restrictions caused by strongly nonlinear system, or the restrictions that the nonlinear term needs to satisfy the assumption of global Lipschitz condition in all previous works for fractional undamped or damped wave equations. Finally, some numerical experiments are presented to confirm our theoretical analysis.
△ Less
Submitted 4 May, 2021;
originally announced May 2021.
-
Optimal Lockdown Policy for Covid-19: A Modelling Study
Authors:
Yuting Fu,
Haitao Xiang,
Hanqing Jin,
Ning Wang
Abstract:
As the COVID19 spreads across the world, prevention measures are becoming the essential weapons to combat the pandemic in the period of crisis. The lockdown measure is the most controversial one as it imposes an overwhelming impact on our economy and society. Especially when and how to enforce the lockdown measures are the most challenging questions considering both economic and epidemiological co…
▽ More
As the COVID19 spreads across the world, prevention measures are becoming the essential weapons to combat the pandemic in the period of crisis. The lockdown measure is the most controversial one as it imposes an overwhelming impact on our economy and society. Especially when and how to enforce the lockdown measures are the most challenging questions considering both economic and epidemiological costs. In this paper, we extend the classic SIR model to find optimal decision making to balance between economy and people's health during the outbreak of COVID-19. In our model, we intend to solve a two phases optimization problem: policymakers control the lockdown rate to maximize the overall welfare of the society; people in different health statuses take different decisions on their working hours and consumption to maximize their utility. We develop a novel method to estimate parameters for the model through various additional sources of data. We use the Cournot equilibrium to model people's behavior and also consider the cost of death in order to leverage between economic and epidemic costs. The analysis of simulation results provides scientific suggestions for policymakers to make critical decisions on when to start the lockdown and how strong it should be during the whole period of the outbreak. Although the model is originally proposed for the COVID19 pandemic, it can be generalized to address similar problems to control the outbreak of other infectious diseases with lockdown measures.
△ Less
Submitted 31 January, 2021;
originally announced February 2021.
-
Average size of Selmer group in large q limit
Authors:
Sun Woo Park,
Niudun Wang
Abstract:
In this paper, we prove a function field-analogue of Poonen-Rains heuristics on the average size of $p$-Selmer group. Let $E$ be an elliptic curve defined over $\mathbb{Z}[t]$. Then $E$ is also defined over $\mathbb{F}_q$ for any $q$ of prime power. We show that for large enough $q$, the average size of the $p$-Selmer groups over the family of quadratic twists of $E$ over $\mathbb{F}_q[t]$ is equa…
▽ More
In this paper, we prove a function field-analogue of Poonen-Rains heuristics on the average size of $p$-Selmer group. Let $E$ be an elliptic curve defined over $\mathbb{Z}[t]$. Then $E$ is also defined over $\mathbb{F}_q$ for any $q$ of prime power. We show that for large enough $q$, the average size of the $p$-Selmer groups over the family of quadratic twists of $E$ over $\mathbb{F}_q[t]$ is equal to $p+1$ for all but finitely many primes $p$. Namely, if we twist the curve in $\mathbb{F}_q[t]$ by polynomials of fixed degree $n$ and let both $n$ and $q$ approach to infinity, then the average rank of $p$-Selmer group converges to $p+1$.
△ Less
Submitted 3 February, 2021; v1 submitted 31 January, 2021;
originally announced February 2021.
-
Deep-learning based discovery of partial differential equations in integral form from sparse and noisy data
Authors:
Hao Xu,
Dongxiao Zhang,
Nanzhe Wang
Abstract:
Data-driven discovery of partial differential equations (PDEs) has attracted increasing attention in recent years. Although significant progress has been made, certain unresolved issues remain. For example, for PDEs with high-order derivatives, the performance of existing methods is unsatisfactory, especially when the data are sparse and noisy. It is also difficult to discover heterogeneous parame…
▽ More
Data-driven discovery of partial differential equations (PDEs) has attracted increasing attention in recent years. Although significant progress has been made, certain unresolved issues remain. For example, for PDEs with high-order derivatives, the performance of existing methods is unsatisfactory, especially when the data are sparse and noisy. It is also difficult to discover heterogeneous parametric PDEs where heterogeneous parameters are embedded in the partial differential operators. In this work, a new framework combining deep-learning and integral form is proposed to handle the above-mentioned problems simultaneously, and improve the accuracy and stability of PDE discovery. In the framework, a deep neural network is firstly trained with observation data to generate meta-data and calculate derivatives. Then, a unified integral form is defined, and the genetic algorithm is employed to discover the best structure. Finally, the value of parameters is calculated, and whether the parameters are constants or variables is identified. Numerical experiments proved that our proposed algorithm is more robust to noise and more accurate compared with existing methods due to the utilization of integral form. Our proposed algorithm is also able to discover PDEs with high-order derivatives or heterogeneous parameters accurately with sparse and noisy data.
△ Less
Submitted 24 November, 2020;
originally announced November 2020.
-
Filtering cohomology of ordinary and Lagrangian Grassmannians
Authors:
The 2020 Polymath Jr. REU "q-binomials,
the Grassmannian group",
:,
Huda Ahmed,
Rasiel Chishti,
Yu-Cheng Chiu,
Galen Dorpalen-Barry,
Jeremy Ellis,
David Fang,
Michael Feigen,
Jonathan Feigert,
Mabel González,
Dylan Harker,
Jiaye Wei,
Bhavna Joshi,
Gandhar Kulkarni,
Kapil Lad,
Zhen Liu,
Ma Mingyang,
Lance Myers,
Arjun Nigam,
Tudor Popescu,
Victor Reiner,
Zijian Rong,
Eunice Sukarto
, et al. (9 additional authors not shown)
Abstract:
This paper studies, for a positive integer $m$, the subalgebra of the cohomology ring of the complex Grassmannians generated by the elements of degree at most $m$. We build in two ways upon a conjecture for the Hilbert series of this subalgebra due to Reiner and Tudose. The first reinterprets it in terms of the operation of $k$-conjugation, suggesting two conjectural bases for the subalgebras that…
▽ More
This paper studies, for a positive integer $m$, the subalgebra of the cohomology ring of the complex Grassmannians generated by the elements of degree at most $m$. We build in two ways upon a conjecture for the Hilbert series of this subalgebra due to Reiner and Tudose. The first reinterprets it in terms of the operation of $k$-conjugation, suggesting two conjectural bases for the subalgebras that would imply their conjecture. The second introduces an analogous conjecture for the cohomology of Lagrangian Grassmannians.
△ Less
Submitted 12 September, 2021; v1 submitted 5 November, 2020;
originally announced November 2020.
-
A Lagrangian Dual-based Theory-guided Deep Neural Network
Authors:
Miao Rong,
Dongxiao Zhang,
Nanzhe Wang
Abstract:
The theory-guided neural network (TgNN) is a kind of method which improves the effectiveness and efficiency of neural network architectures by incorporating scientific knowledge or physical information. Despite its great success, the theory-guided (deep) neural network possesses certain limits when maintaining a tradeoff between training data and domain knowledge during the training process. In th…
▽ More
The theory-guided neural network (TgNN) is a kind of method which improves the effectiveness and efficiency of neural network architectures by incorporating scientific knowledge or physical information. Despite its great success, the theory-guided (deep) neural network possesses certain limits when maintaining a tradeoff between training data and domain knowledge during the training process. In this paper, the Lagrangian dual-based TgNN (TgNN-LD) is proposed to improve the effectiveness of TgNN. We convert the original loss function into a constrained form with fewer items, in which partial differential equations (PDEs), engineering controls (ECs), and expert knowledge (EK) are regarded as constraints, with one Lagrangian variable per constraint. These Lagrangian variables are incorporated to achieve an equitable tradeoff between observation data and corresponding constraints, in order to improve prediction accuracy, and conserve time and computational resources adjusted by an ad-hoc procedure. To investigate the performance of the proposed method, the original TgNN model with a set of optimized weight values adjusted by ad-hoc procedures is compared on a subsurface flow problem, with their L2 error, R square (R2), and computational time being analyzed. Experimental results demonstrate the superiority of the Lagrangian dual-based TgNN.
△ Less
Submitted 23 August, 2020;
originally announced August 2020.
-
Deep-Learning based Inverse Modeling Approaches: A Subsurface Flow Example
Authors:
Nanzhe Wang,
Haibin Chang,
Dongxiao Zhang
Abstract:
Deep-learning has achieved good performance and shown great potential for solving forward and inverse problems. In this work, two categories of innovative deep-learning based inverse modeling methods are proposed and compared. The first category is deep-learning surrogate-based inversion methods, in which the Theory-guided Neural Network (TgNN) is constructed as a deep-learning surrogate for probl…
▽ More
Deep-learning has achieved good performance and shown great potential for solving forward and inverse problems. In this work, two categories of innovative deep-learning based inverse modeling methods are proposed and compared. The first category is deep-learning surrogate-based inversion methods, in which the Theory-guided Neural Network (TgNN) is constructed as a deep-learning surrogate for problems with uncertain model parameters. By incorporating physical laws and other constraints, the TgNN surrogate can be constructed with limited simulation runs and accelerate the inversion process significantly. Three TgNN surrogate-based inversion methods are proposed, including the gradient method, the iterative ensemble smoother (IES), and the training method. The second category is direct-deep-learning-inversion methods, in which TgNN constrained with geostatistical information, named TgNN-geo, is proposed for direct inverse modeling. In TgNN-geo, two neural networks are introduced to approximate the respective random model parameters and the solution. Since the prior geostatistical information can be incorporated, the direct-inversion method based on TgNN-geo works well, even in cases with sparse spatial measurements or imprecise prior statistics. Although the proposed deep-learning based inverse modeling methods are general in nature, and thus applicable to a wide variety of problems, they are tested with several subsurface flow problems. It is found that satisfactory results are obtained with a high efficiency. Moreover, both the advantages and disadvantages are further analyzed for the proposed two categories of deep-learning based inversion methods.
△ Less
Submitted 28 July, 2020;
originally announced July 2020.
-
Simpler completeness proofs for modal logics with intersection
Authors:
Yì N. Wáng,
Thomas Ågotnes
Abstract:
There has been a significant interest in extending various modal logics with intersection, the most prominent examples being epistemic and doxastic logics with distributed knowledge. Completeness proofs for such logics tend to be complicated, in particular on model classes such as S5 like in standard epistemic logic, mainly due to the undefinability of intersection of modalities in standard modal…
▽ More
There has been a significant interest in extending various modal logics with intersection, the most prominent examples being epistemic and doxastic logics with distributed knowledge. Completeness proofs for such logics tend to be complicated, in particular on model classes such as S5 like in standard epistemic logic, mainly due to the undefinability of intersection of modalities in standard modal logics. A standard proof method for the S5 case was outlined in [8] and later explicated in more detail in [13], using an "unraveling-folding method" case to achieve a treelike model to deal with the problem of undefinability. This method, however, is not easily adapted to other logics, due to the level of detail and reliance on S5. In this paper we propose a simpler proof technique by building a treelike canonical model directly, which avoids the complications in the processes of unraveling and folding. We demonstrate the technique by showing completeness of the normal modal logics K, D, T, B, S4 and S5 extended with intersection modalities. Furthermore, these treelike canonical models are compatible with Fischer-Ladner-style closures, and we combine the methods to show the completeness of the mentioned logics further extended with transitive closure of union modalities known from PDL or epistemic logic. Some of these completeness results are new.
△ Less
Submitted 5 April, 2020;
originally announced April 2020.
-
A Stage-wise Decision Framework for Transportation Network Resilience Planning
Authors:
Weili Zhang,
Naiyu Wang,
Charles Nicholsonc,
Mohammad Hadikhan Tehrani
Abstract:
This study introduces a comprehensive stage-wise decision framework to support resilience planning for roadway networks regarding pre-disaster mitigation (Stage I), post-disaster emergency response (Stage II) and long-term recovery (Stage III). Three decision metrics are first defined, each based on a derivation of the number of independent pathways (IPW) within a roadway system, to measure the pe…
▽ More
This study introduces a comprehensive stage-wise decision framework to support resilience planning for roadway networks regarding pre-disaster mitigation (Stage I), post-disaster emergency response (Stage II) and long-term recovery (Stage III). Three decision metrics are first defined, each based on a derivation of the number of independent pathways (IPW) within a roadway system, to measure the performance of a network in term of its robustness, redundancy, and recoverability, respectively. Using the three IPW-based decision metrics, a stage-wise decision process is then formulated as a stochastic multi-objective optimization problem, which includes a project ranking mechanism to identify pre-disaster network retrofit projects in Phase I, a prioritization approach for temporary repairs to facilitate immediate post-disaster emergency responses in Phase II, and a methodology for scheduling network-wide repairs during the long-term recovery of the roadway system in Phase III. Finally, this stage-wise decision framework is applied to the roadway network of Shelby County, TN, USA subjected to seismic hazards, to illustrate its implementation in supporting community network resilience planning.
△ Less
Submitted 11 August, 2018;
originally announced August 2018.
-
A spectral penalty method for two-sided fractional differential equations with general boundary conditions
Authors:
Nan Wang,
Zhiping Mao,
Chengming Huang,
George Em Karniadakis
Abstract:
We consider spectral approximations to the conservative form of the two-sided Riemann-Liouville (R-L) and Caputo fractional differential equations (FDEs) with nonhomogeneous Dirichlet (fractional and classical, respectively) and Neumann (fractional) boundary conditions. In particular, we develop a spectral penalty method (SPM) by using the Jacobi poly-fractonomial approximation for the conservativ…
▽ More
We consider spectral approximations to the conservative form of the two-sided Riemann-Liouville (R-L) and Caputo fractional differential equations (FDEs) with nonhomogeneous Dirichlet (fractional and classical, respectively) and Neumann (fractional) boundary conditions. In particular, we develop a spectral penalty method (SPM) by using the Jacobi poly-fractonomial approximation for the conservative R-L FDEs while using the polynomial approximation for the conservative Caputo FDEs. We establish the well-posedness of the corresponding weak problems and analyze sufficient conditions for the coercivity of the SPM for different types of fractional boundary value problems. This analysis allows us to estimate the proper values of the penalty parameters at boundary points. We present several numerical examples to verify the theory and demonstrate the high accuracy of SPM, both for stationary and time dependent FDEs. Moreover, we compare the results against a Petrov-Galerkin spectral tau method (PGS-$τ$, an extension of [Z. Mao, G.E. Karniadakis, SIAM J. Numer. Anal., 2018]) and demonstrate the superior accuracy of SPM for all cases considered.
△ Less
Submitted 12 July, 2018;
originally announced July 2018.
-
Complete regular dessins and skew-morphisms of cyclic groups
Authors:
Yan-Quan Feng,
Kan Hu,
Roman Nedela,
Martin Skoviera,
Na-Er Wang
Abstract:
A dessin is a 2-cell embedding of a connected $2$-coloured bipartite graph into an orientable closed surface. A dessin is regular if its group of orientation- and colour-preserving automorphisms acts regularly on the edges. In this paper we study regular dessins whose underlying graph is a complete bipartite graph $K_{m,n}$, called $(m,n)$-complete regular dessins. The purpose is to establish a ra…
▽ More
A dessin is a 2-cell embedding of a connected $2$-coloured bipartite graph into an orientable closed surface. A dessin is regular if its group of orientation- and colour-preserving automorphisms acts regularly on the edges. In this paper we study regular dessins whose underlying graph is a complete bipartite graph $K_{m,n}$, called $(m,n)$-complete regular dessins. The purpose is to establish a rather surprising correspondence between $(m,n)$-complete regular dessins and pairs of skew-morphisms of cyclic groups. A skew-morphism of a finite group $A$ is a bijection $\varphi\colon A\to A$ that satisfies the identity $\varphi(xy)=\varphi(x)\varphi^{π(x)}(y)$ for some function $π\colon A\to\mathbb{Z}$ and fixes the neutral element of~$A$. We show that every $(m,n)$-complete regular dessin $\mathcal{D}$ determines a pair of reciprocal skew-morphisms of the cyclic groups $\mathbb{Z}_n$ and $\mathbb{Z}_m$.
Conversely, $\mathcal{D}$ can be reconstructed from such a reciprocal pair.
As a consequence, we prove that complete regular dessins, exact bicyclic groups with a distinguished pair of generators, and pairs of reciprocal skew-morphisms of cyclic groups are all in one-to-one correspondence. Finally, we apply the main result to determining all pairs of integers $m$ and $n$ for which there exists, up to interchange of colours, exactly one $(m,n)$-complete regular dessin. We show that the latter occurs precisely when every group expressible as a product of cyclic groups of order $m$ and $n$ is abelian, which eventually comes down to the condition $\gcd(m,φ(n))=\gcd(φ(m),n)=1$, where $φ$ is Euler's totient function.
△ Less
Submitted 18 June, 2018;
originally announced June 2018.
-
Smooth skew-morphisms of the dihedral groups
Authors:
Naer Wang,
Kan Hu,
Kai Yuan,
Junyang Zhang
Abstract:
A skew-morphism $\varphi$ of a finite group $A$ is a permutation on $A$ such that $\varphi(1)=1$ and $\varphi(xy)=\varphi(x)\varphi^{π(x)}(y)$ for all $x,y\in A$ where $π:A\to\mathbb{Z}_{|\varphi|}$ is an integer function. A skew-morphism is smooth if $π(\varphi(x))=π(x)$ for all $x\in A$. The concept of smooth skew-morphisms is a generalization of that of $t$-balanced skew-morphisms. The aim of t…
▽ More
A skew-morphism $\varphi$ of a finite group $A$ is a permutation on $A$ such that $\varphi(1)=1$ and $\varphi(xy)=\varphi(x)\varphi^{π(x)}(y)$ for all $x,y\in A$ where $π:A\to\mathbb{Z}_{|\varphi|}$ is an integer function. A skew-morphism is smooth if $π(\varphi(x))=π(x)$ for all $x\in A$. The concept of smooth skew-morphisms is a generalization of that of $t$-balanced skew-morphisms. The aim of the paper is to develop a general theory of smooth skew-morphisms. As an application we classify smooth skew-morphisms of the dihedral groups.
△ Less
Submitted 18 June, 2018;
originally announced June 2018.
-
Regular dessins uniquely determined by a nilpotent automorphism group
Authors:
Naer Wang,
Roman Nedela,
Kan Hu
Abstract:
It is well known that the automorphism group of a regular dessin is a two-generator finite group, and the isomorphism classes of regular dessins with automorphism groups isomorphic to a given finite group $G$ are in one-to-one correspondence with the orbits of the action of $\Aut(G)$ on the ordered generating pairs of $G$. If there is only one orbit, then up to isomorphism the regular dessin is un…
▽ More
It is well known that the automorphism group of a regular dessin is a two-generator finite group, and the isomorphism classes of regular dessins with automorphism groups isomorphic to a given finite group $G$ are in one-to-one correspondence with the orbits of the action of $\Aut(G)$ on the ordered generating pairs of $G$. If there is only one orbit, then up to isomorphism the regular dessin is uniquely determined by the group $G$ and it is called uniquely regular. In the paper we investigate the classification of uniquely regular dessins with a nilpotent automorphism group. The problem is reduced to the classification of finite maximally automorphic $p$-groups $G$, i.e., the order of the automorphism group of $G$ attains Hall's upper bound. Maximally automorphic $p$-groups of nilpotency class three are classified.
△ Less
Submitted 12 June, 2018;
originally announced June 2018.
-
Nilpotent groups of class two which underly a unique regular dessin
Authors:
Kan Hu,
Roman Nedela,
Naer Wang
Abstract:
A dessin is an embedding of connected bipartite graph into an oriented closed surface. A dessin is regular if its group of colour- and orientation-preserving automorphisms acts transitively on the edges. In the present paper regular dessins with a nilpotent automorphism group are investigated, and attention are paid on those with the highest level of external symmetry. Depending on the algebraic t…
▽ More
A dessin is an embedding of connected bipartite graph into an oriented closed surface. A dessin is regular if its group of colour- and orientation-preserving automorphisms acts transitively on the edges. In the present paper regular dessins with a nilpotent automorphism group are investigated, and attention are paid on those with the highest level of external symmetry. Depending on the algebraic theory of dessins and using group-theoretical methods, we present a classification of nilpotent groups of class two which underly a unique regular dessin.
△ Less
Submitted 12 June, 2018;
originally announced June 2018.
-
Complete regular dessins of odd prime power order
Authors:
Kan Hu,
Naer Wang,
Roman Nedela
Abstract:
A dessin is a $2$-cell embedding of a connected $2$-coloured bipartite graph into an orientable closed surface. A dessin is regular if its group of colour- and orientation-preserving automorphisms acts regularly on the edges. In this paper we employ group-theoretic method to determine and enumerate the isomorphism classes of regular dessins with the complete bipartite underlying graphs of odd prim…
▽ More
A dessin is a $2$-cell embedding of a connected $2$-coloured bipartite graph into an orientable closed surface. A dessin is regular if its group of colour- and orientation-preserving automorphisms acts regularly on the edges. In this paper we employ group-theoretic method to determine and enumerate the isomorphism classes of regular dessins with the complete bipartite underlying graphs of odd prime power order.
△ Less
Submitted 12 June, 2018;
originally announced June 2018.
-
Algebraic formulas for the structure constants in symmetric functions
Authors:
Na Wang,
Ke Wu
Abstract:
Littlewood-Richardson rule gives the decomposition formula for the multiplication of two Schur functions, while the decomposition formula for the multiplication of two Hall-Littlewood functions or two universal characters is also given by the combinatorial method. In this paper, using the vertex operator realizations of these symmetric functions, we construct the algebraic forms of these decomposi…
▽ More
Littlewood-Richardson rule gives the decomposition formula for the multiplication of two Schur functions, while the decomposition formula for the multiplication of two Hall-Littlewood functions or two universal characters is also given by the combinatorial method. In this paper, using the vertex operator realizations of these symmetric functions, we construct the algebraic forms of these decomposition formulas.
△ Less
Submitted 31 January, 2018;
originally announced February 2018.
-
Neighborhood selection with application to social networks
Authors:
Nana Wang,
Wolfgang Polonik
Abstract:
The topic of this paper is modeling and analyzing dependence in stochastic social networks. Using a latent variable block model allows the analysis of dependence between blocks via the analysis of a latent graphical model. Our approach to the analysis of the graphical model then is based on the idea underlying the neighborhood selection scheme put forward by Meinshausen and Bühlmann (2006). Howeve…
▽ More
The topic of this paper is modeling and analyzing dependence in stochastic social networks. Using a latent variable block model allows the analysis of dependence between blocks via the analysis of a latent graphical model. Our approach to the analysis of the graphical model then is based on the idea underlying the neighborhood selection scheme put forward by Meinshausen and Bühlmann (2006). However, because of the latent nature of our model, estimates have to be used in lieu of the unobserved variables. This leads to a novel analysis of graphical models under uncertainty, in the spirit of Rosenbaum et al. (2010), or Belloni et al. (2017). Lasso-based selectors, and a class of Dantzig-type selectors are studied.
△ Less
Submitted 23 August, 2018; v1 submitted 16 November, 2017;
originally announced November 2017.