-
Strong Uniqueness by Kraichnan Transport Noise for the 2D Boussinesq Equations with Zero Viscosity
Authors:
Shuaijie Jiao,
Dejun Luo
Abstract:
We investigate the inviscid 2D Boussinesq equations driven by rough transport noise of Kraichnan type with regularity index $α\in (0,1/2)$. For all $1<p<\infty$, we establish the existence and uniqueness of probabilistic strong solutions for all $L^p$ initial vorticity and $L^2$ initial temperature, under the parameter constraint $0<α< 1-1/(p\wedge 2)$. The key ingredient is the anomalous regulari…
▽ More
We investigate the inviscid 2D Boussinesq equations driven by rough transport noise of Kraichnan type with regularity index $α\in (0,1/2)$. For all $1<p<\infty$, we establish the existence and uniqueness of probabilistic strong solutions for all $L^p$ initial vorticity and $L^2$ initial temperature, under the parameter constraint $0<α< 1-1/(p\wedge 2)$. The key ingredient is the anomalous regularity due to the noise proven by Coghi and Maurelli \cite{CogMau} who dealt with stochastic 2D Euler equations. Combining techniques from analysis and probability, we demonstrate how the additional regularity from noise compensates the singularity due to the nonlinear parts and coupled terms.
△ Less
Submitted 29 April, 2025; v1 submitted 27 April, 2025;
originally announced April 2025.
-
The broken sample problem revisited: Proof of a conjecture by Bai-Hsing and high-dimensional extensions
Authors:
Simiao Jiao,
Yihong Wu,
Jiaming Xu
Abstract:
We revisit the classical broken sample problem: Two samples of i.i.d. data points $\mathbf{X}=\{X_1,\cdots, X_n\}$ and $\mathbf{Y}=\{Y_1,\cdots,Y_m\}$ are observed without correspondence with $m\leq n$. Under the null hypothesis, $\mathbf{X}$ and $\mathbf{Y}$ are independent. Under the alternative hypothesis, $\mathbf{Y}$ is correlated with a random subsample of $\mathbf{X}$, in the sense that…
▽ More
We revisit the classical broken sample problem: Two samples of i.i.d. data points $\mathbf{X}=\{X_1,\cdots, X_n\}$ and $\mathbf{Y}=\{Y_1,\cdots,Y_m\}$ are observed without correspondence with $m\leq n$. Under the null hypothesis, $\mathbf{X}$ and $\mathbf{Y}$ are independent. Under the alternative hypothesis, $\mathbf{Y}$ is correlated with a random subsample of $\mathbf{X}$, in the sense that $(X_{π(i)},Y_i)$'s are drawn independently from some bivariate distribution for some latent injection $π:[m] \to [n]$. Originally introduced by DeGroot, Feder, and Goel (1971) to model matching records in census data, this problem has recently gained renewed interest due to its applications in data de-anonymization, data integration, and target tracking. Despite extensive research over the past decades, determining the precise detection threshold has remained an open problem even for equal sample sizes ($m=n$). Assuming $m$ and $n$ grow proportionally, we show that the sharp threshold is given by a spectral and an $L_2$ condition of the likelihood ratio operator, resolving a conjecture of Bai and Hsing (2005) in the positive. These results are extended to high dimensions and settle the sharp detection thresholds for Gaussian and Bernoulli models.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
Sparse $H_\infty$ Controller for Networked Control Systems: Non-Structured and Optimal Structured Design
Authors:
Zhaohua Yang,
Pengyu Wang,
Haishan Zhang,
Shiyue Jia,
Nachuan Yang,
Yuxing Zhong,
Ling Shi
Abstract:
This paper provides a comprehensive analysis of the design of optimal structured and sparse $H_\infty$ controllers for continuous-time linear time-invariant (LTI) systems. Three problems are considered. First, designing the sparsest $H_\infty$ controller, which minimizes the sparsity of the controller while satisfying the given performance requirements. Second, designing a sparsity-promoting…
▽ More
This paper provides a comprehensive analysis of the design of optimal structured and sparse $H_\infty$ controllers for continuous-time linear time-invariant (LTI) systems. Three problems are considered. First, designing the sparsest $H_\infty$ controller, which minimizes the sparsity of the controller while satisfying the given performance requirements. Second, designing a sparsity-promoting $H_\infty$ controller, which balances system performance and controller sparsity. Third, designing a $H_\infty$ controller subject to a structural constraint, which enhances system performance with a specified sparsity pattern. For each problem, we adopt a linearization technique that transforms the original nonconvex problem into a convex semidefinite programming (SDP) problem. Subsequently, we design an iterative linear matrix inequality (ILMI) algorithm for each problem, which ensures guaranteed convergence. We further characterize the first-order optimality using the Karush-Kuhn-Tucker (KKT) conditions and prove that any limit point of the solution sequence generated by the ILMI algorithm is a stationary point. For the first and second problems, we validate that our algorithms can reduce the number of non-zero elements and thus the communication burden through several numerical simulations. For the third problem, we refine the solutions obtained in existing literature, demonstrating that our approaches achieve significant improvements.
△ Less
Submitted 6 May, 2025; v1 submitted 1 November, 2024;
originally announced November 2024.
-
Multidimensional analogues of the refined versions of Bohr inequalities involving Schwarz mappings
Authors:
Shanshan Jia,
Ming-Sheng Liu,
Saminathan Ponnusamy
Abstract:
Our first aim of this article is to establish several new versions of refined Bohr inequalities for bounded analytic functions in the unit disk involving Schwarz functions. Secondly, %as applications of these results, we obtain several new multidimensional analogues of the refined Bohr inequalities for bounded holomorphic mappings on the unit ball in a complex Banach space involving higher dimensi…
▽ More
Our first aim of this article is to establish several new versions of refined Bohr inequalities for bounded analytic functions in the unit disk involving Schwarz functions. Secondly, %as applications of these results, we obtain several new multidimensional analogues of the refined Bohr inequalities for bounded holomorphic mappings on the unit ball in a complex Banach space involving higher dimensional Schwarz mappings. All the results are proved to be sharp.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
On the pathwise uniqueness of stochastic 2D Euler equations with Kraichnan noise and $L^p$-data
Authors:
Shuaijie Jiao,
Dejun Luo
Abstract:
In the recent work [arXiv:2308.03216], Coghi and Maurelli proved pathwise uniqueness of solutions to the vorticity form of stochastic 2D Euler equation, with Kraichnan transport noise and initial data in $L^1\cap L^p$ for $p>3/2$. The aim of this note is to remove the constraint on $p$, showing that pathwise uniqueness holds for all $L^1\cap L^p$ initial data with arbitrary $p>1$.
In the recent work [arXiv:2308.03216], Coghi and Maurelli proved pathwise uniqueness of solutions to the vorticity form of stochastic 2D Euler equation, with Kraichnan transport noise and initial data in $L^1\cap L^p$ for $p>3/2$. The aim of this note is to remove the constraint on $p$, showing that pathwise uniqueness holds for all $L^1\cap L^p$ initial data with arbitrary $p>1$.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Well-posedness of stochastic mSQG equations with Kraichnan noise and $L^p$ data
Authors:
Shuaijie Jiao,
Dejun Luo
Abstract:
We consider stochastic mSQG (modified Surface Quasi-Geostrophic) equations with multiplicative transport noise of Kraichnan type, and $L^p$-initial conditions. Inspired by the recent work of Coghi and Maurelli [arXiv:2308.03216], we show weak existence and pathwise uniqueness of solutions to the equations for suitable choices of parameters in the nonlinearity, the noise and the integrability of in…
▽ More
We consider stochastic mSQG (modified Surface Quasi-Geostrophic) equations with multiplicative transport noise of Kraichnan type, and $L^p$-initial conditions. Inspired by the recent work of Coghi and Maurelli [arXiv:2308.03216], we show weak existence and pathwise uniqueness of solutions to the equations for suitable choices of parameters in the nonlinearity, the noise and the integrability of initial data.
△ Less
Submitted 30 May, 2024; v1 submitted 2 May, 2024;
originally announced May 2024.
-
Clustered Switchback Designs for Experimentation Under Spatio-temporal Interference
Authors:
Su Jia,
Nathan Kallus,
Christina Lee Yu
Abstract:
We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between average outcomes having exposed all units at all times to treatment or to control. We suppose spatial interference is described by a graph, where a unit's outc…
▽ More
We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between average outcomes having exposed all units at all times to treatment or to control. We suppose spatial interference is described by a graph, where a unit's outcome depends on its neighborhood's treatments, and that temporal interference is described by an MDP, where the transition kernel under either treatment (action) satisfies a rapid mixing condition. We propose a clustered switchback design, where units are grouped into clusters and time steps are grouped into blocks, and each whole cluster-block combination is assigned a single random treatment. Under this design, we show that for graphs that admit good clustering, a truncated Horvitz-Thompson estimator achieves a $\tilde O(1/NT)$ mean squared error (MSE), matching the lower bound up to logarithmic terms for sparse graphs. Our results simultaneously generalize the results from \citet{hu2022switchback,ugander2013graph} and \citet{leung2022rate}. Simulation studies validate the favorable performance of our approach.
△ Less
Submitted 26 March, 2025; v1 submitted 24 December, 2023;
originally announced December 2023.
-
Smooth Non-Stationary Bandits
Authors:
Su Jia,
Qian Xie,
Nathan Kallus,
Peter I. Frazier
Abstract:
In many applications of online decision making, the environment is non-stationary and it is therefore crucial to use bandit algorithms that handle changes. Most existing approaches are designed to protect against non-smooth changes, constrained only by total variation or Lipschitzness over time. However, in practice, environments often change {\em smoothly}, so such algorithms may incur higher-tha…
▽ More
In many applications of online decision making, the environment is non-stationary and it is therefore crucial to use bandit algorithms that handle changes. Most existing approaches are designed to protect against non-smooth changes, constrained only by total variation or Lipschitzness over time. However, in practice, environments often change {\em smoothly}, so such algorithms may incur higher-than-necessary regret. We study a non-stationary bandits problem where each arm's mean reward sequence can be embedded into a $β$-Hölder function, i.e., a function that is $(β-1)$-times Lipschitz-continuously differentiable. The non-stationarity becomes more smooth as $β$ increases. When $β=1$, this corresponds to the non-smooth regime, where \cite{besbes2014stochastic} established a minimax regret of $\tilde Θ(T^{2/3})$. We show the first separation between the smooth (i.e., $β\ge 2$) and non-smooth (i.e., $β=1$) regimes by presenting a policy with $\tilde O(k^{4/5} T^{3/5})$ regret on any $k$-armed, $2$-Hölder instance. We complement this result by showing that the minimax regret on the $β$-Hölder family of instances is $Ω(T^{(β+1)/(2β+1)})$ for any integer $β\ge 1$. This matches our upper bound for $β=2$ up to logarithmic factors. Furthermore, we validated the effectiveness of our policy through a comprehensive numerical study using real-world click-through rate data.
△ Less
Submitted 17 November, 2024; v1 submitted 29 January, 2023;
originally announced January 2023.
-
LU decomposition and Toeplitz decomposition of a neural network
Authors:
Yucong Liu,
Simiao Jiao,
Lek-Heng Lim
Abstract:
It is well-known that any matrix $A$ has an LU decomposition. Less well-known is the fact that it has a 'Toeplitz decomposition' $A = T_1 T_2 \cdots T_r$ where $T_i$'s are Toeplitz matrices. We will prove that any continuous function $f : \mathbb{R}^n \to \mathbb{R}^m$ has an approximation to arbitrary accuracy by a neural network that takes the form…
▽ More
It is well-known that any matrix $A$ has an LU decomposition. Less well-known is the fact that it has a 'Toeplitz decomposition' $A = T_1 T_2 \cdots T_r$ where $T_i$'s are Toeplitz matrices. We will prove that any continuous function $f : \mathbb{R}^n \to \mathbb{R}^m$ has an approximation to arbitrary accuracy by a neural network that takes the form $L_1 σ_1 U_1 σ_2 L_2 σ_3 U_2 \cdots L_r σ_{2r-1} U_r$, i.e., where the weight matrices alternate between lower and upper triangular matrices, $σ_i(x) := σ(x - b_i)$ for some bias vector $b_i$, and the activation $σ$ may be chosen to be essentially any uniformly continuous nonpolynomial function. The same result also holds with Toeplitz matrices, i.e., $f \approx T_1 σ_1 T_2 σ_2 \cdots σ_{r-1} T_r$ to arbitrary accuracy, and likewise for Hankel matrices. A consequence of our Toeplitz result is a fixed-width universal approximation theorem for convolutional neural networks, which so far have only arbitrary width versions. Since our results apply in particular to the case when $f$ is a general neural network, we may regard them as LU and Toeplitz decompositions of a neural network. The practical implication of our results is that one may vastly reduce the number of weight parameters in a neural network without sacrificing its power of universal approximation. We will present several experiments on real data sets to show that imposing such structures on the weight matrices sharply reduces the number of training parameters with almost no noticeable effect on test accuracy.
△ Less
Submitted 25 November, 2022;
originally announced November 2022.
-
Continuous factorization of the identity matrix
Authors:
Yuying Dai,
Ankush Hore,
Siqi Jiao,
Tianxu Lan,
Pavlos Motakis
Abstract:
We investigate conditions under which the identity matrix $I_n$ can be continuously factorized through a continuous $N\times N$ matrix function $A$ with domain in $\mathbb{R}$. We study the relationship of the dimension $N$, the diagonal entries of $A$, and the norm of $A$ to the dimension $n$ and the norms of the matrices that witness the factorization of $I_n$ through $A$.
We investigate conditions under which the identity matrix $I_n$ can be continuously factorized through a continuous $N\times N$ matrix function $A$ with domain in $\mathbb{R}$. We study the relationship of the dimension $N$, the diagonal entries of $A$, and the norm of $A$ to the dimension $n$ and the norms of the matrices that witness the factorization of $I_n$ through $A$.
△ Less
Submitted 16 October, 2019; v1 submitted 2 September, 2019;
originally announced September 2019.
-
Finite Element Methods For Wave Propagation With Debye Polarization In Nonlinear Dielectric Materials
Authors:
Qiumei Huang,
Shanghui Jia,
Fei Xu,
Zhongwen Xu,
Changhui Yao
Abstract:
In this paper, we consider the wave propagation with
Debye polarization in nonlinear dielectric materials. For this model, the Rother's method is employed to derive the well-posedness of the electric fields and the existence of the polarized fields by monotonicity theorem as well as the boundedness of the two fields are established. Then, the time errors are derived for the semi-discrete solutio…
▽ More
In this paper, we consider the wave propagation with
Debye polarization in nonlinear dielectric materials. For this model, the Rother's method is employed to derive the well-posedness of the electric fields and the existence of the polarized fields by monotonicity theorem as well as the boundedness of the two fields are established. Then, the time errors are derived for the semi-discrete solutions by the order $O(Δt)$.
Subsequently, decoupled the full-discrete scheme of the Euler in time and Raviart-Thomas-N$\acute{e}$d$\acute{e}$lec element $k\geq 2$ in spatial is established. Based on the truncated error, we present the convergent analysis with the order $O(Δt+h^s) $ under the technique of a-prior $L^\infty$ assumption. For the $k=1$, we employ the superconvergence technique to ensure the a-prior $L^\infty$ assumption. In the end, we give some numerical examples to demonstrate our theories.
△ Less
Submitted 5 December, 2017;
originally announced December 2017.
-
A Full Multigrid Method for Nonlinear Eigenvalue Problems
Authors:
Shanghui Jia,
Hehu Xie,
Manting Xie,
Fei Xu
Abstract:
This paper is to introduce a type of full multigrid method for the nonlinear eigenvalue problem. The main idea is to transform the solution of nonlinear eigenvalue problem into a series of solutions of the corresponding linear boundary value problems on the sequence of finite element spaces and nonlinear eigenvalue problems on the coarsest finite element space. The linearized boundary value proble…
▽ More
This paper is to introduce a type of full multigrid method for the nonlinear eigenvalue problem. The main idea is to transform the solution of nonlinear eigenvalue problem into a series of solutions of the corresponding linear boundary value problems on the sequence of finite element spaces and nonlinear eigenvalue problems on the coarsest finite element space. The linearized boundary value problems are solved by some multigrid iterations. Besides the multigrid iteration, all other efficient iteration methods for solving boundary value problems can serve as the linear problem solver. We will prove that the computational work of this new scheme is truly optimal, the same as solving the linear corresponding boundary value problem. In this case, this type of iteration scheme certainly improves the overfull efficiency of solving nonlinear eigenvalue problems. Some numerical experiments are presented to validate the efficiency of the new method.
△ Less
Submitted 16 February, 2015;
originally announced February 2015.