Search | arXiv e-print repository

arXiv:2504.19153 [pdf, ps, other]

Strong Uniqueness by Kraichnan Transport Noise for the 2D Boussinesq Equations with Zero Viscosity

Abstract: We investigate the inviscid 2D Boussinesq equations driven by rough transport noise of Kraichnan type with regularity index $α\in (0,1/2)$. For all $1<p<\infty$, we establish the existence and uniqueness of probabilistic strong solutions for all $L^p$ initial vorticity and $L^2$ initial temperature, under the parameter constraint $0<α< 1-1/(p\wedge 2)$. The key ingredient is the anomalous regulari… ▽ More We investigate the inviscid 2D Boussinesq equations driven by rough transport noise of Kraichnan type with regularity index $α\in (0,1/2)$. For all $1<p<\infty$, we establish the existence and uniqueness of probabilistic strong solutions for all $L^p$ initial vorticity and $L^2$ initial temperature, under the parameter constraint $0<α< 1-1/(p\wedge 2)$. The key ingredient is the anomalous regularity due to the noise proven by Coghi and Maurelli \cite{CogMau} who dealt with stochastic 2D Euler equations. Combining techniques from analysis and probability, we demonstrate how the additional regularity from noise compensates the singularity due to the nonlinear parts and coupled terms. △ Less

Submitted 29 April, 2025; v1 submitted 27 April, 2025; originally announced April 2025.

Comments: 34 pages, we have corrected a typo in (1.3)

arXiv:2503.14619 [pdf, ps, other]

The broken sample problem revisited: Proof of a conjecture by Bai-Hsing and high-dimensional extensions

Authors: Simiao Jiao, Yihong Wu, Jiaming Xu

Abstract: We revisit the classical broken sample problem: Two samples of i.i.d. data points $\mathbf{X}=\{X_1,\cdots, X_n\}$ and $\mathbf{Y}=\{Y_1,\cdots,Y_m\}$ are observed without correspondence with $m\leq n$. Under the null hypothesis, $\mathbf{X}$ and $\mathbf{Y}$ are independent. Under the alternative hypothesis, $\mathbf{Y}$ is correlated with a random subsample of $\mathbf{X}$, in the sense that… ▽ More We revisit the classical broken sample problem: Two samples of i.i.d. data points $\mathbf{X}=\{X_1,\cdots, X_n\}$ and $\mathbf{Y}=\{Y_1,\cdots,Y_m\}$ are observed without correspondence with $m\leq n$. Under the null hypothesis, $\mathbf{X}$ and $\mathbf{Y}$ are independent. Under the alternative hypothesis, $\mathbf{Y}$ is correlated with a random subsample of $\mathbf{X}$, in the sense that $(X_{π(i)},Y_i)$'s are drawn independently from some bivariate distribution for some latent injection $π:[m] \to [n]$. Originally introduced by DeGroot, Feder, and Goel (1971) to model matching records in census data, this problem has recently gained renewed interest due to its applications in data de-anonymization, data integration, and target tracking. Despite extensive research over the past decades, determining the precise detection threshold has remained an open problem even for equal sample sizes ($m=n$). Assuming $m$ and $n$ grow proportionally, we show that the sharp threshold is given by a spectral and an $L_2$ condition of the likelihood ratio operator, resolving a conjecture of Bai and Hsing (2005) in the positive. These results are extended to high dimensions and settle the sharp detection thresholds for Gaussian and Bernoulli models. △ Less

Submitted 18 March, 2025; originally announced March 2025.

Comments: 35 pages, 3 figures

arXiv:2411.00370 [pdf, ps, other]

Sparse $H_\infty$ Controller for Networked Control Systems: Non-Structured and Optimal Structured Design

Authors: Zhaohua Yang, Pengyu Wang, Haishan Zhang, Shiyue Jia, Nachuan Yang, Yuxing Zhong, Ling Shi

Abstract: This paper provides a comprehensive analysis of the design of optimal structured and sparse $H_\infty$ controllers for continuous-time linear time-invariant (LTI) systems. Three problems are considered. First, designing the sparsest $H_\infty$ controller, which minimizes the sparsity of the controller while satisfying the given performance requirements. Second, designing a sparsity-promoting… ▽ More This paper provides a comprehensive analysis of the design of optimal structured and sparse $H_\infty$ controllers for continuous-time linear time-invariant (LTI) systems. Three problems are considered. First, designing the sparsest $H_\infty$ controller, which minimizes the sparsity of the controller while satisfying the given performance requirements. Second, designing a sparsity-promoting $H_\infty$ controller, which balances system performance and controller sparsity. Third, designing a $H_\infty$ controller subject to a structural constraint, which enhances system performance with a specified sparsity pattern. For each problem, we adopt a linearization technique that transforms the original nonconvex problem into a convex semidefinite programming (SDP) problem. Subsequently, we design an iterative linear matrix inequality (ILMI) algorithm for each problem, which ensures guaranteed convergence. We further characterize the first-order optimality using the Karush-Kuhn-Tucker (KKT) conditions and prove that any limit point of the solution sequence generated by the ILMI algorithm is a stationary point. For the first and second problems, we validate that our algorithms can reduce the number of non-zero elements and thus the communication burden through several numerical simulations. For the third problem, we refine the solutions obtained in existing literature, demonstrating that our approaches achieve significant improvements. △ Less

Submitted 6 May, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

arXiv:2409.10091 [pdf, ps, other]

Multidimensional analogues of the refined versions of Bohr inequalities involving Schwarz mappings

Authors: Shanshan Jia, Ming-Sheng Liu, Saminathan Ponnusamy

Abstract: Our first aim of this article is to establish several new versions of refined Bohr inequalities for bounded analytic functions in the unit disk involving Schwarz functions. Secondly, %as applications of these results, we obtain several new multidimensional analogues of the refined Bohr inequalities for bounded holomorphic mappings on the unit ball in a complex Banach space involving higher dimensi… ▽ More Our first aim of this article is to establish several new versions of refined Bohr inequalities for bounded analytic functions in the unit disk involving Schwarz functions. Secondly, %as applications of these results, we obtain several new multidimensional analogues of the refined Bohr inequalities for bounded holomorphic mappings on the unit ball in a complex Banach space involving higher dimensional Schwarz mappings. All the results are proved to be sharp. △ Less

Submitted 16 September, 2024; originally announced September 2024.

Comments: 25 pages; It is with a journal

MSC Class: Primary: 30A10; 30C45; 30C62; Secondary: 30C75

arXiv:2406.07167 [pdf, ps, other]

On the pathwise uniqueness of stochastic 2D Euler equations with Kraichnan noise and $L^p$-data

Authors: Shuaijie Jiao, Dejun Luo

Abstract: In the recent work [arXiv:2308.03216], Coghi and Maurelli proved pathwise uniqueness of solutions to the vorticity form of stochastic 2D Euler equation, with Kraichnan transport noise and initial data in $L^1\cap L^p$ for $p>3/2$. The aim of this note is to remove the constraint on $p$, showing that pathwise uniqueness holds for all $L^1\cap L^p$ initial data with arbitrary $p>1$. In the recent work [arXiv:2308.03216], Coghi and Maurelli proved pathwise uniqueness of solutions to the vorticity form of stochastic 2D Euler equation, with Kraichnan transport noise and initial data in $L^1\cap L^p$ for $p>3/2$. The aim of this note is to remove the constraint on $p$, showing that pathwise uniqueness holds for all $L^1\cap L^p$ initial data with arbitrary $p>1$. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 10 pages

arXiv:2405.01045 [pdf, ps, other]

Well-posedness of stochastic mSQG equations with Kraichnan noise and $L^p$ data

Authors: Shuaijie Jiao, Dejun Luo

Abstract: We consider stochastic mSQG (modified Surface Quasi-Geostrophic) equations with multiplicative transport noise of Kraichnan type, and $L^p$-initial conditions. Inspired by the recent work of Coghi and Maurelli [arXiv:2308.03216], we show weak existence and pathwise uniqueness of solutions to the equations for suitable choices of parameters in the nonlinearity, the noise and the integrability of in… ▽ More We consider stochastic mSQG (modified Surface Quasi-Geostrophic) equations with multiplicative transport noise of Kraichnan type, and $L^p$-initial conditions. Inspired by the recent work of Coghi and Maurelli [arXiv:2308.03216], we show weak existence and pathwise uniqueness of solutions to the equations for suitable choices of parameters in the nonlinearity, the noise and the integrability of initial data. △ Less

Submitted 30 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

Comments: 33 pages. We have updated the relation of $β_N$ and $β_L$ in Lemma 2.2, following Proposition 2.7 in arXiv:2308.03216v2. Moreover, we have simplified the statements of Theorem 1.4, covering slightly wider range of parameters

arXiv:2312.15574 [pdf, other]

Clustered Switchback Designs for Experimentation Under Spatio-temporal Interference

Authors: Su Jia, Nathan Kallus, Christina Lee Yu

Abstract: We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between average outcomes having exposed all units at all times to treatment or to control. We suppose spatial interference is described by a graph, where a unit's outc… ▽ More We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between average outcomes having exposed all units at all times to treatment or to control. We suppose spatial interference is described by a graph, where a unit's outcome depends on its neighborhood's treatments, and that temporal interference is described by an MDP, where the transition kernel under either treatment (action) satisfies a rapid mixing condition. We propose a clustered switchback design, where units are grouped into clusters and time steps are grouped into blocks, and each whole cluster-block combination is assigned a single random treatment. Under this design, we show that for graphs that admit good clustering, a truncated Horvitz-Thompson estimator achieves a $\tilde O(1/NT)$ mean squared error (MSE), matching the lower bound up to logarithmic terms for sparse graphs. Our results simultaneously generalize the results from \citet{hu2022switchback,ugander2013graph} and \citet{leung2022rate}. Simulation studies validate the favorable performance of our approach. △ Less

Submitted 26 March, 2025; v1 submitted 24 December, 2023; originally announced December 2023.

arXiv:2301.12366 [pdf, other]

Smooth Non-Stationary Bandits

Authors: Su Jia, Qian Xie, Nathan Kallus, Peter I. Frazier

Abstract: In many applications of online decision making, the environment is non-stationary and it is therefore crucial to use bandit algorithms that handle changes. Most existing approaches are designed to protect against non-smooth changes, constrained only by total variation or Lipschitzness over time. However, in practice, environments often change {\em smoothly}, so such algorithms may incur higher-tha… ▽ More In many applications of online decision making, the environment is non-stationary and it is therefore crucial to use bandit algorithms that handle changes. Most existing approaches are designed to protect against non-smooth changes, constrained only by total variation or Lipschitzness over time. However, in practice, environments often change {\em smoothly}, so such algorithms may incur higher-than-necessary regret. We study a non-stationary bandits problem where each arm's mean reward sequence can be embedded into a $β$-Hölder function, i.e., a function that is $(β-1)$-times Lipschitz-continuously differentiable. The non-stationarity becomes more smooth as $β$ increases. When $β=1$, this corresponds to the non-smooth regime, where \cite{besbes2014stochastic} established a minimax regret of $\tilde Θ(T^{2/3})$. We show the first separation between the smooth (i.e., $β\ge 2$) and non-smooth (i.e., $β=1$) regimes by presenting a policy with $\tilde O(k^{4/5} T^{3/5})$ regret on any $k$-armed, $2$-Hölder instance. We complement this result by showing that the minimax regret on the $β$-Hölder family of instances is $Ω(T^{(β+1)/(2β+1)})$ for any integer $β\ge 1$. This matches our upper bound for $β=2$ up to logarithmic factors. Furthermore, we validated the effectiveness of our policy through a comprehensive numerical study using real-world click-through rate data. △ Less

Submitted 17 November, 2024; v1 submitted 29 January, 2023; originally announced January 2023.

Comments: Accepted by ICML 2023

arXiv:2211.13935 [pdf, other]

LU decomposition and Toeplitz decomposition of a neural network

Authors: Yucong Liu, Simiao Jiao, Lek-Heng Lim

Abstract: It is well-known that any matrix $A$ has an LU decomposition. Less well-known is the fact that it has a 'Toeplitz decomposition' $A = T_1 T_2 \cdots T_r$ where $T_i$'s are Toeplitz matrices. We will prove that any continuous function $f : \mathbb{R}^n \to \mathbb{R}^m$ has an approximation to arbitrary accuracy by a neural network that takes the form… ▽ More It is well-known that any matrix $A$ has an LU decomposition. Less well-known is the fact that it has a 'Toeplitz decomposition' $A = T_1 T_2 \cdots T_r$ where $T_i$'s are Toeplitz matrices. We will prove that any continuous function $f : \mathbb{R}^n \to \mathbb{R}^m$ has an approximation to arbitrary accuracy by a neural network that takes the form $L_1 σ_1 U_1 σ_2 L_2 σ_3 U_2 \cdots L_r σ_{2r-1} U_r$, i.e., where the weight matrices alternate between lower and upper triangular matrices, $σ_i(x) := σ(x - b_i)$ for some bias vector $b_i$, and the activation $σ$ may be chosen to be essentially any uniformly continuous nonpolynomial function. The same result also holds with Toeplitz matrices, i.e., $f \approx T_1 σ_1 T_2 σ_2 \cdots σ_{r-1} T_r$ to arbitrary accuracy, and likewise for Hankel matrices. A consequence of our Toeplitz result is a fixed-width universal approximation theorem for convolutional neural networks, which so far have only arbitrary width versions. Since our results apply in particular to the case when $f$ is a general neural network, we may regard them as LU and Toeplitz decompositions of a neural network. The practical implication of our results is that one may vastly reduce the number of weight parameters in a neural network without sacrificing its power of universal approximation. We will present several experiments on real data sets to show that imposing such structures on the weight matrices sharply reduces the number of training parameters with almost no noticeable effect on test accuracy. △ Less

Submitted 25 November, 2022; originally announced November 2022.

Comments: 14 pages, 3 figures

MSC Class: 68T07; 41A30; 41A46; 15B05

arXiv:1909.00807 [pdf, ps, other]

doi 10.2140/involve.2020.13.149

Continuous factorization of the identity matrix

Authors: Yuying Dai, Ankush Hore, Siqi Jiao, Tianxu Lan, Pavlos Motakis

Abstract: We investigate conditions under which the identity matrix $I_n$ can be continuously factorized through a continuous $N\times N$ matrix function $A$ with domain in $\mathbb{R}$. We study the relationship of the dimension $N$, the diagonal entries of $A$, and the norm of $A$ to the dimension $n$ and the norms of the matrices that witness the factorization of $I_n$ through $A$. We investigate conditions under which the identity matrix $I_n$ can be continuously factorized through a continuous $N\times N$ matrix function $A$ with domain in $\mathbb{R}$. We study the relationship of the dimension $N$, the diagonal entries of $A$, and the norm of $A$ to the dimension $n$ and the norms of the matrices that witness the factorization of $I_n$ through $A$. △ Less

Submitted 16 October, 2019; v1 submitted 2 September, 2019; originally announced September 2019.

Comments: 14 pages

MSC Class: 15A23; 46B07

Journal ref: Involve 13 (2020) 149-164

arXiv:1712.01987 [pdf, ps, other]

Finite Element Methods For Wave Propagation With Debye Polarization In Nonlinear Dielectric Materials

Authors: Qiumei Huang, Shanghui Jia, Fei Xu, Zhongwen Xu, Changhui Yao

Abstract: In this paper, we consider the wave propagation with Debye polarization in nonlinear dielectric materials. For this model, the Rother's method is employed to derive the well-posedness of the electric fields and the existence of the polarized fields by monotonicity theorem as well as the boundedness of the two fields are established. Then, the time errors are derived for the semi-discrete solutio… ▽ More In this paper, we consider the wave propagation with Debye polarization in nonlinear dielectric materials. For this model, the Rother's method is employed to derive the well-posedness of the electric fields and the existence of the polarized fields by monotonicity theorem as well as the boundedness of the two fields are established. Then, the time errors are derived for the semi-discrete solutions by the order $O(Δt)$. Subsequently, decoupled the full-discrete scheme of the Euler in time and Raviart-Thomas-N$\acute{e}$d$\acute{e}$lec element $k\geq 2$ in spatial is established. Based on the truncated error, we present the convergent analysis with the order $O(Δt+h^s) $ under the technique of a-prior $L^\infty$ assumption. For the $k=1$, we employ the superconvergence technique to ensure the a-prior $L^\infty$ assumption. In the end, we give some numerical examples to demonstrate our theories. △ Less

Submitted 5 December, 2017; originally announced December 2017.

Comments: we consider the numerical analysisof wave propagation with Debye polarization in nonlinear dielectric materials. This will be submitted to Journal of Scentific computing

MSC Class: 65N30; 65N15; 35J25

arXiv:1502.04657 [pdf, ps, other]

doi 10.1007/s11425-015-0234-x

A Full Multigrid Method for Nonlinear Eigenvalue Problems

Authors: Shanghui Jia, Hehu Xie, Manting Xie, Fei Xu

Abstract: This paper is to introduce a type of full multigrid method for the nonlinear eigenvalue problem. The main idea is to transform the solution of nonlinear eigenvalue problem into a series of solutions of the corresponding linear boundary value problems on the sequence of finite element spaces and nonlinear eigenvalue problems on the coarsest finite element space. The linearized boundary value proble… ▽ More This paper is to introduce a type of full multigrid method for the nonlinear eigenvalue problem. The main idea is to transform the solution of nonlinear eigenvalue problem into a series of solutions of the corresponding linear boundary value problems on the sequence of finite element spaces and nonlinear eigenvalue problems on the coarsest finite element space. The linearized boundary value problems are solved by some multigrid iterations. Besides the multigrid iteration, all other efficient iteration methods for solving boundary value problems can serve as the linear problem solver. We will prove that the computational work of this new scheme is truly optimal, the same as solving the linear corresponding boundary value problem. In this case, this type of iteration scheme certainly improves the overfull efficiency of solving nonlinear eigenvalue problems. Some numerical experiments are presented to validate the efficiency of the new method. △ Less

Submitted 16 February, 2015; originally announced February 2015.

Comments: 15 Pages, 4 Figures. arXiv admin note: substantial text overlap with arXiv:1409.7944

MSC Class: 65N30; 65N25; 65L15; 65B99

Showing 1–12 of 12 results for author: Jia, S