Skip to main content

Showing 1–50 of 251 results for author: Wu, L

Searching in archive math. Search in all archives.
.
  1. arXiv:2505.10366  [pdf, ps, other

    math.OC

    Arbitrarily Small Execution-Time Certificate: What was Missed in Analog Optimization

    Authors: Liang Wu, Ambrose Adegbege, Yongduan Song, Richard D. Braatz

    Abstract: Numerical optimization (solving optimization problems using digital computers) currently dominates, but has three major drawbacks: high energy consumption, poor scalability, and lack of an execution time certificate. To address these challenges, this article explores the recent resurgence of analog computers, proposing a novel paradigm of arbitrarily small execution-time-certified analog optimizat… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 16 pages

  2. arXiv:2505.01645  [pdf, ps, other

    math.NT

    Note on a sum involving the divisor function

    Authors: Liuying Wu

    Abstract: Let $d(n)$ be the divisor function and denote by $[t]$ the integral part of the real number $t$. In this paper, we prove that $$\sum_{n\leq x^{1/c}}d\left(\left[\frac{x}{n^c}\right]\right)=d_cx^{1/c}+\mathcal{O}_{\varepsilon,c} \left(x^{\max\{(2c+2)/(2c^2+5c+2),5/(5c+6)\}+\varepsilon}\right),$$ where $d_c=\sum_{k\geq1}d(k)\left(\frac{1}{k^{1/c}}-\frac{1}{(k+1)^{1/c}}\right)$ is a constant. This re… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  3. arXiv:2504.16827  [pdf, ps, other

    math.CA

    Endpoint boundedness of singular integrals: CMO space associated to Schrödinger operators

    Authors: Xueting Han, Ji Li, Liangchuan Wu

    Abstract: Let $ \mathcal{L} = -Δ+ V $ be a Schrödinger operator acting on $ L^2(\mathbb{R}^n) $, where the nonnegative potential $ V $ belongs to the reverse Hölder class $ RH_q $ for some $ q \geq n/2 $. This article is primarily concerned with the study of endpoint boundedness for classical singular integral operators in the context of the space $ \mathrm{CMO}_{\mathcal{L}}(\mathbb{R}^n) $, consisting of… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    MSC Class: 42B20; 42B25; 42B35

  4. arXiv:2504.08467  [pdf, ps, other

    math.PR math-ph

    Quasi-stationarity of the Dyson Brownian Motion With Collisions

    Authors: Arnaud Guillin, Boris Nectoux, Liming Wu

    Abstract: In this work, we investigate the ergodic behavior of a system of particules, subject to collisions, before it exits a fixed subdomain of its state space. This system is composed of several one-dimensional ordered Brownian particules in interaction with electrostatic repulsions, which is usually referred as the (generalized) Dyson Brownian motion. The starting points of our analysis are the work [E… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  5. arXiv:2503.19845  [pdf, ps, other

    math.DS math-ph math.SP

    The fibered rotation number for ergodic symplectic cocycles and its applications: I. Gap Labelling Theorem

    Authors: Xianzhe Li, Li Wu

    Abstract: Let $ (Θ,T,μ) $ be an ergodic topological dynamical system. The fibered rotation number for cocycles in $ Θ\times \mathrm{SL}(2,\mathbb{R}) $, acting on $ Θ\times \mathbb{R}\mathbb{P}^1 $ is well-defined and has wide applications in the study of the spectral theory of Schrödinger operators. In this paper, we will provide its natural generalization for higher dimensional cocycles in… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: 24 pages, comments are welcome

  6. arXiv:2502.19002  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training

    Authors: Jinbo Wang, Mingze Wang, Zhanpeng Zhou, Junchi Yan, Weinan E, Lei Wu

    Abstract: Transformers consist of diverse building blocks, such as embedding layers, normalization layers, self-attention mechanisms, and point-wise feedforward networks. Thus, understanding the differences and interactions among these blocks is important. In this paper, we uncover a clear Sharpness Disparity across these blocks, which emerges early in training and intriguingly persists throughout the train… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: 23 pages

  7. arXiv:2502.17884  [pdf, ps, other

    math.CA

    The fractional Riesz transform and their commutator in Dunkl setting

    Authors: Yanping Chen, Xueting Han, Liangchuan Wu

    Abstract: In this paper, we study the boundedness of the fractional Riesz transforms in the Dunkl setting. Moreover, we establish the necessary and sufficient conditions for the boundedness of their commutator with respect to the central BMO space associated with Euclidean metric and the BMO space associated with Dunkl metric, respectively. Based on this, we further characterize the compactness of the commu… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    MSC Class: 42B35 (Primary) 43A85; 42B20 (Secondary)

  8. arXiv:2502.07738  [pdf, other

    eess.SY math.OC

    EIQP: Execution-time-certified and Infeasibility-detecting QP Solver

    Authors: Liang Wu, Wei Xiao, Richard D. Braatz

    Abstract: Solving real-time quadratic programming (QP) is a ubiquitous task in control engineering, such as in model predictive control and control barrier function-based QP. In such real-time scenarios, certifying that the employed QP algorithm can either return a solution within a predefined level of optimality or detect QP infeasibility before the predefined sampling time is a pressing requirement. This… ▽ More

    Submitted 14 February, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: 14 pages, 3 figures

  9. arXiv:2502.06322  [pdf, ps, other

    math.CA

    The uniform quantitive weighted boundedness of fractional Marcinkiewicz integral and its commutator

    Authors: Huoxiong Wu, Lin Wu

    Abstract: Suppose that $Ω\in L^{\infty}(\mathbb{S} ^{n-1})$ is homogeneous of degree zero with mean value zero. Then we consider a fractional type Marcinkiewicz integral operator $$μ_{Ω,β}f(x) = \left ( \int_{0}^{\infty } \left | \int_{\left | x-y \right |\le t }^{} \frac{Ω(x-y)}{\left | x-y \right |^{n-1-β} } f(y)dy \right | ^{2}\frac{dt}{t^3} \right )^{\frac{1}{2} },\quad 0<β<n.$$ Our main contribution is… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    MSC Class: 42B20; 42B25

  10. arXiv:2502.04676  [pdf, ps, other

    math.AP

    Refined regularity for nonlocal elliptic equations and applications

    Authors: Wenxiong Chen, Congming Li, Leyun Wu, Zhouping Xin

    Abstract: In this paper, we establish refined regularity estimates for nonnegative solutions to the fractional Poisson equation $$ (-Δ)^s u(x) =f(x),\,\, x\in B_1(0). $$ Specifically, we have derived Hölder, Schauder, and Ln-Lipschitz regularity estimates for any nonnegative solution $u,$ provided that only the local $L^\infty$ norm of $u$ is bounded. These estimates stand in sharp contrast to the existing… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  11. arXiv:2501.14475  [pdf, other

    math.NA

    Point Cloud Neural Operator for Parametric PDEs on Complex and Variable Geometries

    Authors: Chenyu Zeng, Yanshu Zhang, Jiayi Zhou, Yuhan Wang, Zilin Wang, Yuhao Liu, Lei Wu, Daniel Zhengyu Huang

    Abstract: Surrogate models are critical for accelerating computationally expensive simulations in science and engineering, particularly for solving parametric partial differential equations (PDEs). Developing practical surrogate models poses significant challenges, particularly in handling geometrically complex and variable domains, which are often discretized as point clouds. In this work, we systematicall… ▽ More

    Submitted 15 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 45 pages, 19 figures

  12. A real-time battle situation intelligent awareness system based on Meta-learning & RNN

    Authors: Yuchun Li, Zihan Lin, Xize Wang, Chunyang Liu, Liaoyuan Wu, Fang Zhang

    Abstract: In modern warfare, real-time and accurate battle situation analysis is crucial for making strategic and tactical decisions. The proposed real-time battle situation intelligent awareness system (BSIAS) aims at meta-learning analysis and stepwise RNN (recurrent neural network) modeling, where the former carries out the basic processing and analysis of battlefield data, which includes multi-steps suc… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  13. arXiv:2501.11032  [pdf, ps, other

    math.FA math.DS

    A formula of local Maslov index and applications

    Authors: Li Wu, Chaofeng Zhu

    Abstract: In this paper, we explicitly express the local Maslov index by a Maslov index in finite dimensional case without symplectic reduction. Then we calculate the Maslov index for the path of pairs of Lagrangian subspaces in triangular form. In particular, we get the Maslov-type index of a given symplectic path in triangle form. As applications, we calculate the splitting numbers of the symplectic matri… ▽ More

    Submitted 26 January, 2025; v1 submitted 19 January, 2025; originally announced January 2025.

    Comments: 62 pages

    MSC Class: Primary 53D12; Secondary 58J30

  14. arXiv:2501.05622  [pdf, ps, other

    math.AG

    Poincaré polynomials of moduli spaces of one-dimensional sheaves on the projective plane

    Authors: Shuai Guo, Longting Wu, with an appendix by Miguel Moreira

    Abstract: Let $M_β$ denote the moduli space of stable one-dimensional sheaves on a del Pezzo surface $S$, supported on curves of class $β$ with Euler characteristic one. We show that the divisibility property of the Poincaré polynomial of $M_β$, proposed by Choi-van Garrel-Katz-Takahashi follows from Bousseau's conjectural refined sheaves/Gromov-Witten correspondence. Since this correspondence is known for… ▽ More

    Submitted 9 March, 2025; v1 submitted 9 January, 2025; originally announced January 2025.

    Comments: We add an appendix by M. Moreira where a more general conjecture concerning the higher range Betti numbers of the moduli of one-dimensional sheaves on $\mathbb{P}^2$ is presented, along with another conjecture that involves refinements from the perverse/Chern filtration. Conjecture 1.9 has made more precise. 37 pages. Comments are welcome!

  15. arXiv:2501.05200  [pdf, other

    math.OC

    On Coordinated Drone-Courier Logistics for Intra-city Express Services

    Authors: Shuiwang Chen, Kai Wang, Lingxiao Wu, Wei Qi

    Abstract: Problem definition: Drones, despite being acknowledged as a transformative force in the city logistics sector, are unable to execute the \textit{last-meter delivery} (unloading goods directly to customers' doorsteps) due to airspace restrictions and safety concerns. To leverage advancements and overcome the limitations of drones in providing intra-city express services, we introduce a coordinated… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  16. arXiv:2501.04951  [pdf, ps, other

    math.OA math.CA

    Weighted norm estimates of noncommutative Calderón-Zygmund operators

    Authors: Wenfei Fan, Yong Jiao, Lian Wu, Dejian Zhou

    Abstract: This paper is devoted to studying weighted endpoint estimates of operator-valued singular integrals. Our main results include weighted weak-type $(1,1)$ estimate of noncommutative maximal Calderón-Zygmund operators, corresponding version of square functions and a weighted $H_1- L_1$ type inequality. All these results are obtained under the condition that the weight belonging to the Muchenhoupt… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: 35 pages

    MSC Class: 46L52 (Primary) 42B20 (Secondary)

  17. arXiv:2412.07561  [pdf, ps, other

    math.AP

    The $L_q$ Minkowski problem for $\mathbf{p}$-harmonic measure

    Authors: Hai Li, Longyu Wu, Baocheng Zhu

    Abstract: In this paper, we consider an extremal problem associated with the solution to a boundary value problem. Our main focus is on establishing a variational formula for a functional related to the $\mathbf{p}$-harmonic measure, from which a new measure is derived. This further motivates us to study the Minkowski problem for this new measure. As a main result, we prove the existence of solutions to the… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 28

  18. arXiv:2411.17216  [pdf, ps, other

    math.PR

    Large deviations of the empirical measures of a strong-Feller Markov process inside a subset and quasi-ergodic distribution

    Authors: Arnaud Guillin, Boris Nectoux, Liming Wu

    Abstract: In this work, we establish, for a strong Feller process, the large deviation principle for the occupation measure conditioned not to exit a given subregion. The rate function vanishes only at a unique measure, which is the so-called quasi-ergodic distribution of the process in this subregion. In addition, we show that the rate function is the Dirichlet form in the particular case when the process… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

  19. arXiv:2411.16774  [pdf, ps, other

    math.GM

    A Note on a Recent Attempt to Prove the Irrationality of $ζ(5)$

    Authors: Keyu Chen, Wei He, Yixin He, Yuxiang Huang, Yanyang Li, Quanyu Tang, Lei Wu, Shenhao Xu, Shuo Yang, Zijun Yu

    Abstract: Recently Shekhar Suman [arXiv: 2407.07121v6 [math.GM] 3 Aug 2024] made an attempt to prove the irrationality of $ζ(5)$. But unfortunately the proof is not correct. In this note, we discuss the fallacy in the proof.

    Submitted 9 January, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

    Comments: 5 pages, just a note

    MSC Class: Primary 11J72; Secondary 11M06

  20. arXiv:2411.13099  [pdf, ps, other

    math.PR

    Long time behavior of killed Feynman-Kac semigroups with singular Schr{ö}dinger potentials

    Authors: Arnaud Guillin, D I Lu, Boris Nectoux, Liming Wu

    Abstract: In this work, we investigate the compactness and the long time behavior of killed Feynman-Kac semigroups of various processes arising from statistical physics with very general singular Schr{ö}dinger potentials. The processes we consider cover a large class of processes used in statistical physics, with strong links with quantum mechanics and (local or not) Schr{ö}dinger operators (including e.g.… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  21. arXiv:2410.11474  [pdf, other

    cs.LG math.OC stat.ML

    How Transformers Get Rich: Approximation and Dynamics Analysis

    Authors: Mingze Wang, Ruoxi Yu, Weinan E, Lei Wu

    Abstract: Transformers have demonstrated exceptional in-context learning capabilities, yet the theoretical understanding of the underlying mechanisms remains limited. A recent work (Elhage et al., 2021) identified a ``rich'' in-context mechanism known as induction head, contrasting with ``lazy'' $n$-gram models that overlook long-range dependencies. In this work, we provide both approximation and dynamics a… ▽ More

    Submitted 29 January, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: 47 pages

  22. arXiv:2410.04743  [pdf, other

    eess.SY cs.LG math.OC

    Smart energy management: process structure-based hybrid neural networks for optimal scheduling and economic predictive control in integrated systems

    Authors: Long Wu, Xunyuan Yin, Lei Pan, Jinfeng Liu

    Abstract: Integrated energy systems (IESs) are complex systems consisting of diverse operating units spanning multiple domains. To address its operational challenges, we propose a physics-informed hybrid time-series neural network (NN) surrogate to predict the dynamic performance of IESs across multiple time scales. This neural network-based modeling approach develops time-series multi-layer perceptrons (ML… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  23. arXiv:2410.03418  [pdf, ps, other

    math.AP math.CA

    Hölder regularity and Liouville Theorem for the Schrödinger equation with certain critical potentials, and applications to Dirichlet problems

    Authors: Bo Li, Ji Li, Liangchuan Wu

    Abstract: Let $(X,d,μ)$ be a metric measure space satisfying a doubling property with the upper/lower dimension $Q\ge n>1$, and admitting an $L^2$-Poincaré inequality. In this article, we establish the Hölder continuity and a Liouville-type theorem for the (elliptic-type) Schrödinger equation $$\mathbb L u(x,t)=-\partial^2_{t}u(x,t)+\mathcal L u(x,t)+V(x)u(x,t)=0,\quad x\in X,\, t\in\mathbb R, $$ where… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 39 pages, no figures

    MSC Class: 35J10; 42B35; 43A85

  24. arXiv:2410.01164  [pdf, ps, other

    math.CA

    On maximal functions generated by Hörmander-type spectral multipliers

    Authors: Peng Chen, Xixi Lin, Liangchuan Wu, Lixin Yan

    Abstract: Let $(X,d,μ)$ be a metric space with doubling measure and $L$ be a nonnegative self-adjoint operator on $L^2(X)$ whose heat kernel satisfies the Gaussian upper bound. We assume that there exists an $L$-harmonic function $h$ such that the semigroup $\exp(-tL)$, after applying the Doob transform related to $h$, satisfies the upper and lower Gaussian estimates. In this paper we apply the Doob transfo… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: 37 pages

    MSC Class: 42B15; 42B25; 47F10

  25. arXiv:2409.08506  [pdf, other

    math.FA

    Dynamical Sampling in Shift-Invariant Spaces Associated with multi-dimensional Special Affine Fourier Transform

    Authors: Meng Ning, Li-Ping Wu, Qing-yue Zhang, Bei Liu

    Abstract: The Special Affine Fourier Transformation(SAFT), which generalizes several well-known unitary transformations, has been demonstrated as a valuable tool in signal processing and optics. In this paper, we explore the multivariate dynamical sampling problem in shift-invariant spaces associated with the multi-dimensional SAFT. Specifically, we derive a sufficient and necessary condition under which a… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 22 pages, 11 figures

    MSC Class: 94A20; 94A12; 42C15

  26. arXiv:2408.13560  [pdf, ps, other

    math.AG

    Bernstein-Sato ideals

    Authors: Nero Budur, Robin van der Veer, Lei Wu, Peng Zhou

    Abstract: In this paper, we review several results on the zero loci of Bernstein-Sato ideals related to singularities of hypersurfaces. This is an exposition for the Frontiers of Science Awards in Mathematics presenting results from one of our articles, with history, motivation, and further developments.

    Submitted 24 August, 2024; originally announced August 2024.

  27. arXiv:2408.09505  [pdf, other

    q-fin.MF econ.TH math.DS

    Periodic Trading Activities in Financial Markets: Mean-field Liquidation Game with Major-Minor Players

    Authors: Yufan Chen, Lan Wu, Renyuan Xu, Ruixun Zhang

    Abstract: Motivated by recent empirical findings on the periodic phenomenon of aggregated market volumes in equity markets, we aim to understand the causes and consequences of periodic trading activities through a game-theoretic perspective, examining market interactions among different types of participants. Specifically, we introduce a new mean-field liquidation game involving major and minor traders, whe… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: 62 pages; 9 figures

    MSC Class: 91-XX; 91Gxx; 65Cxx; 39A50; 49N80; 91A16

  28. arXiv:2408.00036  [pdf, ps, other

    math.AP

    A complementary result on a singular mean field equation with a sign-changing potential function

    Authors: Lina Wu

    Abstract: In this note, we study the singular mean field equation defined on a Riemann surface with a sign-changing potential function. We prove if some singular sources happen to be placed on the zero-level curve of the potential function, a priori estimate can still be obtained. As a consequence of this estimate, existence and multiplicity results can still be obtained based on the topology of the manifol… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  29. arXiv:2407.17895  [pdf, ps, other

    math.AP

    Global Well-Posedness of Contact Lines: 2D Navier-Stokes Flow

    Authors: Yan Guo, Ian Tice, Lei Wu, Yunrui Zheng

    Abstract: Based on the global a priori estimates in [Guo-Tice, J. Eur. Math. Soc. (2024)], we establish the well-posedness of a viscous fluid model satisfying the dynamic law for the contact line \begin{equation*} \mathscr{W}(\p_tζ(\pm\ell,t))=[\![γ]\!]\mpσ\frac{\p_1ζ}{(1+|\p_1ζ|^2)^{1/2}}(\pm\ell,t) \end{equation*} in 2D domain, where $ζ(x_1,t)$ is a free surface with two contact points $ζ(\pm\ell,t)$,… ▽ More

    Submitted 30 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

  30. arXiv:2407.04498  [pdf, ps, other

    math.AP

    Global dynamics for the generalized chemotaxis-Navier-Stokes system in $\mathbb{R}^3$

    Authors: Qingyou He, Ling-Yun Shou, Leyun Wu

    Abstract: We consider the chemotaxis-Navier-Stokes system with generalized fluid dissipation in $\mathbb{R}^3$: \begin{eqnarray*} \begin{cases} \partial_t n+u\cdot \nabla n=Δn- \nabla \cdot (χ(c)n \nabla c),\\ \partial_t c+u \cdot \nabla c=Δc-nf(c),\\ \partial_t u +u \cdot \nabla u+\nabla P=-(-Δ)^αu-n\nabla φ,\\ \nabla \cdot u=0, \end{cases} \end{eqnarray*} which describes the motion of swimming bacte… ▽ More

    Submitted 7 August, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 39 pages

  31. arXiv:2405.20763  [pdf, other

    cs.LG math.OC stat.ML

    Improving Generalization and Convergence by Enhancing Implicit Regularization

    Authors: Mingze Wang, Jinbo Wang, Haotian He, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

    Abstract: In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I… ▽ More

    Submitted 31 October, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: 44 pages, accepted by NeurIPS 2024

  32. arXiv:2405.01791  [pdf, ps, other

    math.AP

    Global-in-time maximal regularity for the Cauchy problem of the heat equation in BMO and applications

    Authors: Xuan Thinh Duong, Ji Li, Liangchuan Wu, Lixin Yan

    Abstract: In this article, we establish global-in-time maximal regularity for the Cauchy problem of the classical heat equation $\partial_t u(x,t)-Δu(x,t)=f(x,t)$ with $u(x,0)=0$ in a certain $\rm BMO$ setting, which improves the local-in-time result initially proposed by Ogawa and Shimizu in \cite{OS, OS2}. In further developing our method originally formulated for the heat equation, we obtain analogous gl… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 32 pages

    MSC Class: 42B35; 35K15; 42B37

  33. arXiv:2404.02366  [pdf, ps, other

    math.AP

    The modified Korteweg--de Vries limit of the Ablowitz--Ladik system

    Authors: Rowan Killip, Zhimeng Ouyang, Monica Visan, Lei Wu

    Abstract: For slowly-varying initial data, solutions to the Ablowitz-Ladik system have been proven to converge to solutions of the cubic Schrödinger equation. In this paper we show that in the continuum limit, solutions to the Ablowitz-Ladik system with $H^1$ initial data may also converge to solutions of the modified Korteweg--de Vries equation. To exhibit this new limiting behavior, it suffices that the i… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 27 pages

  34. arXiv:2404.00438  [pdf, other

    cs.DC cs.AI cs.LG math.OC stat.ML

    Communication Efficient Distributed Training with Distributed Lion

    Authors: Bo Liu, Lemeng Wu, Lizhang Chen, Kaizhao Liang, Jiaxu Zhu, Chen Liang, Raghuraman Krishnamoorthi, Qiang Liu

    Abstract: The Lion optimizer has been a promising competitor with the AdamW for training large AI models, with advantages on memory, computation, and sample efficiency. In this paper, we introduce Distributed Lion, an innovative adaptation of Lion for distributed training environments. Leveraging the sign operator in Lion, our Distributed Lion only requires communicating binary or lower-precision vectors be… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 22 pages

  35. arXiv:2404.00329  [pdf, ps, other

    math.CA

    Schatten classes and commutators in the two weight setting, II. Riesz transforms

    Authors: Michael Lacey, Ji Li, Brett D. Wick, Liangchuan Wu

    Abstract: We characterize the Schatten class $S^p$ of the commutator of Riesz transforms $[b,R_j]$ in $\mathbb R^n$ ($j=1,\ldots, n$) in the two weight setting for $n< p<\infty$, by introducing the condition that the symbol $b$ being in Besov spaces associated with the given two weights. At the critical index $p=n$, the commutator $[b,R_j]$ belongs to Schatten class $S^{n}$ if and only if $b$ is a constant,… ▽ More

    Submitted 13 December, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: This is an update of V2, typos fixed. more explanations added

  36. arXiv:2403.18235  [pdf, other

    eess.SY math.OC

    A Parallel Vector-form $LDL^\top$ Decomposition for Accelerating Execution-time-certified $\ell_1$-penalty Soft-constrained MPC

    Authors: Liang Wu, Liwei Zhou, Richard D. Braatz

    Abstract: Handling possible infeasibility and providing an execution time certificate are two pressing requirements of real-time Model Predictive Control (MPC). To meet these two requirements simultaneously, this paper proposes an $\ell_1$-penalty soft-constrained MPC formulation that is globally feasible and solvable with an execution time certificate using our proposed algorithm. This paper proves for the… ▽ More

    Submitted 8 August, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 11 pages

  37. arXiv:2403.17471  [pdf, ps, other

    math.PR

    Generalized Langevin and Nos{é}-Hoover processes absorbed at the boundary of a metastable domain

    Authors: Arnaud Guillin, D I Lu, Boris Nectoux, Liming Wu

    Abstract: In this paper, we prove in a very weak regularity setting existence and uniqueness of quasi-stationary distributions as well as exponential convergence towards the quasi-stationary distribution for the generalized Langevin and the Nos{é}-Hoover processes, two processes which are widely used in molecular dynamics. The case of singular potentials is considered. With the techniques used in this work,… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  38. arXiv:2402.07193  [pdf, other

    cs.LG math.OC stat.ML

    Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent

    Authors: Liu Ziyin, Mingze Wang, Hongchao Li, Lei Wu

    Abstract: Symmetries are prevalent in deep learning and can significantly influence the learning dynamics of neural networks. In this paper, we examine how exponential symmetries -- a broad subclass of continuous symmetries present in the model architecture or loss function -- interplay with stochastic gradient descent (SGD). We first prove that gradient noise creates a systematic motion (a ``Noether flow")… ▽ More

    Submitted 6 November, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: NeurIPS camera ready

  39. arXiv:2401.04653  [pdf, other

    math.OC eess.SY

    Time-certified Input-constrained NMPC via Koopman Operator

    Authors: Liang Wu, Krystian Ganko, Richard D. Braatz

    Abstract: Determining solving-time certificates of nonlinear model predictive control (NMPC) implementations is a pressing requirement when deploying NMPC in production environments. Such a certificate guarantees that the NMPC controller returns a solution before the next sampling time. However, NMPC formulations produce nonlinear programs (NLPs) for which it is very difficult to derive their solving-time c… ▽ More

    Submitted 26 February, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: 6 pages, submitted into 8th IFAC Conference on Nonlinear Model Predictive Control NMPC 2024

  40. arXiv:2311.15221  [pdf, other

    cs.IT cs.LG eess.SP math.OC math.ST stat.ML

    The Local Landscape of Phase Retrieval Under Limited Samples

    Authors: Kaizhao Liu, Zihao Wang, Lei Wu

    Abstract: In this paper, we present a fine-grained analysis of the local landscape of phase retrieval under the regime of limited samples. Specifically, we aim to ascertain the minimal sample size required to guarantee a benign local landscape surrounding global minima in high dimensions. Let $n$ and $d$ denote the sample size and input dimension, respectively. We first explore the local convexity and estab… ▽ More

    Submitted 11 October, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 47 pages, 5 figures. Accepted by IEEE Transactions on Information Theory

  41. arXiv:2311.14387  [pdf, other

    cs.LG math.OC

    Achieving Margin Maximization Exponentially Fast via Progressive Norm Rescaling

    Authors: Mingze Wang, Zeping Min, Lei Wu

    Abstract: In this work, we investigate the margin-maximization bias exhibited by gradient-based algorithms in classifying linearly separable data. We present an in-depth analysis of the specific properties of the velocity field associated with (normalized) gradients, focusing on their role in margin maximization. Inspired by this analysis, we propose a novel algorithm called Progressive Rescaling Gradient D… ▽ More

    Submitted 25 December, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: 37 pages, accepted by ICML 2024

  42. PFA and the definability of the nonstationary ideal

    Authors: Stefan Hoffelner, Paul Larson, Ralf Schindler, Liuzhen Wu

    Abstract: We produce, relative to a ${\sf ZFC}$ model with a supercompact cardinal, a ${\sf ZFC}$ model of the Proper Forcing Axiom in which the nonstationary ideal on $ω_1$ is $Π_1$-definable in a parameter from $H_{\aleph_2}$.

    Submitted 20 October, 2023; originally announced October 2023.

  43. arXiv:2309.13588  [pdf, ps, other

    math.RA

    A new class of partial orders

    Authors: Huihui Zhu, Liyun Wu

    Abstract: Let $R$ be a unital $*$-ring. For any $a,w,b\in R$, we apply the defined $w$-core inverse to define a new class of partial orders in $R$, called the $w$-core partial order. Suppose $a,b\in R$ are $w$-core invertible. We say that $a$ is below $b$ under the $w$-core partial order, denoted by $a\overset{\tiny{\textcircled{\#}}}\leq_w b$, if… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    MSC Class: 15A09

  44. arXiv:2309.00756  [pdf, other

    stat.AP math.OC

    Learning Risk Preferences in Markov Decision Processes: an Application to the Fourth Down Decision in the National Football League

    Authors: Nathan Sandholtz, Lucas Wu, Martin Puterman, Timothy C. Y. Chan

    Abstract: For decades, National Football League (NFL) coaches' observed fourth down decisions have been largely inconsistent with prescriptions based on statistical models. In this paper, we develop a framework to explain this discrepancy using an inverse optimization approach. We model the fourth down decision and the subsequent sequence of plays in a game as a Markov decision process (MDP), the dynamics o… ▽ More

    Submitted 15 August, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: 22 pages, 12 figures

  45. arXiv:2307.13287  [pdf, other

    math.OC math.NA

    Finding the spectral radius of a nonnegative irreducible symmetric tensor via DC programming

    Authors: Xueli Bai, Dong-Hui Li, Lei Wu, Jiefeng Xu

    Abstract: The Perron-Frobenius theorem says that the spectral radius of an irreducible nonnegative tensor is the unique positive eigenvalue corresponding to a positive eigenvector. With this in mind, the purpose of this paper is to find the spectral radius and its corresponding positive eigenvector of an irreducible nonnegative symmetric tensor. By transferring the eigenvalue problem into an equivalent prob… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    MSC Class: 15A18; 15A69; 90C90

  46. arXiv:2307.03879  [pdf, ps, other

    math.DG

    A direct approach to sharp Li-Yau Estimates on closed manifolds with negative Ricci lower bound

    Authors: Xingyu Song, Ling Wu, Meng Zhu

    Abstract: Recently, Qi S.Zhang [26] has derived a sharp Li-Yau estimate for positive solutions of the heat equation on closed Riemannian manifolds with the Ricci curvature bounded below by a negative constant. The proof is based on an integral iteration argument which utilizes Hamilton's gradient estimate, heat kernel Gaussian bounds and parabolic Harnack inequality. In this paper, we show that the sharp… ▽ More

    Submitted 24 August, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 14 pages

  47. arXiv:2306.15079  [pdf, ps, other

    math.OC cs.AI cs.CC eess.SY math.NA

    A direct optimization algorithm for input-constrained MPC

    Authors: Liang Wu, Richard D. Braatz

    Abstract: Providing an execution time certificate is a pressing requirement when deploying Model Predictive Control (MPC) in real-time embedded systems such as microcontrollers. Real-time MPC requires that its worst-case (maximum) execution time must be theoretically guaranteed to be smaller than the sampling time in closed-loop. This technical note considers input-constrained MPC problems and exploits the… ▽ More

    Submitted 30 March, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 8 pages, Resubmitted to IEEE TAC

  48. arXiv:2306.13261  [pdf, ps, other

    math.DG

    Heat kernel estimate for the Laplace-Beltrami operator under Bakry-Émery Ricci curvature condition and applications

    Authors: Xingyu Song, Ling Wu, Meng Zhu

    Abstract: We establish a Gaussian upper bound of the heat kernel for the Laplace-Beltrami operator on complete Riemannian manifolds with Bakry-Émery Ricci curvature bounded below. As applications, we first prove an L^1-Liouville property for non-negative subharmonic functions when the potential function of the Bakry-Émery Ricci curvature tensor is of at most quadratic growth. Then we derive lower bounds of… ▽ More

    Submitted 26 June, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: 33 pages. arXiv admin note: text overlap with arXiv:1401.6155 by other authors

  49. arXiv:2306.07485  [pdf, other

    cs.LG math.OC

    Learning Unnormalized Statistical Models via Compositional Optimization

    Authors: Wei Jiang, Jiayu Qin, Lingyu Wu, Changyou Chen, Tianbao Yang, Lijun Zhang

    Abstract: Learning unnormalized statistical models (e.g., energy-based models) is computationally challenging due to the complexity of handling the partition function. To eschew this complexity, noise-contrastive estimation~(NCE) has been proposed by formulating the objective as the logistic loss of the real data and the artificial noise. However, as found in previous works, NCE may perform poorly in many t… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  50. arXiv:2306.02833  [pdf, ps, other

    stat.ML cs.LG math.ST

    The $L^\infty$ Learnability of Reproducing Kernel Hilbert Spaces

    Authors: Hongrui Chen, Jihao Long, Lei Wu

    Abstract: In this work, we analyze the learnability of reproducing kernel Hilbert spaces (RKHS) under the $L^\infty$ norm, which is critical for understanding the performance of kernel methods and random feature models in safety- and security-critical applications. Specifically, we relate the $L^\infty$ learnability of a RKHS to the spectrum decay of the associate kernel and both lower bounds and upper boun… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 20 pages