Skip to main content

Showing 1–50 of 149 results for author: Lu, S

Searching in archive math. Search in all archives.
.
  1. arXiv:2505.10287  [pdf, ps, other

    math.AP

    Pogorelov type interior $C^2$ estimate for Hessian quotient equation and its application

    Authors: Siyuan Lu, Yi-Lin Tsai

    Abstract: In this paper, we derive a Pogorelov type interior $C^2$ estimate for the Hessian quotient equation $\frac{σ_n}{σ_k}\left( D^2u\right) =f$. As an application, we show that convex viscosity solutions are regular for $k\leq n-3$ if $u\in C^{1,α}$ with $α>1-\frac{2}{n-k}$ or $u\in W^{2,p}$ with $p\geq\frac{(n-1)(n-k)}{2}$. Both exponents are sharp in view of the example in arXiv:2401.12229.

    Submitted 15 May, 2025; originally announced May 2025.

    MSC Class: 35J60

  2. arXiv:2504.19481  [pdf, ps, other

    math.NA

    Preasymptotic error estimates of higher-order EEM for the time-harmonic Maxwell equations with large wave number

    Authors: Shuaishuai Lu, Haijun Wu

    Abstract: The time-harmonic Maxwell equations with impedance boundary condition and large wave number are discretized using the second-type Nédélec's edge element method (EEM). Preasymptotic error bounds are derived, showing that, under the mesh condition $κ^{2p+1}h^{2p}$ being sufficiently small, the error of the EEM of order $p$ in the energy norm is bounded by… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  3. arXiv:2503.23564  [pdf, other

    math.OC

    A Class of Optimal Directed Graphs for Network Synchronization

    Authors: Susie Lu, Ji Liu

    Abstract: In a paper by Nishikawa and Motter, a quantity called the normalized spread of the Laplacian eigenvalues is used to measure the synchronizability of certain network dynamics. Through simulations, and without theoretical validation, it is conjectured that among all simple directed graphs with a fixed number of vertices and arcs, the optimal value of this quantity is achieved if the Laplacian spectr… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

  4. arXiv:2502.15406  [pdf, ps, other

    math.AP

    Stability for an inverse flux and an inverse boundary coefficient problems

    Authors: Mourad Choulli, Shuai Lu, Hiroshi Takase

    Abstract: We establish both Lipschitz and logarithmic stability estimates for an inverse flux problem and subsequently apply these results to an inverse boundary coefficient problem. Furthermore, we demonstrate how the stability inequalities derived for the inverse boundary coefficient problem can be utilized in solving an inverse corrosion problem. This involves determining the unknown corrosion coefficien… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  5. arXiv:2502.13478  [pdf, ps, other

    math.PR

    Stochastic tamed 3D Navier-Stokes equations with locally weak monotonicity coefficients: existence, uniqueness and averaging principle

    Authors: Shuaishuai Lu, Xue Yang, Yong Li

    Abstract: This paper investigates the stochastic tamed 3D Navier-Stokes equations with locally weak monotonicity coefficients in the whole space as well as in the three-dimensional torus, which play a crucial role in turbulent flows analysis. A significant issue is addressed in this work, specifically, the reduced regularity of the coefficients and the inapplicability of Gronwall's lemma complicates the est… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  6. arXiv:2502.03819  [pdf, ps, other

    math.NA math.FA

    Interpolation and inverse problems in spectral Barron spaces

    Authors: Shuai Lu, Peter Mathé

    Abstract: Spectral Barron spaces, which quantify the absolute value of weighted Fourier coefficients of a function, have gained considerable attention due to their capability for universal approximation across certain function classes. By establishing a connection between these spaces and a specific positive linear operator, we investigate the interpolation and scaling relationships among diverse spectral B… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    MSC Class: 65J20; 68T07; 41A05

  7. arXiv:2501.00808  [pdf, other

    math.GT math.CO math.DG

    The moduli space of HCMU surfaces

    Authors: Sicheng Lu, Bin Xu

    Abstract: HCMU surfaces are compact Riemann surfaces equipped with an extremal Kähler metric and a finite number of singularities. Research on these surfaces was initiated by E. Calabi and X.-X. Chen over thirty years ago. We provide a detailed description of the geometric structure of HCMU surfaces, building on the classical football decomposition introduced by Chen-Chen-Wu. From this perspective, most HCM… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

    Comments: 51 pages, 14 figures

    MSC Class: 32G15; 58E11 (Primary) 57M15; 30F30 (Secondary)

  8. arXiv:2501.00748  [pdf, ps, other

    math.AP

    Stable inversion of potential in nonlinear wave equations with cubic nonlinearity

    Authors: Xi Chen, Shuai Lu, Ruochong Zhang

    Abstract: This paper investigates inverse potential problems of wave equations with cubic nonlinearity. We develop a methodology for establishing stability estimates for inversion of lower order coefficients. The new ingredients of our approach include trilinear approximations of nonlinear response operators, symbol estimates of distorted plane waves, and lower order symbol calculus.

    Submitted 20 January, 2025; v1 submitted 1 January, 2025; originally announced January 2025.

    Comments: The exponents of the error bounds in (1.4)-(1.5) are slightly improved

  9. arXiv:2412.16683  [pdf, other

    math.DS

    Dynamical Behaviors of the Gradient Flows for In-Context Learning

    Authors: Songtao Lu, Yingdong Lu, Tomasz Nowicki

    Abstract: We derive the system of differential equations for the gradient flow characterizing the training process of linear in-context learning in full generality. Next, we explore the geometric structure of the gradient flows in two instances, including identifying its invariants, optimum, and saddle points. This understanding allows us to quantify the behavior of the two gradient flows under the full gen… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  10. arXiv:2412.05857  [pdf, ps, other

    math.CO

    On primality and atomicity of numerical power monoids

    Authors: Anay Aggarwal, Felix Gotti, Susie Lu

    Abstract: In the first part of this paper, we establish a variation of a recent result by Bienvenu and Geroldinger on the (almost) non-existence of absolute irreducibles in (restricted) power monoids of numerical monoids: we argue the (almost) non-existence of primal elements in the same class of power monoids. The second part of this paper, devoted to the study of the atomic density of… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: 17 pages

  11. arXiv:2412.00576  [pdf, ps, other

    math.AP

    A simple proof of curvature estimates for the n-1 Hessian equation

    Authors: Siyuan Lu, Yi-Lin Tsai

    Abstract: In [Amer. J. Math. 141 (2019), no. 5, 1281-1315], Ren and Wang proved the curvature estimates for the $n-1$ curvature equation. The purpose of this note is to give a simple proof of their theorem.

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: 13 pages

    MSC Class: 58C05 (Primary) 58J05; 35J60 (Secondary)

  12. arXiv:2411.14166  [pdf, other

    math.OC cs.LG stat.ML

    SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization

    Authors: Shuchen Zhu, Boao Kong, Songtao Lu, Xinmeng Huang, Kun Yuan

    Abstract: This paper studies decentralized bilevel optimization, in which multiple agents collaborate to solve problems involving nested optimization structures with neighborhood communications. Most existing literature primarily utilizes gradient tracking to mitigate the influence of data heterogeneity, without exploring other well-known heterogeneity-correction techniques such as EXTRA or Exact Diffusion.… ▽ More

    Submitted 17 December, 2024; v1 submitted 21 November, 2024; originally announced November 2024.

    Comments: 74 pages, the Thirty-Eighth Annual Conference on Neural Information Processing Systems (2024)

  13. arXiv:2407.18365  [pdf, other

    cs.LG cs.AI cs.DC math.OC

    FADAS: Towards Federated Adaptive Asynchronous Optimization

    Authors: Yujia Wang, Shiqiang Wang, Songtao Lu, Jinghui Chen

    Abstract: Federated learning (FL) has emerged as a widely adopted training paradigm for privacy-preserving machine learning. While the SGD-based FL algorithms have demonstrated considerable success in the past, there is a growing trend towards adopting adaptive federated optimization methods, particularly for training large-scale models. However, the conventional synchronous aggregation design poses a signi… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted by ICML 2024

  14. arXiv:2407.08496  [pdf, ps, other

    math.DG math.GT

    Convergences of Combinatorial Ricci Flows to Degenerated Circle Packings in Hyperbolic Background Geometry

    Authors: Guangming Hu, Sicheng Lu, Dong Tan, Youliang Zhong, Puchun Zhou

    Abstract: This paper investigates a kind of degenerated circle packings in hyperbolic background geometry. A main problem is whether a prescribed total geodesic curvature data can be realized by a degenerated circle packing or not. We fully characterize the sufficient and necessary conditions and show the uniqueness. Furthermore, we introduce the combinatoral Ricci flow to find the desired degenerated circl… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 36 pages, 9 figures

    MSC Class: 52C26; 57M50

  15. arXiv:2407.06784  [pdf, other

    math.NA

    Preasymptotic error estimates of EEM and CIP-EEM for the time-harmonic Maxwell equations with large wave number

    Authors: Shuaishuai Lu, Haijun Wu

    Abstract: Preasymptotic error estimates are derived for the linear edge element method (EEM) and the linear $\boldsymbol{H}(\boldsymbol{\mathrm{curl}})$-conforming interior penalty edge element method (CIP-EEM) for the time-harmonic Maxwell equations with large wave number. It is shown that under the mesh condition that $κ^3 h^2$ is sufficiently small, the errors of the solutions to both methods are bounded… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  16. arXiv:2407.05078  [pdf, ps, other

    math.NA

    Function and derivative approximation by shallow neural networks

    Authors: Yuanyuan Li, Shuai Lu

    Abstract: We investigate a Tikhonov regularization scheme specifically tailored for shallow neural networks within the context of solving a classic inverse problem: approximating an unknown function and its derivatives within a unit cubic domain based on noisy measurements. The proposed Tikhonov regularization scheme incorporates a penalty term that takes three distinct yet intricately related network (semi… ▽ More

    Submitted 14 December, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    MSC Class: 65D15; 65F22; 65J20

  17. arXiv:2407.04909  [pdf, ps, other

    math.PR

    The weak averaging principle of stochastic functional partial differential equations with H$\ddot{\text{o}}$lder continuous coefficients and infinite delay

    Authors: Shuaishuai Lu, Xue Yang, Yong Li

    Abstract: In this paper, we establish the weak averaging principle for stochastic functional partial differential equations (in short, SFPDEs) with H$\ddot{\text{o}}$lder continuous coefficients and infinite delay by a new generalized coupling approach. Firstly, we rigorously establish the existence and uniqueness of weak solutions for a specific class of finite-dimensional systems by the generalized coupli… ▽ More

    Submitted 28 March, 2025; v1 submitted 5 July, 2024; originally announced July 2024.

  18. Forward and backward problems for coupled subdiffusion systems

    Authors: Dian Feng, Yikan Liu, Shuai Lu

    Abstract: In this article, we investigate both forward and backward problems for coupled systems of time-fractional diffusion equations, encompassing scenarios of strong coupling. For the forward problem, we establish the well-posedness of the system, leveraging the eigensystem of the corresponding elliptic system as the foundation. When considering the backward problem, specifically the determination of in… ▽ More

    Submitted 3 February, 2025; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 26 pages, 7 figures

    MSC Class: 35R11; 35K58; 35B44

  19. arXiv:2406.10511  [pdf, other

    cs.DC cs.AR cs.PF math.NA

    Efficient Hardware Accelerator Based on Medium Granularity Dataflow for SpTRSV

    Authors: Qian Chen, Xiaofeng Yang, Shengli Lu

    Abstract: Sparse triangular solve (SpTRSV) is widely used in various domains. Numerous studies have been conducted using CPUs, GPUs, and specific hardware accelerators, where dataflows can be categorized into coarse and fine granularity. Coarse dataflows offer good spatial locality but suffer from low parallelism, while fine dataflows provide high parallelism but disrupt the spatial structure, leading to in… ▽ More

    Submitted 17 March, 2025; v1 submitted 15 June, 2024; originally announced June 2024.

    Journal ref: IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 33 (2025) 807-820

  20. arXiv:2405.18858  [pdf, other

    math.OC

    Distributed Bilevel Optimization with Communication Compression

    Authors: Yutong He, Jie Hu, Xinmeng Huang, Songtao Lu, Bin Wang, Kun Yuan

    Abstract: Stochastic bilevel optimization tackles challenges involving nested optimization structures. Its fast-growing scale nowadays necessitates efficient distributed algorithms. In conventional distributed bilevel methods, each worker must transmit full-dimensional stochastic gradients to the server every iteration, leading to significant communication overhead and thus hindering efficiency and scalabil… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  21. arXiv:2405.06938  [pdf, ps, other

    math.PR math.DS

    Stochastic functional partial differential equations with monotone coefficients: Poisson stability measures, exponential mixing and limit theorems

    Authors: Shuaishuai Lu, Xue Yang, Yong Li

    Abstract: This paper examines Poisson stable (including stationary, periodic, almost periodic, Levitan almost periodic, Bohr almost automorphic, pseudo-periodic, Birkhoff recurrent, pseudo-recurrent, etc.) measures and limit theorems for stochastic functional partial differential equations(SFPDEs) with monotone coefficients. We first show the existence and uniqueness of entrance measure $μ_{t}$ for SFPDEs b… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  22. arXiv:2405.06223  [pdf, ps, other

    math.PR

    McKean-Vlasov SPDEs with coefficients exhibiting locally weak monotonicity: existence, uniqueness, ergodicity, exponential mixing and limit theorems

    Authors: Shuaishuai Lu, Xue Yang, Yong Li

    Abstract: This paper investigates the existence and uniqueness of solutions, as well as the ergodicity and exponential mixing to invariant measures, and limit theorems for a class of McKean-Vlasov SPDEs with locally weak monotonicity. In particular, for a class of weak monotonicity conditions, including H$\ddot{\text{o}}$lder continuity, we rigorously establish the existence and uniqueness of weak solutions… ▽ More

    Submitted 9 March, 2025; v1 submitted 9 May, 2024; originally announced May 2024.

  23. arXiv:2404.07230  [pdf, ps, other

    math.GM cs.AI

    Interval-valued fuzzy soft $β$-covering approximation spaces

    Authors: Shizhan Lu

    Abstract: The concept of interval-valued fuzzy soft $β$-covering approximation spaces (IFS$β$CASs) is introduced to combine the theories of soft sets, rough sets and interval-valued fuzzy sets, and some fundamental propositions concerning interval-valued fuzzy soft $β$-neighborhoods and soft $β$-neighborhoods of IFS$β$CASs are explored. And then four kinds of interval-valued fuzzy soft $β$-coverings based f… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 12 pages

  24. arXiv:2403.15745  [pdf, other

    math.OC

    Fast Consensus Topology Design via Minimizing Laplacian Energy

    Authors: Susie Lu, Ji Liu

    Abstract: This paper characterizes the graphical properties of an optimal topology with minimal Laplacian energy under the constraint of fixed numbers of vertices and edges, and devises an algorithm to construct such connected optimal graphs. These constructed graphs possess maximum vertex and edge connectivity, and more importantly, exhibit large algebraic connectivity of an optimal order provided they are… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  25. arXiv:2402.03167  [pdf, other

    math.OC cs.LG stat.ML

    Decentralized Bilevel Optimization: A Perspective from Transient Iteration Complexity

    Authors: Boao Kong, Shuchen Zhu, Songtao Lu, Xinmeng Huang, Kun Yuan

    Abstract: Stochastic bilevel optimization (SBO) is becoming increasingly essential in machine learning due to its versatility in handling nested structures. To address large-scale SBO, decentralized approaches have emerged as effective paradigms in which nodes communicate with immediate neighbors without a central server, thereby improving communication efficiency and enhancing algorithmic robustness. Howev… ▽ More

    Submitted 31 March, 2025; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 59 pages, 7 figures

  26. arXiv:2401.15945  [pdf, ps, other

    math.NA

    Regularization of linear inverse problems with irregular noise using embedding operators

    Authors: Xinyan Li, Simon Hubmer, Shuai Lu, Ronny Ramlau

    Abstract: In this paper, we investigate regularization of linear inverse problems with irregular noise. In particular, we consider the case that the noise can be preprocessed by certain adjoint embedding operators. By introducing the consequent preprocessed problem, we provide convergence analysis for general regularization schemes under standard assumptions. Furthermore, for a special case of Tikhonov regu… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 23 pages, 2 figures

  27. arXiv:2401.12229  [pdf, ps, other

    math.AP

    Interior $C^2$ estimate for Hessian quotient equation in general dimension

    Authors: Siyuan Lu

    Abstract: In this paper, we study the interior $C^2$ regularity problem for the Hessian quotient equation $\left(\frac{σ_n}{σ_k}\right)(D^2u)=f$. We give a complete answer to this longstanding problem: for $k=n-1,n-2$, we establish an interior $C^2$ estimate; for $k\leq n-3$, we show that interior $C^2$ estimate fails by finding a singular solution.

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.05835

    MSC Class: 35J60; 35J15; 35J96

  28. arXiv:2312.01807  [pdf, other

    math.GT math.CV math.DG

    Moduli Space of Dihedral Spherical Surfaces and Measured Foliations

    Authors: Sicheng Lu, Bin Xu

    Abstract: Cone spherical surfaces are orientable Riemannian surfaces with constant curvature one and a finite set of conical singularities. A subset of these surfaces, referred to as dihedral surfaces, is characterized by their monodromy groups, which notably preserve a pair of antipodal points on the unit two-sphere within three-dimensional Euclidean space. On each dihedral surface, we define a pair of tra… ▽ More

    Submitted 2 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: 40 pages, 15 figures. All comments are welcome! Small modifications on propositions, descriptions, figures and fonts in v2

    MSC Class: 58D27; 53C12; 30F45; 30F30

  29. arXiv:2311.05835  [pdf, ps, other

    math.AP

    Interior $C^2$ estimate for Hessian quotient equation in dimension three

    Authors: Siyuan Lu

    Abstract: In this paper, we establish an interior $C^2$ estimate for the Hessian quotient equation $\left(\frac{σ_3}{σ_1}\right)(D^2u)=f$ in dimension three. A crucial ingredient in our proof is a Jacobi inequality.

    Submitted 9 November, 2023; originally announced November 2023.

  30. arXiv:2310.20709  [pdf, other

    math.RT

    Quadratic Differentials as Stability Conditions of Graded Skew-gentle Algebras

    Authors: Suiqi Lu, Yu Qiu, Dongjian Wu

    Abstract: We prove that the principal component of the exchange graph of hearts of a graded skew-gentle algebra can be identified with the corresponding exchange graph of S-graphs, using the geometric models and $\operatorname{Int}=\operatorname{dim}\operatorname{Hom}$ formula in Qiu-Zhang-Zhou. Using the same argument in Bridgeland-Smith, Barbieri-Möller-Qiu-So and Christ-Haiden-Qiu, we extend this identif… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  31. arXiv:2310.06784  [pdf, ps, other

    math.AG math.CV

    Finiteness of pointed maps to moduli spaces of polarized varieties

    Authors: Ariyan Javanpeykar, Steven Lu, Ruiran Sun, Kang Zuo

    Abstract: We prove a finiteness result for pointed maps to the base space of a family of polarized varieties with maximal variation in moduli. A key ingredient is a new criterion for the rigidity of pointed maps.

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 15 pages. Comments welcome

  32. arXiv:2309.12425  [pdf, other

    stat.ME math.ST

    Principal Stratification with Continuous Post-Treatment Variables: Nonparametric Identification and Semiparametric Estimation

    Authors: Sizhu Lu, Zhichao Jiang, Peng Ding

    Abstract: Post-treatment variables often complicate causal inference. They appear in many scientific problems, including noncompliance, truncation by death, mediation, and surrogate endpoint evaluation. Principal stratification is a strategy to address these challenges by adjusting for the potential values of the post-treatment variables, defined as the principal strata. It allows for characterizing treatme… ▽ More

    Submitted 3 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  33. arXiv:2306.02422  [pdf, other

    math.OC cs.LG

    A Generalized Alternating Method for Bilevel Learning under the Polyak-Łojasiewicz Condition

    Authors: Quan Xiao, Songtao Lu, Tianyi Chen

    Abstract: Bilevel optimization has recently regained interest owing to its applications in emerging machine learning fields such as hyperparameter optimization, meta-learning, and reinforcement learning. Recent results have shown that simple alternating (implicit) gradient-based algorithms can match the convergence rate of single-level gradient descent (GD) when addressing bilevel problems with a strongly c… ▽ More

    Submitted 5 October, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: Camera ready version

  34. arXiv:2305.14686  [pdf, other

    math.NA

    Harmonic Measures and Numerical Computation of Cauchy Problems for Laplace Equations

    Authors: Yu Chen, Jin Cheng, Shuai Lu, Masahiro Yamamoto

    Abstract: It is well known that Cauchy problem for Laplace equations is an ill-posed problem in Hadamard's sense. Small deviations in Cauchy data may lead to large errors in the solutions. It is observed that if a bound is imposed on the solution, there exists a conditional stability estimate. This gives a reasonable way to construct stable algorithms. However, it is impossible to have good results at all p… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  35. arXiv:2302.14252  [pdf, other

    math.OC

    Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data

    Authors: Yonggui Yan, Jie Chen, Pin-Yu Chen, Xiaodong Cui, Songtao Lu, Yangyang Xu

    Abstract: We first propose a decentralized proximal stochastic gradient tracking method (DProxSGT) for nonconvex stochastic composite problems, with data heterogeneously distributed on multiple workers in a decentralized connected network. To save communication cost, we then extend DProxSGT to a compressed method by compressing the communicated information. Both methods need only $\mathcal{O}(1)$ samples pe… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  36. arXiv:2302.12409  [pdf, ps, other

    math.DG math.AP

    Curvature estimates for semi-convex solutions of Hessian equations in hyperbolic space

    Authors: Siyuan Lu

    Abstract: In this paper, we establish a curvature estimate for semi-convex solutions of Hessian equations in hyperbolic space. We also obtain a curvature estimate for admissible solutions to prescribed curvature measure type problem in hyperbolic space. A crucial ingredient in both estimates is a concavity inequality for Hessian operator.

    Submitted 23 February, 2023; originally announced February 2023.

  37. arXiv:2302.02922  [pdf, other

    cs.LG cs.AI eess.SP math.OC

    Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks

    Authors: Shuai Zhang, Meng Wang, Pin-Yu Chen, Sijia Liu, Songtao Lu, Miao Liu

    Abstract: Due to the significant computational challenge of training large-scale graph neural networks (GNNs), various sparse learning techniques have been exploited to reduce memory and storage costs. Examples include \textit{graph sparsification} that samples a subgraph to reduce the amount of data aggregation and \textit{model sparsification} that prunes the neural network to reduce the number of trainab… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Journal ref: The Eleventh International Conference on Learning Representations, 2023

  38. arXiv:2301.07875  [pdf, ps, other

    math.AP

    Increasing stability of a linearized inverse boundary value problem for a nonlinear Schrödinger equation on transversally anisotropic manifolds

    Authors: Shuai Lu, Jian Zhai

    Abstract: We consider the problem of recovering a nonlinear potential function in a nonlinear Schrödinger equation on transversally anisotropic manifolds from the linearized Dirichlet-to-Neumann map at a large wavenumber. By calibrating the complex geometric optics (CGO) solutions according to the wavenumber, we prove the increasing stability of recovering the coefficient of a cubic term as the wavenumber b… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  39. arXiv:2212.09513  [pdf, other

    math.OC cs.LG math.NA

    Stochastic Inexact Augmented Lagrangian Method for Nonconvex Expectation Constrained Optimization

    Authors: Zichong Li, Pin-Yu Chen, Sijia Liu, Songtao Lu, Yangyang Xu

    Abstract: Many real-world problems not only have complicated nonconvex functional constraints but also use a large number of data points. This motivates the design of efficient stochastic methods on finite-sum or expectation constrained problems. In this paper, we design and analyze stochastic inexact augmented Lagrangian methods (Stoc-iALM) to solve problems involving a nonconvex composite (i.e. smooth+non… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  40. Increasing stability of the first order linearized inverse Schrödinger potential problem with integer power type nonlinearities

    Authors: Sen Zou, Shuai Lu, Boxi Xu

    Abstract: We investigate the increasing stability of the inverse Schrödinger potential problem with integer power type nonlinearities at a large wavenumber. By considering the first order linearized system with respect to the unknown potential function, a combination formula of the first order linearization is proposed, which provides a Lipschitz type stability for the recovery of the Fourier coefficients o… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: 37 pages, 8 figures

    MSC Class: 35J25; 65N20

    Journal ref: SIAM Journal on Applied Mathematics, 84(4), 1868-1889, 2024

  41. arXiv:2207.13499  [pdf, other

    math.NA math.ST

    On a Dynamic Variant of the Iteratively Regularized Gauss-Newton Method with Sequential Data

    Authors: Neil K. Chada, Marco A. Iglesias, Shuai Lu, Frank Werner

    Abstract: For numerous parameter and state estimation problems, assimilating new data as they become available can help produce accurate and fast inference of unknown quantities. While most existing algorithms for solving those kind of ill-posed inverse problems can only be used with a single instance of the observed data, in this work we propose a new framework that enables existing algorithms to invert mu… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  42. arXiv:2207.13283  [pdf, other

    cs.LG math.OC stat.ML

    INTERACT: Achieving Low Sample and Communication Complexities in Decentralized Bilevel Learning over Networks

    Authors: Zhuqing Liu, Xin Zhang, Prashant Khanduri, Songtao Lu, Jia Liu

    Abstract: In recent years, decentralized bilevel optimization problems have received increasing attention in the networking and machine learning communities thanks to their versatility in modeling decentralized learning problems over peer-to-peer networks (e.g., multi-agent meta-learning, multi-agent reinforcement learning, personalized training, and Byzantine-resilient learning). However, for decentralized… ▽ More

    Submitted 5 October, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

  43. arXiv:2207.05650  [pdf, other

    math.OC cs.AI cs.LG math.NA

    A Single-Loop Gradient Descent and Perturbed Ascent Algorithm for Nonconvex Functional Constrained Optimization

    Authors: Songtao Lu

    Abstract: Nonconvex constrained optimization problems can be used to model a number of machine learning problems, such as multi-class Neyman-Pearson classification and constrained Markov decision processes. However, such kinds of problems are challenging because both the objective and constraints are possibly nonconvex, so it is difficult to balance the reduction of the loss value and reduction of constrain… ▽ More

    Submitted 2 December, 2024; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: This work was published in the Proceedings of the Thirty-Ninth International Conference on Machine Learning (ICML 2022)

  44. arXiv:2206.13482  [pdf, other

    cs.LG math.OC stat.ML

    Understanding Benign Overfitting in Gradient-Based Meta Learning

    Authors: Lisha Chen, Songtao Lu, Tianyi Chen

    Abstract: Meta learning has demonstrated tremendous success in few-shot learning with limited supervised data. In those settings, the meta model is usually overparameterized. While the conventional statistical learning theory suggests that overparameterized models tend to overfit, empirical evidence reveals that overparameterized meta learning methods still work well -- a phenomenon often called "benign ove… ▽ More

    Submitted 9 November, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

  45. arXiv:2206.00161  [pdf, ps, other

    math.DG math.AP

    On the asymptotic Plateau problem in hyperbolic space

    Authors: Siyuan Lu

    Abstract: In this paper, we solve the asymptotic Plateau problem in hyperbolic space for constant $σ_{n-1}$ curvature, i.e. the existence of a complete hypersurface in $\mathbb{H}^{n+1}$ satisfying $σ_{n-1}(κ)=σ\in (0,n)$ with a prescribed asymptotic boundary $Γ$. The key ingredient is the curvature estimates. Previously, this is only known for $σ_0<σ<n$, where $σ_0$ is a positive constant.

    Submitted 12 February, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

  46. arXiv:2205.14413  [pdf

    cs.GT math.OC

    Discrimination-Based Double Auction for Maximizing Social Welfare in the Electricity and Heating Market Considering Privacy Preservation

    Authors: Lu Wang, Wei Gu, Shuai Lu, Haifeng Qiu, Zhi Wu

    Abstract: This paper proposes a doubled-sided auction mechanism with price discrimination for social welfare (SW) maximization in the electricity and heating market. In this mechanism, energy service providers (ESPs) submit offers and load aggregators (LAs) submit bids to an energy trading center (ETC) to maximize their utility; in turn, the selfless ETC as an auctioneer leverages dis-criminatory price weig… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  47. arXiv:2204.05420  [pdf, ps, other

    math.AP math.DG

    On the Dirichlet problem for Lagrangian phase equation with critical and supercritical phase

    Authors: Siyuan Lu

    Abstract: In this paper, we solve the Dirichlet problem for Lagrangian phase equation with critical and supercritical phase. A crucial ingredient is the interior $C^2$ estimate. Our result is sharp in the sense that there exist singular solutions in the subcritical phase case.

    Submitted 12 February, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

  48. arXiv:2203.01924  [pdf, other

    cs.LG math.OC

    Min-Max Bilevel Multi-objective Optimization with Applications in Machine Learning

    Authors: Alex Gu, Songtao Lu, Parikshit Ram, Lily Weng

    Abstract: We consider a generic min-max multi-objective bilevel optimization problem with applications in robust machine learning such as representation learning and hyperparameter optimization. We design MORBiT, a novel single-loop gradient descent-ascent bilevel optimization algorithm, to solve the generic problem and present a novel analysis showing that MORBiT converges to the first-order stationary poi… ▽ More

    Submitted 7 March, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 43 pages, 3 figures, ICLR 2023 version

  49. arXiv:2202.06303  [pdf, other

    math.OC eess.SY

    On the Exactness of an Energy-efficient Train Control model based on Convex Optimization

    Authors: Shaofeng Lu, Minling Feng, Kunpeng Wu

    Abstract: In this paper, we demonstrate the exactness proof for the energy-efficient train control (EETC) model based on convex optimization. The proof of exactness shows that the convex optimization model will share the same optimization results with the initial model on which the convex relaxations are conducted. We first show how the relaxation on the initial non-convex model is conducted and provide ana… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: 11 pages and 4 figures

  50. arXiv:2202.06217  [pdf, ps, other

    math.CT

    The double contravariant powerset monad in the Goguen category of fuzzy sets

    Authors: Sijia Lu, Dexue Zhang

    Abstract: A monad is constructed in the Goguen category of fuzzy sets valued in a unital quantale, which is an analog of the double contravariant powerset monad in the category of sets. With help of this monad it is proved that the Goguen category of fuzzy sets is dually monadic over itself.

    Submitted 3 August, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: 21 pages

    MSC Class: 03E72; 18C15; 18C20