Skip to main content

Showing 1–50 of 205 results for author: Xia, L

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.22401  [pdf, ps, other

    cs.LG math.OC

    Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL

    Authors: Tong Yang, Bo Dai, Lin Xiao, Yuejie Chi

    Abstract: Online reinforcement learning (RL) with complex function approximations such as transformers and deep neural networks plays a significant role in the modern practice of artificial intelligence. Despite its popularity and importance, balancing the fundamental trade-off between exploration and exploitation remains a long-standing challenge; in particular, we are still in lack of efficient and practi… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  2. arXiv:2505.11782  [pdf, ps, other

    math.CO

    Multiplicative and mining property for stability numbers of graphs

    Authors: Metrose Metsidik, Lixiao Xiao

    Abstract: $f$-vertex stability number $vs_f(G)=\min\{|X|: X\subseteq V(G) \enspace \text{and} \enspace f(G-X)\neq f(G)\}$, and $f$-edge stability number is defined similarly by setting $X\subseteq E(G)$. In this paper, for multiplicative and mining invariant $f$, we give some general bounds for $f$-vertex/edge stability numbers of graphs and some results about the relations between the $f$-vertex/edge stabi… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  3. arXiv:2505.01825  [pdf, ps, other

    math.ST

    Asymptotic representations for Spearman's footrule correlation coefficient

    Authors: Liqi Xia, Sami Ullah, Li Guan

    Abstract: In order to address the theoretical challenges arising from the dependence structure of ranks in Spearman's footrule correlation coefficient, we propose two asymptotic representations under the null hypothesis of independence. The first representation simplifies the dependence structure by replacing empirical distribution functions with their population counterparts. The second representation leve… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

  4. arXiv:2504.10262  [pdf, ps, other

    math.RT

    Whittaker modules for $U_q(\mathfrak{sl}_3)$

    Authors: Xiangqian Guo, Xuewen Liu, Limeng Xia

    Abstract: In this paper, we study the Whittaker modules for the quantum enveloping algebra $U_q(\sl_3)$ with respect to a fixed Whittaker function. We construct the universal Whittaker module, find all its Whittaker vectors and investigate the submodules generated by subsets of Whittaker vectors and corresponding quotient modules. We also find Whittaker vectors and determine the irreducibility of these quot… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    MSC Class: 2010: 17B37; 20G42

  5. arXiv:2503.23198  [pdf, ps, other

    math.DG

    An isoperimetric type inequality in De Sitter space

    Authors: Ling Xiao

    Abstract: In this paper, we prove an optimal isoperimetric inequality for spacelike, compact, star-shaped, and $2$-convex hypersurfaces in de Sitter space.

    Submitted 29 March, 2025; originally announced March 2025.

  6. arXiv:2503.23194  [pdf, ps, other

    math.DG

    Closed minimal hypersurfaces in $\mathbb S^5$ with constant $S$ and $A_3$

    Authors: Joel Spruck, Ling Xiao

    Abstract: In this paper, we prove that a closed minimally immersed hypersurface $M^4\subset\mathbb S^5$ with constant $S:=\sum\limits_{i=1}^4λ_i^2$ and $A_3:=\sum\limits_{i=1}^4λ_i^3$ whose scalar curvature $R_M$ is nonnegative must be isoparametric. Moreover, $S$ can only be $0, 4,$ and $12.$ That is $M^4$ is either an equatorial $4$-sphere, a clifford torus, or a Cartan's minimal hypersurface.

    Submitted 29 March, 2025; originally announced March 2025.

  7. arXiv:2503.22779  [pdf, ps, other

    cs.MA cs.GT cs.LG math.OC

    Policy Optimization and Multi-agent Reinforcement Learning for Mean-variance Team Stochastic Games

    Authors: Junkai Hu, Li Xia

    Abstract: We study a long-run mean-variance team stochastic game (MV-TSG), where each agent shares a common mean-variance objective for the system and takes actions independently to maximize it. MV-TSG has two main challenges. First, the variance metric is neither additive nor Markovian in a dynamic setting. Second, simultaneous policy updates of all agents lead to a non-stationary environment for each indi… ▽ More

    Submitted 12 June, 2025; v1 submitted 28 March, 2025; originally announced March 2025.

  8. arXiv:2503.15748  [pdf, other

    cs.LG math.OC

    PARQ: Piecewise-Affine Regularized Quantization

    Authors: Lisa Jin, Jianhao Ma, Zechun Liu, Andrey Gromov, Aaron Defazio, Lin Xiao

    Abstract: We develop a principled method for quantization-aware training (QAT) of large-scale machine learning models. Specifically, we show that convex, piecewise-affine regularization (PAR) can effectively induce the model parameters to cluster towards discrete values. We minimize PAR-regularized loss functions using an aggregate proximal stochastic gradient method (AProx) and prove that it has last-itera… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  9. arXiv:2502.09780  [pdf, ps, other

    cs.LG cs.AI cs.GT math.OC

    Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games

    Authors: Tong Yang, Bo Dai, Lin Xiao, Yuejie Chi

    Abstract: Multi-agent reinforcement learning (MARL) lies at the heart of a plethora of applications involving the interaction of a group of agents in a shared unknown environment. A prominent framework for studying MARL is Markov games, with the goal of finding various notions of equilibria in a sample-efficient manner, such as the Nash equilibrium (NE) and the coarse correlated equilibrium (CCE). However,… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  10. arXiv:2411.06793  [pdf, other

    math.OC

    Service Deployment in the On-Demand Economy: Employees, Contractors, or Both?

    Authors: Lijian Lu, Xin Weng, Li Xiao

    Abstract: The recent advancements in mobile/data technology have fostered a widespread adoption of on-demand or gig service platforms. The increasingly available data and independent contractors have enabled these platforms to design customized services and a cost-efficient workforce to effectively match demand and supply. In practice, a diverse landscape of the workforce has been observed: some rely solely… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: 60 pages, 23 figures

    MSC Class: 90B22

  11. arXiv:2409.12006  [pdf, ps, other

    math.MG

    Quasihyperbolic metric and Gromov hyperbolic spaces I

    Authors: Hongjun Liu, Ling Xia, Shasha Yan

    Abstract: In this paper, we introduce the concepts of short arc and length map in quasihyperbolic metric spaces, and obtain some geometric characterizations of Gromov hyperbolicity for quasihyperbolic metric spaces in terms of the properties of short arc and length map.

    Submitted 9 November, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: arXiv admin note: text overlap with arXiv:1706.05494 by other authors

  12. arXiv:2409.10315  [pdf, other

    math.ST

    Consistent complete independence test in high dimensions based on Chatterjee correlation coefficient

    Authors: Liqi Xia, Ruiyuan Cao, Jiang Du, Jun Dai

    Abstract: In this article, we consider the complete independence test of high-dimensional data. Based on Chatterjee coefficient, we pioneer the development of quadratic test and extreme value test which possess good testing performance for oscillatory data, and establish the corresponding large sample properties under both null hypotheses and alternative hypotheses. In order to overcome the shortcomings of… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  13. arXiv:2408.05377  [pdf, other

    math.CO

    More results on stack-sorting for set partitions

    Authors: Samanyu Ganesh, Lanxuan Xia, Bole Ying

    Abstract: Let a sock be an element of an ordered finite alphabet A and a sequence of these elements be a sock sequence. In 2023, Xia introduced a deterministic version of Defant and Kravitz's stack-sorting map by defining the $φ_σ$ and $φ_{\overlineσ}$ pattern-avoidance stack-sorting maps for sock sequences. Xia showed that the $φ_{aba}$ map is the only one that eventually sorts all set partitions; in this… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  14. arXiv:2407.04358  [pdf, other

    math.OC cs.LG

    An Adaptive Stochastic Gradient Method with Non-negative Gauss-Newton Stepsizes

    Authors: Antonio Orvieto, Lin Xiao

    Abstract: We consider the problem of minimizing the average of a large number of smooth but possibly non-convex functions. In the context of most machine learning applications, each loss function is non-negative and thus can be expressed as the composition of a square and its real-valued square root. This reformulation allows us to apply the Gauss-Newton method, or the Levenberg-Marquardt method when adding… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  15. arXiv:2406.00624  [pdf, other

    math.NT

    Iwasawa's main conjecture for Rankin-Selberg motives in the anticyclotomic case

    Authors: Yifeng Liu, Yichao Tian, Liang Xiao

    Abstract: In this article, we study the Iwasawa theory for cuspidal automorphic representations of $\mathrm{GL}(n)\times\mathrm{GL}(n+1)$ over CM fields along anticyclotomic directions, in the framework of the Gan--Gross--Prasad conjecture for unitary groups. We prove one-side divisibility of the corresponding Iwasawa main conjecture: when the global root number is $1$, the $p$-adic $L$-function belongs to… ▽ More

    Submitted 25 December, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: v3: 103 pages; Sec 5.2 rewritten; add Sec 5.7. arXiv admin note: text overlap with arXiv:2211.06673 by other authors

    MSC Class: 11F33; 11G05; 11G18; 11G40; 11R34

  16. arXiv:2405.15074  [pdf, other

    stat.ML cs.LG math.OC math.PR math.ST

    4+3 Phases of Compute-Optimal Neural Scaling Laws

    Authors: Elliot Paquette, Courtney Paquette, Lechao Xiao, Jeffrey Pennington

    Abstract: We consider the solvable neural scaling model with three parameters: data complexity, target complexity, and model-parameter-count. We use this neural scaling model to derive new predictions about the compute-limited, infinite-data scaling law regime. To train the neural scaling model, we run one-pass stochastic gradient descent on a mean-squared loss. We derive a representation of the loss curves… ▽ More

    Submitted 18 April, 2025; v1 submitted 23 May, 2024; originally announced May 2024.

  17. arXiv:2403.19284  [pdf, ps, other

    math.AP

    Existence of solutions for a class of Kirchhoff-type equations with indefinite potential

    Authors: Linlian Xiao, Jiaqian Yuan, Jian Zhou, Yunshun Wu

    Abstract: In this paper, we consider the existence of solutions of the following Kirchhoff-type problem \[ \left\{ \begin{array} [c]{ll} -\left(a+b\int_{\mathbb{R}^3}|\nabla u|^2dx\right)Δu+ V(x)u=f(x,u),~{\rm{in}}~ \mathbb{R}^{3},\\ u\in H^1(\mathbb{R}^3), \end{array} \right. \] where $a,b$ are postive constants, and the potential $V(x)$ is continuous and indefinite in sign. Under some suitable… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  18. arXiv:2401.07373  [pdf, ps, other

    math.AP

    Non-convexity of level sets for $k$-Hessian equations in convex ring

    Authors: Zhizhang Wang, Ling Xiao

    Abstract: In this paper we construct explicit examples that show the sublevel sets of the solution of a $k$-Hessian equation defined on a convex ring do not have to be convex.

    Submitted 6 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  19. arXiv:2401.00680  [pdf, ps, other

    math.QA

    Whittaker modules and hyperbolic Toda lattices

    Authors: Limeng Xia

    Abstract: Let $\sg$ be a complex finite-dimensional simple Lie algebra and let $\sg_l$ be the corresponding generalized Takiff algebra. This paper studies the affine variety $\ssf+\sb_l$ where $\ssf$ is similar to a principal nilpotent element of $\sg$ and $\sb_l$ is a subalgebra corresponding to the Borel subalgebra $\sb$ of $\sg$. Inspired by Kostant's work then we deal with two questions. One of them is… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 45 pages

  20. arXiv:2312.15295  [pdf, other

    stat.ML cs.LG math.OC

    AdamL: A fast adaptive gradient method incorporating loss function

    Authors: Lu Xia, Stefano Massei

    Abstract: Adaptive first-order optimizers are fundamental tools in deep learning, although they may suffer from poor generalization due to the nonuniform gradient scaling. In this work, we propose AdamL, a novel variant of the Adam optimizer, that takes into account the loss function information to attain better generalization results. We provide sufficient conditions that together with the Polyak-Lojasiewi… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  21. arXiv:2312.01586  [pdf, ps, other

    math.OC eess.SY

    On the Maximization of Long-Run Reward CVaR for Markov Decision Processes

    Authors: Li Xia, Zhihui Yu, Peter W. Glynn

    Abstract: This paper studies the optimization of Markov decision processes (MDPs) from a risk-seeking perspective, where the risk is measured by conditional value-at-risk (CVaR). The objective is to find a policy that maximizes the long-run CVaR of instantaneous rewards over an infinite horizon across all history-dependent randomized policies. By establishing two optimality inequalities of opposing directio… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Risk-seeking optimization of CVaR in MDP

  22. arXiv:2308.10506  [pdf, other

    math.OC

    A relaxation method for binary optimizations on constrained Stiefel manifold

    Authors: Lianghai Xiao, Yitian Qian, Shaohua Pan

    Abstract: This paper focuses on a class of binary orthogonal optimization problems frequently arising in semantic hashing. Consider that this class of problems may have an empty feasible set, rendering them not well-defined. We introduce an equivalent model involving a restricted Stiefel manifold and a matrix box set, and then investigate its penalty problems induced by the $\ell_1$-distance from the box se… ▽ More

    Submitted 7 July, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

  23. arXiv:2307.14608  [pdf, ps, other

    math.RT math.QA math.RA

    Smooth modules over the N=1 Bondi-Metzner-Sachs superalgebra

    Authors: Dong Liu, Yufeng Pei, Limeng Xia, Kaiming Zhao

    Abstract: In this paper, we present a determinant formula for the contravariant form on Verma modules over the N=1 Bondi-Metzner-Sachs (BMS) superalgebra. This formula establishes a necessary and sufficient condition for the irreducibility of the Verma modules. We then introduce and characterize a class of simple smooth modules that generalize both Verma and Whittaker modules over the N=1 BMS superalgebra.… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Latex 27pages, comments are welcome!

    MSC Class: 17B65; 17B68; 17B69; 17B70; 81R10

  24. arXiv:2305.16662  [pdf, ps, other

    math.RT math-ph math.QA math.RA

    Simple smooth modules over the superconformal current algebra

    Authors: Dong Liu, Yufeng Pei, Limeng Xia, Kaiming Zhao

    Abstract: In this paper, we classify simple smooth modules over the superconformal current algebra $\frak g$. More precisely, we first classify simple smooth modules over the Heisenberg-Clifford algebra, and then prove that any simple smooth $\frak g$-module is a tensor product of such modules for the super Virasoro algebra and the Heisenberg-Clifford algebra, or an induced module from a simple module over… ▽ More

    Submitted 28 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Latex, 30pages, comments are welcome!

    MSC Class: 17B65; 17B68; 17B70

  25. arXiv:2303.12613  [pdf, other

    math.ST cs.IT

    Noisy recovery from random linear observations: Sharp minimax rates under elliptical constraints

    Authors: Reese Pathak, Martin J. Wainwright, Lin Xiao

    Abstract: Estimation problems with constrained parameter spaces arise in various settings. In many of these problems, the observations available to the statistician can be modelled as arising from the noisy realization of the image of a random linear operator; an important special case is random design regression. We derive sharp rates of estimation for arbitrary compact elliptical parameter sets and demons… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 53 pages, 2 figures

  26. arXiv:2302.13710  [pdf, ps, other

    math.OC cs.LG

    Global Algorithms for Mean-Variance Optimization in Markov Decision Processes

    Authors: Li Xia, Shuai Ma

    Abstract: Dynamic optimization of mean and variance in Markov decision processes (MDPs) is a long-standing challenge caused by the failure of dynamic programming. In this paper, we propose a new approach to find the globally optimal policy for combined metrics of steady-state mean and variance in an infinite-horizon undiscounted MDP. By introducing the concepts of pseudo mean and pseudo variance, we convert… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: A breakthrough to develop globally optimal algorithms to solve the steady-state mean-variance MDP problem

  27. arXiv:2302.07697  [pdf, ps, other

    math.NT

    Slopes of modular forms and geometry of eigencurves

    Authors: Ruochuan Liu, Nha Xuan Truong, Liang Xiao, Bin Zhao

    Abstract: Under a stronger genericity condition, we prove the local analogue of ghost conjecture of Bergdall and Pollack. As applications, we deduce in this case (a) a folklore conjecture of Breuil--Buzzard--Emerton on the crystalline slopes of Kisin's crystabelian deformation spaces, (b) Gouvea's $\lfloor\frac{k-1}{p+1}\rfloor$-conjecture on slopes of modular forms, and (c) the finiteness of irreducible co… ▽ More

    Submitted 26 January, 2025; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: 117 pages; we significantly improve the exposition following the referee's suggestions

    MSC Class: 11F33; 11F85

  28. arXiv:2301.09511  [pdf, other

    stat.ML cs.LG math.NA math.OC

    On the Convergence of the Gradient Descent Method with Stochastic Fixed-point Rounding Errors under the Polyak-Lojasiewicz Inequality

    Authors: Lu Xia, Michiel E. Hochstenbach, Stefano Massei

    Abstract: When training neural networks with low-precision computation, rounding errors often cause stagnation or are detrimental to the convergence of the optimizers; in this paper we study the influence of rounding errors on the convergence of the gradient descent method for problems satisfying the Polyak-\Lojasiewicz inequality. Within this context, we show that, in contrast, biased stochastic rounding e… ▽ More

    Submitted 18 January, 2025; v1 submitted 23 January, 2023; originally announced January 2023.

  29. arXiv:2212.02106  [pdf, ps, other

    math.RT math-ph math.RA

    $U(\frak h)$-free modules over the Lie algebras of differential operators

    Authors: Munayim Dilxat, Shoulan Gao, Dong Liu, Limeng Xia

    Abstract: In this paper, we consider some non-weight modules over the Lie algebra of Weyl type. First, we determine the modules whose restriction to $U(\frak h)$ are free of rank $1$ over the Lie algebra of differential operators on the circle. Then we determine the necessary and sufficient conditions for the tensor products of quasi-finite highest weight modules and $U(\frak h)$-free modules to be irreduci… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Latex, 17 pages

    MSC Class: 17B10; 17B65; 17B68

    Journal ref: Published in Mathematics, 2022, 10(10), 1728

  30. arXiv:2211.02500  [pdf, ps, other

    math.QA

    Heisenberg double of the generalized quantum euclidean group and its representations

    Authors: Limeng Xia

    Abstract: The generalized quantum Euclidean group $\oq(\frak{b}_{m,n})$ is a natural generalization of the quantum Euclidean group $\oq(\frak{b}_{1,1})$. The Heisenberg double $\od(\frak{b}_{m,n})$ of $\oq(\frak{b}_{m,n})$ is the smash product of $\oq(\frak{b}_{m,n})$ with its Hopf dual $\ou(\frak{b}_{m,n})$. In this paper, we study the weight modules, the prime spectrum and the automorphism group of the He… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 11pages. comments are welcome

  31. arXiv:2211.01160  [pdf, other

    cs.IR cs.LG math.OC

    A Profit-Maximizing Strategy for Advertising on the e-Commerce Platforms

    Authors: Lianghai Xiao, Yixing Zhao, Jiwei Chen

    Abstract: The online advertising management platform has become increasingly popular among e-commerce vendors/advertisers, offering a streamlined approach to reach target customers. Despite its advantages, configuring advertising strategies correctly remains a challenge for online vendors, particularly those with limited resources. Ineffective strategies often result in a surge of unproductive ``just lookin… ▽ More

    Submitted 21 August, 2023; v1 submitted 30 October, 2022; originally announced November 2022.

    Comments: Online advertising campaigns

  32. arXiv:2210.08740  [pdf, ps, other

    math.OC cs.AI

    Risk-Sensitive Markov Decision Processes with Long-Run CVaR Criterion

    Authors: Li Xia, Peter W. Glynn

    Abstract: CVaR (Conditional Value at Risk) is a risk metric widely used in finance. However, dynamically optimizing CVaR is difficult since it is not a standard Markov decision process (MDP) and the principle of dynamic programming fails. In this paper, we study the infinite-horizon discrete-time MDP with a long-run CVaR criterion, from the view of sensitivity-based optimization. By introducing a pseudo CVa… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 33 pages, 7 figures, 4 tables. A risk-sensitive MDP methodology for optimizing long-run CVaR, which is extensive to data-driven learning scenarios

  33. arXiv:2210.01400  [pdf, ps, other

    cs.LG cs.AI math.OC

    Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies

    Authors: Rui Yuan, Simon S. Du, Robert M. Gower, Alessandro Lazaric, Lin Xiao

    Abstract: We consider infinite-horizon discounted Markov decision processes and study the convergence rates of the natural policy gradient (NPG) and the Q-NPG methods with the log-linear policy class. Using the compatible function approximation framework, both methods with log-linear policies can be written as inexact versions of the policy mirror descent (PMD) method. We show that both methods attain linea… ▽ More

    Submitted 21 February, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: This version adds a table of comparison for the literature review. The paper is published as a conference paper at ICLR 2023

  34. arXiv:2210.01050  [pdf, ps, other

    cs.GT cs.AI cs.LG math.OC

    Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games

    Authors: Shicong Cen, Yuejie Chi, Simon S. Du, Lin Xiao

    Abstract: Multi-Agent Reinforcement Learning (MARL) -- where multiple agents learn to interact in a shared dynamic environment -- permeates across a wide range of critical applications. While there has been substantial progress on understanding the global convergence of policy optimization methods in single-agent RL, designing and analysis of efficient policy optimization algorithms in the MARL setting pres… ▽ More

    Submitted 3 October, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

  35. arXiv:2209.15224  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Robust Unsupervised Multi-task and Transfer Learning on Gaussian Mixture Models

    Authors: Ye Tian, Haolei Weng, Lucy Xia, Yang Feng

    Abstract: Unsupervised learning has been widely used in many real-world applications. One of the simplest and most important unsupervised learning models is the Gaussian mixture model (GMM). In this work, we study the multi-task learning problem on GMMs, which aims to leverage potentially similar GMM parameter structures among tasks to obtain improved learning performance compared to single-task learning. W… ▽ More

    Submitted 2 August, 2024; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 162 pages, 15 figures, 2 tables

  36. arXiv:2208.00982  [pdf, other

    math.NA math.DS q-bio.PE

    Dominant Eigenvalue-Eigenvector Pair Estimation via Graph Infection

    Authors: Kaiyuan Yang, Li Xia, Y. C. Tay

    Abstract: We present a novel method to estimate the dominant eigenvalue and eigenvector pair of any non-negative real matrix via graph infection. The key idea in our technique lies in approximating the solution to the first-order matrix ordinary differential equation (ODE) with the Euler method. Graphs, which can be weighted, directed, and with loops, are first converted to its adjacency matrix A. Then by a… ▽ More

    Submitted 7 May, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Research paper accepted by Proc. 16th International Conference on Graph Transformation (ICGT 2023), Leicester, UK. Extended abstract accepted by the Graph Signal Processing (GSP) Workshop 2023, Oxford, UK. GitHub source code: https://github.com/FeynmanDNA/Dominant_EigenPair_Est_Graph_Infection

  37. arXiv:2207.05673  [pdf, ps, other

    math.DG

    Generalized Minkowski inequality via degenerate Hessian equations on exterior domains

    Authors: Ling Xiao

    Abstract: In this paper, we prove a generalized Minkowski inequality holds for any smooth, $(k-1)$-convex, starshaped domain $Ω.$ Our proof relies on the solvability of the degenerate $k$-Hessian equation on the exterior domain $\mathbb R^n\setminusΩ.$

    Submitted 12 July, 2022; originally announced July 2022.

  38. arXiv:2207.04552  [pdf, ps, other

    math.DG

    Entire $σ_k$ curvature flow in Minkowski space

    Authors: Zhizhang Wang, Ling Xiao

    Abstract: In this paper, we study the $σ_k$ curvature flow of noncompact spacelike hypersurfaces in Minkowski space. We prove that if the initial hypersurface satisfies certain conditions, then the flow exists for all time. Moreover, we show that after rescaling, the flow converges to a self-expander.

    Submitted 10 July, 2022; originally announced July 2022.

  39. arXiv:2207.04432  [pdf, ps, other

    math.RT

    Simple weight modules for Yangian $\operatorname{Y}(\mathfrak{sl}_{2})$

    Authors: Yikun Zhou, Yilan Tan, Limeng Xia

    Abstract: Let $\mathfrak{g}$ be a finite-dimensional simple Lie algebra over $\mathbb{C}$. A $\operatorname{Y}(\mathfrak{g})$-module is said to be weight if it is a weight $\mathfrak{g}$-module. We give a complete classification of simple weight modules for $\operatorname{Y}(\mathfrak{sl}_2)$ which admits a one-dimensional weight space. We prove that there are four classes of such modules: finite, highest w… ▽ More

    Submitted 4 August, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: 14 pages, 2 figures

    MSC Class: 20G42; 81R50

  40. arXiv:2206.15372  [pdf, ps, other

    math.NT

    A local analogue of the ghost conjecture of Bergdall-Pollack

    Authors: Ruochuan Liu, Nha Xuan Truong, Liang Xiao, Bin Zhao

    Abstract: We formulate a local analogue of the ghost conjecture of Bergdall and Pollack, which essentially relies purely on the representation theory of GL_2(Q_p). We further study the combinatorial properties of the ghost series as well as its Newton polygon, in particular, giving a characterization of the vertices of the Newton polygon and proving an integrality result of the slopes. In a forthcoming sequ… ▽ More

    Submitted 29 November, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: We change several notations from the first version. We change 'weak arithmetic module' to 'K_p-projective augmented module' in Definition 2.3 and change 'arithmetic p-adic forms' to 'abstract p-adic forms' in section 2.4. We also add an example (5.13) to explain the meaning of the function Δdefined in 5.1

    MSC Class: 11F33 (primary); 11F85 (secondary)

    Journal ref: Peking Math. J. 7 (2024), no. 1, 247-344

  41. arXiv:2206.06900  [pdf, other

    cs.LG math.OC stat.ML

    Grad-GradaGrad? A Non-Monotone Adaptive Stochastic Gradient Method

    Authors: Aaron Defazio, Baoyu Zhou, Lin Xiao

    Abstract: The classical AdaGrad method adapts the learning rate by dividing by the square root of a sum of squared gradients. Because this sum on the denominator is increasing, the method can only decrease step sizes over time, and requires a learning rate scaling hyper-parameter to be carefully tuned. To overcome this restriction, we introduce GradaGrad, a method in the same family that naturally grows or… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  42. arXiv:2205.06853  [pdf, ps, other

    math.DG

    Entire self-expanders for power of $σ_k$ curvature flow in Minkowski space

    Authors: Zhizhang Wang, Ling Xiao

    Abstract: In [19], we prove that if an entire, spacelike, convex hypersurface $\mathcal{M}_{u_0}$ has bounded principal curvatures, then the $σ_k^{1/α}$ (power of $σ_k$) curvature flow starting from $\mathcal{M}_{u_0}$ admits a smooth convex solution $u$ for $t>0.$ Moreover, after rescaling, the flow converges to a convex self-expander $\tilde{\mathcal{M}}=\{(x, \tilde{u}(x))\mid x\in\mathbb{R}^n\}$ that sa… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  43. arXiv:2205.06849  [pdf, ps, other

    math.DG

    Entire convex curvature flow in Minkowski space

    Authors: Zhizhang Wang, Ling Xiao

    Abstract: In this paper, we study fully nonlinear curvature flows of noncompact spacelike hypersurfaces in Minkowski space. We prove that if the initial hypersurface satisfies certain conditions, then the flow exists for all time. Moreover, we show that after rescaling the flow converges to the future timelike hyperboloid, which is a self-expander.

    Submitted 13 May, 2022; originally announced May 2022.

  44. arXiv:2204.13169  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    FedShuffle: Recipes for Better Use of Local Work in Federated Learning

    Authors: Samuel Horváth, Maziar Sanjabi, Lin Xiao, Peter Richtárik, Michael Rabbat

    Abstract: The practice of applying several local updates before aggregation across clients has been empirically shown to be a successful approach to overcoming the communication bottleneck in Federated Learning (FL). Such methods are usually implemented by having clients perform one or more epochs of local training per round while randomly reshuffling their finite dataset in each epoch. Data imbalance, wher… ▽ More

    Submitted 27 September, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: Published in Transactions on Machine Learning Research (09/2022)

  45. arXiv:2204.03809  [pdf, other

    cs.LG cs.DC math.OC

    Federated Learning with Partial Model Personalization

    Authors: Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael Rabbat, Maziar Sanjabi, Lin Xiao

    Abstract: We consider two federated learning algorithms for training partially personalized models, where the shared and personal parameters are updated either simultaneously or alternately on the devices. Both algorithms have been proposed in the literature, but their convergence properties are not fully understood, especially for the alternating variant. We provide convergence analyses of both algorithms… ▽ More

    Submitted 15 August, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Journal ref: ICML 2022: 17716-17758

  46. arXiv:2202.12276  [pdf, other

    cs.LG math.NA stat.ML

    On the influence of stochastic roundoff errors and their bias on the convergence of the gradient descent method with low-precision floating-point computation

    Authors: Lu Xia, Stefano Massei, Michiel E. Hochstenbach, Barry Koren

    Abstract: When implementing the gradient descent method in low precision, the employment of stochastic rounding schemes helps to prevent stagnation of convergence caused by the vanishing gradient effect. Unbiased stochastic rounding yields zero bias by preserving small updates with probabilities proportional to their relative magnitudes. This study provides a theoretical explanation for the stagnation of th… ▽ More

    Submitted 25 February, 2023; v1 submitted 24 February, 2022; originally announced February 2022.

  47. arXiv:2201.07443  [pdf, ps, other

    math.OC cs.LG

    On the Convergence Rates of Policy Gradient Methods

    Authors: Lin Xiao

    Abstract: We consider infinite-horizon discounted Markov decision problems with finite state and action spaces and study the convergence rates of the projected policy gradient method and a general class of policy mirror descent methods, all with direct parametrization in the policy space. First, we develop a theory of weak gradient-mapping dominance and use it to prove sharper sublinear convergence rate of… ▽ More

    Submitted 6 March, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: This version removed a mistake and related comments in the previous version of the paper. Specifically, Theorem 1 of the previous version (arXiv:2201.07443v1) state that weighted value function is both quasi-concave and quasi-convex, which is wrong. Fortunately this mistake does not affect rest of the results that are contained in this version

  48. arXiv:2201.05737  [pdf, other

    math.OC cs.AI

    A unified algorithm framework for mean-variance optimization in discounted Markov decision processes

    Authors: Shuai Ma, Xiaoteng Ma, Li Xia

    Abstract: This paper studies the risk-averse mean-variance optimization in infinite-horizon discounted Markov decision processes (MDPs). The involved variance metric concerns reward variability during the whole process, and future deviations are discounted to their present values. This discounted mean-variance optimization yields a reward function dependent on a discounted mean, and this dependency renders… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  49. arXiv:2112.00329  [pdf, other

    stat.ME math.ST

    Non-splitting Neyman-Pearson Classifiers

    Authors: Jingming Wang, Lucy Xia, Zhigang Bao, Xin Tong

    Abstract: The Neyman-Pearson (NP) binary classification paradigm constrains the more severe type of error (e.g., the type I error) under a preferred level while minimizing the other (e.g., the type II error). This paradigm is suitable for applications such as severe disease diagnosis, fraud detection, among others. A series of NP classifiers have been developed to guarantee the type I error control with hig… ▽ More

    Submitted 4 June, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  50. Error bound and exact penalty method for optimization problems with nonnegative orthogonal constraint

    Authors: Yitian Qian, Shaohua Pan, Lianghai Xiao

    Abstract: This paper is concerned with a class of optimization problems with the nonnegative orthogonal constraint, in which the objective function is $L$-smooth on an open set containing the Stiefel manifold ${\rm St}(n,r)$. We derive a locally Lipschitzian error bound for the feasible points without zero rows when $n>r>1$, and when $n>r=1$ or $n=r$ achieve a global Lipschitzian error bound. Then, we show… ▽ More

    Submitted 4 February, 2025; v1 submitted 5 November, 2021; originally announced November 2021.

    Comments: 34 pages, and 6 figures