Skip to main content

Showing 1–26 of 26 results for author: Li, C J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.23895  [pdf, ps, other

    cs.CV cs.AI

    Preserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios

    Authors: Jinghan Xu Yuyang Zhang Qixuan Cai Jiancheng Chen Keqiu Li

    Abstract: Visual modality is the most vulnerable to privacy leakage in real-world multimodal applications like autonomous driving with visual and radar data; Machine unlearning removes specific training data from pre-trained models to address privacy leakage, however, existing methods fail to preserve cross-modal knowledge and maintain intra-class structural stability of retain data, leading to reduced over… ▽ More

    Submitted 28 September, 2025; originally announced September 2025.

    Comments: 9 pages,4 figures

  2. arXiv:2506.18003  [pdf, ps, other

    cs.AR

    AMD Versal Implementations of FAM and SSCA Estimators

    Authors: Carol Jingyi Li, Ruilin Wu, Philip H. W. Leong

    Abstract: Cyclostationary analysis is widely used in signal processing, particularly in the analysis of human-made signals, and spectral correlation density (SCD) is often used to characterise cyclostationarity. Unfortunately, for real-time applications, even utilising the fast Fourier transform (FFT), the high computational complexity associated with estimating the SCD limits its applicability. In this wor… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

  3. arXiv:2506.06958  [pdf, ps, other

    cs.CY cs.AI cs.MA

    Position: Simulating Society Requires Simulating Thought

    Authors: Chance Jiajie Li, Jiayi Wu, Zhenze Mo, Ao Qu, Yuhan Tang, Kaiya Ivy Zhao, Yulu Gan, Jie Fan, Jiangbo Yu, Jinhua Zhao, Paul Liang, Luis Alonso, Kent Larson

    Abstract: Simulating society with large language models (LLMs), we argue, requires more than generating plausible behavior; it demands cognitively grounded reasoning that is structured, revisable, and traceable. LLM-based agents are increasingly used to emulate individual and group behavior, primarily through prompting and supervised fine-tuning. Yet they often lack internal coherence, causal reasoning, and… ▽ More

    Submitted 26 September, 2025; v1 submitted 7 June, 2025; originally announced June 2025.

    Comments: To appear in NeurIPS 2025 (Position Paper Track)

  4. arXiv:2407.10955   

    stat.ML cs.LG math.OC

    Enhancing Stochastic Optimization for Statistical Efficiency Using ROOT-SGD with Diminishing Stepsize

    Authors: Chris Junchi Li

    Abstract: In this paper, we revisit \textsf{ROOT-SGD}, an innovative method for stochastic optimization to bridge the gap between stochastic optimization and statistical efficiency. The proposed method enhances the performance and reliability of \textsf{ROOT-SGD} by integrating a carefully designed \emph{diminishing stepsize strategy}. This approach addresses key challenges in optimization, providing robust… ▽ More

    Submitted 16 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: Author list truncated. This submission has been withdrawn by arXiv administrators as the other author was added without their knowledge or consent

  5. arXiv:2405.04566  [pdf, other

    cs.LG cs.DC stat.ML

    Fast Decentralized Gradient Tracking for Federated Minimax Optimization with Local Updates

    Authors: Chris Junchi Li

    Abstract: Federated learning (FL) for minimax optimization has emerged as a powerful paradigm for training models across distributed nodes/clients while preserving data privacy and model robustness on data heterogeneity. In this work, we delve into the decentralized implementation of federated minimax optimization by proposing \texttt{K-GT-Minimax}, a novel decentralized minimax optimization algorithm that… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2405.00914  [pdf, other

    math.OC cs.LG stat.ML

    Accelerated Fully First-Order Methods for Bilevel and Minimax Optimization

    Authors: Chris Junchi Li

    Abstract: We present in this paper novel accelerated fully first-order methods in \emph{Bilevel Optimization} (BLO). Firstly, for BLO under the assumption that the lower-level functions admit the typical strong convexity assumption, the \emph{(Perturbed) Restarted Accelerated Fully First-order methods for Bilevel Approximation} (\texttt{PRAF${}^2$BA}) algorithm leveraging \emph{fully} first-order oracles is… ▽ More

    Submitted 9 July, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  7. arXiv:2404.14358  [pdf, other

    math.OC cs.LG

    A General Continuous-Time Formulation of Stochastic ADMM and Its Variants

    Authors: Chris Junchi Li

    Abstract: Stochastic versions of the alternating direction method of multiplier (ADMM) and its variants play a key role in many modern large-scale machine learning problems. In this work, we introduce a unified algorithmic framework called generalized stochastic ADMM and investigate their continuous-time analysis. The generalized framework widely includes many stochastic ADMM variants such as standard, line… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  8. arXiv:2307.00126  [pdf, other

    math.OC cs.LG stat.ML

    Accelerating Inexact HyperGradient Descent for Bilevel Optimization

    Authors: Haikuo Yang, Luo Luo, Chris Junchi Li, Michael I. Jordan

    Abstract: We present a method for solving general nonconvex-strongly-convex bilevel optimization problems. Our method -- the \emph{Restarted Accelerated HyperGradient Descent} (\texttt{RAHGD}) method -- finds an $ε$-first-order stationary point of the objective with $\tilde{\mathcal{O}}(κ^{3.25}ε^{-1.75})$ oracle complexity, where $κ$ is the condition number of the lower-level objective and $ε$ is the desir… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  9. arXiv:2210.17550  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization

    Authors: Chris Junchi Li, Angela Yuan, Gauthier Gidel, Quanquan Gu, Michael I. Jordan

    Abstract: We propose a new first-order optimization algorithm -- AcceleratedGradient-OptimisticGradient (AG-OG) Descent Ascent -- for separable convex-concave minimax optimization. The main idea of our algorithm is to carefully leverage the structure of the minimax problem, performing Nesterov acceleration on the individual component and optimistic gradient on the coupling component. Equipped with proper re… ▽ More

    Submitted 14 August, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 44 pages. This version matches the camera-ready that appeared at ICML 2023 under the same title

  10. arXiv:2209.15634  [pdf, other

    cs.LG cs.AI stat.ML

    A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

    Authors: Zixiang Chen, Chris Junchi Li, Angela Yuan, Quanquan Gu, Michael I. Jordan

    Abstract: With the increasing need for handling large state and action spaces, general function approximation has become a key technique in reinforcement learning (RL). In this paper, we propose a general framework that unifies model-based and model-free RL, and an Admissible Bellman Characterization (ABC) class that subsumes nearly all Markov Decision Process (MDP) models in the literature for tractable RL… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  11. arXiv:2208.05363  [pdf, ps, other

    cs.LG cs.AI cs.GT math.OC stat.ML

    Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

    Authors: Chris Junchi Li, Dongruo Zhou, Quanquan Gu, Michael I. Jordan

    Abstract: We consider learning Nash equilibria in two-player zero-sum Markov Games with nonlinear function approximation, where the action-value function is approximated by a function in a Reproducing Kernel Hilbert Space (RKHS). The key challenge is how to do exploration in the high-dimensional function space. We propose a novel online learning algorithm to find a Nash equilibrium by minimizing the duality… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 42 pages

  12. arXiv:2206.08573  [pdf, ps, other

    math.OC cs.CC cs.GT cs.LG

    Optimal Extragradient-Based Bilinearly-Coupled Saddle-Point Optimization

    Authors: Simon S. Du, Gauthier Gidel, Michael I. Jordan, Chris Junchi Li

    Abstract: We consider the smooth convex-concave bilinearly-coupled saddle-point problem, $\min_{\mathbf{x}}\max_{\mathbf{y}}~F(\mathbf{x}) + H(\mathbf{x},\mathbf{y}) - G(\mathbf{y})$, where one has access to stochastic first-order oracles for $F$, $G$ as well as the bilinear coupling function $H$. Building upon standard stochastic extragradient analysis for variational inequalities, we present a stochastic… ▽ More

    Submitted 11 August, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: More polishing and clarifications; 36 pages

  13. arXiv:2112.14738  [pdf, other

    stat.ML cs.LG math.OC

    Nonconvex Stochastic Scaled-Gradient Descent and Generalized Eigenvector Problems

    Authors: Chris Junchi Li, Michael I. Jordan

    Abstract: Motivated by the problem of online canonical correlation analysis, we propose the \emph{Stochastic Scaled-Gradient Descent} (SSGD) algorithm for minimizing the expectation of a stochastic function over a generic Riemannian manifold. SSGD generalizes the idea of projected stochastic gradient descent and allows the use of scaled stochastic gradients instead of stochastic gradients. In the special ca… ▽ More

    Submitted 23 January, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

    Comments: Minor typographical updates

  14. arXiv:2107.00464  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    On the Convergence of Stochastic Extragradient for Bilinear Games using Restarted Iteration Averaging

    Authors: Chris Junchi Li, Yaodong Yu, Nicolas Loizou, Gauthier Gidel, Yi Ma, Nicolas Le Roux, Michael I. Jordan

    Abstract: We study the stochastic bilinear minimax optimization problem, presenting an analysis of the same-sample Stochastic ExtraGradient (SEG) method with constant step size, and presenting variations of the method that yield favorable convergence. In sharp contrasts with the basic SEG method whose last iterate only contracts to a fixed neighborhood of the Nash equilibrium, SEG augmented with iteration a… ▽ More

    Submitted 8 April, 2022; v1 submitted 30 June, 2021; originally announced July 2021.

    Comments: Camera-ready version appeared at AISTATS 2022; short version appeared at NeurIPS OPT 2021 Workshop

  15. arXiv:2012.14415  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Approximation for Online Tensorial Independent Component Analysis

    Authors: Chris Junchi Li, Michael I. Jordan

    Abstract: Independent component analysis (ICA) has been a popular dimension reduction tool in statistical machine learning and signal processing. In this paper, we present a convergence analysis for an online tensorial ICA algorithm, by viewing the problem as a nonconvex stochastic approximation problem. For estimating one component, we provide a dynamics-based analysis to prove that our online tensorial IC… ▽ More

    Submitted 29 July, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: To appear in Conference on Learning Theory (COLT), 2021

  16. arXiv:2008.12690  [pdf, ps, other

    math.OC cs.LG stat.ML

    ROOT-SGD: Sharp Nonasymptotics and Near-Optimal Asymptotics in a Single Algorithm

    Authors: Chris Junchi Li, Wenlong Mou, Martin J. Wainwright, Michael I. Jordan

    Abstract: We study the problem of solving strongly convex and smooth unconstrained optimization problems using stochastic first-order algorithms. We devise a novel algorithm, referred to as Recursive One-Over-T SGD (ROOT-SGD), based on an easily implementable, recursive averaging of past stochastic gradients. We prove that it simultaneously achieves state-of-the-art performance in both a finite-sample, nona… ▽ More

    Submitted 17 September, 2024; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: some corrections

    Journal ref: published at COLT 2022

  17. arXiv:2004.04719  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration

    Authors: Wenlong Mou, Chris Junchi Li, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

    Abstract: We undertake a precise study of the asymptotic and non-asymptotic properties of stochastic approximation procedures with Polyak-Ruppert averaging for solving a linear system $\bar{A} θ= \bar{b}$. When the matrix $\bar{A}$ is Hurwitz, we prove a central limit theorem (CLT) for the averaged iterates with fixed step size and number of iterations going to infinity. The CLT characterizes the exact asym… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

  18. arXiv:2003.03532  [pdf, ps, other

    math.OC cs.LG stat.ML

    Stochastic Modified Equations for Continuous Limit of Stochastic ADMM

    Authors: Xiang Zhou, Huizhuo Yuan, Chris Junchi Li, Qingyun Sun

    Abstract: Stochastic version of alternating direction method of multiplier (ADMM) and its variants (linearized ADMM, gradient-based ADMM) plays a key role for modern large scale machine learning problems. One example is the regularized empirical risk minimization problem. In this work, we put different variants of stochastic ADMM into a unified form, which includes standard, linearized and gradient-based AD… ▽ More

    Submitted 7 March, 2020; originally announced March 2020.

    MSC Class: 37N40; 65K99 ACM Class: G.1.6

  19. arXiv:1904.12627  [pdf, other

    cs.CV cs.LG stat.ML

    Catch Me If You Can

    Authors: Antoine Viscardi, Casey Juanxi Li, Thomas Hollis

    Abstract: As advances in signature recognition have reached a new plateau of performance at around 2% error rate, it is interesting to investigate alternative approaches. The approach detailed in this paper looks at using Variational Auto-Encoders (VAEs) to learn a latent space representation of genuine signatures. This is then used to pass unlabelled signatures such that only the genuine ones will successf… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  20. arXiv:1812.11377  [pdf, other

    cs.LG cs.CR stat.ML

    Hessian-Aware Zeroth-Order Optimization for Black-Box Adversarial Attack

    Authors: Haishan Ye, Zhichao Huang, Cong Fang, Chris Junchi Li, Tong Zhang

    Abstract: Zeroth-order optimization is an important research topic in machine learning. In recent years, it has become a key tool in black-box adversarial attack to neural network based image classifiers. However, existing zeroth-order optimization algorithms rarely extract second-order information of the model function. In this paper, we utilize the second-order information of the objective function and pr… ▽ More

    Submitted 20 March, 2019; v1 submitted 29 December, 2018; originally announced December 2018.

  21. arXiv:1809.02495   

    math.PR cs.LG

    A note on concentration inequality for vector-valued martingales with weak exponential-type tails

    Authors: Chris Junchi Li

    Abstract: We present novel martingale concentration inequalities for martingale differences with finite Orlicz-$ψ_α$ norms. Such martingale differences with weak exponential-type tails scatters in many statistical applications and can be heavier than sub-exponential distributions. In the case of one dimension, we prove in general that for a sequence of scalar-valued supermartingale difference, the tail boun… ▽ More

    Submitted 17 March, 2020; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: This short note has been merged and integrated into a follow-up work arXiv:2003.03532. Communications on v2 are still welcome

  22. arXiv:1808.09645  [pdf, other

    stat.ML cs.LG

    Diffusion Approximations for Online Principal Component Estimation and Global Convergence

    Authors: Chris Junchi Li, Mengdi Wang, Han Liu, Tong Zhang

    Abstract: In this paper, we propose to adopt the diffusion approximation tools to study the dynamics of Oja's iteration which is an online stochastic gradient descent method for the principal component analysis. Oja's iteration maintains a running estimate of the true principal component from streaming data and enjoys less temporal and spatial complexities. We show that the Oja's iteration for the top eigen… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: Appeared in NIPS 2017

  23. arXiv:1808.09642  [pdf, other

    stat.ML cs.LG

    Online ICA: Understanding Global Dynamics of Nonconvex Optimization via Diffusion Processes

    Authors: Chris Junchi Li, Zhaoran Wang, Han Liu

    Abstract: Solving statistical learning problems often involves nonconvex optimization. Despite the empirical success of nonconvex statistical optimization methods, their global dynamics, especially convergence to the desirable local minima, remain less well understood in theory. In this paper, we propose a new analytic paradigm based on diffusion processes to characterize the global dynamics of nonconvex st… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: Appeared in NIPS 2016

  24. arXiv:1807.01695  [pdf, other

    math.OC cs.LG stat.ML

    SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path Integrated Differential Estimator

    Authors: Cong Fang, Chris Junchi Li, Zhouchen Lin, Tong Zhang

    Abstract: In this paper, we propose a new technique named \textit{Stochastic Path-Integrated Differential EstimatoR} (SPIDER), which can be used to track many deterministic quantities of interest with significantly reduced computational cost. We apply SPIDER to two tasks, namely the stochastic first-order and zeroth-order methods. For stochastic first-order method, combining SPIDER with normalized gradient… ▽ More

    Submitted 17 October, 2018; v1 submitted 4 July, 2018; originally announced July 2018.

  25. arXiv:1705.07562  [pdf, other

    stat.ML cs.LG

    On the diffusion approximation of nonconvex stochastic gradient descent

    Authors: Wenqing Hu, Chris Junchi Li, Lei Li, Jian-Guo Liu

    Abstract: We study the Stochastic Gradient Descent (SGD) method in nonconvex optimization problems from the point of view of approximating diffusion processes. We prove rigorously that the diffusion process can approximate the SGD algorithm weakly using the weak form of master equation for probability evolution. In the small step size regime and the presence of omnidirectional noise, our weak approximating… ▽ More

    Submitted 3 March, 2018; v1 submitted 22 May, 2017; originally announced May 2017.

  26. arXiv:1702.08134  [pdf, other

    cs.LG math.OC stat.ML

    Dropping Convexity for More Efficient and Scalable Online Multiview Learning

    Authors: Zhehui Chen, Lin F. Yang, Chris J. Li, Tuo Zhao

    Abstract: Multiview representation learning is very popular for latent factor analysis. It naturally arises in many data analysis, machine learning, and information retrieval applications to model dependent structures among multiple data sources. For computational convenience, existing approaches usually formulate the multiview representation learning as convex optimization problems, where global optima can… ▽ More

    Submitted 15 September, 2019; v1 submitted 26 February, 2017; originally announced February 2017.

    Comments: A preliminary version appears in ICML 2017