Skip to main content

Showing 1–50 of 53 results for author: Cai, H

Searching in archive math. Search in all archives.
.
  1. arXiv:2505.18414  [pdf, ps, other

    cs.LG cs.IT math.OC stat.ML

    A Dual Basis Approach for Structured Robust Euclidean Distance Geometry

    Authors: Chandra Kundu, Abiy Tasissa, HanQin Cai

    Abstract: Euclidean Distance Matrix (EDM), which consists of pairwise squared Euclidean distances of a given point configuration, finds many applications in modern machine learning. This paper considers the setting where only a set of anchor nodes is used to collect the distances between themselves and the rest. In the presence of potential outliers, it results in a structured partial observation on EDM wit… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2504.19635  [pdf, other

    cs.MA cs.LG math.OC stat.ML

    Diffusion Stochastic Learning Over Adaptive Competing Networks

    Authors: Yike Zhao, Haoyuan Cai, Ali H. Sayed

    Abstract: This paper studies a stochastic dynamic game between two competing teams, each consisting of a network of collaborating agents. Unlike fully cooperative settings, where all agents share a common objective, each team in this game aims to minimize its own distinct objective. In the adversarial setting, their objectives could be conflicting as in zero-sum games. Throughout the competition, agents sha… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  3. arXiv:2501.07528  [pdf, other

    math.AG math.AC

    Plus-pure thresholds of some cusp-like singularities in mixed characteristic

    Authors: Hanlin Cai, Suchitra Pande, Eamon Quinlan-Gallego, Karl Schwede, Kevin Tucker

    Abstract: Log-canonical and $F$-pure thresholds of pairs in equal characteristic admit an analog in the recent theory of singularities in mixed characteristic, which is known as the plus-pure threshold. In this paper we study plus-pure thresholds for singularities of the form $p^a + x^b \in {\bf Z}_p [[ x ]]$, showing that in a number of cases this plus-pure threshold agrees with the $F$-pure threshold of t… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: 15 pages, 1 table, 1 figure

  4. arXiv:2501.03974  [pdf, ps, other

    math.NT math.AG

    Characterizing perfectoid covers of abelian varieties

    Authors: Rebecca Bellovin, Hanlin Cai, Sean Howe, Tongmu He

    Abstract: We give a simple characterization of all perfectoid profinite étale covers of abelian varieties in terms of the Hodge-Tate filtration on the $p$-adic Tate module. We also compute the geometric Sen morphism for all profinite $p$-adic Lie torsors over an abelian variety, and combine this with our characterization to prove a conjecture of Rodríguez Camargo on perfectoidness of $p$-adic Lie torsors in… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

    Comments: 22 pages + two appendices and references. Comments welcome!

  5. arXiv:2501.00677  [pdf, other

    cs.LG cs.CV cs.IT math.NA stat.ML

    Deeply Learned Robust Matrix Completion for Large-scale Low-rank Data Recovery

    Authors: HanQin Cai, Chandra Kundu, Jialin Liu, Wotao Yin

    Abstract: Robust matrix completion (RMC) is a widely used machine learning tool that simultaneously tackles two critical issues in low-rank data analysis: missing data entries and extreme outliers. This paper proposes a novel scalable and learnable non-convex approach, coined Learned Robust Matrix Completion (LRMC), for large-scale RMC problems. LRMC enjoys low computational complexity with linear convergen… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.05649

  6. arXiv:2412.10664  [pdf, other

    cs.LG cs.IT math.OC stat.ML

    Structured Sampling for Robust Euclidean Distance Geometry

    Authors: Chandra Kundu, Abiy Tasissa, HanQin Cai

    Abstract: This paper addresses the problem of estimating the positions of points from distance measurements corrupted by sparse outliers. Specifically, we consider a setting with two types of nodes: anchor nodes, for which exact distances to each other are known, and target nodes, for which complete but corrupted distance measurements to the anchors are available. To tackle this problem, we propose a novel… ▽ More

    Submitted 17 February, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

    Journal ref: 59th Annual Conference on Information Sciences and Systems, 2025

  7. arXiv:2410.16826  [pdf, ps, other

    math.OC cs.LG

    Guarantees of a Preconditioned Subgradient Algorithm for Overparameterized Asymmetric Low-rank Matrix Recovery

    Authors: Paris Giampouras, HanQin Cai, Rene Vidal

    Abstract: In this paper, we focus on a matrix factorization-based approach to recover low-rank {\it asymmetric} matrices from corrupted measurements. We propose an {\it Overparameterized Preconditioned Subgradient Algorithm (OPSA)} and provide, for the first time in the literature, linear convergence rates independent of the rank of the sought asymmetric matrix in the presence of gross corruptions. Our work… ▽ More

    Submitted 29 May, 2025; v1 submitted 22 October, 2024; originally announced October 2024.

    Journal ref: International Conference on Machine Learning, 2025

  8. arXiv:2410.06376  [pdf, other

    math.OC cs.LG

    Riemannian Optimization for Non-convex Euclidean Distance Geometry with Global Recovery Guarantees

    Authors: Chandler Smith, HanQin Cai, Abiy Tasissa

    Abstract: The problem of determining the configuration of points from partial distance information, known as the Euclidean Distance Geometry (EDG) problem, is fundamental to many tasks in the applied sciences. In this paper, we propose two algorithms grounded in the Riemannian optimization framework to address the EDG problem. Our approach formulates the problem as a low-rank matrix completion task over the… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 38 pages, 4 figures, 5 tables

  9. arXiv:2406.13041  [pdf, other

    cs.LG math.OC

    Accelerated Stochastic Min-Max Optimization Based on Bias-corrected Momentum

    Authors: Haoyuan Cai, Sulaiman A. Alghunaim, Ali H. Sayed

    Abstract: Lower-bound analyses for nonconvex strongly-concave minimax optimization problems have shown that stochastic first-order algorithms require at least $\mathcal{O}(\varepsilon^{-4})$ oracle complexity to find an $\varepsilon$-stationary point. Some works indicate that this complexity can be improved to $\mathcal{O}(\varepsilon^{-3})$ when the loss gradient is Lipschitz continuous. The question of ac… ▽ More

    Submitted 13 May, 2025; v1 submitted 18 June, 2024; originally announced June 2024.

  10. arXiv:2406.11092  [pdf, other

    cs.LG math.NA stat.ML

    Guaranteed Sampling Flexibility for Low-tubal-rank Tensor Completion

    Authors: Bowen Su, Juntao You, HanQin Cai, Longxiu Huang

    Abstract: While Bernoulli sampling is extensively studied in tensor completion, t-CUR sampling approximates low-tubal-rank tensors via lateral and horizontal subtensors. However, both methods lack sufficient flexibility for diverse practical applications. To address this, we introduce Tensor Cross-Concentrated Sampling (t-CCS), a novel and straightforward sampling model that advances the matrix cross-concen… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  11. arXiv:2406.07409  [pdf, other

    stat.ML cs.IT cs.LG eess.SP math.OC

    Accelerating Ill-conditioned Hankel Matrix Recovery via Structured Newton-like Descent

    Authors: HanQin Cai, Longxiu Huang, Xiliang Lu, Juntao You

    Abstract: This paper studies the robust Hankel recovery problem, which simultaneously removes the sparse outliers and fulfills missing entries from the partial observation. We propose a novel non-convex algorithm, coined Hankel Structured Newton-Like Descent (HSNLD), to tackle the robust Hankel recovery problem. HSNLD is highly efficient with linear convergence, and its convergence rate is independent of th… ▽ More

    Submitted 10 April, 2025; v1 submitted 11 June, 2024; originally announced June 2024.

    MSC Class: 15A29; 15A83; 47B35; 90C17; 90C26; 90C53

  12. arXiv:2402.19147  [pdf, other

    math.NA

    Efficient quaternion CUR method for low-rank approximation to quaternion matrix

    Authors: Peng-Ling Wu, Kit Ian Kou, Hongmin Cai, Zhaoyuan Yu

    Abstract: The low-rank quaternion matrix approximation has been successfully applied in many applications involving signal processing and color image processing. However, the cost of quaternion models for generating low-rank quaternion matrix approximation is sometimes considerable due to the computation of the quaternion singular value decomposition (QSVD), which limits their application to real large-scal… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  13. arXiv:2401.15566  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    On the Robustness of Cross-Concentrated Sampling for Matrix Completion

    Authors: HanQin Cai, Longxiu Huang, Chandra Kundu, Bowen Su

    Abstract: Matrix completion is one of the crucial tools in modern data science research. Recently, a novel sampling model for matrix completion coined cross-concentrated sampling (CCS) has caught much attention. However, the robustness of the CCS model against sparse outliers remains unclear in the existing studies. In this paper, we aim to answer this question by exploring a novel Robust CCS Completion pro… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 58th Annual Conference of Information Sciences and Systems

    Journal ref: 58th Annual Conference on Information Sciences and Systems, 2024

  14. arXiv:2401.14585  [pdf, other

    cs.LG math.OC

    Diffusion Stochastic Optimization for Min-Max Problems

    Authors: Haoyuan Cai, Sulaiman A. Alghunaim, Ali H. Sayed

    Abstract: The optimistic gradient method is useful in addressing minimax optimization problems. Motivated by the observation that the conventional stochastic version suffers from the need for a large batch size on the order of $\mathcal{O}(\varepsilon^{-2})$ to achieve an $\varepsilon$-stationary solution, we introduce and analyze a new formulation termed Diffusion Stochastic Same-Sample Optimistic Gradient… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  15. arXiv:2401.05517  [pdf, other

    stat.ME econ.EM math.ST

    On Efficient Inference of Causal Effects with Multiple Mediators

    Authors: Haoyu Wei, Hengrui Cai, Chengchun Shi, Rui Song

    Abstract: This paper provides robust estimators and efficient inference of causal effects involving multiple interacting mediators. Most existing works either impose a linear model assumption among the mediators or are restricted to handle conditionally independent mediators given the exposure. To overcome these limitations, we define causal and individual mediation effects in a general setting, and employ… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    MSC Class: 62A09; 62G05; 62G35

  16. arXiv:2308.09885  [pdf, other

    math.CO

    One-element Extensions of Hyperplane Arrangements

    Authors: Hang Cai, Houshan Fu, Suijie Wang

    Abstract: We classify one-element extensions of a hyperplane arrangement by the induced adjoint arrangement. Based on the classification, several kinds of combinatorial invariants including Whitney polynomials, characteristic polynomials, Whitney numbers and face numbers, are constants on those strata associated with the induced adjoint arrangement, and also order-preserving with respect to the intersection… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 19 pages,6 figures

    MSC Class: 52C35

  17. arXiv:2307.10620  [pdf, other

    cs.CV math.NA

    Quaternion tensor left ring decomposition and application for color image inpainting

    Authors: Jifei Miao, Kit Ian Kou, Hongmin Cai, Lizhi Liu

    Abstract: In recent years, tensor networks have emerged as powerful tools for solving large-scale optimization problems. One of the most promising tensor networks is the tensor ring (TR) decomposition, which achieves circular dimensional permutation invariance in the model through the utilization of the trace operation and equitable treatment of the latent cores. On the other hand, more recently, quaternion… ▽ More

    Submitted 16 September, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  18. arXiv:2305.18577  [pdf, other

    cs.LG math.OC stat.ML

    Towards Constituting Mathematical Structures for Learning to Optimize

    Authors: Jialin Liu, Xiaohan Chen, Zhangyang Wang, Wotao Yin, HanQin Cai

    Abstract: Learning to Optimize (L2O), a technique that utilizes machine learning to learn an optimization algorithm automatically from data, has gained arising attention in recent years. A generic L2O approach parameterizes the iterative update rule and learns the update direction as a black-box network. While the generic approach is widely applicable, the learned model can overfit and may not generalize we… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  19. arXiv:2305.05134  [pdf, ps, other

    econ.TH cs.AI math.OC

    To AI or not to AI, to Buy Local or not to Buy Local: A Mathematical Theory of Real Price

    Authors: Huan Cai, Catherine Xu, Weiyu Xu

    Abstract: In the past several decades, the world's economy has become increasingly globalized. On the other hand, there are also ideas advocating the practice of ``buy local'', by which people buy locally produced goods and services rather than those produced farther away. In this paper, we establish a mathematical theory of real price that determines the optimal global versus local spending of an agent whi… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 16 pages, 3 figures

    MSC Class: 65

  20. arXiv:2305.04080  [pdf, other

    math.NA cs.LG

    Robust Tensor CUR Decompositions: Rapid Low-Tucker-Rank Tensor Recovery with Sparse Corruption

    Authors: HanQin Cai, Zehan Chao, Longxiu Huang, Deanna Needell

    Abstract: We study the tensor robust principal component analysis (TRPCA) problem, a tensorial extension of matrix robust principal component analysis (RPCA), that aims to split the given tensor into an underlying low-rank component and a sparse outlier component. This work proposes a fast algorithm, called Robust Tensor CUR Decompositions (RTCUR), for large-scale non-convex TRPCA problems under the Tucker… ▽ More

    Submitted 10 October, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    MSC Class: 68p20; 68W20; 68W25; 68Q25; 65F30

    Journal ref: SIAM Journal on Imaging Sciences 17 (1), 225-247, 2024

  21. arXiv:2304.11535  [pdf, other

    math.AP

    Lipschitz optimal transport metric for a wave system modeling nematic liquid crystals

    Authors: Hong Cai, Geng Chen, Yannan Shen

    Abstract: In this paper, we study the Lipschitz continuous dependence of conservative Hölder continuous weak solutions to a variational wave system derived from a model for nematic liquid crystals. Since the solution of this system generally forms finite time cusp singularity, the solution flow is not Lipschitz continuous under the Sobolev metric used in the existence and uniqueness theory. We establish a F… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: text overlap with arXiv:2007.15201

  22. Non-convex approaches for low-rank tensor completion under tubal sampling

    Authors: Zheng Tan, Longxiu Huang, HanQin Cai, Yifei Lou

    Abstract: Tensor completion is an important problem in modern data analysis. In this work, we investigate a specific sampling strategy, referred to as tubal sampling. We propose two novel non-convex tensor completion frameworks that are easy to implement, named tensor $L_1$-$L_2$ (TL12) and tensor completion via CUR (TCCUR). We test the efficiency of both methods on synthetic data and a color image inpainti… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1-5. IEEE, 2023

  23. arXiv:2212.14580  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Heterogeneous Synthetic Learner for Panel Data

    Authors: Ye Shen, Runzhe Wan, Hengrui Cai, Rui Song

    Abstract: In the new era of personalization, learning the heterogeneous treatment effect (HTE) becomes an inevitable trend with numerous applications. Yet, most existing HTE estimation methods focus on independently and identically distributed observations and cannot handle the non-stationarity and temporal dependency in the common panel data setting. The treatment evaluators developed for panel data, on th… ▽ More

    Submitted 29 January, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

  24. arXiv:2209.04046  [pdf, ps, other

    math.AC math.AG math.NT

    Perfectoid signature, perfectoid Hilbert-Kunz multiplicity, and an application to local fundamental groups

    Authors: Hanlin Cai, Seungsu Lee, Linquan Ma, Karl Schwede, Kevin Tucker

    Abstract: We define a (perfectoid) mixed characteristic version of $F$-signature and Hilbert-Kunz multiplicity by utilizing the perfectoidization functor of Bhatt-Scholze and Faltings' normalized length (also developed in the work of Gabber-Ramero). We show that these definitions coincide with the classical theory in equal characteristic $p > 0$. We prove that a ring is regular if and only if either its per… ▽ More

    Submitted 4 February, 2025; v1 submitted 8 September, 2022; originally announced September 2022.

    Comments: 75 pages, minor correction to the statement and proof of Proposition 6.3.1, Lemma 7.10 added, proof of Theorem 7.11 has been reorganized

    MSC Class: 14G45; 13A35; 14F35; 14B05; 13D22; 14F18; 13C20; 14C20; 14D10; 11G99

  25. Matrix Completion with Cross-Concentrated Sampling: Bridging Uniform Sampling and CUR Sampling

    Authors: HanQin Cai, Longxiu Huang, Pengyu Li, Deanna Needell

    Abstract: While uniform sampling has been widely studied in the matrix completion literature, CUR sampling approximates a low-rank matrix via row and column samples. Unfortunately, both sampling models lack flexibility for various circumstances in real-world applications. In this work, we propose a novel and easy-to-implement sampling strategy, coined Cross-Concentrated Sampling (CCS). By bridging uniform s… ▽ More

    Submitted 21 March, 2023; v1 submitted 20 August, 2022; originally announced August 2022.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(8): 10100-10113, 2023

  26. arXiv:2206.09042  [pdf, other

    stat.ML cs.LG math.NA

    Riemannian CUR Decompositions for Robust Principal Component Analysis

    Authors: Keaton Hamm, Mohamed Meskini, HanQin Cai

    Abstract: Robust Principal Component Analysis (PCA) has received massive attention in recent years. It aims to recover a low-rank matrix and a sparse matrix from their sum. This paper proposes a novel nonconvex Robust PCA algorithm, coined Riemannian CUR (RieCUR), which utilizes the ideas of Riemannian optimization and robust CUR decompositions. This algorithm has the same computational complexity as Iterat… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Journal ref: ICML workshop on Topological, Algebraic and Geometric Learning (2022): 152-160

  27. arXiv:2204.03316  [pdf, ps, other

    cs.IT math.OC

    Structured Gradient Descent for Fast Robust Low-Rank Hankel Matrix Completion

    Authors: HanQin Cai, Jian-Feng Cai, Juntao You

    Abstract: We study the robust matrix completion problem for the low-rank Hankel matrix, which detects the sparse corruptions caused by extreme outliers while we try to recover the original Hankel matrix from the partial observation. In this paper, we explore the convenient Hankel structure and propose a novel non-convex algorithm, coined Hankel Structured Gradient Descent (HSGD), for large-scale robust Hank… ▽ More

    Submitted 19 March, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    Journal ref: SIAM Journal on Scientific Computing, 45(3): A1172-A1198, 2023

  28. arXiv:2111.08885  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Jump Interval-Learning for Individualized Decision Making

    Authors: Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu

    Abstract: An individualized decision rule (IDR) is a decision function that assigns each individual a given treatment based on his/her observed characteristics. Most of the existing works in the literature consider settings with binary or finitely many treatment options. In this paper, we focus on the continuous treatment setting and propose a jump interval-learning to develop an individualized interval-val… ▽ More

    Submitted 28 January, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

  29. arXiv:2110.15501  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning

    Authors: Ye Shen, Hengrui Cai, Rui Song

    Abstract: Evaluating the performance of an ongoing policy plays a vital role in many areas such as medicine and economics, to provide crucial instructions on the early-stop of the online experiment and timely feedback from the environment. Policy evaluation in online learning thus attracts increasing attention by inferring the mean outcome of the optimal policy (i.e., the value) in real-time. Yet, such a pr… ▽ More

    Submitted 2 August, 2024; v1 submitted 28 October, 2021; originally announced October 2021.

  30. arXiv:2110.05649  [pdf, other

    cs.LG cs.CV cs.IT math.NA

    Learned Robust PCA: A Scalable Deep Unfolding Approach for High-Dimensional Outlier Detection

    Authors: HanQin Cai, Jialin Liu, Wotao Yin

    Abstract: Robust principal component analysis (RPCA) is a critical tool in modern machine learning, which detects outliers in the task of low-rank matrix reconstruction. In this paper, we propose a scalable and learnable non-convex approach for high-dimensional RPCA problems, which we call Learned Robust PCA (LRPCA). LRPCA is highly efficient, and its free parameters can be effectively learned to optimize v… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

    Journal ref: Advances in Neural Information Processing Systems 34 (2021): 16977-16989

  31. Curvature-Aware Derivative-Free Optimization

    Authors: Bumsu Kim, HanQin Cai, Daniel McKenzie, Wotao Yin

    Abstract: The paper discusses derivative-free optimization (DFO), which involves minimizing a function without access to gradients or directional derivatives, only function evaluations. Classical DFO methods, which mimic gradient-based methods, such as Nelder-Mead and direct search have limited scalability for high-dimensional problems. Zeroth-order methods have been gaining popularity due to the demands of… ▽ More

    Submitted 12 April, 2023; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: 31 pages, 9 figures

    MSC Class: 49M15; 65K05; 68Q25; 90C56

    Journal ref: Journal of Scientific Computing, 103(43): 1-28, 2025

  32. arXiv:2108.10448  [pdf, other

    cs.LG cs.CV eess.IV math.OC

    Fast Robust Tensor Principal Component Analysis via Fiber CUR Decomposition

    Authors: HanQin Cai, Zehan Chao, Longxiu Huang, Deanna Needell

    Abstract: We study the problem of tensor robust principal component analysis (TRPCA), which aims to separate an underlying low-multilinear-rank tensor and a sparse outlier tensor from their sum. In this work, we propose a fast non-convex algorithm, coined Robust Tensor CUR (RTCUR), for large-scale TRPCA problems. RTCUR considers a framework of alternating projections and utilizes the recently developed tens… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: Accepted to Workshop on Robust Subspace Learning and Applications in Computer Vision, International Conference on Computer Vision (ICCV) 2021

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pages 189-197, 2021

  33. arXiv:2107.08724  [pdf, other

    stat.ME math.ST

    Estimation of high-dimensional change-points under a group sparsity structure

    Authors: Hanqing Cai, Tengyao Wang

    Abstract: Change-points are a routine feature of 'big data' observed in the form of high-dimensional data streams. In many such data streams, the component series possess group structures and it is natural to assume that changes only occur in a small number of all groups. We propose a new change point procedure, called 'groupInspect', that exploits the group sparsity structure to estimate a projection direc… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

    Comments: 25 pages, 6 figures

  34. arXiv:2104.10573  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    GEAR: On Optimal Decision Making with Auxiliary Data

    Authors: Hengrui Cai, Rui Song, Wenbin Lu

    Abstract: Personalized optimal decision making, finding the optimal decision rule (ODR) based on individual characteristics, has attracted increasing attention recently in many fields, such as education, economics, and medicine. Current ODR methods usually require the primary outcome of interest in samples for assessing treatment effects, namely the experimental sample. However, in many studies, treatments… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

  35. arXiv:2104.10554  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome

    Authors: Hengrui Cai, Wenbin Lu, Rui Song

    Abstract: We consider the optimal decision-making problem in a primary sample of interest with multiple auxiliary sources available. The outcome of interest is limited in the sense that it is only observed in the primary sample. In reality, such multiple data sources may belong to heterogeneous studies and thus cannot be combined directly. This paper proposes a new framework to handle heterogeneous samples… ▽ More

    Submitted 21 September, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

  36. arXiv:2103.11037  [pdf, other

    math.NA cs.IT cs.LG eess.IV

    Mode-wise Tensor Decompositions: Multi-dimensional Generalizations of CUR Decompositions

    Authors: HanQin Cai, Keaton Hamm, Longxiu Huang, Deanna Needell

    Abstract: Low rank tensor approximation is a fundamental tool in modern machine learning and data science. In this paper, we study the characterization, perturbation analysis, and an efficient sampling strategy for two primary tensor CUR approximations, namely Chidori and Fiber CUR. We characterize exact tensor CUR decompositions for low multilinear rank tensors. We also present theoretical error bounds of… ▽ More

    Submitted 25 June, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Journal ref: The Journal of Machine Learning Research 22.185 (2021): 1-36

  37. arXiv:2102.10707  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box Optimization

    Authors: HanQin Cai, Yuchen Lou, Daniel McKenzie, Wotao Yin

    Abstract: We consider the zeroth-order optimization problem in the huge-scale setting, where the dimension of the problem is so large that performing even basic vector operations on the decision variables is infeasible. In this paper, we propose a novel algorithm, coined ZO-BCD, that exhibits favorable overall query complexity and has a much smaller per-iteration computational complexity. In addition, we di… ▽ More

    Submitted 11 June, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: Accepted to ICML 2021

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:1193-1203, 2021

  38. arXiv:2010.07422  [pdf, other

    stat.ML cs.AI cs.IT cs.LG math.NA math.OC

    Rapid Robust Principal Component Analysis: CUR Accelerated Inexact Low Rank Estimation

    Authors: HanQin Cai, Keaton Hamm, Longxiu Huang, Jiaqi Li, Tao Wang

    Abstract: Robust principal component analysis (RPCA) is a widely used tool for dimension reduction. In this work, we propose a novel non-convex algorithm, coined Iterated Robust CUR (IRCUR), for solving RPCA problems, which dramatically improves the computational efficiency in comparison with the existing algorithms. IRCUR achieves this acceleration by employing CUR decomposition when updating the low rank… ▽ More

    Submitted 7 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Journal ref: IEEE Signal Processing Letters, 28 (2021): 116-120

  39. arXiv:2010.02479  [pdf, other

    math.OC cs.AI cs.LG

    A One-bit, Comparison-Based Gradient Estimator

    Authors: HanQin Cai, Daniel Mckenzie, Wotao Yin, Zhenliang Zhang

    Abstract: We study zeroth-order optimization for convex functions where we further assume that function evaluations are unavailable. Instead, one only has access to a $\textit{comparison oracle}$, which given two points $x$ and $y$ returns a single bit of information indicating which point has larger function value, $f(x)$ or $f(y)$. By treating the gradient as an unknown signal to be recovered, we show how… ▽ More

    Submitted 23 April, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

    Journal ref: Applied and Computational Harmonic Analysis, 60 (2022): 242-266

  40. arXiv:2008.06946  [pdf, ps, other

    math.AP

    Uniqueness of Dissipative Solution for Camassa-Holm Equation with Peakon-Antipeakon Initial Data

    Authors: Hong Cai, Geng Chen, Hongwei Mei

    Abstract: We give a proof for the uniqueness of dissipative solution for the Camassa-Holm equation with some peakon-antipeakon initial data following Dafermos' earlier resut in [5] on the Hunter-Saxton equation. Our result shows that two existing global existence frameworks, through the vanishing viscosity method by Xin-Zhang in [11] and the transformation of coordinate method for dissipative solutions by B… ▽ More

    Submitted 21 October, 2020; v1 submitted 16 August, 2020; originally announced August 2020.

  41. arXiv:2007.15201  [pdf, other

    math.AP

    A Finsler type Lipschitz optimal transport metric for a quasilinear wave equation

    Authors: Hong Cai, Geng Chen, Yannan Shen

    Abstract: We consider the global well-posedness of weak energy conservative solution to a general quasilinear wave equation through variational principle, where the solution may form finite time cusp singularity, when energy concentrates. As a main result in this paper, we construct a Finsler type optimal transport metric, then prove that the solution flow is Lipschitz under this metric. We also prove a gen… ▽ More

    Submitted 14 August, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: Add some references from version 2

  42. Uniqueness of conservative solutions to a one-dimensional general quasilinear wave equation through variational principle

    Authors: Hong Cai, Geng Chen, Yi Du, Yannan Shen

    Abstract: In this paper, we prove the uniqueness of energy conservative Holder continuous weak solution to a general quasilinear wave equation by the analysis of characteristics. This result has no restriction on the size of solutions, i.e. it is a large data result.

    Submitted 14 August, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: Change a reference from version 2

  43. arXiv:2003.13001  [pdf, other

    math.OC cs.LG

    Zeroth-Order Regularized Optimization (ZORO): Approximately Sparse Gradients and Adaptive Sampling

    Authors: HanQin Cai, Daniel Mckenzie, Wotao Yin, Zhenliang Zhang

    Abstract: We consider the problem of minimizing a high-dimensional objective function, which may include a regularization term, using (possibly noisy) evaluations of the function. Such optimization is also called derivative-free, zeroth-order, or black-box optimization. We propose a new $\textbf{Z}$eroth-$\textbf{O}$rder $\textbf{R}$egularized $\textbf{O}$ptimization method, dubbed ZORO. When the underlying… ▽ More

    Submitted 30 November, 2021; v1 submitted 29 March, 2020; originally announced March 2020.

    Journal ref: SIAM Journal on Optimization 32, no. 2 (2022): 687-714

  44. arXiv:2001.06753  [pdf, other

    math.AP

    Singularity formation for radially symmetric expanding wave of Compressible Euler Equations

    Authors: Hong Cai, Geng Chen, Tian-Yi Wang

    Abstract: In this paper, for compressible Euler equations in multiple space dimensions, we prove the break-down of classical solutions with a large class of initial data by tracking the propagation of radially symmetric expanding wave including compression. The singularity formation is corresponding to the finite time shock formation. We also provide some new global sup-norm estimates on velocity and densit… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

  45. arXiv:1910.05859  [pdf, other

    cs.IT cs.LG eess.SP math.OC

    Accelerated Structured Alternating Projections for Robust Spectrally Sparse Signal Recovery

    Authors: HanQin Cai, Jian-Feng Cai, Tianming Wang, Guojian Yin

    Abstract: Consider a spectrally sparse signal $\boldsymbol{x}$ that consists of $r$ complex sinusoids with or without damping. We study the robust recovery problem for the spectrally sparse signal under the fully observed setting, which is about recovering $\boldsymbol{x}$ and a sparse corruption vector $\boldsymbol{s}$ from their sum $\boldsymbol{z}=\boldsymbol{x}+\boldsymbol{s}$. In this paper, we exploit… ▽ More

    Submitted 16 January, 2021; v1 submitted 13 October, 2019; originally announced October 2019.

    Journal ref: IEEE Transactions on Signal Processing, 69 (2021): 809-821

  46. arXiv:1909.07836  [pdf, ps, other

    math.ST

    Two-Sample Test Based on Classification Probability

    Authors: Haiyan Cai, Bryan Goggin, Qingtang Jiang

    Abstract: Robust classification algorithms have been developed in recent years with great success. We take advantage of this development and recast the classical two-sample test problem in the framework of classification. Based on the estimates of classification probabilities from a classifier trained from the samples, a test statistic is proposed. We explain why such a test can be a powerful test and compa… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

  47. arXiv:1905.11446  [pdf, other

    math.AP math.DS

    Fisher-KPP dynamics in diffusive Rosenzweig-MacArthur and Holling-Tanner models

    Authors: Hong Cai, Anna Ghazaryan, Vahagn Manukian

    Abstract: We prove the existence of traveling fronts in diffusive Rosenzweig-MacArthur and Holling-Tanner population models and investigate their relation with fronts in a scalar Fisher-KPP equation. More precisely, we prove the existence of fronts in a Rosenzweig-MacArthur predator-prey model in two situations: when the prey diffuses at the rate much smaller than that of the predator and when both the pred… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    MSC Class: 92D25; 35B25; 35K57; 35B36

    Journal ref: Math. Model. Nat. Phenom., 14 4 (2019) 404

  48. arXiv:1812.11364  [pdf, other

    eess.SP math.NA

    Adaptive Synchrosqueezing Transform with a Time-Varying Parameter for Non-stationary Signal Separation

    Authors: Lin Li, Haiyan Cai, Qingtang Jiang

    Abstract: The continuous wavelet transform (CWT) is a linear time-frequency representation and a powerful tool for analyzing non-stationary signals. The synchrosqueezing transform (SST) is a special type of the reassignment method which not only enhances the energy concentration of CWT in the time-frequency plane, but also separates the components of multicomponent signals. The "bump wavelet" and Morlet's w… ▽ More

    Submitted 26 September, 2019; v1 submitted 29 December, 2018; originally announced December 2018.

    Comments: arXiv admin note: text overlap with arXiv:1812.11292

  49. arXiv:1812.11033  [pdf, ps, other

    math.NA

    Analysis of Adaptive Short-time Fourier Transform-based Synchrosqueezing Transform

    Authors: Haiyan Cai, Qingtang Jiang, Lin Li, Bruce W. Suter

    Abstract: Recently the study of modeling a non-stationary signal as a superposition of amplitude and frequency-modulated Fourier-like oscillatory modes has been a very active research area. The synchrosqueezing transform (SST) is a powerful method for instantaneous frequency estimation and component separation of non-stationary multicomponent signals. The short-time Fourier transform-based SST (FSST for sho… ▽ More

    Submitted 28 December, 2018; originally announced December 2018.

  50. arXiv:1711.05519  [pdf, other

    cs.IT cs.LG math.NA math.OC

    Accelerated Alternating Projections for Robust Principal Component Analysis

    Authors: HanQin Cai, Jian-Feng Cai, Ke Wei

    Abstract: We study robust PCA for the fully observed setting, which is about separating a low rank matrix $\boldsymbol{L}$ and a sparse matrix $\boldsymbol{S}$ from their sum $\boldsymbol{D}=\boldsymbol{L}+\boldsymbol{S}$. In this paper, a new algorithm, dubbed accelerated alternating projections, is introduced for robust PCA which significantly improves the computational efficiency of the existing alternat… ▽ More

    Submitted 10 February, 2019; v1 submitted 15 November, 2017; originally announced November 2017.

    Journal ref: Journal of Machine Learning Research, 20 (2019): 685-717