Skip to main content

Showing 1–50 of 72 results for author: Cheng, G

Searching in archive math. Search in all archives.
.
  1. arXiv:2403.18216  [pdf, other

    stat.ML cs.CY cs.LG math.ST

    Minimax Optimal Fair Classification with Bounded Demographic Disparity

    Authors: Xianli Zeng, Guang Cheng, Edgar Dobriban

    Abstract: Mitigating the disparate impact of statistical machine learning methods is crucial for ensuring fairness. While extensive research aims to reduce disparity, the effect of using a \emph{finite dataset} -- as opposed to the entire population -- remains unclear. This paper explores the statistical foundations of fair binary classification with two protected groups, focusing on controlling demographic… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  2. arXiv:2403.12187  [pdf, ps, other

    stat.ML cs.LG math.ST

    Approximation of RKHS Functionals by Neural Networks

    Authors: Tian-Yi Zhou, Namjoon Suh, Guang Cheng, Xiaoming Huo

    Abstract: Motivated by the abundance of functional data such as time series and images, there has been a growing interest in integrating such data into neural networks and learning maps from function spaces to R (i.e., functionals). In this paper, we study the approximation of functionals on reproducing kernel Hilbert spaces (RKHS's) using neural networks. We establish the universality of the approximation… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  3. arXiv:2401.07187  [pdf, other

    stat.ML cs.LG math.ST

    A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models

    Authors: Namjoon Suh, Guang Cheng

    Abstract: In this article, we review the literature on statistical theories of neural networks from three perspectives: approximation, training dynamics and generative models. In the first part, results on excess risks for neural networks are reviewed in the nonparametric framework of regression (and classification in Appendix~{\color{blue}B}). These results rely on explicit constructions of neural networks… ▽ More

    Submitted 16 September, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

    Comments: 38 pages, 2 figures. Invited for review in Annual Review of Statistics and Its Application

  4. arXiv:2401.01770  [pdf, other

    math.OC

    Legendre-Moment Transform for Linear Ensemble Control and Computation

    Authors: Xin Ning, Gong Cheng, Wei Zhang, Jr-Shin Li

    Abstract: Ensemble systems, pervasive in diverse scientific and engineering domains, pose challenges to existing control methods due to their massive scale and underactuated nature. This paper presents a dynamic moment approach to addressing theoretical and computational challenges in systems-theoretic analysis and control design for linear ensemble systems. We introduce the Legendre-moments and Legendre-mo… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    MSC Class: 93B05; 93B28; 93B51

  5. arXiv:2301.03126  [pdf, other

    stat.ME math.ST

    Statistical Inference for Ultrahigh Dimensional Location Parameter Based on Spatial Median

    Authors: Guanghui Cheng, Liuhua Peng, Changliang Zou

    Abstract: Motivated by the widely used geometric median-of-means estimator in machine learning, this paper studies statistical inference for ultrahigh dimensionality location parameter based on the sample spatial median under a general multivariate model, including simultaneous confidence intervals construction, global tests, and multiple testing with false discovery rate control. To achieve these goals, we… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  6. arXiv:2301.00841  [pdf, other

    stat.ML cs.CR cs.LG math.ST

    Ranking Differential Privacy

    Authors: Shirong Xu, Will Wei Sun, Guang Cheng

    Abstract: Rankings are widely collected in various real-life scenarios, leading to the leakage of personal information such as users' preferences on videos or news. To protect rankings, existing works mainly develop privacy protection on a single ranking within a set of ranking or pairwise comparisons of a ranking under the $ε$-differential privacy. This paper proposes a novel notion called $ε$-ranking diff… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: 59 pages, 8 figures

    MSC Class: 62F07

  7. arXiv:2210.17070  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Private optimization in the interpolation regime: faster rates and hardness results

    Authors: Hilal Asi, Karan Chadha, Gary Cheng, John Duchi

    Abstract: In non-private stochastic convex optimization, stochastic gradient methods converge much faster on interpolation problems -- problems where there exists a solution that simultaneously minimizes all of the sample losses -- than on non-interpolating ones; we show that generally similar improvements are impossible in the private setting. However, when the functions exhibit quadratic growth around the… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: published at ICML 2022; 25 pages

  8. arXiv:2207.01602  [pdf, other

    math.ST

    Minimax Optimal Deep Neural Network Classifiers Under Smooth Decision Boundary

    Authors: Tianyang Hu, Ruiqi Liu, Zuofeng Shang, Guang Cheng

    Abstract: Deep learning has gained huge empirical successes in large-scale classification problems. In contrast, there is a lack of statistical understanding about deep learning methods, particularly in the minimax optimality perspective. For instance, in the classical smooth decision boundary setting, existing deep neural network (DNN) approaches are rate-suboptimal, and it remains elusive how to construct… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  9. arXiv:2202.12119  [pdf, ps, other

    cs.LG math.ST

    Optimal Convergence Rates of Deep Convolutional Neural Networks: Additive Ridge Functions

    Authors: Zhiying Fang, Guang Cheng

    Abstract: Convolutional neural networks have shown impressive abilities in many applications, especially those related to the classification tasks. However, for the regression problem, the abilities of convolutional structures have not been fully understood, and further investigation is needed. In this paper, we consider the mean squared error analysis for deep convolutional neural networks. We show that, f… ▽ More

    Submitted 20 January, 2023; v1 submitted 24 February, 2022; originally announced February 2022.

  10. arXiv:2201.08507  [pdf, other

    cs.LG math.OC math.ST

    Decentralized Sparse Linear Regression via Gradient-Tracking: Linear Convergence and Statistical Guarantees

    Authors: Marie Maros, Gesualdo Scutari, Ying Sun, Guang Cheng

    Abstract: We study sparse linear regression over a network of agents, modeled as an undirected graph and no server node. The estimation of the $s$-sparse parameter is formulated as a constrained LASSO problem wherein each agent owns a subset of the $N$ total observations. We analyze the convergence rate and statistical guarantees of a distributed projected gradient tracking-based algorithm under high-dimens… ▽ More

    Submitted 26 December, 2024; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: The order of the first three authors is alphabetic. Final revised version

  11. arXiv:2112.14301  [pdf, other

    math.OC eess.SY

    On Uniform Ensemble Controllability of Diagonalizable Linear Ensemble Systems

    Authors: Wei Miao, Gong Cheng, Jr-Shin Li

    Abstract: In this paper, we study uniform ensemble controllability (UEC) of linear ensemble systems defined in an infinite-dimensional space through finite-dimensional settings. Specifically, with the help of the Stone-Weierstrass theorem for modules, we provide an algebraic framework for examining UEC of linear ensemble systems with diagonalizable drift vector fields through checking the controllability of… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

  12. arXiv:2108.07313  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Federated Asymptotics: a model to compare federated learning algorithms

    Authors: Gary Cheng, Karan Chadha, John Duchi

    Abstract: We propose an asymptotic framework to analyze the performance of (personalized) federated learning algorithms. In this new framework, we formulate federated learning as a multi-criterion objective, where the goal is to minimize each client's loss using information from all of the clients. We analyze a linear regression model where, for a given client, we may theoretically compare the performance o… ▽ More

    Submitted 18 February, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: 42 pages (11 main pages, 2 reference pages, 29 appendix pages), 13 figures

  13. arXiv:2108.03706  [pdf, other

    stat.ML cs.AI cs.LG math.ST

    Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning

    Authors: Pratik Ramprasad, Yuantong Li, Zhuoran Yang, Zhaoran Wang, Will Wei Sun, Guang Cheng

    Abstract: The recent emergence of reinforcement learning has created a demand for robust statistical inference methods for the parameter estimates computed using these algorithms. Existing methods for statistical inference in online learning are restricted to settings involving independently sampled observations, while existing statistical inference methods in reinforcement learning (RL) are limited to the… ▽ More

    Submitted 28 June, 2022; v1 submitted 8 August, 2021; originally announced August 2021.

    Comments: To Appear in Journal of the American Statistical Association

  14. arXiv:2102.10080  [pdf, other

    stat.ME math.ST stat.ML

    Distributed Bootstrap for Simultaneous Inference Under High Dimensionality

    Authors: Yang Yu, Shih-Kang Chao, Guang Cheng

    Abstract: We propose a distributed bootstrap method for simultaneous inference on high-dimensional massive data that are stored and processed with many machines. The method produces an $\ell_\infty$-norm confidence region based on a communication-efficient de-biased lasso, and we propose an efficient cross-validation approach to tune the method at every iteration. We theoretically prove a lower bound on the… ▽ More

    Submitted 14 June, 2022; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: To appear in JMLR. arXiv admin note: text overlap with arXiv:2002.08443

  15. arXiv:2101.02696  [pdf, other

    math.OC cs.LG stat.ML

    Accelerated, Optimal, and Parallel: Some Results on Model-Based Stochastic Optimization

    Authors: Karan Chadha, Gary Cheng, John C. Duchi

    Abstract: We extend the Approximate-Proximal Point (aProx) family of model-based methods for solving stochastic convex optimization problems, including stochastic subgradient, proximal point, and bundle methods, to the minibatch and accelerated setting. To do so, we propose specific model-based algorithms and an acceleration scheme for which we provide non-asymptotic convergence guarantees, which are order-… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 24 pages, 17 figures

  16. arXiv:2012.13760  [pdf, ps, other

    stat.ML cs.AI cs.LG math.OC

    Variance Reduction on General Adaptive Stochastic Mirror Descent

    Authors: Wenjie Li, Zhanyu Wang, Yichen Zhang, Guang Cheng

    Abstract: In this work, we investigate the idea of variance reduction by studying its properties with general adaptive mirror descent algorithms in nonsmooth nonconvex finite-sum optimization problems. We propose a simple yet generalized framework for variance reduced adaptive mirror descent algorithms named SVRAMD and provide its convergence analysis in both the nonsmooth nonconvex problem and the P-L cond… ▽ More

    Submitted 29 August, 2021; v1 submitted 26 December, 2020; originally announced December 2020.

    Comments: NeurIPS 2020 OPT workshop

    Journal ref: Machine Learning (2022)

  17. arXiv:2012.13669  [pdf, other

    math.ST math.PR

    Power Iteration for Tensor PCA

    Authors: Jiaoyang Huang, Daniel Z. Huang, Qing Yang, Guang Cheng

    Abstract: In this paper, we study the power iteration algorithm for the spiked tensor model, as introduced in [44]. We give necessary and sufficient conditions for the convergence of the power iteration algorithm. When the power iteration algorithm converges, for the rank one spiked tensor model, we show the estimators for the spike strength and linear functionals of the signal are asymptotically Gaussian;… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

    Comments: Draft version, comments are welcome!

  18. arXiv:2011.07439  [pdf, other

    stat.ML cs.LG math.ST

    Efficient Variational Inference for Sparse Deep Learning with Theoretical Guarantee

    Authors: Jincheng Bai, Qifan Song, Guang Cheng

    Abstract: Sparse deep learning aims to address the challenge of huge storage consumption by deep neural networks, and to recover the sparse structure of target functions. Although tremendous empirical successes have been achieved, most sparse deep learning algorithms are lacking of theoretical support. On the other hand, another line of works have proposed theoretical frameworks that are computationally inf… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Accepted to NeurIPS 2020

  19. arXiv:2010.12887  [pdf, ps, other

    stat.ML cs.LG math.ST

    Nearly Optimal Variational Inference for High Dimensional Regression with Shrinkage Priors

    Authors: Jincheng Bai, Qifan Song, Guang Cheng

    Abstract: We propose a variational Bayesian (VB) procedure for high-dimensional linear model inferences with heavy tail shrinkage priors, such as student-t prior. Theoretically, we establish the consistency of the proposed VB method and prove that under the proper choice of prior specifications, the contraction rate of the VB posterior is nearly optimal. It justifies the validity of VB inference as an alter… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

  20. arXiv:2009.03430  [pdf, ps, other

    math.OC

    Combinatorics-Based Approaches to Controllability Characterization for Bilinear Systems

    Authors: Gong Cheng, Wei Zhang, Jr-Shin Li

    Abstract: The control of bilinear systems has attracted considerable attention in the field of systems and control for decades, owing to their prevalence in diverse applications across science and engineering disciplines. Although much work has been conducted on analyzing controllability properties, the mostly used tool remains the Lie algebra rank condition. In this paper, we develop alternative approaches… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: Keywords: Bilinear systems, Lie groups, graph theory, symmetric groups, representation theory, Cartan decomposition

    MSC Class: 93B05; 93A15; 93C10; 34K35

  21. arXiv:2008.07107  [pdf, other

    math.ST stat.ME

    Sparse Confidence Sets for Normal Mean Models

    Authors: Yang Ning, Guang Cheng

    Abstract: In this paper, we propose a new framework to construct confidence sets for a $d$-dimensional unknown sparse parameter $θ$ under the normal mean model $X\sim N(θ,σ^2I)$. A key feature of the proposed confidence set is its capability to account for the sparsity of $θ$, thus named as {\em sparse} confidence set. This is in sharp contrast with the classical methods, such as Bonferroni confidence inter… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  22. arXiv:2007.06285  [pdf, ps, other

    math.FA

    A Gaussian version of Littlewood's theorem on random power series

    Authors: Guozheng Cheng, Xiang Fang, Kunyu Guo, Chao Liu

    Abstract: We prove a Littlewood-type theorem on random analytic functions for not necessarily independent Gaussian processes. We show that if we randomize a function in the Hardy space $H^2(\dd)$ by a Gaussian process whose covariance matrix $K$ induces a bounded operator on $l^2$, then the resulting random function is almost surely in $H^p(\dd)$ for any $p>0$. The case $K=\text{Id}$, the identity operator,… ▽ More

    Submitted 26 March, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

  23. arXiv:2004.14954  [pdf, ps, other

    math.ST cs.LG stat.ML

    On Deep Instrumental Variables Estimate

    Authors: Ruiqi Liu, Zuofeng Shang, Guang Cheng

    Abstract: The endogeneity issue is fundamentally important as many empirical applications may suffer from the omission of explanatory variables, measurement error, or simultaneous causality. Recently, \cite{hllt17} propose a "Deep Instrumental Variable (IV)" framework based on deep neural networks to address endogeneity, demonstrating superior performances than existing approaches. The aim of this paper is… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  24. arXiv:1911.08082  [pdf, other

    math.NA cs.LG

    Low rank tensor completion with sparse regularization in a transformed domain

    Authors: Ping-Ping Wang, Liang Li, Guang-Hui Cheng

    Abstract: Tensor completion is a challenging problem with various applications. Many related models based on the low-rank prior of the tensor have been proposed. However, the low-rank prior may not be enough to recover the original tensor from the observed incomplete tensor. In this paper, we prose a tensor completion method by exploiting both the low-rank and sparse prior of tensor. Specifically, the tenso… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: 18 pages, 8 figures

  25. arXiv:1910.04355  [pdf, other

    math.ST

    Adaptive Variational Bayesian Inference for Sparse Deep Neural Network

    Authors: Jincheng Bai, Qifan Song, Guang Cheng

    Abstract: In this work, we focus on variational Bayesian inference on the sparse Deep Neural Network (DNN) modeled under a class of spike-and-slab priors. Given a pre-specified sparse DNN structure, the corresponding variational posterior contraction rate is characterized that reveals a trade-off between the variational error and the approximation error, which are both determined by the network structural c… ▽ More

    Submitted 2 August, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

  26. arXiv:1909.01464  [pdf, other

    stat.ML cs.LG math.ST

    Rates of Convergence for Large-scale Nearest Neighbor Classification

    Authors: Xingye Qiao, Jiexin Duan, Guang Cheng

    Abstract: Nearest neighbor is a popular class of classification methods with many desirable properties. For a large data set which cannot be loaded into the memory of a single machine due to computation, communication, privacy, or ownership limitations, we consider the divide and conquer scheme: the entire data set is divided into small subsamples, on which nearest neighbor predictions are made, and then a… ▽ More

    Submitted 30 October, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: Camera ready version for NeurIPS

  27. arXiv:1906.02389  [pdf, other

    math.ST

    Enhancing Multi-model Inference with Natural Selection

    Authors: Ching-Wei Cheng, Guang Cheng

    Abstract: Multi-model inference covers a wide range of modern statistical applications such as variable selection, model confidence set, model averaging and variable importance. The performance of multi-model inference depends on the availability of candidate models, whose quality has been rarely studied in literature. In this paper, we study genetic algorithm (GA) in order to obtain high-quality candidate… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

  28. arXiv:1812.10013  [pdf, other

    math.ST

    Optimal False Discovery Control of Minimax Estimator

    Authors: Qifan Song, Guang Cheng

    Abstract: Two major research tasks lie at the heart of high dimensional data analysis: accurate parameter estimation and correct support recovery. The existing literature mostly aims for either the best parameter estimation or the best model selection result, however little has been done to understand the potential interaction between the estimation precision and the selection behavior. In this work, our mi… ▽ More

    Submitted 23 June, 2022; v1 submitted 24 December, 2018; originally announced December 2018.

  29. arXiv:1812.05005  [pdf, other

    math.ST

    Distributed Nearest Neighbor Classification

    Authors: Jiexin Duan, Xingye Qiao, Guang Cheng

    Abstract: Nearest neighbor is a popular nonparametric method for classification and regression with many appealing properties. In the big data era, the sheer volume and spatial/temporal disparity of big data may prohibit centrally processing and storing the data. This has imposed considerable hurdle for nearest neighbor predictions since the entire training data must be memorized. One effective way to overc… ▽ More

    Submitted 12 December, 2018; originally announced December 2018.

  30. arXiv:1811.10197  [pdf, ps, other

    math.ST

    Finite Time Analysis of Vector Autoregressive Models under Linear Restrictions

    Authors: Yao Zheng, Guang Cheng

    Abstract: This paper develops a unified finite-time theory for the ordinary least squares estimation of possibly unstable and even slightly explosive vector autoregressive models under linear restrictions, with the applicable region $ρ(A)\leq 1+c/n$, where $ρ(A)$ is the spectral radius of the transition matrix $A$ in the \VAR(1) representation, $n$ is the time horizon and $c>0$ is a universal constant. The… ▽ More

    Submitted 18 May, 2020; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: To Appear in Biometrika

  31. arXiv:1811.05761  [pdf, ps, other

    math.FA

    Random weighted shifts

    Authors: Guozheng Cheng, Xiang Fang, Sen Zhu

    Abstract: In this paper we initiate the study of a fundamental yet untapped random model of non-selfadjoint, bounded linear operators acting on a separable complex Hilbert space. We replace the weights $w_n=1$ in the classical unilateral shift $T$, defined as $Te_n=w_ne_{n+1}$, where $\{e_n\}_{n=1}^\infty$ form an orthonormal basis of a complex Hilbert space, by a sequence of i.i.d. random variables… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

    Comments: 57 pages

    MSC Class: 60H25; 47B37

  32. arXiv:1811.00535  [pdf, ps, other

    math.ST

    High Dimensional Robust Inference for Cox Regression Models

    Authors: Shengchun Kong, Zhuqing Yu, Xianyang Zhang, Guang Cheng

    Abstract: We consider high-dimensional inference for potentially misspecified Cox proportional hazard models based on low dimensional results by Lin and Wei [1989]. A de-sparsified Lasso estimator is proposed based on the log partial likelihood function and shown to converge to a pseudo-true parameter vector. Interestingly, the sparsity of the true parameter can be inferred from that of the above limiting p… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

  33. arXiv:1810.01323  [pdf, other

    math.ST

    Moderate-Dimensional Inferences on Quadratic Functionals in Ordinary Least Squares

    Authors: Xiao Guo, Guang Cheng

    Abstract: Statistical inferences for quadratic functionals of linear regression parameter have found wide applications including signal detection, global testing, inferences of error variance and fraction of variance explained. Classical theory based on ordinary least squares estimator works perfectly in the low-dimensional regime, but fails when the parameter dimension $p_n$ grows proportionally to the sam… ▽ More

    Submitted 15 June, 2020; v1 submitted 2 October, 2018; originally announced October 2018.

    Comments: To appear in JASA-T&M

  34. arXiv:1809.06019  [pdf, other

    math.ST cs.LG stat.ML

    Statistically and Computationally Efficient Variance Estimator for Kernel Ridge Regression

    Authors: Meimei Liu, Jean Honorio, Guang Cheng

    Abstract: In this paper, we propose a random projection approach to estimate variance in kernel ridge regression. Our approach leads to a consistent estimator of the true variance, while being computationally more efficient. Our variance estimator is optimal for a large family of kernels, including cubic splines and Gaussian kernels. Simulation analysis is conducted to support our theory.

    Submitted 17 September, 2018; originally announced September 2018.

    Comments: To Appear in 2018 Allerton

  35. arXiv:1808.10065  [pdf, ps, other

    math.ST

    Quadratic Discriminant Analysis under Moderate Dimension

    Authors: Qing Yang, Guang Cheng

    Abstract: Quadratic discriminant analysis (QDA) is a simple method to classify a subject into two populations, and was proven to perform as well as the Bayes rule when the data dimension p is fixed. The main purpose of this paper is to examine the empirical and theoretical behaviors of QDA where p grows proportionally to the sample sizes without imposing any structural assumption on the parameters. The firs… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

  36. arXiv:1805.09950  [pdf, other

    math.ST stat.ML

    Early Stopping for Nonparametric Testing

    Authors: Meimei Liu, Guang Cheng

    Abstract: Early stopping of iterative algorithms is an algorithmic regularization method to avoid over-fitting in estimation and classification. In this paper, we show that early stopping can also be applied to obtain the minimax optimal testing in a general non-parametric setup. Specifically, a Wald-type test statistic is obtained based on an iterated estimate produced by functional gradient descent algori… ▽ More

    Submitted 17 September, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: To appear in NIPS 2018

  37. arXiv:1805.09948  [pdf, other

    math.ST stat.ML

    How Many Machines Can We Use in Parallel Computing for Kernel Ridge Regression?

    Authors: Meimei Liu, Zuofeng Shang, Guang Cheng

    Abstract: This paper aims to solve a basic problem in distributed statistical inference: how many machines can we use in parallel computing? In kernel ridge regression, we address this question in two important settings: nonparametric estimation and hypothesis testing. Specifically, we find a range for the number of machines under which optimal estimation/testing is achievable. The employed empirical proces… ▽ More

    Submitted 23 February, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: This work extends the work in arXiv:1512.09226 to random and multivariate design

  38. arXiv:1804.08102  [pdf, ps, other

    math.FA

    Three Remarks on Carleson Measures for Dirichlet Space

    Authors: Guozheng Cheng, Xiang Fang, Zipeng Wang, Jiayang Yu

    Abstract: In this paper, we prove that all doubling measures on the unit disk $\mathbb{D}$ are Carleson measures for the standard Dirichlet space $\mathcal{D}$. The proof has three ingredients. The first one is a characterization of Carleson measures which holds true for general reproducing kernel Hilbert spaces. The second one is another new equivalent condition for Carleson measures, which holds true only… ▽ More

    Submitted 22 April, 2018; originally announced April 2018.

  39. arXiv:1802.06308  [pdf, other

    math.ST stat.ME stat.ML

    Nonparametric Testing under Random Projection

    Authors: Meimei Liu, Zuofeng Shang, Guang Cheng

    Abstract: A common challenge in nonparametric inference is its high computational complexity when data volume is large. In this paper, we develop computationally efficient nonparametric testing by employing a random projection strategy. In the specific kernel ridge regression setup, a simple distance-based test statistic is proposed. Notably, we derive the minimum number of random projections that is suffic… ▽ More

    Submitted 17 February, 2018; originally announced February 2018.

  40. arXiv:1801.09326  [pdf, other

    math.ST stat.ML

    Sparse and Low-rank Tensor Estimation via Cubic Sketchings

    Authors: Botao Hao, Anru Zhang, Guang Cheng

    Abstract: In this paper, we propose a general framework for sparse and low-rank tensor estimation from cubic sketchings. A two-stage non-convex implementation is developed based on sparse tensor decomposition and thresholded gradient descent, which ensures exact recovery in the noiseless case and stable recovery in the noisy case with high probability. The non-asymptotic analysis sheds light on an interplay… ▽ More

    Submitted 14 March, 2020; v1 submitted 28 January, 2018; originally announced January 2018.

    Comments: Accepted at IEEE Transactions on Information Theory

  41. Skew-symmetric Nitsche's formulation in isogeometric analysis: Dirichlet and symmetry conditions, patch coupling and frictionless contact

    Authors: Qingyuan Hu, Franz Chouly, Ping Hu, Gengdong Cheng, Stéphane Pierre Alain Bordas

    Abstract: A simple skew-symmetric Nitsche's formulation is introduced into the framework of isogeometric analysis (IGA) to deal with various problems in small strain elasticity: essential boundary conditions, symmetry conditions for Kirchhoff plates, patch coupling in statics and in modal analysis as well as Signorini contact conditions. For linear boundary or interface conditions, the skew-symmetric formul… ▽ More

    Submitted 27 April, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

  42. arXiv:1711.09947  [pdf, other

    math.NA physics.flu-dyn

    Anisotropic Radial Basis Function Methods for Continental Size Ice Sheet Simulations

    Authors: Gong Cheng, Victor Shcherbakov

    Abstract: In this paper we develop and implement anisotropic radial basis function methods for simulating the dynamics of ice sheets and glaciers. We test the methods on two problems: the well-known benchmark ISMIP-HOM B that corresponds to a glacier size ice and a synthetic ice sheet whose geometry is inspired by the EISMINT benchmark that corresponds to a continental size ice sheet. We illustrate the adva… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

    Comments: The authors contributed equally to this work

    MSC Class: 65N15; 65N35; 76D07

  43. arXiv:1708.02564  [pdf, other

    math.ST

    High Dimensional Inference in Partially Linear Models

    Authors: Ying Zhu, Zhuqing Yu, Guang Cheng

    Abstract: We propose two semiparametric versions of the debiased Lasso procedure for the model $Y_i = X_iβ_0 + g_0(Z_i) + ε_i$, where $β_0$ is high dimensional but sparse (exactly or approximately). Both versions are shown to have the same asymptotic normal distribution and do not require the minimal signal condition for statistical inference of any component in $β_0$. Our method also works when $Z_i$ is hi… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

  44. arXiv:1704.05642  [pdf, ps, other

    math.NA

    Solving General Joint Block Diagonalization Problem via Linearly Independent Eigenvectors of a Matrix Polynomial

    Authors: Yunfeng Cai, Guanghui Cheng, Decai Shi

    Abstract: In this paper, we consider the exact/approximate general joint block diagonalization (GJBD) problem of a matrix set $\{A_i\}_{i=0}^p$ ($p\ge 1$), where a nonsingular matrix $W$ (often referred to as diagonalizer) needs to be found such that the matrices $W^{H}A_iW$'s are all exactly/approximately block diagonal matrices with as many diagonal blocks as possible. We show that the diagonalizer of the… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

    MSC Class: 15A69; 65F15

  45. arXiv:1702.07027  [pdf, other

    stat.ME math.ST

    Nonparametric Inference via Bootstrapping the Debiased Estimator

    Authors: Gang Cheng, Yen-Chi Chen

    Abstract: In this paper, we propose to construct confidence bands by bootstrapping the debiased kernel density estimator (for density estimation) and the debiased local polynomial regression estimator (for regression analysis). The idea of using a debiased estimator was recently employed by Calonico et al. (2018b) to construct a confidence interval of the density function (and regression function) at a give… ▽ More

    Submitted 4 June, 2019; v1 submitted 22 February, 2017; originally announced February 2017.

    Comments: Accepted to the Electronic Journal of Statistics. 64 pages, 6 tables, 11 figures

    MSC Class: Primary 62G15; secondary 62G09; 62G07; 62G08

  46. arXiv:1702.01330  [pdf, other

    math.ST

    Non-asymptotic theory for nonparametric testing

    Authors: Yun Yang, Zuofeng Shang, Guang Cheng

    Abstract: We consider nonparametric testing in a non-asymptotic framework. Our statistical guarantees are exact in the sense that Type I and II errors are controlled for any finite sample size. Meanwhile, one proposed test is shown to achieve minimax optimality in the asymptotic sense. An important consequence of this non-asymptotic theory is a new and practically useful formula for selecting the optimal sm… ▽ More

    Submitted 4 February, 2017; originally announced February 2017.

  47. arXiv:1701.06088  [pdf, other

    math.ST stat.ME

    Distributed inference for quantile regression processes

    Authors: Stanislav Volgushev, Shih-Kang Chao, Guang Cheng

    Abstract: The increased availability of massive data sets provides a unique opportunity to discover subtle patterns in their distributions, but also imposes overwhelming computational challenges. To fully utilize the information contained in big data, we propose a two-step procedure: (i) estimate conditional quantile functions at different levels in a parallel computing environment; (ii) construct a conditi… ▽ More

    Submitted 10 April, 2018; v1 submitted 21 January, 2017; originally announced January 2017.

  48. arXiv:1612.05906  [pdf, other

    math.ST

    Minimax Optimal Estimation in Partially Linear Additive Models under High Dimension

    Authors: Zhuqing Yu, Michael Levine, Guang Cheng

    Abstract: In this paper, we derive minimax rates for estimating both parametric and nonparametric components in partially linear additive models with high dimensional sparse vectors and smooth functional components. The minimax lower bound for Euclidean components is the typical sparse estimation rate that is independent of nonparametric smoothness indices. However, the minimax lower bound for each componen… ▽ More

    Submitted 13 January, 2018; v1 submitted 18 December, 2016; originally announced December 2016.

    Comments: To Appear in Bernoulli

  49. arXiv:1611.09391  [pdf, other

    stat.ML math.ST

    Simultaneous Clustering and Estimation of Heterogeneous Graphical Models

    Authors: Botao Hao, Will Wei Sun, Yufeng Liu, Guang Cheng

    Abstract: We consider joint estimation of multiple graphical models arising from heterogeneous and high-dimensional observations. Unlike most previous approaches which assume that the cluster structure is given in advance, an appealing feature of our method is to learn cluster structure while estimating heterogeneous graphical models. This is achieved via a high dimensional version of Expectation Conditiona… ▽ More

    Submitted 12 January, 2018; v1 submitted 28 November, 2016; originally announced November 2016.

    Comments: 61 pages. Accepted by Journal of Machine Learning Research

  50. arXiv:1610.07697  [pdf, other

    math.ST stat.ME

    Embracing the Blessing of Dimensionality in Factor Models

    Authors: Quefeng Li, Guang Cheng, Jianqing Fan, Yuyan Wang

    Abstract: Factor modeling is an essential tool for exploring intrinsic dependence structures among high-dimensional random variables. Much progress has been made for estimating the covariance matrix from a high-dimensional factor model. However, the blessing of dimensionality has not yet been fully embraced in the literature: much of the available data is often ignored in constructing covariance matrix esti… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.