Skip to main content

Showing 1–39 of 39 results for author: Shang, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.01610  [pdf, other

    stat.ME

    Bootstrap Nonparametric Inference under Data Integration

    Authors: Zuofeng Shang, Peijun Sang, Chong Jin

    Abstract: We propose multiplier bootstrap procedures for nonparametric inference and uncertainty quantification of the target mean function, based on a novel framework of integrating target and source data. We begin with the relatively easier covariate shift scenario with equal target and source mean functions and propose estimation and inferential procedures through a straightforward combination of all tar… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: 29 pages, 5 figures

  2. arXiv:2407.00564  [pdf, other

    stat.ME

    Variational Nonparametric Inference in Functional Stochastic Block Model

    Authors: Zuofeng Shang, Peijun Sang, Yang Feng, Chong Jin

    Abstract: We propose a functional stochastic block model whose vertices involve functional data information. This new model extends the classic stochastic block model with vector-valued nodal information, and finds applications in real-world networks whose nodal information could be functional curves. Examples include international trade data in which a network vertex (country) is associated with the annual… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  3. arXiv:2307.13890  [pdf, ps, other

    stat.ME

    Empirical likelihood test for community structure in networks

    Authors: Mingao Yuan, Sharmin Hossain, Zuofeng Shang

    Abstract: Network data, characterized by interconnected nodes and edges, is pervasive in various domains and has gained significant popularity in recent years. In network data analysis, testing the presence of community structure in a network is one of the important research tasks. Existing tests are mainly developed for unweighted networks. In this paper, we study the problem of testing the existence of co… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  4. arXiv:2305.15671  [pdf, other

    stat.ME

    Matrix Autoregressive Model with Vector Time Series Covariates for Spatio-Temporal Data

    Authors: Hu Sun, Zuofeng Shang, Yang Chen

    Abstract: We develop a new methodology for forecasting matrix-valued time series with historical matrix data and auxiliary vector time series data. We focus on a time series of matrices defined on a static 2-D spatial grid and an auxiliary time series of non-spatial vectors. The proposed model, Matrix AutoRegression with Auxiliary Covariates (MARAC), contains an autoregressive component for the historical m… ▽ More

    Submitted 17 May, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  5. arXiv:2302.12717  [pdf, ps, other

    stat.ME stat.ML

    Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

    Authors: Ruiqi Liu, Xi Chen, Zuofeng Shang

    Abstract: Stochastic gradient descent (SGD) is a scalable and memory-efficient optimization algorithm for large datasets and stream data, which has drawn a great deal of attention and popularity. The applications of SGD-based estimators to statistical inference such as interval estimation have also achieved great success. However, most of the related works are based on i.i.d. observations or Markov chains.… ▽ More

    Submitted 28 March, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    MSC Class: 62F12; 62F40; 62M10

  6. arXiv:2302.02457   

    stat.ME

    Scalable inference in functional linear regression with streaming data

    Authors: Jinhan Xie, Enze Shi, Peijun Sang, Zuofeng Shang, Bei Jiang, Linglong Kong

    Abstract: Traditional static functional data analysis is facing new challenges due to streaming data, where data constantly flow in. A major challenge is that storing such an ever-increasing amount of data in memory is nearly impossible. In addition, existing inferential tools in online learning are mainly developed for finite-dimensional problems, while inference methods for functional data are focused on… ▽ More

    Submitted 10 October, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Due to the request of one of the co-authors, we tentatively withdrew the manuscript

  7. arXiv:2212.06505  [pdf, other

    q-bio.QM math.AT q-bio.GN stat.ME

    Multiscale topology classifies and quantifies cell types in subcellular spatial transcriptomics

    Authors: Katherine Benjamin, Aneesha Bhandari, Zhouchun Shang, Yanan Xing, Yanru An, Nannan Zhang, Yong Hou, Ulrike Tillmann, Katherine R. Bull, Heather A. Harrington

    Abstract: Spatial transcriptomics has the potential to transform our understanding of RNA expression in tissues. Classical array-based technologies produce multiple-cell-scale measurements requiring deconvolution to recover single cell information. However, rapid advances in subcellular measurement of RNA expression at whole-transcriptome depth necessitate a fundamentally different approach. To integrate si… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: Main text: 8 pages, 4 figures. Supplement: 12 pages, 5 figures

    MSC Class: 92-08; 55N31; 62R40; 68T09

  8. arXiv:2209.13779  [pdf

    astro-ph.SR stat.ML

    Solar Flare Index Prediction Using SDO/HMI Vector Magnetic Data Products with Statistical and Machine Learning Methods

    Authors: Hewei Zhang, Qin Li, Yanxing Yang, Ju Jing, Jason T. L. Wang, Haimin Wang, Zuofeng Shang

    Abstract: Solar flares, especially the M- and X-class flares, are often associated with coronal mass ejections (CMEs). They are the most important sources of space weather effects, that can severely impact the near-Earth environment. Thus it is essential to forecast flares (especially the M-and X-class ones) to mitigate their destructive and hazardous consequences. Here, we introduce several statistical and… ▽ More

    Submitted 1 December, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

    Journal ref: The Astrophysical Journal Supplement Series (2022), Volume 263, Number 2

  9. arXiv:2208.03252  [pdf, other

    stat.ME stat.AP

    Partial-Mastery Cognitive Diagnosis Models

    Authors: Zhuoran Shang, Elena A. Erosheva, Gongjun Xu

    Abstract: Cognitive diagnosis models (CDMs) are a family of discrete latent attribute models that serve as statistical basis in educational and psychological cognitive diagnosis assessments. CDMs aim to achieve fine-grained inference on individuals' latent attributes, based on their observed responses to a set of designed diagnostic items. In the literature, CDMs usually assume that items require mastery of… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Journal ref: This work has been published in Ann. Appl. Stat. 15(3): 1529-1555 (September 2021)

  10. arXiv:2205.08592  [pdf, other

    stat.ML cs.LG stat.ME

    Deep Neural Network Classifier for Multi-dimensional Functional Data

    Authors: Shuoyang Wang, Guanqun Cao, Zuofeng Shang

    Abstract: We propose a new approach, called as functional deep neural network (FDNN), for classifying multi-dimensional functional data. Specifically, a deep neural network is trained based on the principle components of the training data which shall be used to predict the class label of a future data function. Unlike the popular functional discriminant analysis approaches which rely on Gaussian assumption,… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  11. arXiv:2204.09097  [pdf, other

    math.ST stat.ME

    Information-theoretic Limits for Testing Community Structures in Weighted Networks

    Authors: Mingao Yuan, Zuofeng Shang

    Abstract: Community detection refers to the problem of clustering the nodes of a network into groups. Existing inferential methods for community structure mainly focus on unweighted (binary) networks. Many real-world networks are nonetheless weighted and a common practice is to dichotomize a weighted network to an unweighted one which is known to result in information loss. Literature on hypothesis testing… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  12. arXiv:2202.11747  [pdf, other

    math.ST stat.ME

    Statistical Inference for Functional Linear Quantile Regression

    Authors: Peijun Sang, Zuofeng Shang, Pang Du

    Abstract: We propose inferential tools for functional linear quantile regression where the conditional quantile of a scalar response is assumed to be a linear functional of a functional covariate. In contrast to conventional approaches, we employ kernel convolution to smooth the original loss function. The coefficient function is estimated under a reproducing kernel Hilbert space framework. A gradient desce… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  13. arXiv:2202.05888  [pdf, ps, other

    math.ST stat.ML

    Statistical Limits for Testing Correlation of Hypergraphs

    Authors: Mingao Yuan, Zuofeng Shang

    Abstract: In this paper, we consider the hypothesis testing of correlation between two $m$-uniform hypergraphs on $n$ unlabelled nodes. Under the null hypothesis, the hypergraphs are independent, while under the alternative hypothesis, the hyperdges have the same marginal distributions as in the null hypothesis but are correlated after some unknown node permutation. We focus on two scenarios: the hypergraph… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: 20pages

  14. arXiv:2111.07878  [pdf, other

    stat.ME

    An Approach of Bayesian Variable Selection for Ultrahigh Dimensional Multivariate Regression

    Authors: Xiaotian Dai, Guifang Fu, Randall Reese, Shaofei Zhao, Zuofeng Shang

    Abstract: In many practices, scientists are particularly interested in detecting which of the predictors are truly associated with a multivariate response. It is more accurate to model multiple responses as one vector rather than separating each component one by one. This is particularly true for complex traits having multiple correlated components. A Bayesian multivariate variable selection (BMVS) approach… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  15. arXiv:2106.03591  [pdf, other

    stat.ML cs.LG stat.ME

    Calibrating multi-dimensional complex ODE from noisy data via deep neural networks

    Authors: Kexuan Li, Fangfang Wang, Ruiqi Liu, Fan Yang, Zuofeng Shang

    Abstract: Ordinary differential equations (ODEs) are widely used to model complex dynamics that arises in biology, chemistry, engineering, finance, physics, etc. Calibration of a complicated ODE system using noisy data is generally very difficult. In this work, we propose a two-stage nonparametric approach to address this problem. We first extract the de-noised data and their higher order derivatives using… ▽ More

    Submitted 18 September, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

  16. arXiv:2105.13282  [pdf, ps, other

    eess.SP cs.IT stat.AP

    Detection of a rank-one signal with limited training data

    Authors: Weijian Liu, Zhaojian Zhang, Jun Liu, Zheran Shang, Yong-Liang Wang

    Abstract: In this paper, we reconsider the problem of detecting a matrix-valued rank-one signal in unknown Gaussian noise, which was previously addressed for the case of sufficient training data. We relax the above assumption to the case of limited training data. We re-derive the corresponding generalized likelihood ratio test (GLRT) and two-step GLRT (2S--GLRT) based on certain unitary transformation on th… ▽ More

    Submitted 13 April, 2021; originally announced May 2021.

    Comments: This manuscript is accepted by Signal Processing

    Report number: SIGPRO_108120

  17. arXiv:2105.10315  [pdf, ps, other

    stat.ML cs.LG

    Online Statistical Inference for Parameters Estimation with Linear-Equality Constraints

    Authors: Ruiqi Liu, Mingao Yuan, Zuofeng Shang

    Abstract: Stochastic gradient descent (SGD) and projected stochastic gradient descent (PSGD) are scalable algorithms to compute model parameters in unconstrained and constrained optimization problems. In comparison with SGD, PSGD forces its iterative values into the constrained parameter space via projection. From a statistical point of view, this paper studies the limiting distribution of PSGD-based estima… ▽ More

    Submitted 22 March, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

  18. arXiv:2105.09788  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Distributed Adaptive Nearest Neighbor Classifier: Algorithm and Theory

    Authors: Ruiqi Liu, Ganggang Xu, Zuofeng Shang

    Abstract: When data is of an extraordinarily large size or physically stored in different locations, the distributed nearest neighbor (NN) classifier is an attractive tool for classification. We propose a novel distributed adaptive NN classifier for which the number of nearest neighbors is a tuning parameter stochastically chosen by a data-driven criterion. An early stopping rule is proposed when searching… ▽ More

    Submitted 3 June, 2023; v1 submitted 20 May, 2021; originally announced May 2021.

  19. arXiv:2105.02259  [pdf, other

    cs.IT math.ST stat.ML

    Information Limits for Detecting a Subhypergraph

    Authors: Mingao Yuan, Zuofeng Shang

    Abstract: We consider the problem of recovering a subhypergraph based on an observed adjacency tensor corresponding to a uniform hypergraph. The uniform hypergraph is assumed to contain a subset of vertices called as subhypergraph. The edges restricted to the subhypergraph are assumed to follow a different probability distribution than other edges. We consider both weak recovery and exact recovery of the su… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  20. arXiv:2104.04047  [pdf, ps, other

    stat.ML cs.LG math.ST

    Heterogeneous Dense Subhypergraph Detection

    Authors: Mingao Yuan, Zuofeng Shang

    Abstract: We study the problem of testing the existence of a heterogeneous dense subhypergraph. The null hypothesis corresponds to a heterogeneous Erdös-Rényi uniform random hypergraph and the alternative hypothesis corresponds to a heterogeneous uniform random hypergraph that contains a dense subhypergraph. We establish detection boundaries when the edge probabilities are known and construct an asymptotica… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

  21. arXiv:2103.00569  [pdf, other

    stat.ME

    Optimal Classification for Functional Data

    Authors: Shuoyang Wang, Zuofeng Shang, Guanqun Cao, Jun Liu

    Abstract: A central topic in functional data analysis is how to design an optimaldecision rule, based on training samples, to classify a data function. We exploit the optimal classification problem when data functions are Gaussian processes. Sharp nonasymptotic convergence rates for minimax excess mis-classification risk are derived in both settings that data functions are fully observed and discretely obse… ▽ More

    Submitted 10 September, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

    MSC Class: 2010 subject classifications: 62H30(Primary) 62C20; 62H12 (Secondary)

  22. arXiv:2101.04584  [pdf, other

    math.ST stat.ML

    Sharp detection boundaries on testing dense subhypergraph

    Authors: Mingao Yuan, Zuofeng Shang

    Abstract: We study the problem of testing the existence of a dense subhypergraph. The null hypothesis is an Erdos-Renyi uniform random hypergraph and the alternative hypothesis is a uniform random hypergraph that contains a dense subhypergraph. We establish sharp detection boundaries in both scenarios: (1) the edge probabilities are known; (2) the edge probabilities are unknown. In both scenarios, sharp det… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

  23. arXiv:2012.04573  [pdf, other

    stat.ML cs.LG

    Estimation of the Mean Function of Functional Data via Deep Neural Networks

    Authors: Shuoyang Wang, Guanqun Cao, Zuofeng Shang

    Abstract: In this work, we propose a deep neural network method to perform nonparametric regression for functional data. The proposed estimators are based on sparsely connected deep neural networks with ReLU activation function. By properly choosing network architecture, our estimator achieves the optimal nonparametric convergence rate in empirical norm. Under certain circumstances such as trigonometric pol… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  24. arXiv:2011.04147  [pdf, ps, other

    math.ST stat.ME

    A Computationally Efficient Classification Algorithm in Posterior Drift Model: Phase Transition and Minimax Adaptivity

    Authors: Ruiqi Liu, Kexuan Li, Zuofeng Shang

    Abstract: In massive data analysis, training and testing data often come from very different sources, and their probability distributions are not necessarily identical. A feature example is nonparametric classification in posterior drift model where the conditional distributions of the label given the covariates are possibly different. In this paper, we derive minimax rate of the excess risk for nonparametr… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

  25. arXiv:2004.14954  [pdf, ps, other

    math.ST cs.LG stat.ML

    On Deep Instrumental Variables Estimate

    Authors: Ruiqi Liu, Zuofeng Shang, Guang Cheng

    Abstract: The endogeneity issue is fundamentally important as many empirical applications may suffer from the omission of explanatory variables, measurement error, or simultaneous causality. Recently, \cite{hllt17} propose a "Deep Instrumental Variable (IV)" framework based on deep neural networks to address endogeneity, demonstrating superior performances than existing approaches. The aim of this paper is… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  26. arXiv:2001.06892  [pdf, other

    stat.ML cs.LG

    Sharp Rate of Convergence for Deep Neural Network Classifiers under the Teacher-Student Setting

    Authors: Tianyang Hu, Zuofeng Shang, Guang Cheng

    Abstract: Classifiers built with neural networks handle large-scale high dimensional data, such as facial images from computer vision, extremely well while traditional statistical methods often fail miserably. In this paper, we attempt to understand this empirical success in high dimensional classification by deriving the convergence rates of excess risk. In particular, a teacher-student framework is propos… ▽ More

    Submitted 31 January, 2020; v1 submitted 19 January, 2020; originally announced January 2020.

  27. arXiv:1911.08830  [pdf, ps, other

    econ.EM stat.ME

    Statistical Inference on Partially Linear Panel Model under Unobserved Linearity

    Authors: Ruiqi Liu, Ben Boukai, Zuofeng Shang

    Abstract: A new statistical procedure, based on a modified spline basis, is proposed to identify the linear components in the panel data model with fixed effects. Under some mild assumptions, the proposed procedure is shown to consistently estimate the underlying regression function, correctly select the linear components, and effectively conduct the statistical inference. When compared to existing methods… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

  28. arXiv:1911.02171  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    Minimax Nonparametric Two-sample Test under Smoothing

    Authors: Xin Xing, Zuofeng Shang, Pang Du, Ping Ma, Wenxuan Zhong, Jun S. Liu

    Abstract: We consider the problem of comparing probability densities between two groups. A new probabilistic tensor product smoothing spline framework is developed to model the joint density of two variables. Under such a framework, the probability density comparison is equivalent to testing the presence/absence of interactions. We propose a penalized likelihood ratio test for such interaction testing and s… ▽ More

    Submitted 11 January, 2021; v1 submitted 5 November, 2019; originally announced November 2019.

  29. arXiv:1902.01687  [pdf, other

    cs.LG stat.ML

    Optimal Nonparametric Inference via Deep Neural Network

    Authors: Ruiqi Liu, Ben Boukai, Zuofeng Shang

    Abstract: Deep neural network is a state-of-art method in modern science and technology. Much statistical literature have been devoted to understanding its performance in nonparametric estimation, whereas the results are suboptimal due to a redundant logarithmic sacrifice. In this paper, we show that such log-factors are not necessary. We derive upper bounds for the $L^2$ minimax risk in nonparametric estim… ▽ More

    Submitted 16 August, 2021; v1 submitted 5 February, 2019; originally announced February 2019.

  30. arXiv:1901.08571  [pdf, other

    math.ST cs.LG stat.ML

    Nonparametric Inference under B-bits Quantization

    Authors: Kexuan Li, Ruiqi Liu, Ganggang Xu, Zuofeng Shang

    Abstract: Statistical inference based on lossy or incomplete samples is often needed in research areas such as signal/image processing, medical image storage, remote sensing, signal transmission. In this paper, we propose a nonparametric testing procedure based on samples quantized to $B$ bits through a computationally efficient algorithm. Under mild technical conditions, we establish the asymptotic propert… ▽ More

    Submitted 11 August, 2023; v1 submitted 24 January, 2019; originally announced January 2019.

  31. arXiv:1807.04426  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    A likelihood-ratio type test for stochastic block models with bounded degrees

    Authors: Mingao Yuan, Yang Feng, Zuofeng Shang

    Abstract: A fundamental problem in network data analysis is to test Erdös-Rényi model $\mathcal{G}\left(n,\frac{a+b}{2n}\right)$ versus a bisection stochastic block model $\mathcal{G}\left(n,\frac{a}{n},\frac{b}{n}\right)$, where $a,b>0$ are constants that represent the expected degrees of the graphs and $n$ denotes the number of nodes. This problem serves as the foundation of many other problems such as te… ▽ More

    Submitted 22 November, 2018; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: In this new submission, we add a comment in introduction stating that > the classic test based on counting the $k_n$-cycles with > $k_n=\log^{1/4}{n}$ is unrealistic in practice, which is also the > motivation of our regularized LR test

  32. arXiv:1805.09948  [pdf, other

    math.ST stat.ML

    How Many Machines Can We Use in Parallel Computing for Kernel Ridge Regression?

    Authors: Meimei Liu, Zuofeng Shang, Guang Cheng

    Abstract: This paper aims to solve a basic problem in distributed statistical inference: how many machines can we use in parallel computing? In kernel ridge regression, we address this question in two important settings: nonparametric estimation and hypothesis testing. Specifically, we find a range for the number of machines under which optimal estimation/testing is achievable. The employed empirical proces… ▽ More

    Submitted 23 February, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: This work extends the work in arXiv:1512.09226 to random and multivariate design

  33. arXiv:1802.06308  [pdf, other

    math.ST stat.ME stat.ML

    Nonparametric Testing under Random Projection

    Authors: Meimei Liu, Zuofeng Shang, Guang Cheng

    Abstract: A common challenge in nonparametric inference is its high computational complexity when data volume is large. In this paper, we develop computationally efficient nonparametric testing by employing a random projection strategy. In the specific kernel ridge regression setup, a simple distance-based test statistic is proposed. Notably, we derive the minimum number of random projections that is suffic… ▽ More

    Submitted 17 February, 2018; originally announced February 2018.

  34. arXiv:1612.05907  [pdf, other

    stat.ML

    Distributed Generalized Cross-Validation for Divide-and-Conquer Kernel Ridge Regression and its Asymptotic Optimality

    Authors: Ganggang Xu, Zuofeng Shang, Guang Cheng

    Abstract: Tuning parameter selection is of critical importance for kernel ridge regression. To this date, data driven tuning method for divide-and-conquer kernel ridge regression (d-KRR) has been lacking in the literature, which limits the applicability of d-KRR for large data sets. In this paper, by modifying the Generalized Cross-validation (GCV, Wahba, 1990) score, we propose a distributed Generalized Cr… ▽ More

    Submitted 18 February, 2019; v1 submitted 18 December, 2016; originally announced December 2016.

    Comments: To appear in Journal of Computational and Graphical Statistics as an extended version of http://proceedings.mlr.press/v80/xu18f.html

  35. arXiv:1310.8633  [pdf, other

    stat.ME

    Sparse and Efficient Estimation for Partial Spline Models with Increasing Dimension

    Authors: Guang Cheng, Hao Helen Zhang, Zuofeng Shang

    Abstract: We consider model selection and estimation for partial spline models and propose a new regularization method in the context of smoothing splines. The regularization method has a simple yet elegant form, consisting of roughness penalty on the nonparametric component and shrinkage penalty on the parametric components, which can achieve function smoothing and sparse estimation simultaneously. We esta… ▽ More

    Submitted 21 November, 2013; v1 submitted 31 October, 2013; originally announced October 2013.

    Comments: 34 pages, 6 figures, 10 tables, published at Annals of the Institute of Statistical Mathematics 2013

  36. arXiv:1307.0056  [pdf, ps, other

    stat.ME

    High-Dimensional Bayesian Inference in Nonparametric Additive Models

    Authors: Zuofeng Shang, Ping Li

    Abstract: A fully Bayesian approach is proposed for ultrahigh-dimensional nonparametric additive models in which the number of additive components may be larger than the sample size, though ideally the true model is believed to include only a small number of components. Bayesian approaches can conduct stochastic model search and fulfill flexible parameter estimation by stochastic draws. The theory shows tha… ▽ More

    Submitted 23 September, 2013; v1 submitted 28 June, 2013; originally announced July 2013.

  37. arXiv:1302.1154  [pdf, ps, other

    stat.ME math.ST stat.CO

    Bayesian Ultrahigh-Dimensional Screening Via MCMC

    Authors: Zuofeng Shang, Ping Li

    Abstract: We explore the theoretical and numerical property of a fully Bayesian model selection method in sparse ultrahigh-dimensional settings, i.e., $p\gg n$, where $p$ is the number of covariates and $n$ is the sample size. Our method consists of (1) a hierarchical Bayesian model with a novel prior placed over the model space which includes a hyperparameter $t_n$ controlling the model size, and (2) an ef… ▽ More

    Submitted 12 March, 2013; v1 submitted 5 February, 2013; originally announced February 2013.

  38. arXiv:1212.6788  [pdf, ps, other

    math.ST stat.ML

    Local and global asymptotic inference in smoothing spline models

    Authors: Zuofeng Shang, Guang Cheng

    Abstract: This article studies local and global inference for smoothing spline estimation in a unified asymptotic framework. We first introduce a new technical tool called functional Bahadur representation, which significantly generalizes the traditional Bahadur representation in parametric models, that is, Bahadur [Ann. Inst. Statist. Math. 37 (1966) 577-580]. Equipped with this tool, we develop four inter… ▽ More

    Submitted 26 November, 2013; v1 submitted 30 December, 2012; originally announced December 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1164 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1164

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 5, 2608-2638

  39. arXiv:1202.0517  [pdf, other

    stat.AP

    An Application of Bayesian Variable Selection to Spatial Concurrent Linear Models

    Authors: Zuofeng Shang, Murray K. Clayton

    Abstract: Spatial concurrent linear models, in which the model coefficients are spatial processes varying at a local level, are flexible and useful tools for analyzing spatial data. One approach places stationary Gaussian process priors on the spatial processes, but in applications the data may display strong nonstationary patterns. In this article, we propose a Bayesian variable selection approach based on… ▽ More

    Submitted 2 February, 2012; originally announced February 2012.