-
Confidence Interval Construction and Conditional Variance Estimation with Dense ReLU Networks
Authors:
Carlos Misael Madrid Padilla,
Oscar Hernan Madrid Padilla,
Yik Lun Kei,
Zhi Zhang,
Yanzhen Chen
Abstract:
This paper addresses the problems of conditional variance estimation and confidence interval construction in nonparametric regression using dense networks with the Rectified Linear Unit (ReLU) activation function. We present a residual-based framework for conditional variance estimation, deriving nonasymptotic bounds for variance estimation under both heteroscedastic and homoscedastic settings. We…
▽ More
This paper addresses the problems of conditional variance estimation and confidence interval construction in nonparametric regression using dense networks with the Rectified Linear Unit (ReLU) activation function. We present a residual-based framework for conditional variance estimation, deriving nonasymptotic bounds for variance estimation under both heteroscedastic and homoscedastic settings. We relax the sub-Gaussian noise assumption, allowing the proposed bounds to accommodate sub-Exponential noise and beyond. Building on this, for a ReLU neural network estimator, we derive non-asymptotic bounds for both its conditional mean and variance estimation, representing the first result for variance estimation using ReLU networks. Furthermore, we develop a ReLU network based robust bootstrap procedure (Efron, 1992) for constructing confidence intervals for the true mean that comes with a theoretical guarantee on the coverage, providing a significant advancement in uncertainty quantification and the construction of reliable confidence intervals in deep learning settings.
△ Less
Submitted 31 December, 2024; v1 submitted 29 December, 2024;
originally announced December 2024.
-
Change Point Detection in Dynamic Graphs with Decoder-only Latent Space Model
Authors:
Yik Lun Kei,
Jialiang Li,
Hangjian Li,
Yanzhen Chen,
Oscar Hernan Madrid Padilla
Abstract:
This manuscript studies the unsupervised change point detection problem in time series of graphs using a decoder-only latent space model. The proposed framework consists of learnable prior distributions for low-dimensional graph representations and of a decoder that bridges the observed graphs and latent representations. The prior distributions of the latent spaces are learned from the observed da…
▽ More
This manuscript studies the unsupervised change point detection problem in time series of graphs using a decoder-only latent space model. The proposed framework consists of learnable prior distributions for low-dimensional graph representations and of a decoder that bridges the observed graphs and latent representations. The prior distributions of the latent spaces are learned from the observed data as empirical Bayes to assist change point detection. Specifically, the model parameters are estimated via maximum approximate likelihood, with a Group Fused Lasso regularization imposed on the prior parameters. The augmented Lagrangian is solved via Alternating Direction Method of Multipliers, and Langevin Dynamics are recruited for posterior inference. Simulation studies show good performance of the latent space model in supporting change point detection and real data experiments yield change points that align with significant events.
△ Less
Submitted 17 April, 2025; v1 submitted 6 April, 2024;
originally announced April 2024.
-
Change Point Detection on A Separable Model for Dynamic Networks
Authors:
Yik Lun Kei,
Hangjian Li,
Yanzhen Chen,
Oscar Hernan Madrid Padilla
Abstract:
This paper studies the unsupervised change point detection problem in time series of networks using the Separable Temporal Exponential-family Random Graph Model (STERGM). Inherently, dynamic network patterns can be complex due to dyadic and temporal dependence, and change points detection can identify the discrepancies in the underlying data generating processes to facilitate downstream analysis.…
▽ More
This paper studies the unsupervised change point detection problem in time series of networks using the Separable Temporal Exponential-family Random Graph Model (STERGM). Inherently, dynamic network patterns can be complex due to dyadic and temporal dependence, and change points detection can identify the discrepancies in the underlying data generating processes to facilitate downstream analysis. Moreover, the STERGM that utilizes network statistics to represent the structural patterns is a flexible and parsimonious model to fit dynamic networks. We propose a new estimator derived from the Alternating Direction Method of Multipliers (ADMM) procedure and Group Fused Lasso (GFL) regularization to simultaneously detect multiple time points, where the parameters of a time-heterogeneous STERGM have changed. We also provide a Bayesian information criterion for model selection and an R package CPDstergm to implement the proposed method. Experiments on simulated and real data show good performance of the proposed framework.
△ Less
Submitted 2 March, 2025; v1 submitted 30 March, 2023;
originally announced March 2023.
-
A Partially Separable Model for Dynamic Valued Networks
Authors:
Yik Lun Kei,
Yanzhen Chen,
Oscar Hernan Madrid Padilla
Abstract:
The Exponential-family Random Graph Model (ERGM) is a powerful model to fit networks with complex structures. However, for dynamic valued networks whose observations are matrices of counts that evolve over time, the development of the ERGM framework is still in its infancy. To facilitate the modeling of dyad value increment and decrement, a Partially Separable Temporal ERGM is proposed for dynamic…
▽ More
The Exponential-family Random Graph Model (ERGM) is a powerful model to fit networks with complex structures. However, for dynamic valued networks whose observations are matrices of counts that evolve over time, the development of the ERGM framework is still in its infancy. To facilitate the modeling of dyad value increment and decrement, a Partially Separable Temporal ERGM is proposed for dynamic valued networks. The parameter learning algorithms inherit state-of-the-art estimation techniques to approximate the maximum likelihood, by drawing Markov chain Monte Carlo (MCMC) samples conditioning on the valued network from the previous time step. The ability of the proposed model to interpret network dynamics and forecast temporal trends is demonstrated with real data.
△ Less
Submitted 16 June, 2023; v1 submitted 26 May, 2022;
originally announced May 2022.