Skip to main content

Showing 1–30 of 30 results for author: Wong, R K W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.20048  [pdf, ps, other

    stat.ML cs.LG

    A Principled Path to Fitted Distributional Evaluation

    Authors: Sungee Hong, Jiayi Wang, Zhengling Qi, Raymond Ka Wai Wong

    Abstract: In reinforcement learning, distributional off-policy evaluation (OPE) focuses on estimating the return distribution of a target policy using offline data collected under a different policy. This work focuses on extending the widely used fitted-Q evaluation -- developed for expectation-based reinforcement learning -- to the distributional OPE setting. We refer to this extension as fitted distributi… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  2. arXiv:2504.13467  [pdf, ps, other

    stat.ME

    Efficient Estimation under Multiple Missing Patterns via Balancing Weights

    Authors: Jianing Dong, Raymond K. W. Wong, Kwun Chuen Gary Chan

    Abstract: As one of the most commonly seen data challenges, missing data, in particular, multiple, non-monotone missing patterns, complicates estimation and inference due to the fact that missingness mechanisms are often not missing at random, and conventional methods cannot be applied. Pattern graphs have recently been proposed as a tool to systematically relate various observed patterns in the sample. We… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.08873

  3. arXiv:2402.08873  [pdf, ps, other

    stat.ME

    Balancing Weights for Non-monotone Missing Data

    Authors: Jianing Dong, Raymond K. W. Wong, Kwun Chuen Gary Chan

    Abstract: Balancing weights have been widely applied to single or monotone missingness due to empirical advantages over likelihood-based methods and inverse probability weighting approaches. This paper considers non-monotone missing data under the complete-case missing variable condition (CCMV), a case of missing not at random (MNAR). Using relationships between each missing pattern and the complete-case su… ▽ More

    Submitted 12 December, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  4. arXiv:2402.01900  [pdf, other

    stat.ML cs.LG

    Distributional Off-policy Evaluation with Bellman Residual Minimization

    Authors: Sungee Hong, Zhengling Qi, Raymond K. W. Wong

    Abstract: We study distributional off-policy evaluation (OPE), of which the goal is to learn the distribution of the return for a target policy using offline data generated by a different policy. The theoretical foundation of many existing work relies on the supremum-extended statistical distances such as supremum-Wasserstein distance, which are hard to estimate. In contrast, we study the more manageable ex… ▽ More

    Submitted 12 March, 2025; v1 submitted 2 February, 2024; originally announced February 2024.

  5. arXiv:2310.20537  [pdf, other

    stat.ME stat.ML

    Directed Cyclic Graph for Causal Discovery from Multivariate Functional Data

    Authors: Saptarshi Roy, Raymond K. W. Wong, Yang Ni

    Abstract: Discovering causal relationship using multivariate functional data has received a significant amount of attention very recently. In this article, we introduce a functional linear structural equation model for causal structure learning when the underlying graph involving the multivariate functions may have cycles. To enhance interpretability, our model involves a low-dimensional causal embedded spa… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 36 pages, 2 figures, 7 tables

  6. arXiv:2309.08039  [pdf, other

    stat.ME math.ST

    Flexible Functional Treatment Effect Estimation

    Authors: Jiayi Wang, Raymond K. W. Wong, Xiaoke Zhang, Kwun Chuen Gary Chan

    Abstract: We study treatment effect estimation with functional treatments where the average potential outcome functional is a function of functions, in contrast to continuous treatment effect estimation where the target is a function of real numbers. By considering a flexible scalar-on-function marginal structural model, a weight-modified kernel ridge regression (WMKRR) is adopted for estimation. The weight… ▽ More

    Submitted 12 November, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  7. Bayesian Nonlinear Tensor Regression with Functional Fused Elastic Net Prior

    Authors: Shuoli Chen, Kejun He, Shiyuan He, Yang Ni, Raymond K. W. Wong

    Abstract: Tensor regression methods have been widely used to predict a scalar response from covariates in the form of a multiway array. In many applications, the regions of tensor covariates used for prediction are often spatially connected with unknown shapes and discontinuous jumps on the boundaries. Moreover, the relationship between the response and the tensor covariates can be nonlinear. In this articl… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Journal ref: Technometrics, 65:4, 524-536 (2023)

  8. arXiv:2301.12540  [pdf, other

    stat.ML cs.LG

    Implicit Regularization for Group Sparsity

    Authors: Jiangyuan Li, Thanh V. Nguyen, Chinmay Hegde, Raymond K. W. Wong

    Abstract: We study the implicit regularization of gradient descent towards structured sparsity via a novel neural reparameterization, which we call a diagonally grouped linear neural network. We show the following intriguing property of our reparameterization: gradient descent over the squared regression loss, without any explicit regularization, biases towards solutions with a group sparsity structure. In… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

    Comments: accepted by ICLR 2023

  9. arXiv:2206.12891  [pdf, other

    stat.ME

    Hierarchical nuclear norm penalization for multi-view data

    Authors: Sangyoon Yi, Raymond K. W. Wong, Irina Gaynanova

    Abstract: The prevalence of data collected on the same set of samples from multiple sources (i.e., multi-view data) has prompted significant development of data integration methods based on low-rank matrix factorizations. These methods decompose signal matrices from each view into the sum of shared and individual structures, which are further used for dimension reduction, exploratory analyses, and quantifyi… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: 39 pages, 10 figures, 3 tables

  10. arXiv:2109.04640  [pdf, other

    cs.LG stat.ME

    Projected State-action Balancing Weights for Offline Reinforcement Learning

    Authors: Jiayi Wang, Zhengling Qi, Raymond K. W. Wong

    Abstract: Offline policy evaluation (OPE) is considered a fundamental and challenging problem in reinforcement learning (RL). This paper focuses on the value estimation of a target policy based on pre-collected data generated from a possibly different policy, under the framework of infinite-horizon Markov decision processes. Motivated by the recently developed marginal importance sampling method in RL and t… ▽ More

    Submitted 9 June, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  11. arXiv:2108.05574  [pdf, other

    stat.ML cs.LG

    Implicit Sparse Regularization: The Impact of Depth and Early Stopping

    Authors: Jiangyuan Li, Thanh V. Nguyen, Chinmay Hegde, Raymond K. W. Wong

    Abstract: In this paper, we study the implicit bias of gradient descent for sparse regression. We extend results on regression with quadratic parametrization, which amounts to depth-2 diagonal linear networks, to more general depth-N networks, under more realistic settings of noise and correlated designs. We show that early stopping is crucial for gradient descent to converge to a sparse model, a phenomenon… ▽ More

    Submitted 26 October, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: 32 pages, accepted by NeurIPS 2021. arXiv admin note: text overlap with arXiv:1909.05122 by other authors

  12. arXiv:2106.05850  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Matrix Completion with Model-free Weighting

    Authors: Jiayi Wang, Raymond K. W. Wong, Xiaojun Mao, Kwun Chuen Gary Chan

    Abstract: In this paper, we propose a novel method for matrix completion under general non-uniform missing structures. By controlling an upper bound of a novel balancing error, we construct weights that can actively adjust for the non-uniformity in the empirical risk without explicitly modeling the observation probabilities, and can be computed efficiently via convex optimization. The recovered matrix based… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  13. arXiv:2103.03437  [pdf, other

    stat.ME

    Estimation of Partially Conditional Average Treatment Effect by Hybrid Kernel-covariate Balancing

    Authors: Jiayi Wang, Raymond K. W. Wong, Shu Yang, Kwun Chuen Gary Chan

    Abstract: We study nonparametric estimation for the partially conditional average treatment effect, defined as the treatment effect function over an interested subset of confounders. We propose a hybrid kernel weighting estimator where the weights aim to control the balancing error of any function of the confounders from a reproducing kernel Hilbert space after kernel smoothing over the subset of interested… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 19 pages, 2 figures

  14. arXiv:2010.13568  [pdf, other

    stat.ML cs.LG stat.ME

    CP Degeneracy in Tensor Regression

    Authors: Ya Zhou, Raymond K. W. Wong, Kejun He

    Abstract: Tensor linear regression is an important and useful tool for analyzing tensor data. To deal with high dimensionality, CANDECOMP/PARAFAC (CP) low-rank constraints are often imposed on the coefficient tensor parameter in the (penalized) $M$-estimation. However, we show that the corresponding optimization may not be attainable, and when this happens, the estimator is not well-defined. This is closely… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Journal ref: IEEE Access, 9:1, 7775-7788 (2021)

  15. arXiv:2009.11452  [pdf, ps, other

    stat.ME stat.AP

    A Wavelet-Based Independence Test for Functional Data with an Application to MEG Functional Connectivity

    Authors: Rui Miao, Xiaoke Zhang, Raymond K. W. Wong

    Abstract: Measuring and testing the dependency between multiple random functions is often an important task in functional data analysis. In the literature, a model-based method relies on a model which is subject to the risk of model misspecification, while a model-free method only provides a correlation measure which is inadequate to test independence. In this paper, we adopt the Hilbert-Schmidt Independenc… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

  16. Broadcasted Nonparametric Tensor Regression

    Authors: Ya Zhou, Raymond K. W. Wong, Kejun He

    Abstract: We propose a novel use of a broadcasting operation, which distributes univariate functions to all entries of the tensor covariate, to model the nonlinearity in tensor regression nonparametrically. A penalized estimation and the corresponding algorithm are proposed. Our theoretical investigation, which allows the dimensions of the tensor covariate to diverge, indicates that the proposed estimation… ▽ More

    Submitted 23 March, 2024; v1 submitted 29 August, 2020; originally announced August 2020.

  17. Low-Rank Covariance Function Estimation for Multidimensional Functional Data

    Authors: Jiayi Wang, Raymond K. W. Wong, Xiaoke Zhang

    Abstract: Multidimensional function data arise from many fields nowadays. The covariance function plays an important role in the analysis of such increasingly common data. In this paper, we propose a novel nonparametric covariance function estimation approach under the framework of reproducing kernel Hilbert spaces (RKHS) that can handle both sparse and dense functional data. We extend multilinear rank stru… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: 25 pages, 4 figures

  18. arXiv:2006.10400  [pdf, other

    stat.ML cs.LG

    Median Matrix Completion: from Embarrassment to Optimality

    Authors: Weidong Liu, Xiaojun Mao, Raymond K. W. Wong

    Abstract: In this paper, we consider matrix completion with absolute deviation loss and obtain an estimator of the median matrix. Despite several appealing properties of median, the non-smooth absolute deviation loss leads to computational challenge for large-scale data sets which are increasingly common among matrix completion problems. A simple solution to large-scale problems is parallel computing. Howev… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: 26 pages, 1 figure, 5 tables

  19. arXiv:1911.11983  [pdf, ps, other

    cs.LG stat.ML

    Benefits of Jointly Training Autoencoders: An Improved Neural Tangent Kernel Analysis

    Authors: Thanh V. Nguyen, Raymond K. W. Wong, Chinmay Hegde

    Abstract: A remarkable recent discovery in machine learning has been that deep neural networks can achieve impressive performance (in terms of both lower training error and higher generalization capacity) in the regime where they are massively over-parameterized. Consequently, over the past year, the community has devoted growing interest in analyzing optimization and generalization properties of over-param… ▽ More

    Submitted 2 March, 2020; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: Added Sections 3.2 and 3.4 on inductive biases. Fixed an error in deriving the neural tangent kernel in Section 3.3

  20. Adjusting for Spatial Effects in Genomic Prediction

    Authors: Xiaojun Mao, Somak Dutta, Raymond K. W. Wong, Dan Nettleton

    Abstract: This paper investigates the problem of adjusting for spatial effects in genomic prediction. Despite being seldomly considered in genomic prediction, spatial effects often affect phenotypic measurements of plants. We consider a Gaussian random field model with an additive covariance structure that incorporates genotype effects, spatial effects and subpopulation effects. An empirical study shows the… ▽ More

    Submitted 7 June, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: 22 pages, 6 figures, 10 tables

  21. arXiv:1812.07813  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Matrix Completion under Low-Rank Missing Mechanism

    Authors: Xiaojun Mao, Raymond K. W. Wong, Song Xi Chen

    Abstract: Matrix completion is a modern missing data problem where both the missing structure and the underlying parameter are high dimensional. Although missing structure is a key component to any missing data problems, existing matrix completion methods often assume a simple uniform missing mechanism. In this work, we study matrix completion from corrupted data under a novel low-rank missing mechanism. Th… ▽ More

    Submitted 19 March, 2020; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: 29 pages, 0 figures

  22. arXiv:1809.00420  [pdf, other

    stat.ME

    Network estimation via graphon with node features

    Authors: Yi Su, Raymond K. W. Wong, Thomas C. M. Lee

    Abstract: Estimating the probabilities of linkages in a network has gained increasing interest in recent years. One popular model for network analysis is the exchangeable graph model (ExGM) characterized by a two-dimensional function known as a graphon. Estimating an underlying graphon becomes the key of such analysis. Several nonparametric estimation methods have been proposed, and some are provably consis… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.

  23. arXiv:1806.00572  [pdf, ps, other

    stat.ML cs.LG

    Autoencoders Learn Generative Linear Models

    Authors: Thanh V. Nguyen, Raymond K. W. Wong, Chinmay Hegde

    Abstract: We provide a series of results for unsupervised learning with autoencoders. Specifically, we study shallow two-layer autoencoder architectures with shared weights. We focus on three generative models for data that are common in statistical machine learning: (i) the mixture-of-gaussians model, (ii) the sparse coding model, and (iii) the sparsity model with non-negative coefficients. For each of the… ▽ More

    Submitted 15 February, 2019; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: Experimental study on synthesis data added. Typos fixed

  24. arXiv:1711.03638  [pdf, ps, other

    stat.ML cs.LG

    Provably Accurate Double-Sparse Coding

    Authors: Thanh V. Nguyen, Raymond K. W. Wong, Chinmay Hegde

    Abstract: Sparse coding is a crucial subroutine in algorithms for various signal processing, deep learning, and other machine learning applications. The central goal is to learn an overcomplete dictionary that can sparsely represent a given input dataset. However, a key challenge is that storage, transmission, and processing of the learned dictionary can be untenably high if the data dimension is high. In t… ▽ More

    Submitted 12 December, 2017; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: 40 pages. An abbreviated conference version appears at AAAI 2018

  25. arXiv:1701.06263  [pdf, other

    stat.ME

    Nonparametric Operator-Regularized Covariance Function Estimation for Functional Data

    Authors: Raymond K. W. Wong, Xiaoke Zhang

    Abstract: In functional data analysis (FDA), covariance function is fundamental not only as a critical quantity for understanding elementary aspects of functional data but also as an indispensable ingredient for many advanced FDA methods. This paper develops a new class of nonparametric covariance function estimators in terms of various spectral regularizations of an operator associated with a reproducing k… ▽ More

    Submitted 22 January, 2017; originally announced January 2017.

    Comments: 26 pages, 3 figures

  26. arXiv:1508.07083  [pdf, other

    stat.AP astro-ph.IM

    Detecting Abrupt Changes in the Spectra of High-Energy Astrophysical Sources

    Authors: Raymond K. W. Wong, Vinay L. Kashyap, Thomas C. M. Lee, David A. van Dyk

    Abstract: Variable-intensity astronomical sources are the result of complex and often extreme physical processes. Abrupt changes in source intensity are typically accompanied by equally sudden spectral shifts, i.e., sudden changes in the wavelength distribution of the emission. This article develops a method for modeling photon counts collected from observation of such sources. We embed change points into a… ▽ More

    Submitted 10 December, 2015; v1 submitted 27 August, 2015; originally announced August 2015.

    Comments: 30 pages, 6 figures

  27. arXiv:1503.00214  [pdf, other

    stat.ML

    Matrix Completion with Noisy Entries and Outliers

    Authors: Raymond K. W. Wong, Thomas C. M. Lee

    Abstract: This paper considers the problem of matrix completion when the observed entries are noisy and contain outliers. It begins with introducing a new optimization criterion for which the recovered matrix is defined as its solution. This criterion uses the celebrated Huber function from the robust statistics literature to downweigh the effects of outliers. A practical algorithm is developed to solve the… ▽ More

    Submitted 27 December, 2017; v1 submitted 28 February, 2015; originally announced March 2015.

    Comments: 33 pages, 2 figures

  28. arXiv:1411.4723  [pdf, other

    stat.ME

    A Frequentist Approach to Computer Model Calibration

    Authors: Raymond K. W. Wong, Curtis B. Storlie, Thomas C. M. Lee

    Abstract: This paper considers the computer model calibration problem and provides a general frequentist solution. Under the proposed framework, the data model is semi-parametric with a nonparametric discrepancy function which accounts for any discrepancy between the physical reality and the computer model. In an attempt to solve a fundamentally important (but often ignored) identifiability issue between th… ▽ More

    Submitted 10 September, 2015; v1 submitted 17 November, 2014; originally announced November 2014.

    Comments: 21 pages, 2 figures

  29. arXiv:1406.0581  [pdf, other

    stat.ME stat.AP

    Fiber Direction Estimation, Smoothing and Tracking in Diffusion MRI

    Authors: Raymond K. W. Wong, Thomas C. M. Lee, Debashis Paul, Jie Peng, the Alzheimer's Disease Neuroimaging Initiative

    Abstract: Diffusion magnetic resonance imaging is an imaging technology designed to probe anatomical architectures of biological samples in an in vivo and non-invasive manner through measuring water diffusion. The contribution of this paper is threefold. First it proposes a new method to identify and estimate multiple diffusion directions within a voxel through a new and identifiable parametrization of the… ▽ More

    Submitted 24 September, 2015; v1 submitted 3 June, 2014; originally announced June 2014.

    Comments: 21 pages, 5 figures

  30. Automatic estimation of flux distributions of astrophysical source populations

    Authors: Raymond K. W. Wong, Paul Baines, Alexander Aue, Thomas C. M. Lee, Vinay L. Kashyap

    Abstract: In astrophysics a common goal is to infer the flux distribution of populations of scientifically interesting objects such as pulsars or supernovae. In practice, inference for the flux distribution is often conducted using the cumulative distribution of the number of sources detected at a given sensitivity. The resulting "$\log(N>S)$-$\log (S)$" relationship can be used to compare and evaluate theo… ▽ More

    Submitted 24 November, 2014; v1 submitted 4 May, 2013; originally announced May 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS750 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS750

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 3, 1690-1712