Skip to main content

Showing 1–50 of 77 results for author: Mao, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.07025  [pdf, ps, other

    stat.ME math.ST

    Conformal Link Prediction with False Discovery Rate Control

    Authors: Wenqin Du, Wanteng Ma, Dong Xia, Yuan Zhang, Wen Zhou

    Abstract: We propose a new method for predicting multiple missing links in partially observed networks while controlling the false discovery rate (FDR), a largely unresolved challenge in network analysis. The main difficulty lies in handling complex dependencies and unknown, heterogeneous missing patterns. We introduce conformal link prediction ({\tt clp}), a distribution-free procedure grounded in the exch… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  2. arXiv:2505.24261  [pdf, other

    cs.LG stat.ML

    Taming Hyperparameter Sensitivity in Data Attribution: Practical Selection Without Costly Retraining

    Authors: Weiyi Wang, Junwei Deng, Yuzheng Hu, Shiyuan Zhang, Xirui Jiang, Runting Zhang, Han Zhao, Jiaqi W. Ma

    Abstract: Data attribution methods, which quantify the influence of individual training data points on a machine learning model, have gained increasing popularity in data-centric applications in modern AI. Despite a recent surge of new methods developed in this space, the impact of hyperparameter tuning in these methods remains under-explored. In this work, we present the first large-scale empirical study t… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  3. arXiv:2502.14424  [pdf, ps, other

    stat.ML cs.AI cs.LG stat.ME

    Distribution Matching for Self-Supervised Transfer Learning

    Authors: Yuling Jiao, Wensen Ma, Defeng Sun, Hansheng Wang, Yang Wang

    Abstract: In this paper, we propose a novel self-supervised transfer learning method called \underline{\textbf{D}}istribution \underline{\textbf{M}}atching (DM), which drives the representation distribution toward a predefined reference distribution while preserving augmentation invariance. DM results in a learned representation space that is intuitively structured and therefore easy to interpret. Experim… ▽ More

    Submitted 2 July, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  4. arXiv:2501.14602  [pdf, other

    stat.ME

    Minimax Optimal Design with Spillover and Carryover Effects

    Authors: Haoyang Yu, Wei Ma, Hanzhong Liu

    Abstract: In various applications, the potential outcome of a unit may be influenced by the treatments received by other units, a phenomenon known as interference, as well as by prior treatments, referred to as carryover effects. These phenomena violate the stable unit treatment value assumption and pose significant challenges in causal inference. To address these complexities, we propose a minimax optimal… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

  5. arXiv:2412.01335  [pdf, other

    cs.LG stat.ML

    A Versatile Influence Function for Data Attribution with Non-Decomposable Loss

    Authors: Junwei Deng, Weijing Tang, Jiaqi W. Ma

    Abstract: Influence function, a technique rooted in robust statistics, has been adapted in modern machine learning for a novel application: data attribution -- quantifying how individual training data points affect a model's predictions. However, the common derivation of influence functions in the data attribution literature is limited to loss functions that can be decomposed into a sum of individual data p… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  6. arXiv:2411.16220  [pdf, other

    stat.ME math.ST

    On the achievability of efficiency bounds for covariate-adjusted response-adaptive randomization

    Authors: Jiahui Xin, Wei Ma

    Abstract: In the context of precision medicine, covariate-adjusted response-adaptive randomization (CARA) has garnered much attention from both academia and industry due to its benefits in providing ethical and tailored treatment assignments based on patients' profiles while still preserving favorable statistical properties. Recent years have seen substantial progress in understanding the inference for vari… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  7. arXiv:2411.06329  [pdf, other

    cs.LG stat.ML

    Regret Minimization and Statistical Inference in Online Decision Making with High-dimensional Covariates

    Authors: Congyuan Duan, Wanteng Ma, Jiashuo Jiang, Dong Xia

    Abstract: This paper investigates regret minimization, statistical inference, and their interplay in high-dimensional online decision-making based on the sparse linear context bandit model. We integrate the $\varepsilon$-greedy bandit algorithm for decision-making with a hard thresholding algorithm for estimating sparse bandit parameters and introduce an inference framework based on a debiasing method using… ▽ More

    Submitted 17 May, 2025; v1 submitted 9 November, 2024; originally announced November 2024.

  8. arXiv:2410.11225  [pdf, other

    math.ST stat.ML

    Statistical Inference in Tensor Completion: Optimal Uncertainty Quantification and Statistical-to-Computational Gaps

    Authors: Wanteng Ma, Dong Xia

    Abstract: This paper presents a simple yet efficient method for statistical inference of tensor linear forms using incomplete and noisy observations. Under the Tucker low-rank tensor model and the missing-at-random assumption, we utilize an appropriate initial estimate along with a debiasing technique followed by a one-step power iteration to construct an asymptotically normal test statistic. This method is… ▽ More

    Submitted 1 November, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

  9. arXiv:2410.10056  [pdf, ps, other

    cs.LG cs.AI stat.ML

    The Epochal Sawtooth Phenomenon: Unveiling Training Loss Oscillations in Adam and Other Optimizers

    Authors: Qi Liu, Wanjing Ma

    Abstract: In this paper, we identify and analyze a recurring training loss pattern, which we term the \textit{Epochal Sawtooth Phenomenon (ESP)}, commonly observed during training with adaptive gradient-based optimizers, particularly Adam optimizer. This pattern is characterized by a sharp drop in loss at the beginning of each epoch, followed by a gradual increase, resulting in a sawtooth-shaped loss curve.… ▽ More

    Submitted 17 June, 2025; v1 submitted 13 October, 2024; originally announced October 2024.

    Comments: 15 pages, 21 figures

  10. arXiv:2409.18153  [pdf, other

    cs.LG stat.ML

    Most Influential Subset Selection: Challenges, Promises, and Beyond

    Authors: Yuzheng Hu, Pingbang Hu, Han Zhao, Jiaqi W. Ma

    Abstract: How can we attribute the behaviors of machine learning models to their training data? While the classic influence function sheds light on the impact of individual samples, it often fails to capture the more complex and pronounced collective influence of a set of samples. To tackle this challenge, we study the Most Influential Subset Selection (MISS) problem, which aims to identify a subset of trai… ▽ More

    Submitted 8 January, 2025; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: Accepted at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Edit: Added discussion on a concurrent work

  11. arXiv:2409.04126  [pdf, other

    stat.ME

    Incorporating external data for analyzing randomized clinical trials: A transfer learning approach

    Authors: Yujia Gu, Hanzhong Liu, Wei Ma

    Abstract: Randomized clinical trials are the gold standard for analyzing treatment effects, but high costs and ethical concerns can limit recruitment, potentially leading to invalid inferences. Incorporating external trial data with similar characteristics into the analysis using transfer learning appears promising for addressing these issues. In this paper, we present a formal framework for applying transf… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  12. arXiv:2409.03505  [pdf, other

    stat.ML cs.LG

    Survey of Data-driven Newsvendor: Unified Analysis and Spectrum of Achievable Regrets

    Authors: Zhuoxin Chen, Will Ma

    Abstract: In the Newsvendor problem, the goal is to guess the number that will be drawn from some distribution, with asymmetric consequences for guessing too high vs. too low. In the data-driven version, the distribution is unknown, and one must work with samples from the distribution. Data-driven Newsvendor has been studied under many variants: additive vs. multiplicative regret, high probability vs. expec… ▽ More

    Submitted 5 May, 2025; v1 submitted 5 September, 2024; originally announced September 2024.

  13. arXiv:2408.08533  [pdf, ps, other

    stat.ML cs.LG

    Unsupervised Transfer Learning via Adversarial Contrastive Training

    Authors: Chenguang Duan, Yuling Jiao, Huazhen Lin, Wensen Ma, Jerry Zhijian Yang

    Abstract: Learning a data representation for downstream supervised learning tasks under unlabeled scenario is both critical and challenging. In this paper, we propose a novel unsupervised transfer learning approach using adversarial contrastive training (ACT). Our experimental results demonstrate outstanding classification accuracy with both fine-tuned linear probe and K-NN protocol across various datasets,… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  14. arXiv:2408.01697  [pdf, other

    cs.LG cs.AI stat.ML

    Invariant Graph Learning Meets Information Bottleneck for Out-of-Distribution Generalization

    Authors: Wenyu Mao, Jiancan Wu, Haoyang Liu, Yongduo Sui, Xiang Wang

    Abstract: Graph out-of-distribution (OOD) generalization remains a major challenge in graph learning since graph neural networks (GNNs) often suffer from severe performance degradation under distribution shifts. Invariant learning, aiming to extract invariant features across varied distributions, has recently emerged as a promising approach for OOD generation. Despite the great success of invariant learning… ▽ More

    Submitted 12 February, 2025; v1 submitted 3 August, 2024; originally announced August 2024.

    Comments: The article has been accepted by Frontiers of Computer Science (FCS), with the DOI: {10.1007/s11704-025-40798-3}

  15. arXiv:2407.05001  [pdf, other

    stat.ME

    Treatment effect estimation under covariate-adaptive randomization with heavy-tailed outcomes

    Authors: Hongzi Li, Wei Ma, Yingying Ma, Hanzhong Liu

    Abstract: Randomized experiments are the gold standard for investigating causal relationships, with comparisons of potential outcomes under different treatment groups used to estimate treatment effects. However, outcomes with heavy-tailed distributions pose significant challenges to traditional statistical approaches. While recent studies have explored these issues under simple randomization, their applicat… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  16. arXiv:2405.18856  [pdf, other

    stat.ME math.ST

    Inference under covariate-adaptive randomization with many strata

    Authors: Jiahui Xin, Hanzhong Liu, Wei Ma

    Abstract: Covariate-adaptive randomization is widely employed to balance baseline covariates in interventional studies such as clinical trials and experiments in development economics. Recent years have witnessed substantial progress in inference under covariate-adaptive randomization with a fixed number of strata. However, concerns have been raised about the impact of a large number of strata on its design… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  17. arXiv:2404.11509  [pdf, other

    stat.ML cs.LG

    VC Theory for Inventory Policies

    Authors: Yaqi Xie, Will Ma, Linwei Xin

    Abstract: Advances in computational power and AI have increased interest in reinforcement learning approaches to inventory management. This paper provides a theoretical foundation for these approaches and investigates the benefits of restricting to policy structures that are well-established by inventory theory. In particular, we prove generalization guarantees for learning several well-known classes of inv… ▽ More

    Submitted 7 July, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  18. arXiv:2402.11742  [pdf, other

    cs.LG stat.ML

    Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance

    Authors: Chiraag Kaushik, Ran Liu, Chi-Heng Lin, Amrit Khera, Matthew Y Jin, Wenrui Ma, Vidya Muthukumar, Eva L Dyer

    Abstract: Classification models are expected to perform equally well for different classes, yet in practice, there are often large gaps in their performance. This issue of class bias is widely studied in cases of datasets with sample imbalance, but is relatively overlooked in balanced datasets. In this work, we introduce the concept of spectral imbalance in features as a potential source for class dispariti… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 25 pages, 9 figures

  19. arXiv:2312.01266  [pdf, ps, other

    stat.ME math.ST

    A unified framework for covariate adjustment under stratified randomization

    Authors: Fuyi Tu, Wei Ma, Hanzhong Liu

    Abstract: Randomization, as a key technique in clinical trials, can eliminate sources of bias and produce comparable treatment groups. In randomized experiments, the treatment effect is a parameter of general interest. Researchers have explored the validity of using linear models to estimate the treatment effect and perform covariate adjustment and thus improve the estimation efficiency. However, the relati… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  20. arXiv:2312.00305  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Multiple Testing of Linear Forms for Noisy Matrix Completion

    Authors: Wanteng Ma, Lilun Du, Dong Xia, Ming Yuan

    Abstract: Many important tasks of large-scale recommender systems can be naturally cast as testing multiple linear forms for noisy matrix completion. These problems, however, present unique challenges because of the subtle bias-and-variance tradeoff of and an intricate dependence among the estimated entries induced by the low-rank structure. In this paper, we develop a general approach to overcome these dif… ▽ More

    Submitted 10 March, 2025; v1 submitted 30 November, 2023; originally announced December 2023.

  21. arXiv:2311.17445  [pdf, ps, other

    stat.ME math.ST

    Interaction tests with covariate-adaptive randomization

    Authors: Likun Zhang, Wei Ma

    Abstract: Treatment-covariate interaction tests are commonly applied by researchers to examine whether the treatment effect varies across patient subgroups defined by baseline characteristics. The objective of this study is to explore treatment-covariate interaction tests involving covariate-adaptive randomization. Without assuming a parametric data generating model, we investigate usual interaction tests a… ▽ More

    Submitted 10 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  22. arXiv:2311.01327  [pdf, other

    cs.LG cs.DS stat.ML

    High-dimensional Linear Bandits with Knapsacks

    Authors: Wanteng Ma, Dong Xia, Jiashuo Jiang

    Abstract: We study the contextual bandits with knapsack (CBwK) problem under the high-dimensional setting where the dimension of the feature is large. The reward of pulling each arm equals the multiplication of a sparse high-dimensional weight vector and the feature of the current arrival, with additional random noise. In this paper, we investigate how to exploit this sparsity structure to achieve improved… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  23. arXiv:2308.01314  [pdf, other

    cs.LG cs.SE stat.ML

    Evaluating the Robustness of Test Selection Methods for Deep Neural Networks

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Wei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Testing deep learning-based systems is crucial but challenging due to the required time and labor for labeling collected raw data. To alleviate the labeling effort, multiple test selection methods have been proposed where only a subset of test data needs to be labeled while satisfying testing requirements. However, we observe that such methods with reported promising results are only evaluated und… ▽ More

    Submitted 29 July, 2023; originally announced August 2023.

    Comments: 12 pages

  24. Discovering Dynamic Causal Space for DAG Structure Learning

    Authors: Fangfu Liu, Wenchang Ma, An Zhang, Xiang Wang, Yueqi Duan, Tat-Seng Chua

    Abstract: Discovering causal structure from purely observational data (i.e., causal discovery), aiming to identify causal relationships among variables, is a fundamental task in machine learning. The recent invention of differentiable score-based DAG learners is a crucial enabler, which reframes the combinatorial optimization problem into a differentiable optimization with a DAG constraint over directed gra… ▽ More

    Submitted 11 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by KDD 2023. Our codes are available at https://github.com/liuff19/CASPER

  25. arXiv:2303.03187  [pdf, other

    cs.LG stat.ML

    Boosting Differentiable Causal Discovery via Adaptive Sample Reweighting

    Authors: An Zhang, Fangfu Liu, Wenchang Ma, Zhibo Cai, Xiang Wang, Tat-seng Chua

    Abstract: Under stringent model type and variable distribution assumptions, differentiable score-based causal discovery methods learn a directed acyclic graph (DAG) from observational data by evaluating candidate graphs over an average score function. Despite great success in low-dimensional linear systems, it has been observed that these approaches overly exploit easier-to-fit samples, thus inevitably lear… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: In proceedings of ICLR 2023

  26. arXiv:2302.08424  [pdf, ps, other

    cs.LG math.OC stat.ME

    From Contextual Data to Newsvendor Decisions: On the Actual Performance of Data-Driven Algorithms

    Authors: Omar Besbes, Will Ma, Omar Mouchtaki

    Abstract: In this work, we explore a framework for contextual decision-making to study how the relevance and quantity of past data affects the performance of a data-driven policy. We analyze a contextual Newsvendor problem in which a decision-maker needs to trade-off between an underage and an overage cost in the face of uncertain demand. We consider a setting in which past demands observed under ``close by… ▽ More

    Submitted 24 December, 2024; v1 submitted 16 February, 2023; originally announced February 2023.

  27. arXiv:2212.12658  [pdf, other

    cs.LG stat.ML

    Improving Uncertainty Quantification of Variance Networks by Tree-Structured Learning

    Authors: Wenxuan Ma, Xing Yan, Kun Zhang

    Abstract: To improve the uncertainty quantification of variance networks, we propose a novel tree-structured local neural network model that partitions the feature space into multiple regions based on uncertainty heterogeneity. A tree is built upon giving the training data, whose leaf nodes represent different regions where region-specific neural networks are trained to predict both the mean and the varianc… ▽ More

    Submitted 19 July, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

  28. arXiv:2206.09642  [pdf, ps, other

    cs.LG math.OC stat.ML

    Beyond IID: data-driven decision-making in heterogeneous environments

    Authors: Omar Besbes, Will Ma, Omar Mouchtaki

    Abstract: How should one leverage historical data when past observations are not perfectly indicative of the future, e.g., due to the presence of unobserved confounders which one cannot "correct" for? Motivated by this question, we study a data-driven decision-making framework in which historical samples are generated from unknown and different distributions assumed to lie in a heterogeneity ball with known… ▽ More

    Submitted 1 January, 2025; v1 submitted 20 June, 2022; originally announced June 2022.

  29. arXiv:2206.02164  [pdf, other

    cs.LG cs.AI stat.ME

    Estimating and Mitigating the Congestion Effect of Curbside Pick-ups and Drop-offs: A Causal Inference Approach

    Authors: Xiaohui Liu, Sean Qian, Hock-Hai Teo, Wei Ma

    Abstract: Curb space is one of the busiest areas in urban road networks. Especially in recent years, the rapid increase of ride-hailing trips and commercial deliveries has induced massive pick-ups/drop-offs (PUDOs), which occupy the limited curb space that was designed and built decades ago. These PUDOs could jam curbside utilization and disturb the mainline traffic flow, evidently leading to significant ne… ▽ More

    Submitted 2 January, 2024; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: Accepted at Transportation Science

  30. arXiv:2203.03965  [pdf, other

    cs.LG stat.AP

    Few-Sample Traffic Prediction with Graph Networks using Locale as Relational Inductive Biases

    Authors: Mingxi Li, Yihong Tang, Wei Ma

    Abstract: Accurate short-term traffic prediction plays a pivotal role in various smart mobility operation and management systems. Currently, most of the state-of-the-art prediction models are based on graph neural networks (GNNs), and the required training samples are proportional to the size of the traffic network. In many cities, the available amount of traffic data is substantially below the minimum requ… ▽ More

    Submitted 10 November, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

  31. arXiv:2202.01858  [pdf, other

    stat.ML cs.LG

    Modeling unknown dynamical systems with hidden parameters

    Authors: Xiaohan Fu, Weize Mao, Lo-Bin Chang, Dongbin Xiu

    Abstract: We present a data-driven numerical approach for modeling unknown dynamical systems with missing/hidden parameters. The method is based on training a deep neural network (DNN) model for the unknown system using its trajectory data. A key feature is that the unknown dynamical system contains system parameters that are completely hidden, in the sense that no information about the parameters is availa… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  32. arXiv:2106.14177  [pdf, ps, other

    eess.SP stat.ML

    On Hyperspectral Unmixing

    Authors: Wing-Kin Ma

    Abstract: In this article the author reviews José Bioucas-Dias' key contributions to hyperspectral unmixing (HU), in memory of him as an influential scholar and for his many beautiful ideas introduced to the hyperspectral community. Our story will start with vertex component analysis (VCA) -- one of the most celebrated HU algorithms, with more than 2,000 Google Scholar citations. VCA was pioneering, invente… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

    Comments: to appear in IGARSS 2021, Special Session on "The Contributions of José Manuel Bioucas-Dias to Remote Sensing Data Processing"

  33. A Deep Latent Space Model for Graph Representation Learning

    Authors: Hanxuan Yang, Qingchao Kong, Wenji Mao

    Abstract: Graph representation learning is a fundamental problem for modeling relational data and benefits a number of downstream applications. Traditional Bayesian-based graph models and recent deep learning based GNN either suffer from impracticability or lack interpretability, thus combined models for undirected graphs have been proposed to overcome the weaknesses. As a large portion of real-world graphs… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Journal ref: Neurocomputing, 576 (2024) 127342

  34. Probabilistic Simplex Component Analysis

    Authors: Ruiyuan Wu, Wing-Kin Ma, Yuening Li, Anthony Man-Cho So, Nicholas D. Sidiropoulos

    Abstract: This study presents PRISM, a probabilistic simplex component analysis approach to identifying the vertices of a data-circumscribing simplex from data. The problem has a rich variety of applications, the most notable being hyperspectral unmixing in remote sensing and non-negative matrix factorization in machine learning. PRISM uses a simple probabilistic model, namely, uniform simplex data distribu… ▽ More

    Submitted 20 January, 2022; v1 submitted 18 March, 2021; originally announced March 2021.

  35. arXiv:2101.06742  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    Deep Parametric Continuous Convolutional Neural Networks

    Authors: Shenlong Wang, Simon Suo, Wei-Chiu Ma, Andrei Pokrovsky, Raquel Urtasun

    Abstract: Standard convolutional neural networks assume a grid structured input is available and exploit discrete convolutions as their fundamental building blocks. This limits their applicability to many real-world applications. In this paper we propose Parametric Continuous Convolution, a new learnable operator that operates over non-grid structured data. The key idea is to exploit parameterized kernel fu… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

    Comments: Accepted by CVPR 2018

  36. arXiv:2011.09734  [pdf, ps, other

    stat.ME math.ST

    A general theory of regression adjustment for covariate-adaptive randomization: OLS, Lasso, and beyond

    Authors: Hanzhong Liu, Fuyi Tu, Wei Ma

    Abstract: We consider the problem of estimating and inferring treatment effects in randomized experiments. In practice, stratified randomization, or more generally, covariate-adaptive randomization, is routinely used in the design stage to balance the treatment allocations with respect to a few variables that are most relevant to the outcomes. Then, regression is performed in the analysis stage to adjust th… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Journal ref: Biometrika, asac036, 2022

  37. arXiv:2010.03161  [pdf, other

    cs.LG cs.AI stat.ML

    Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control

    Authors: Weichao Mao, Kaiqing Zhang, Ruihao Zhu, David Simchi-Levi, Tamer Başar

    Abstract: We consider model-free reinforcement learning (RL) in non-stationary Markov decision processes. Both the reward functions and the state transition functions are allowed to vary arbitrarily over time as long as their cumulative variations do not exceed certain variation budgets. We propose Restarted Q-Learning with Upper Confidence Bounds (RestartQ-UCB), the first model-free algorithm for non-stati… ▽ More

    Submitted 19 August, 2022; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: A preliminary version of this work has appeared in ICML 2021

  38. Testing for Treatment Effect in Covariate-Adaptive Randomized Clinical Trials with Generalized Linear Models and Omitted Covariates

    Authors: Li Yang, Wei Ma, Yichen Qin, Feifang Hu

    Abstract: Concerns have been expressed over the validity of statistical inference under covariate-adaptive randomization despite the extensive use in clinical trials. In the literature, the inferential properties under covariate-adaptive randomization have been mainly studied for continuous responses; in particular, it is well known that the usual two sample t-test for treatment effect is typically conserva… ▽ More

    Submitted 2 May, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: Updated to the published version

    Journal ref: Statistical Methods in Medical Research 30, no. 9 (2021): 2148-2164

  39. Regression analysis for covariate-adaptive randomization: A robust and efficient inference perspective

    Authors: Wei Ma, Fuyi Tu, Hanzhong Liu

    Abstract: Linear regression is arguably the most fundamental statistical model; however, the validity of its use in randomized clinical trials, despite being common practice, has never been crystal clear, particularly when stratified or covariate-adaptive randomization is used. In this paper, we investigate several of the most intuitive and commonly used regression models for estimating and inferring the tr… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    Journal ref: Statistics in Medicine 41, no. 29 (2022): 5645-5661

  40. arXiv:2008.09514  [pdf, other

    cs.LG cs.AI cs.IR cs.LO stat.ML

    Neural Logic Reasoning

    Authors: Shaoyun Shi, Hanxiong Chen, Weizhi Ma, Jiaxin Mao, Min Zhang, Yongfeng Zhang

    Abstract: Recent years have witnessed the success of deep neural networks in many research areas. The fundamental idea behind the design of most neural networks is to learn similarity patterns from data for prediction and inference, which lacks the ability of cognitive reasoning. However, the concrete ability of reasoning is critical to many theoretical and practical problems. On the other hand, traditional… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Comments: Accepted to ACM CIKM 2020. arXiv admin note: substantial text overlap with arXiv:1910.08629

  41. arXiv:2006.14901  [pdf, other

    math.OC cs.LG eess.SP stat.ML

    Understanding Notions of Stationarity in Non-Smooth Optimization

    Authors: Jiajin Li, Anthony Man-Cho So, Wing-Kin Ma

    Abstract: Many contemporary applications in signal processing and machine learning give rise to structured non-convex non-smooth optimization problems that can often be tackled by simple iterative methods quite effectively. One of the keys to understanding such a phenomenon---and, in fact, one of the very difficult conundrums even for experts---lie in the study of "stationary points" of the problem in quest… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in IEEE Signal Processing Magazine, 2020

  42. arXiv:2006.14076  [pdf, other

    cs.LG stat.ML

    The Convex Relaxation Barrier, Revisited: Tightened Single-Neuron Relaxations for Neural Network Verification

    Authors: Christian Tjandraatmadja, Ross Anderson, Joey Huchette, Will Ma, Krunal Patel, Juan Pablo Vielma

    Abstract: We improve the effectiveness of propagation- and linear-optimization-based neural network verification algorithms with a new tightened convex relaxation for ReLU neurons. Unlike previous single-neuron relaxations which focus only on the univariate input space of the ReLU, our method considers the multivariate input space of the affine pre-activation function preceding the ReLU. Using results from… ▽ More

    Submitted 22 October, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    MSC Class: 68T07

  43. arXiv:2002.07345  [pdf, other

    math.OC cs.LG stat.ML

    A Distributionally Robust Area Under Curve Maximization Model

    Authors: Wenbo Ma, Miguel A. Lejeune

    Abstract: Area under ROC curve (AUC) is a widely used performance measure for classification models. We propose two new distributionally robust AUC maximization models (DR-AUC) that rely on the Kantorovich metric and approximate the AUC with the hinge loss function. We consider the two cases with respectively fixed and variable support for the worst-case distribution. We use duality theory to reformulate th… ▽ More

    Submitted 7 May, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

    Journal ref: Operations Research Letters, Volume 48, Issue 4, July 2020, Pages 460-466

  44. arXiv:2001.03985  [pdf, other

    cs.LG q-bio.NC q-bio.QM stat.CO stat.ME stat.ML

    Unbiased and Efficient Log-Likelihood Estimation with Inverse Binomial Sampling

    Authors: Bas van Opheusden, Luigi Acerbi, Wei Ji Ma

    Abstract: The fate of scientific hypotheses often relies on the ability of a computational model to explain the data, quantified in modern statistical approaches by the likelihood function. The log-likelihood is the key element for parameter estimation and model evaluation. However, the log-likelihood of complex models in fields such as computational biology and neuroscience is often intractable to compute… ▽ More

    Submitted 27 October, 2020; v1 submitted 12 January, 2020; originally announced January 2020.

    Comments: Bas van Opheusden and Luigi Acerbi contributed equally to this work

  45. arXiv:1912.00295   

    stat.ME stat.AP

    Efficient Estimation of Mixture Cure Frailty Model for Clustered Current Status Data

    Authors: Tong Wang, Kejun He, Wei Ma, Dipankar Bandyopadhyay, Samiran Sinha

    Abstract: Current status data abounds in the field of epidemiology and public health, where the only observable data for a subject is the random inspection time and the event status at inspection. Motivated by such a current status data from a periodontal study where data are inherently clustered, we propose a unified methodology to analyze such complex data. We allow the time-to-event to follow the semipar… ▽ More

    Submitted 23 April, 2020; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: Unstable EM algorithm due to limited information in current status data

  46. arXiv:1911.10658  [pdf, other

    cs.LG stat.ML

    Projective Quadratic Regression for Online Learning

    Authors: Wenye Ma

    Abstract: This paper considers online convex optimization (OCO) problems - the paramount framework for online learning algorithm design. The loss function of learning task in OCO setting is based on streaming data so that OCO is a powerful tool to model large scale applications such as online recommender systems. Meanwhile, real-world data are usually of extreme high-dimensional due to modern feature engine… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

    Comments: AAAI 2020

  47. arXiv:1910.12774  [pdf, other

    stat.ML cs.LG

    Missing Not at Random in Matrix Completion: The Effectiveness of Estimating Missingness Probabilities Under a Low Nuclear Norm Assumption

    Authors: Wei Ma, George H. Chen

    Abstract: Matrix completion is often applied to data with entries missing not at random (MNAR). For example, consider a recommendation system where users tend to only reveal ratings for items they like. In this case, a matrix completion method that relies on entries being revealed at uniformly sampled row and column indices can yield overly optimistic predictions of unseen user ratings. Recently, various pa… ▽ More

    Submitted 29 October, 2019; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2019)

  48. arXiv:1910.09090  [pdf

    cs.LG cs.CV stat.ML

    A game method for improving the interpretability of convolution neural network

    Authors: Jinwei Zhao, Qizhou Wang, Fuqiang Zhang, Wanli Qiu, Yufei Wang, Yu Liu, Guo Xie, Weigang Ma, Bin Wang, Xinhong Hei

    Abstract: Real artificial intelligence always has been focused on by many machine learning researchers, especially in the area of deep learning. However deep neural network is hard to be understood and explained, and sometimes, even metaphysics. The reason is, we believe that: the network is essentially a perceptual model. Therefore, we believe that in order to complete complex intelligent activities from s… ▽ More

    Submitted 20 October, 2019; originally announced October 2019.

  49. arXiv:1908.01580  [pdf, other

    cs.LG stat.ML

    The HSIC Bottleneck: Deep Learning without Back-Propagation

    Authors: Wan-Duo Kurt Ma, J. P. Lewis, W. Bastiaan Kleijn

    Abstract: We introduce the HSIC (Hilbert-Schmidt independence criterion) bottleneck for training deep neural networks. The HSIC bottleneck is an alternative to the conventional cross-entropy loss and backpropagation that has a number of distinct advantages. It mitigates exploding and vanishing gradients, resulting in the ability to learn very deep networks without skip connections. There is no requirement f… ▽ More

    Submitted 5 December, 2019; v1 submitted 5 August, 2019; originally announced August 2019.

  50. arXiv:1907.01723  [pdf

    stat.ML cs.LG stat.AP

    Towards Interpretable Deep Extreme Multi-label Learning

    Authors: Yihuang Kang, I-Ling Cheng, Wenjui Mao, Bowen Kuo, Pei-Ju Lee

    Abstract: Many Machine Learning algorithms, such as deep neural networks, have long been criticized for being "black-boxes"-a kind of models unable to provide how it arrive at a decision without further efforts to interpret. This problem has raised concerns on model applications' trust, safety, nondiscrimination, and other ethical issues. In this paper, we discuss the machine learning interpretability of a… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: 6 pages