Skip to main content

Showing 1–50 of 76 results for author: Yang, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.21154  [pdf, ps, other

    stat.ME cs.AI cs.LG

    Transformer-Based Spatial-Temporal Counterfactual Outcomes Estimation

    Authors: He Li, Haoang Chi, Mingyu Liu, Wanrong Huang, Liyang Xu, Wenjing Yang

    Abstract: The real world naturally has dimensions of time and space. Therefore, estimating the counterfactual outcomes with spatial-temporal attributes is a crucial problem. However, previous methods are based on classical statistical models, which still have limitations in performance and generalization. This paper proposes a novel framework for estimating counterfactual outcomes with spatial-temporal attr… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 24 pages, accepted at ICML 2025

  2. arXiv:2505.19491  [pdf, ps, other

    cs.LG stat.ML

    Discounted Online Convex Optimization: Uniform Regret Across a Continuous Interval

    Authors: Wenhao Yang, Sifan Yang, Lijun Zhang

    Abstract: Reflecting the greater significance of recent history over the distant past in non-stationary environments, $λ$-discounted regret has been introduced in online convex optimization (OCO) to gracefully forget past data as new information arrives. When the discount factor $λ$ is given, online gradient descent with an appropriate step size achieves an $O(1/\sqrt{1-λ})$ discounted regret. However, the… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  3. arXiv:2505.02020  [pdf, other

    cs.LG cs.AI stat.ML

    Wide & Deep Learning for Node Classification

    Authors: Yancheng Chen, Wenguo Yang, Zhipeng Jiang

    Abstract: Wide & Deep, a simple yet effective learning architecture for recommendation systems developed by Google, has had a significant impact in both academia and industry due to its combination of the memorization ability of generalized linear models and the generalization ability of deep models. Graph convolutional networks (GCNs) remain dominant in node classification tasks; however, recent studies ha… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

    Comments: 16 pages, 6 figures, 13 tables

  4. arXiv:2503.00383  [pdf, other

    cs.LG cs.AI stat.ML

    Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems

    Authors: Song Xia, Yi Yu, Wenhan Yang, Meiwen Ding, Zhuo Chen, Ling-Yu Duan, Alex C. Kot, Xudong Jiang

    Abstract: By locally encoding raw data into intermediate features, collaborative inference enables end users to leverage powerful deep learning models without exposure of sensitive raw data to cloud servers. However, recent studies have revealed that these intermediate features may not sufficiently preserve privacy, as information can be leaked and raw data can be reconstructed via model inversion attacks (… ▽ More

    Submitted 3 April, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

    Comments: accepted by CVPR2025

  5. arXiv:2502.01458  [pdf, ps, other

    cs.LG stat.ML

    The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration

    Authors: Wei Yao, Wenkai Yang, Gengze Xu, Ziqiao Wang, Yankai Lin, Yong Liu

    Abstract: Weak-to-strong generalization, where weakly supervised strong models outperform their weaker teachers, offers a promising approach to aligning superhuman models with human values. To deepen the understanding of this approach, we provide theoretical insights into its capabilities and limitations. First, in the classification setting, we establish upper and lower generalization error bounds for the… ▽ More

    Submitted 3 June, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

  6. arXiv:2412.02090  [pdf, other

    stat.ML cs.LG physics.data-an

    MEP-Net: Generating Solutions to Scientific Problems with Limited Knowledge by Maximum Entropy Principle

    Authors: Wuyue Yang, Liangrong Peng, Guojie Li, Liu Hong

    Abstract: Maximum entropy principle (MEP) offers an effective and unbiased approach to inferring unknown probability distributions when faced with incomplete information, while neural networks provide the flexibility to learn complex distributions from data. This paper proposes a novel neural network architecture, the MEP-Net, which combines the MEP with neural networks to generate probability distributions… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 35 pages, 6 figures, 2 tables

  7. arXiv:2410.24094  [pdf, ps, other

    stat.ME

    Adaptive Sphericity Tests for High Dimensional Data

    Authors: Ping Zhao, Wenwan Yang, Long Feng, Zhaojun Wang

    Abstract: In this paper, we investigate sphericity testing in high-dimensional settings, where existing methods primarily rely on sum-type test procedures that often underperform under sparse alternatives. To address this limitation, we propose two max-type test procedures utilizing the sample covariance matrix and the sample spatial-sign covariance matrix, respectively. Furthermore, we introduce two Cauchy… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

  8. arXiv:2410.19300  [pdf, other

    cs.LG stat.ME

    Golden Ratio-Based Sufficient Dimension Reduction

    Authors: Wenjing Yang, Yuhong Yang

    Abstract: Many machine learning applications deal with high dimensional data. To make computations feasible and learning more efficient, it is often desirable to reduce the dimensionality of the input variables by finding linear combinations of the predictors that can retain as much original information as possible in the relationship between the response and the original predictors. We propose a neural net… ▽ More

    Submitted 29 January, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

  9. arXiv:2410.16340  [pdf, other

    stat.ML cs.LG math.PR

    Limit Theorems for Stochastic Gradient Descent with Infinite Variance

    Authors: Jose Blanchet, Aleksandar Mijatović, Wenhao Yang

    Abstract: Stochastic gradient descent is a classic algorithm that has gained great popularity especially in the last decades as the most common approach for training models in machine learning. While the algorithm has been well-studied when stochastic gradients are assumed to have a finite variance, there is significantly less research addressing its theoretical properties in the case of infinite variance g… ▽ More

    Submitted 5 December, 2024; v1 submitted 21 October, 2024; originally announced October 2024.

  10. arXiv:2410.07130  [pdf

    cs.CE stat.AP

    Analysis of vessel traffic flow characteristics in inland restricted waterways using multi-source data

    Authors: Wenzhang Yang, Peng Liao, Shangkun Jiang, Hao Wang

    Abstract: To effectively manage vessel traffic and alleviate congestion on busy inland waterways, a comprehensive understanding of vessel traffic flow characteristics is crucial. However, limited data availability has resulted in minimal research on the traffic flow characteristics of inland waterway vessels. This study addresses this gap by conducting vessel-following experiments and fixed-point video moni… ▽ More

    Submitted 21 September, 2024; originally announced October 2024.

  11. arXiv:2409.04933   

    stat.ME

    Marginal Structural Modeling of Representative Treatment Trajectories

    Authors: Jiewen Liu, Todd A. Miano, Stephen Griffiths, Michael G. S. Shashaty, Wei Yang

    Abstract: Marginal structural models (MSMs) are widely used in observational studies to estimate the causal effect of time-varying treatments. Despite its popularity, limited attention has been paid to summarizing the treatment history in the outcome model, which proves particularly challenging when individuals' treatment trajectories exhibit complex patterns over time. Commonly used metrics such as the ave… ▽ More

    Submitted 16 September, 2024; v1 submitted 7 September, 2024; originally announced September 2024.

    Comments: We have discovered that the core idea of our paper overlaps with a previously published work. In light of this, we need to conduct a more thorough update and revision of our research before proceeding further

  12. arXiv:2408.00131  [pdf, other

    stat.ML cs.AI cs.LG q-fin.RM

    Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

    Authors: Patrick Kuiper, Ali Hasan, Wenhao Yang, Yuting Ng, Hoda Bidkhori, Jose Blanchet, Vahid Tarokh

    Abstract: The goal of this paper is to develop distributionally robust optimization (DRO) estimators, specifically for multidimensional Extreme Value Theory (EVT) statistics. EVT supports using semi-parametric models called max-stable distributions built from spatial Poisson point processes. While powerful, these models are only asymptotically valid for large samples. However, since extreme data is by defin… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  13. arXiv:2312.10238  [pdf, other

    cs.LG stat.ML

    Hypothesis Testing for Class-Conditional Noise Using Local Maximum Likelihood

    Authors: Weisong Yang, Rafael Poyiadzi, Niall Twomey, Raul Santos Rodriguez

    Abstract: In supervised learning, automatically assessing the quality of the labels before any learning takes place remains an open research question. In certain particular cases, hypothesis testing procedures have been proposed to assess whether a given instance-label dataset is contaminated with class-conditional label noise, as opposed to uniform label noise. The existing theory builds on the asymptotic… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  14. arXiv:2311.15539  [pdf

    stat.CO

    A Novel Human-Based Meta-Heuristic Algorithm: Dragon Boat Optimization

    Authors: Xiang Li, Long Lan, Husam Lahza, Shaowu Yang, Shuihua Wang, Wenjing Yang, Hengzhu Liu, Yudong Zhang

    Abstract: (Aim) Dragon Boat Racing, a popular aquatic folklore team sport, is traditionally held during the Dragon Boat Festival. Inspired by this event, we propose a novel human-based meta-heuristic algorithm called dragon boat optimization (DBO) in this paper. (Method) It models the unique behaviors of each crew member on the dragon boat during the race by introducing social psychology mechanisms (social… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  15. arXiv:2311.08005  [pdf, other

    cs.LG cs.AI stat.ML

    Iterative missing value imputation based on feature importance

    Authors: Cong Guo, Chun Liu, Wei Yang

    Abstract: Many datasets suffer from missing values due to various reasons,which not only increases the processing difficulty of related tasks but also reduces the accuracy of classification. To address this problem, the mainstream approach is to use missing value imputation to complete the dataset. Existing imputation methods estimate the missing parts based on the observed values in the original feature sp… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  16. arXiv:2311.04408  [pdf, other

    stat.AP

    Bayesian modelling of response to therapy and drug-sensitivity in acute lymphoblastic leukemia

    Authors: Andrea Cremaschi, Wenjian Yang, Maria De Iorio, William E. Evans, Jun J. Yang, Gary L. Rosner

    Abstract: Acute lymphoblastic leukemia (ALL) is a heterogeneous hematologic malignancy involving the abnormal proliferation of immature lymphocytes, accounting for most pediatric cancer cases. ALL management in children has seen great improvement in the last decades thanks to better understanding of the disease leading to improved treatment strategies evidenced through clinical trials. Commonly a first cour… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  17. arXiv:2311.00878  [pdf, other

    stat.ME stat.AP

    Backward Joint Model for the Dynamic Prediction of Both Competing Risk and Longitudinal Outcomes

    Authors: Wenhao Li, Brad C. Astor, Wei Yang, Tom H. Greene, Liang Li

    Abstract: Joint modeling is a useful approach to dynamic prediction of clinical outcomes using longitudinally measured predictors. When the outcomes are competing risk events, fitting the conventional shared random effects joint model often involves intensive computation, especially when multiple longitudinal biomarkers are be used as predictors, as is often desired in prediction problems. This paper propos… ▽ More

    Submitted 30 August, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  18. arXiv:2309.17262  [pdf, other

    stat.ML cs.LG

    Estimation and Inference in Distributional Reinforcement Learning

    Authors: Liangyu Zhang, Yang Peng, Jiadong Liang, Wenhao Yang, Zhihua Zhang

    Abstract: In this paper, we study distributional reinforcement learning from the perspective of statistical efficiency. We investigate distributional policy evaluation, aiming to estimate the complete return distribution (denoted $η^π$) attained by a given policy $π$. We use the certainty-equivalence method to construct our estimator $\hatη^π$, given a generative model is available. In this circumstance we… ▽ More

    Submitted 19 September, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  19. arXiv:2309.09239  [pdf, other

    cs.LG stat.ML

    Globally Convergent Accelerated Algorithms for Multilinear Sparse Logistic Regression with $\ell_0$-constraints

    Authors: Weifeng Yang, Wenwen Min

    Abstract: Tensor data represents a multidimensional array. Regression methods based on low-rank tensor decomposition leverage structural information to reduce the parameter count. Multilinear logistic regression serves as a powerful tool for the analysis of multidimensional data. To improve its efficacy and interpretability, we present a Multilinear Sparse Logistic Regression model with $\ell_0$-constraints… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2308.12126

  20. arXiv:2309.02430  [pdf, other

    stat.AP

    A Likelihood Approach to Incorporating Self-Report Data in HIV Recency Classification

    Authors: Wenlong Yang, Danping Liu, Le Bao, Runze Li

    Abstract: Estimating new HIV infections is significant yet challenging due to the difficulty in distinguishing between recent and long-term infections. We demonstrate that HIV recency status (recent v.s. long-term) could be determined from the combination of self-report testing history and biomarkers, which are increasingly available in bio-behavioral surveys. HIV recency status is partially observed, given… ▽ More

    Submitted 12 November, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

  21. arXiv:2308.07176  [pdf, ps, other

    stat.CO

    Perfect simulation from unbiased simulation

    Authors: George M. Leigh, Wen-Hsi Yang, Montana E. Wickens, Amanda R. Northrop

    Abstract: We show that any application of the technique of unbiased simulation becomes perfect simulation when coalescence of the two coupled Markov chains can be practically assured in advance. This happens when a fixed number of iterations is high enough that the probability of needing any more to achieve coalescence is negligible; we suggest a value of $10^{-20}$. This finding enormously increases the ra… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 17 pages, 4 figures; for associated R scripts, see https://github.com/George-Leigh/PerfectSimulation

    MSC Class: 62-08; 62F15 ACM Class: G.3; I.6.3; I.6.5; I.6.8

  22. arXiv:2305.00254  [pdf, other

    cs.LG stat.ML

    Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning

    Authors: Liangyu Zhang, Yang Peng, Wenhao Yang, Zhihua Zhang

    Abstract: We propose a novel generalization of constrained Markov decision processes (CMDPs) that we call the \emph{semi-infinitely constrained Markov decision process} (SICMDP). Particularly, we consider a continuum of constraints instead of a finite number of constraints as in the case of ordinary CMDPs. We also devise two reinforcement learning algorithms for SICMDPs that we call SI-CRL and SI-CPO. SI-CR… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: Shorter version accepted at NeurIPS 2022

  23. arXiv:2304.07546  [pdf, other

    stat.ME

    Tests for ultrahigh-dimensional partially linear regression models

    Authors: Hongwei Shi, Bowen Sun, Weichao Yang, Xu Guo

    Abstract: In this paper, we consider tests for ultrahigh-dimensional partially linear regression models. The presence of ultrahigh-dimensional nuisance covariates and unknown nuisance function makes the inference problem very challenging. We adopt machine learning methods to estimate the unknown nuisance function and introduce quadratic-form test statistics. Interestingly, though the machine learning method… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

  24. arXiv:2304.01849  [pdf, other

    stat.ME math.ST

    Semiparametric efficient estimation of genetic relatedness with machine learning methods

    Authors: Xu Guo, Yiyuan Qian, Hongwei Shi, Weichao Yang, Niwen Zhou

    Abstract: In this paper, we propose semiparametric efficient estimators of genetic relatedness between two traits in a model-free framework. Most existing methods require specifying certain parametric models involving the traits and genetic variants. However, the bias due to model misspecification may yield misleading statistical results. Moreover, the semiparametric efficient bounds for estimators of genet… ▽ More

    Submitted 2 June, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: 46pages,9 tables, 1 figure

  25. arXiv:2303.16073  [pdf, other

    stat.AP

    Capturing episodic impacts of environmental signals

    Authors: Manuela Mendiolar, Jerzy A. Filar, Wen-Hsi Yang, Susannah Leahy, Anthony Courtney

    Abstract: Environmental scientists frequently rely on time series of explanatory variables to explain their impact on an important response variable. However, sometimes, researchers are less interested in raw observations of an explanatory variable than in derived indices induced by episodes embedded in its time series. Often these episodes are intermittent, occur within a specific limited memory, persist f… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: 27 pages

  26. arXiv:2303.07259  [pdf, other

    stat.ME

    A novel approach of empirical likelihood with massive data

    Authors: Yang Liu, Xia Chen, Wei-min Yang

    Abstract: In this paper, we propose a novel approach for tackling the obstacles of empirical likelihood in the face of massive data, which is called split sample mean empirical likelihood (SSMEL), our approach provides a unique perspective for solving big data problems. We show that the SSMEL estimator has the same estimation efficiency as the empirical likelihood estimator with the full dataset, and mainta… ▽ More

    Submitted 5 June, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  27. Discovering a change point and piecewise linear structure in a time series of organoid networks via the iso-mirror

    Authors: Tianyi Chen, Youngser Park, Ali Saad-Eldin, Zachary Lubberts, Avanti Athreya, Benjamin D. Pedigo, Joshua T. Vogelstein, Francesca Puppo, Gabriel A. Silva, Alysson R. Muotri, Weiwei Yang, Christopher M. White, Carey E. Priebe

    Abstract: Recent advancements have been made in the development of cell-based in-vitro neuronal networks, or organoids. In order to better understand the network structure of these organoids, a super-selective algorithm has been proposed for inferring the effective connectivity networks from multi-electrode array data. In this paper, we apply a novel statistical method called spectral mirror estimation to t… ▽ More

    Submitted 12 April, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Journal ref: Appl Netw Sci 8, 75 (2023)

  28. arXiv:2302.14186  [pdf, other

    eess.SP cs.LG stat.AP stat.ME stat.ML

    Approximately optimal domain adaptation with Fisher's Linear Discriminant

    Authors: Hayden S. Helm, Ashwin De Silva, Joshua T. Vogelstein, Carey E. Priebe, Weiwei Yang

    Abstract: We propose a class of models based on Fisher's Linear Discriminant (FLD) in the context of domain adaptation. The class is the convex combination of two hypotheses: i) an average hypothesis representing previously seen source tasks and ii) a hypothesis trained on a new target task. For a particular generative setting we derive the optimal convex combination of the two models under 0-1 loss, propos… ▽ More

    Submitted 1 March, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

  29. arXiv:2302.01248  [pdf, other

    stat.ML cs.LG

    Robust Markov Decision Processes without Model Estimation

    Authors: Wenhao Yang, Han Wang, Tadashi Kozuno, Scott M. Jordan, Zhihua Zhang

    Abstract: Robust Markov Decision Processes (MDPs) are receiving much attention in learning a robust policy which is less sensitive to environment changes. There are an increasing number of works analyzing sample-efficiency of robust MDPs. However, there are two major barriers to applying robust MDPs in practice. First, most works study robust MDPs in a model-based regime, where the transition probability ne… ▽ More

    Submitted 12 September, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  30. arXiv:2212.08446  [pdf, other

    stat.ME

    Score function-based tests for ultrahigh-dimensional linear models

    Authors: Weichao Yang, Xu Guo, Lixing Zhu

    Abstract: In this paper, we investigate score function-based tests to check the significance of an ultrahigh-dimensional sub-vector of the model coefficients when the nuisance parameter vector is also ultrahigh-dimensional in linear models. We first reanalyze and extend a recently proposed score function-based test to derive, under weaker conditions, its limiting distributions under the null and local alter… ▽ More

    Submitted 9 November, 2024; v1 submitted 16 December, 2022; originally announced December 2022.

    MSC Class: Primary 62F03; secondary 62H15

  31. arXiv:2210.11540  [pdf

    stat.AP

    Inference and Prediction Using Functional Principal Components Analysis: Application to Diabetic Kidney Disease Progression in the Chronic Renal Insufficiency Cohort (CRIC) Study

    Authors: Brian Kwan, Wei Yang, Daniel Montemayor, Jing Zhang, Tobias Fuhrer, Amanda H. Anderson, Cheryl A. M. Anderson, Jing Chen, Ana C. Ricardo, Sylvia E. Rosas, Loki Natarajan, the CRIC Study Investigators

    Abstract: Repeated longitudinal measurements are commonly used to model long-term disease progression, and timing and number of assessments per patient may vary, leading to irregularly spaced and sparse data. Longitudinal trajectories may exhibit curvilinear patterns, in which mixed linear regression methods may fail to capture true trends in the data. We applied functional principal components analysis to… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  32. arXiv:2209.05186  [pdf, ps, other

    stat.ML cs.LG

    Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach

    Authors: Miao Lu, Wenhao Yang, Liangyu Zhang, Zhihua Zhang

    Abstract: In an Markov decision process (MDP), unobservable confounders may exist and have impacts on the data generating process, so that the classic off-policy evaluation (OPE) estimators may fail to identify the true value function of the target policy. In this paper, we study the statistical properties of OPE in confounded MDPs with observable instrumental variables. Specifically, we propose a two-stage… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  33. arXiv:2206.12280  [pdf, other

    stat.ME

    Bayesian Circular Lattice Filters for Computationally Efficient Estimation of Multivariate Time-Varying Autoregressive Models

    Authors: Yuelei Sui, Scott H. Holan, Wen-Hsi Yang

    Abstract: Nonstationary time series data exist in various scientific disciplines, including environmental science, biology, signal processing, econometrics, among others. Many Bayesian models have been developed to handle nonstationary time series. The time-varying vector autoregressive (TV-VAR) model is a well-established model for multivariate nonstationary time series. Nevertheless, in most cases, the la… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  34. arXiv:2206.05604  [pdf, ps, other

    stat.ML cs.LG math.ST

    A Theoretical Understanding of Neural Network Compression from Sparse Linear Approximation

    Authors: Wenjing Yang, Ganghua Wang, Jie Ding, Yuhong Yang

    Abstract: The goal of model compression is to reduce the size of a large neural network while retaining a comparable performance. As a result, computation and memory costs in resource-limited applications may be significantly reduced by dropping redundant weights, neurons, or layers. There have been many model compression algorithms proposed that provide impressive empirical success. However, a theoretical… ▽ More

    Submitted 8 November, 2022; v1 submitted 11 June, 2022; originally announced June 2022.

  35. arXiv:2205.14211  [pdf, other

    cs.LG cs.AI stat.ML

    KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal

    Authors: Tadashi Kozuno, Wenhao Yang, Nino Vieillard, Toshinori Kitamura, Yunhao Tang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Michal Valko, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári

    Abstract: In this work, we consider and analyze the sample complexity of model-free reinforcement learning with a generative model. Particularly, we analyze mirror descent value iteration (MDVI) by Geist et al. (2019) and Vieillard et al. (2020a), which uses the Kullback-Leibler divergence and entropy regularization in its value and policy updates. Our analysis shows that it is nearly minimax-optimal for fi… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 29 pages, 6 figures

  36. arXiv:2204.02634  [pdf, other

    cs.LG stat.ML

    Federated Reinforcement Learning with Environment Heterogeneity

    Authors: Hao Jin, Yang Peng, Wenhao Yang, Shusen Wang, Zhihua Zhang

    Abstract: We study a Federated Reinforcement Learning (FedRL) problem in which $n$ agents collaboratively learn a single policy without sharing the trajectories they collected during agent-environment interaction. We stress the constraint of environment heterogeneity, which means $n$ environments corresponding to these $n$ agents have different state transitions. To obtain a value function or a policy funct… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Artificial Intelligence and Statistics 2022

  37. arXiv:2203.00516  [pdf, other

    eess.SP cs.LG stat.ME

    Mental State Classification Using Multi-graph Features

    Authors: Guodong Chen, Hayden S. Helm, Kate Lytvynets, Weiwei Yang, Carey E. Priebe

    Abstract: We consider the problem of extracting features from passive, multi-channel electroencephalogram (EEG) devices for downstream inference tasks related to high-level mental states such as stress and cognitive load. Our proposed method leverages recently developed multi-graph tools and applies them to the time series of graphs implied by the statistical dependence structure (e.g., correlation) amongst… ▽ More

    Submitted 25 February, 2022; originally announced March 2022.

  38. arXiv:2112.14582  [pdf, other

    stat.ML cs.LG

    A Statistical Analysis of Polyak-Ruppert Averaged Q-learning

    Authors: Xiang Li, Wenhao Yang, Jiadong Liang, Zhihua Zhang, Michael I. Jordan

    Abstract: We study Q-learning with Polyak-Ruppert averaging in a discounted Markov decision process in synchronous and tabular settings. Under a Lipschitz condition, we establish a functional central limit theorem for the averaged iteration $\bar{\boldsymbol{Q}}_T$ and show that its standardized partial-sum process converges weakly to a rescaled Brownian motion. The functional central limit theorem implies… ▽ More

    Submitted 19 February, 2023; v1 submitted 29 December, 2021; originally announced December 2021.

    Comments: Accepted by AISTATS 2023

  39. arXiv:2109.09265  [pdf, other

    cs.LG cs.MS stat.ML

    Merlion: A Machine Learning Library for Time Series

    Authors: Aadyot Bhatnagar, Paul Kassianik, Chenghao Liu, Tian Lan, Wenzhuo Yang, Rowan Cassius, Doyen Sahoo, Devansh Arpit, Sri Subramanian, Gerald Woo, Amrita Saha, Arun Kumar Jagota, Gokulakrishnan Gopalakrishnan, Manpreet Singh, K C Krithika, Sukumar Maddineni, Daeki Cho, Bo Zong, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Steven Hoi, Huan Wang

    Abstract: We introduce Merlion, an open-source machine learning library for time series. It features a unified interface for many commonly used models and datasets for anomaly detection and forecasting on both univariate and multivariate time series, along with standard pre/post-processing layers. It has several modules to improve ease-of-use, including visualization, anomaly score calibration to improve in… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: 22 pages, 1 figure, 14 tables

  40. arXiv:2107.03342  [pdf, other

    cs.LG stat.ML

    A Survey of Uncertainty in Deep Neural Networks

    Authors: Jakob Gawlikowski, Cedrique Rovile Njieutcheu Tassi, Mohsin Ali, Jongseok Lee, Matthias Humt, Jianxiang Feng, Anna Kruspe, Rudolph Triebel, Peter Jung, Ribana Roscher, Muhammad Shahzad, Wen Yang, Richard Bamler, Xiao Xiang Zhu

    Abstract: Due to their increasing spread, confidence in neural network predictions became more and more important. However, basic neural networks do not deliver certainty estimates or suffer from over or under confidence. Many researchers have been working on understanding and quantifying uncertainty in a neural network's prediction. As a result, different types and sources of uncertainty have been identifi… ▽ More

    Submitted 18 January, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

  41. arXiv:2106.12621  [pdf, other

    cs.LG cs.IR stat.ME

    Leveraging semantically similar queries for ranking via combining representations

    Authors: Hayden S. Helm, Marah Abdin, Benjamin D. Pedigo, Shweti Mahajan, Vince Lyzinski, Youngser Park, Amitabh Basu, Piali~Choudhury, Christopher M. White, Weiwei Yang, Carey E. Priebe

    Abstract: In modern ranking problems, different and disparate representations of the items to be ranked are often available. It is sensible, then, to try to combine these representations to improve ranking. Indeed, learning to rank via combining representations is both principled and practical for learning a ranking function for a particular query. In extremely data-scarce settings, however, the amount of l… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  42. arXiv:2105.03863  [pdf, other

    stat.ML cs.LG

    Towards Theoretical Understandings of Robust Markov Decision Processes: Sample Complexity and Asymptotics

    Authors: Wenhao Yang, Liangyu Zhang, Zhihua Zhang

    Abstract: In this paper, we study the non-asymptotic and asymptotic performances of the optimal robust policy and value function of robust Markov Decision Processes(MDPs), where the optimal robust policy and value function are solved only from a generative model. While prior work focusing on non-asymptotic performances of robust MDPs is restricted in the setting of the KL uncertainty set and $(s,a)$-rectang… ▽ More

    Submitted 12 August, 2022; v1 submitted 9 May, 2021; originally announced May 2021.

  43. arXiv:2103.12946  [pdf, other

    stat.ME math.ST stat.CO

    Envelope Methods with Ignorable Missing Data

    Authors: Linquan Ma, Lan Liu, Wei Yang

    Abstract: Envelope method was recently proposed as a method to reduce the dimension of responses in multivariate regressions. However, when there exists missing data, the envelope method using the complete case observations may lead to biased and inefficient results. In this paper, we generalize the envelope estimation when the predictors and/or the responses are missing at random. Specifically, we incorpor… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  44. arXiv:2102.10263  [pdf, other

    stat.ML cs.LG stat.ME

    Inducing a hierarchy for multi-class classification problems

    Authors: Hayden S. Helm, Weiwei Yang, Sujeeth Bharadwaj, Kate Lytvynets, Oriana Riva, Christopher White, Ali Geisa, Carey E. Priebe

    Abstract: In applications where categorical labels follow a natural hierarchy, classification methods that exploit the label structure often outperform those that do not. Un-fortunately, the majority of classification datasets do not come pre-equipped with a hierarchical structure and classical flat classifiers must be employed. In this paper, we investigate a class of methods that induce a hierarchy that c… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

  45. Seasonal association between viral causes of hospitalised acute lower respiratory infections and meteorological factors in China: a retrospective study

    Authors: Bing Xu, Jinfeng Wang, Zhongjie Li, Chengdong Xu, Yilan Liao, Maogui Hu, Jing Yang, Shengjie Lai, Liping Wang, Weizhong Yang

    Abstract: Acute lower respiratory infections caused by respiratory viruses are common and persistent infectious diseases worldwide and in China, which have pronounced seasonal patterns. Meteorological factors have important roles in the seasonality of some major viruses. Our aim was to identify the dominant meteorological factors and to model their effects on common respiratory viruses in different regions… ▽ More

    Submitted 15 April, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: 6 figures and tables

    Journal ref: The Lancet Planetary Health, 2021

  46. arXiv:2011.06557  [pdf, other

    stat.ML cs.LG stat.ME

    A partition-based similarity for classification distributions

    Authors: Hayden S. Helm, Ronak D. Mehta, Brandon Duderstadt, Weiwei Yang, Christoper M. White, Ali Geisa, Joshua T. Vogelstein, Carey E. Priebe

    Abstract: Herein we define a measure of similarity between classification distributions that is both principled from the perspective of statistical pattern recognition and useful from the perspective of machine learning practitioners. In particular, we propose a novel similarity on classification distributions, dubbed task similarity, that quantifies how an optimally-transformed optimal representation for a… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  47. arXiv:2011.00213  [pdf, ps, other

    cs.LG stat.ML

    Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs

    Authors: Wenhao Yang, Xiang Li, Guangzeng Xie, Zhihua Zhang

    Abstract: Regularized MDPs serve as a smooth version of original MDPs. However, biased optimal policy always exists for regularized MDPs. Instead of making the coefficientλof regularized term sufficiently small, we propose an adaptive reduction scheme for λ to approximate optimal policy of the original MDP. It is shown that the iteration complexity for obtaining anε-optimal policy could be reduced in compar… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

  48. arXiv:2010.10969  [pdf, other

    cs.LG stat.ML

    Incorporating Interpretable Output Constraints in Bayesian Neural Networks

    Authors: Wanqian Yang, Lars Lorch, Moritz A. Graule, Himabindu Lakkaraju, Finale Doshi-Velez

    Abstract: Domains where supervised models are deployed often come with task-specific constraints, such as prior expert knowledge on the ground-truth function, or desiderata like safety and fairness. We introduce a novel probabilistic framework for reasoning with such constraints and formulate a prior that enables us to effectively incorporate them into Bayesian neural networks (BNNs), including a variant th… ▽ More

    Submitted 6 January, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: 11 pages, with six supplementary pages. 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada. Code available at: https://github.com/dtak/ocbnn-public. Updated version (final, official submission to NeurIPS in January 2021) includes post-conference revisions: improved results in Section 6.2, and corrected minor errata in Appendix C

  49. Interactive Steering of Hierarchical Clustering

    Authors: Weikai Yang, Xiting Wang, Jie Lu, Wenwen Dou, Shixia Liu

    Abstract: Hierarchical clustering is an important technique to organize big data for exploratory data analysis. However, existing one-size-fits-all hierarchical clustering methods often fail to meet the diverse needs of different users. To address this challenge, we present an interactive steering method to visually supervise constrained hierarchical clustering by utilizing both public knowledge (e.g., Wiki… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: Accepted for IEEE Transactions on Visualization and Computer Graphics (TVCG)

  50. arXiv:2008.10055  [pdf, other

    stat.ME

    Multiple Network Embedding for Anomaly Detection in Time Series of Graphs

    Authors: Guodong Chen, Jesús Arroyo, Avanti Athreya, Joshua Cape, Joshua T. Vogelstein, Youngser Park, Chris White, Jonathan Larson, Weiwei Yang, Carey E. Priebe

    Abstract: This paper considers the graph signal processing problem of anomaly detection in time series of graphs. We examine two related, complementary inference tasks: the detection of anomalous graphs within a time series, and the detection of temporally anomalous vertices. We approach these tasks via the adaptation of statistically principled methods for joint graph inference, specifically \emph{multiple… ▽ More

    Submitted 15 July, 2024; v1 submitted 23 August, 2020; originally announced August 2020.

    Comments: 51 pages, 17 figures