Skip to main content

Showing 1–24 of 24 results for author: Ren, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.01452  [pdf, other

    stat.ME

    e-GAI: e-value-based Generalized $α$-Investing for Online False Discovery Rate Control

    Authors: Yifan Zhang, Zijian Wei, Haojie Ren, Changliang Zou

    Abstract: Online multiple hypothesis testing has attracted a lot of attention in many applications, e.g., anomaly status detection and stock market price monitoring. The state-of-the-art generalized $α$-investing (GAI) algorithms can control online false discovery rate (FDR) on p-values only under specific dependence structures, a situation that rarely occurs in practice. The e-LOND algorithm (Xu & Ramdas,… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  2. arXiv:2505.04986  [pdf, other

    stat.ML cs.LG

    Conformal Prediction with Cellwise Outliers: A Detect-then-Impute Approach

    Authors: Qian Peng, Yajie Bao, Haojie Ren, Zhaojun Wang, Changliang Zou

    Abstract: Conformal prediction is a powerful tool for constructing prediction intervals for black-box models, providing a finite sample coverage guarantee for exchangeable data. However, this exchangeability is compromised when some entries of the test feature are contaminated, such as in the case of cellwise outliers. To address this issue, this paper introduces a novel framework called detect-then-impute… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: 23 pages, 15 figures

  3. arXiv:2410.02279  [pdf, other

    stat.ML cs.LG math.ST

    On Lai's Upper Confidence Bound in Multi-Armed Bandits

    Authors: Huachen Ren, Cun-Hui Zhang

    Abstract: In this memorial paper, we honor Tze Leung Lai's seminal contributions to the topic of multi-armed bandits, with a specific focus on his pioneering work on the upper confidence bound. We establish sharp non-asymptotic regret bounds for an upper confidence bound index with a constant level of exploration for Gaussian rewards. Furthermore, we establish a non-asymptotic regret bound for the upper con… ▽ More

    Submitted 3 October, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: 25 pages

    MSC Class: 62L05; 62L10 (Primary) 68T05 (Secondary)

  4. arXiv:2403.07728  [pdf, other

    stat.ML cs.LG stat.ME

    CAP: A General Algorithm for Online Selective Conformal Prediction with FCR Control

    Authors: Yajie Bao, Yuyang Huo, Haojie Ren, Changliang Zou

    Abstract: We study the problem of post-selection predictive inference in an online fashion. To avoid devoting resources to unimportant units, a preliminary selection of the current individual before reporting its prediction interval is common and meaningful in online predictive tasks. Since the online selection causes a temporal multiplicity in the selected prediction intervals, it is important to control t… ▽ More

    Submitted 21 April, 2025; v1 submitted 12 March, 2024; originally announced March 2024.

  5. Analyzing Risk Factors for Post-Acute Recovery in Older Adults with Alzheimer's Disease and Related Dementia: A New Semi-Parametric Model for Large-Scale Medicare Claims

    Authors: Biyi Shen, Haoyu Ren, Michelle Shardell, Jason Falvey, Chixiang Chen

    Abstract: Nearly 300,000 older adults experience a hip fracture every year, the majority of which occur following a fall. Unfortunately, recovery after fall-related trauma such as hip fracture is poor, where older adults diagnosed with Alzheimer's Disease and Related Dementia (ADRD) spend a particularly long time in hospitals or rehabilitation facilities during the post-operative recuperation period. Becaus… ▽ More

    Submitted 1 February, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Published on Statistics in Medicine. Contact Emails: [email protected]

  6. Selective conformal inference with false coverage-statement rate control

    Authors: Yajie Bao, Yuyang Huo, Haojie Ren, Changliang Zou

    Abstract: Conformal inference is a popular tool for constructing prediction intervals (PI). We consider here the scenario of post-selection/selective conformal inference, that is PIs are reported only for individuals selected from an unlabeled test data. To account for multiplicity, we develop a general split conformal framework to construct selective PIs with the false coverage-statement rate (FCR) control… ▽ More

    Submitted 12 March, 2024; v1 submitted 2 January, 2023; originally announced January 2023.

  7. arXiv:2103.11269  [pdf

    cs.LG stat.ML

    Development and Validation of a Deep Learning Model for Prediction of Severe Outcomes in Suspected COVID-19 Infection

    Authors: Varun Buch, Aoxiao Zhong, Xiang Li, Marcio Aloisio Bezerra Cavalcanti Rockenbach, Dufan Wu, Hui Ren, Jiahui Guan, Andrew Liteplo, Sayon Dutta, Ittai Dayan, Quanzheng Li

    Abstract: COVID-19 patient triaging with predictive outcome of the patients upon first present to emergency department (ED) is crucial for improving patient prognosis, as well as better hospital resources management and cross-infection control. We trained a deep feature fusion model to predict patient outcomes, where the model inputs were EHR data including demographic information, co-morbidities, vital sig… ▽ More

    Submitted 28 March, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

    Comments: Varun Buch, Aoxiao Zhong and Xiang Li contribute equally to this work

  8. arXiv:2010.12811  [pdf, other

    cs.LG stat.ML

    Graph Information Bottleneck

    Authors: Tailin Wu, Hongyu Ren, Pan Li, Jure Leskovec

    Abstract: Representation learning of graph-structured data is challenging because both graph structure and node features carry important information. Graph Neural Networks (GNNs) provide an expressive way to fuse information from network structure and node features. However, GNNs are prone to adversarial attacks. Here we introduce Graph Information Bottleneck (GIB), an information-theoretic principle that o… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 20 pages, 3 figures, NeurIPS 2020

  9. arXiv:2008.07087  [pdf, other

    cs.LG cs.AI stat.ML

    OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation

    Authors: Hongyu Ren, Yuke Zhu, Jure Leskovec, Anima Anandkumar, Animesh Garg

    Abstract: Real-world tasks often exhibit a compositional structure that contains a sequence of simpler sub-tasks. For instance, opening a door requires reaching, grasping, rotating, and pulling the door knob. Such compositional tasks require an agent to reason about the sub-task at hand while orchestrating global behavior accordingly. This can be cast as an online task inference problem, where the current t… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: UAI 2020

  10. arXiv:2005.00687  [pdf, other

    cs.LG cs.SI stat.ML

    Open Graph Benchmark: Datasets for Machine Learning on Graphs

    Authors: Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, Jure Leskovec

    Abstract: We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research. OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source c… ▽ More

    Submitted 24 February, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: Fix dataset bug in ogbg-code

  11. arXiv:2002.12548  [pdf, other

    math.ST stat.ME

    A New Procedure for Controlling False Discovery Rate in Large-Scale t-tests

    Authors: Changliang Zou, Haojie Ren, Xu Guo, Runze Li

    Abstract: This paper is concerned with false discovery rate (FDR) control in large-scale multiple testing problems. We first propose a new data-driven testing procedure for controlling the FDR in large-scale t-tests for one-sample mean problem. The proposed procedure achieves exact FDR control in finite sample settings when the populations are symmetric no matter the number of tests or sample sizes. Compari… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

  12. arXiv:2002.06757  [pdf, other

    cs.LG stat.ML

    Relational Message Passing for Knowledge Graph Completion

    Authors: Hongwei Wang, Hongyu Ren, Jure Leskovec

    Abstract: Knowledge graph completion aims to predict missing relations between entities in a knowledge graph. In this work, we propose a relational message passing method for knowledge graph completion. Different from existing embedding-based methods, relational message passing only considers edge features (i.e., relation types) without entity IDs in the knowledge graph, and passes relational messages among… ▽ More

    Submitted 27 May, 2021; v1 submitted 16 February, 2020; originally announced February 2020.

  13. arXiv:2002.05969  [pdf, other

    cs.LG cs.CL stat.ML

    Query2box: Reasoning over Knowledge Graphs in Vector Space using Box Embeddings

    Authors: Hongyu Ren, Weihua Hu, Jure Leskovec

    Abstract: Answering complex logical queries on large-scale incomplete knowledge graphs (KGs) is a fundamental yet challenging task. Recently, a promising approach to this problem has been to embed KG entities as well as the query into a vector space such that entities that answer the query are embedded close to the query. However, prior work models queries as single points in the vector space, which is prob… ▽ More

    Submitted 28 February, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: ICLR 2020

  14. arXiv:1910.04499  [pdf, other

    cs.LG stat.ML

    DeGNN: Characterizing and Improving Graph Neural Networks with Graph Decomposition

    Authors: Xupeng Miao, Nezihe Merve Gürel, Wentao Zhang, Zhichao Han, Bo Li, Wei Min, Xi Rao, Hansheng Ren, Yinan Shan, Yingxia Shao, Yujie Wang, Fan Wu, Hui Xue, Yaming Yang, Zitao Zhang, Yang Zhao, Shuai Zhang, Yujing Wang, Bin Cui, Ce Zhang

    Abstract: Despite the wide application of Graph Convolutional Network (GCN), one major limitation is that it does not benefit from the increasing depth and suffers from the oversmoothing problem. In this work, we first characterize this phenomenon from the information-theoretic perspective and show that under certain conditions, the mutual information between the output after $l$ layers and the input of GCN… ▽ More

    Submitted 29 June, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: 20 pages, 5 figures, 5 tables

  15. arXiv:1909.09142  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Using Quantifier Elimination to Enhance the Safety Assurance of Deep Neural Networks

    Authors: Hao Ren, Sai Krishnan Chandrasekar, Anitha Murugesan

    Abstract: Advances in the field of Machine Learning and Deep Neural Networks (DNNs) has enabled rapid development of sophisticated and autonomous systems. However, the inherent complexity to rigorously assure the safe operation of such systems hinders their real-world adoption in safety-critical domains such as aerospace and medical devices. Hence, there is a surge in interest to explore the use of advanced… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

  16. arXiv:1907.13196  [pdf, other

    cs.LG cs.AI stat.ML

    Wasserstein Robust Reinforcement Learning

    Authors: Mohammed Amin Abdullah, Hang Ren, Haitham Bou Ammar, Vladimir Milenkovic, Rui Luo, Mingtian Zhang, Jun Wang

    Abstract: Reinforcement learning algorithms, though successful, tend to over-fit to training environments hampering their application to the real-world. This paper proposes $\text{W}\text{R}^{2}\text{L}$ -- a robust reinforcement learning algorithm with significant robust performance on low and high-dimensional control tasks. Our method formalises robust reinforcement learning as a novel min-max game with a… ▽ More

    Submitted 16 September, 2019; v1 submitted 30 July, 2019; originally announced July 2019.

  17. Time-Series Anomaly Detection Service at Microsoft

    Authors: Hansheng Ren, Bixiong Xu, Yujing Wang, Chao Yi, Congrui Huang, Xiaoyu Kou, Tony Xing, Mao Yang, Jie Tong, Qi Zhang

    Abstract: Large companies need to monitor various metrics (for example, Page Views and Revenue) of their applications and services in real time. At Microsoft, we develop a time-series anomaly detection service which helps customers to monitor the time-series continuously and alert for potential incidents on time. In this paper, we introduce the pipeline and algorithm of our anomaly detection service, which… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: KDD 2019

  18. arXiv:1903.06727  [pdf, ps, other

    cs.LG stat.ML

    On Sample Complexity of Projection-Free Primal-Dual Methods for Learning Mixture Policies in Markov Decision Processes

    Authors: Masoud Badiei Khuzani, Varun Vasudevan, Hongyi Ren, Lei Xing

    Abstract: We study the problem of learning policy of an infinite-horizon, discounted cost, Markov decision process (MDP) with a large number of states. We compute the actions of a policy that is nearly as good as a policy chosen by a suitable oracle from a given mixture policy class characterized by the convex hull of a set of known base policies. To learn the coefficients of the mixture model, we recast th… ▽ More

    Submitted 30 August, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

    Comments: Manuscript accepted to 58th CDC, 31 pages, 2 figures

  19. arXiv:1902.10365  [pdf, other

    cs.LG stat.ML

    A Distributionally Robust Optimization Method for Adversarial Multiple Kernel Learning

    Authors: Masoud Badiei Khuzani, Hongyi Ren, Md Tauhidul Islam, Lei Xing

    Abstract: We propose a novel data-driven method to learn a mixture of multiple kernels with random features that is certifiabaly robust against adverserial inputs. Specifically, we consider a distributionally robust optimization of the kernel-target alignment with respect to the distribution of training samples over a distributional ball defined by the Kullback-Leibler (KL) divergence. The distributionally… ▽ More

    Submitted 13 April, 2021; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: Major revision. The title and abstract have been updated

  20. arXiv:1811.03259  [pdf, other

    cs.LG stat.ML

    Bias and Generalization in Deep Generative Models: An Empirical Study

    Authors: Shengjia Zhao, Hongyu Ren, Arianna Yuan, Jiaming Song, Noah Goodman, Stefano Ermon

    Abstract: In high dimensional settings, density estimation algorithms rely crucially on their inductive bias. Despite recent empirical success, the inductive bias of deep generative models is not well understood. In this paper we propose a framework to systematically investigate bias and generalization in deep generative models of images. Inspired by experimental methods from cognitive psychology, we probe… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

  21. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  22. arXiv:1810.09177  [pdf, other

    cs.LG cs.IR stat.ML

    Compositional Coding Capsule Network with K-Means Routing for Text Classification

    Authors: Hao Ren, Hong Lu

    Abstract: Text classification is a challenging problem which aims to identify the category of texts. In the process of training, word embeddings occupy a large part of parameters. Under the limitation of limited computing resources, it indirectly limits the ability of subsequent network designs. In order to reduce the number of parameters, the compositional coding mechanism has been proposed recently. Based… ▽ More

    Submitted 2 June, 2022; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: the paper is accepted by Pattern Recognition Letters, please refer https://www.sciencedirect.com/science/article/pii/S016786552200188X for an updated version

  23. arXiv:1807.09936  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Multi-Agent Generative Adversarial Imitation Learning

    Authors: Jiaming Song, Hongyu Ren, Dorsa Sadigh, Stefano Ermon

    Abstract: Imitation learning algorithms can be used to learn a policy from expert demonstrations without access to a reward signal. However, most existing approaches are not applicable in multi-agent settings due to the existence of multiple (Nash) equilibria and non-stationary environments. We propose a new framework for multi-agent imitation learning for general Markov games, where we build upon a general… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

  24. arXiv:1805.10561  [pdf, other

    cs.LG cs.CV stat.ML

    Adversarial Constraint Learning for Structured Prediction

    Authors: Hongyu Ren, Russell Stewart, Jiaming Song, Volodymyr Kuleshov, Stefano Ermon

    Abstract: Constraint-based learning reduces the burden of collecting labels by having users specify general properties of structured outputs, such as constraints imposed by physical laws. We propose a novel framework for simultaneously learning these constraints and using them for supervision, bypassing the difficulty of using domain expertise to manually specify constraints. Learning requires a black-box s… ▽ More

    Submitted 30 May, 2018; v1 submitted 26 May, 2018; originally announced May 2018.

    Comments: To appear at IJCAI 2018