Skip to main content

Showing 1–19 of 19 results for author: Liao, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.02137  [pdf, other

    stat.AP stat.ME

    Evaluation of the HeartSteps Online Sampling Algorithm

    Authors: Xiang Meng, Walter Dempsey, Peng Liao, Nick Reid, Pedja Klasnja, Susan Murphy

    Abstract: Micro-randomized trials (MRTs), which sequentially randomize participants at multiple decision times, have gained prominence in digital intervention development. These sequential randomizations are often subject to certain constraints. In the MRT called HeartSteps V2V3, where an intervention is designed to interrupt sedentary behavior, two core design constraints need to be managed: an average of… ▽ More

    Submitted 3 January, 2025; originally announced January 2025.

    Comments: The main paper spans 48 pages and includes figures numbered up to Figure 7. The entire document, including references and appendices, consists of 80 pages, ending with Figure 13

  2. arXiv:2410.07130  [pdf

    cs.CE stat.AP

    Analysis of vessel traffic flow characteristics in inland restricted waterways using multi-source data

    Authors: Wenzhang Yang, Peng Liao, Shangkun Jiang, Hao Wang

    Abstract: To effectively manage vessel traffic and alleviate congestion on busy inland waterways, a comprehensive understanding of vessel traffic flow characteristics is crucial. However, limited data availability has resulted in minimal research on the traffic flow characteristics of inland waterway vessels. This study addresses this gap by conducting vessel-following experiments and fixed-point video moni… ▽ More

    Submitted 21 September, 2024; originally announced October 2024.

  3. arXiv:2409.08396  [pdf, other

    stat.ML cs.LG stat.AP

    Federated One-Shot Ensemble Clustering

    Authors: Rui Duan, Xin Xiong, Jueyi Liu, Katherine P. Liao, Tianxi Cai

    Abstract: Cluster analysis across multiple institutions poses significant challenges due to data-sharing restrictions. To overcome these limitations, we introduce the Federated One-shot Ensemble Clustering (FONT) algorithm, a novel solution tailored for multi-site analyses under such constraints. FONT requires only a single round of communication between sites and ensures privacy by exchanging only fitted m… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  4. arXiv:2304.05365  [pdf, other

    cs.LG stat.AP stat.ME stat.ML

    Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling

    Authors: Susobhan Ghosh, Raphael Kim, Prasidh Chhabria, Raaz Dwivedi, Predrag Klasnja, Peng Liao, Kelly Zhang, Susan Murphy

    Abstract: There is a growing interest in using reinforcement learning (RL) to personalize sequences of treatments in digital health to support users in adopting healthier behaviors. Such sequential decision-making problems involve decisions about when to treat and how to treat based on the user's context (e.g., prior activity level, location, etc.). Online RL is a promising data-driven approach for this pro… ▽ More

    Submitted 7 August, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: The first two authors contributed equally

  5. arXiv:2211.16609  [pdf

    stat.AP

    Harnessing electronic health records for real-world evidence

    Authors: Jue Hou, Rachel Zhao, Jessica Gronsbell, Brett K. Beaulieu-Jones, Griffin Webber, Thomas Jemielita, Shuyan Wan, Chuan Hong, Yucong Lin, Tianrun Cai, Jun Wen, Vidul A. Panickan, Clara-Lea Bonzel, Kai-Li Liaw, Katherine P. Liao, Tianxi Cai

    Abstract: While randomized controlled trials (RCTs) are the gold-standard for establishing the efficacy and safety of a medical treatment, real-world evidence (RWE) generated from real-world data (RWD) has been vital in post-approval monitoring and is being promoted for the regulatory process of experimental therapies. An emerging source of RWD is electronic health records (EHRs), which contain detailed inf… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 39 pages, 1 figure, 1 table

  6. arXiv:2208.07927  [pdf, other

    stat.ME

    Semi-supervised Transfer Learning for Evaluation of Model Classification Performance

    Authors: Linshanshan Wang, Xuan Wang, Katherine P. Liao, Tianxi Cai

    Abstract: In modern machine learning applications, frequent encounters of covariate shift and label scarcity have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled target populations using existing labeled data in a source population. However, there is a paucity of literature on transferring p… ▽ More

    Submitted 18 November, 2022; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 3 figures, 2 tables

  7. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, AdriĆ  Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  8. arXiv:2011.04185  [pdf, other

    math.ST cs.LG stat.ML

    Robust Batch Policy Learning in Markov Decision Processes

    Authors: Zhengling Qi, Peng Liao

    Abstract: We study the offline data-driven sequential decision making problem in the framework of Markov decision process (MDP). In order to enhance the generalizability and adaptivity of the learned policy, we propose to evaluate each policy by a set of the average rewards with respect to distributions centered at the policy induced stationary distribution. Given a pre-collected dataset of multiple traject… ▽ More

    Submitted 9 November, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

  9. arXiv:2010.02521  [pdf, ps, other

    stat.ME

    Augmented Transfer Regression Learning with Semi-non-parametric Nuisance Models

    Authors: Molei Liu, Yi Zhang, Katherine P Liao, Tianxi Cai

    Abstract: In contemporary statistical learning, covariate shift correction plays an important role in transfer learning when distribution of the testing data is shifted from the training data. Importance weighting, as a natural and principle strategy to adjust for covariate shift, has been commonly used in the field of transfer learning. However, this strategy is not robust to model misspecification or exce… ▽ More

    Submitted 11 April, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

  10. arXiv:2009.13504  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Information Obfuscation of Graph Neural Networks

    Authors: Peiyuan Liao, Han Zhao, Keyulu Xu, Tommi Jaakkola, Geoffrey Gordon, Stefanie Jegelka, Ruslan Salakhutdinov

    Abstract: While the advent of Graph Neural Networks (GNNs) has greatly improved node and graph representation learning in many applications, the neighborhood aggregation scheme exposes additional vulnerabilities to adversaries seeking to extract node-level information about sensitive attributes. In this paper, we study the problem of protecting sensitive attributes by information obfuscation when learning w… ▽ More

    Submitted 13 June, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: ICML 2021; Code is available at https://github.com/liaopeiyuan/GAL

  11. arXiv:2008.01571  [pdf, other

    cs.LG cs.CY stat.ML

    IntelligentPooling: Practical Thompson Sampling for mHealth

    Authors: Sabina Tomkins, Peng Liao, Predrag Klasnja, Susan Murphy

    Abstract: In mobile health (mHealth) smart devices deliver behavioral treatments repeatedly over time to a user with the goal of helping the user adopt and maintain healthy behaviors. Reinforcement learning appears ideal for learning how to optimally make these sequential treatment decisions. However, significant challenges must be overcome before reinforcement learning can be effectively deployed in a mobi… ▽ More

    Submitted 12 December, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: arXiv admin note: text overlap with arXiv:2002.09971

  12. arXiv:2007.11771  [pdf, other

    math.ST stat.ML

    Batch Policy Learning in Average Reward Markov Decision Processes

    Authors: Peng Liao, Zhengling Qi, Runzhe Wan, Predrag Klasnja, Susan Murphy

    Abstract: We consider the batch (off-line) policy learning problem in the infinite horizon Markov Decision Process. Motivated by mobile health applications, we focus on learning a policy that maximizes the long-term average reward. We propose a doubly robust estimator for the average reward and show that it achieves semiparametric efficiency. Further we develop an optimization algorithm to compute the optim… ▽ More

    Submitted 17 September, 2022; v1 submitted 22 July, 2020; originally announced July 2020.

  13. arXiv:2002.09971  [pdf, other

    cs.LG cs.CY stat.ML

    Rapidly Personalizing Mobile Health Treatment Policies with Limited Data

    Authors: Sabina Tomkins, Peng Liao, Predrag Klasnja, Serena Yeung, Susan Murphy

    Abstract: In mobile health (mHealth), reinforcement learning algorithms that adapt to one's context without learning personalized policies might fail to distinguish between the needs of individuals. Yet the high amount of noise due to the in situ delivery of mHealth interventions can cripple the ability of an algorithm to learn when given access to only a single user's data, making personalization challengi… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

  14. arXiv:1912.13088  [pdf, ps, other

    cs.LG math.ST stat.ML

    Off-Policy Estimation of Long-Term Average Outcomes with Applications to Mobile Health

    Authors: Peng Liao, Predrag Klasnja, Susan Murphy

    Abstract: Due to the recent advancements in wearables and sensing technology, health scientists are increasingly developing mobile health (mHealth) interventions. In mHealth interventions, mobile devices are used to deliver treatment to individuals as they go about their daily lives. These treatments are generally designed to impact a near time, proximal outcome such as stress or physical activity. The mHea… ▽ More

    Submitted 22 July, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

  15. Off-Policy Evaluation of Probabilistic Identity Data in Lookalike Modeling

    Authors: Randell Cotta, Mingyang Hu, Dan Jiang, Peizhou Liao

    Abstract: We evaluate the impact of probabilistically-constructed digital identity data collected from Sep. to Dec. 2017 (approx.), in the context of Lookalike-targeted campaigns. The backbone of this study is a large set of probabilistically-constructed "identities", represented as small bags of cookies and mobile ad identifiers with associated metadata, that are likely all owned by the same underlying use… ▽ More

    Submitted 3 January, 2019; originally announced January 2019.

    Comments: Accepted by WSDM 2019

  16. arXiv:1711.03587  [pdf, other

    stat.AP

    The stratified micro-randomized trial design: sample size considerations for testing nested causal effects of time-varying treatments

    Authors: Walter Dempsey, Peng Liao, Santosh Kumar, Susan A. Murphy

    Abstract: Technological advancements in the field of mobile devices and wearable sensors have helped overcome obstacles in the delivery of care, making it possible to deliver behavioral treatments anytime and anywhere. Increasingly the delivery of these treatments is triggered by predictions of risk or engagement which may have been impacted by prior treatments. Furthermore the treatments are often designed… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

  17. arXiv:1609.00695  [pdf, other

    stat.ME

    MRT-SS Calculator: An R Shiny Application for Sample Size Calculation in Micro-Randomized Trials

    Authors: Nicholas J. Seewald, Ji Sun, Peng Liao

    Abstract: The micro-randomized trial (MRT) is a new experimental design which allows for the investigation of the proximal effects of a "just-in-time" treatment, often provided via a mobile device as part of a mobile health intervention. As with a traditional randomized controlled trial, computing the minimum required sample size to achieve a desired power is a crucial step in designing an MRT. We present M… ▽ More

    Submitted 5 August, 2020; v1 submitted 2 September, 2016; originally announced September 2016.

    Comments: 20 pages. Source code for the application is available at https://github.com/pengliao/mrt-ss-calculator

  18. arXiv:1511.08074  [pdf, other

    stat.ME

    Estimation and testing for multiple regulation of multivariate mixed outcomes

    Authors: Denis Agniel, Katherine P. Liao, Tianxi Cai

    Abstract: Considerable interest has recently been focused on studying multiple phenotypes simultaneously in both epidemiological and genomic studies, either to capture the multidimensionality of complex disorders or to understand shared etiology of related disorders. We seek to identify {\em multiple regulators} or predictors that are associated with multiple outcomes when these outcomes may be measured on… ▽ More

    Submitted 25 November, 2015; originally announced November 2015.

    Comments: 25 pages, 6 figures

  19. Sample Size Calculations for Micro-randomized Trials in mHealth

    Authors: Peng Liao, Predrag Klasnja, Ambuj Tewari, Susan A. Murphy

    Abstract: The use and development of mobile interventions are experiencing rapid growth. In "just-in-time" mobile interventions, treatments are provided via a mobile device and they are intended to help an individual make healthy decisions "in the moment," and thus have a proximal, near future impact. Currently the development of mobile interventions is proceeding at a much faster pace than that of associat… ▽ More

    Submitted 22 July, 2020; v1 submitted 1 April, 2015; originally announced April 2015.

    Comments: 29 pages, 5 figures, 18 tables

    Journal ref: Statistics in medicine 35, no. 12 (2016): 1944-1971