Skip to main content

Showing 1–50 of 65 results for author: Murphy, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.17645  [pdf, other

    cs.HC

    "It felt more real": Investigating the User Experience of the MiWaves Personalizing JITAI Pilot Study

    Authors: Susobhan Ghosh, Pei-Yao Hung, Lara N. Coughlin, Erin E. Bonar, Yongyi Guo, Inbal Nahum-Shani, Maureen Walton, Mark W. Newman, Susan A. Murphy

    Abstract: Cannabis use among emerging adults is increasing globally, posing significant health risks and creating a need for effective interventions. We present an exploratory analysis of the MiWaves pilot study, a digital intervention aimed at supporting cannabis use reduction among emerging adults (ages 18-25). Our findings indicate the potential of self-monitoring check-ins and trend visualizations in fo… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  2. arXiv:2502.06835  [pdf, other

    cs.LG

    Reinforcement Learning on Dyads to Enhance Medication Adherence

    Authors: Ziping Xu, Hinal Jajal, Sung Won Choi, Inbal Nahum-Shani, Guy Shani, Alexandra M. Psihogios, Pei-Yao Hung, Susan Murphy

    Abstract: Medication adherence is critical for the recovery of adolescents and young adults (AYAs) who have undergone hematopoietic cell transplantation (HCT). However, maintaining adherence is challenging for AYAs after hospital discharge, who experience both individual (e.g. physical and emotional symptoms) and interpersonal barriers (e.g., relational difficulties with their care partner, who is often inv… ▽ More

    Submitted 21 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  3. arXiv:2502.01789  [pdf

    cs.AI cs.MA

    An Agentic AI Workflow for Detecting Cognitive Concerns in Real-world Data

    Authors: Jiazi Tian, Liqin Wang, Pedram Fard, Valdery Moura Junior, Deborah Blacker, Jennifer S. Haas, Chirag Patel, Shawn N. Murphy, Lidia M. V. R. Moura, Hossein Estiri

    Abstract: Early identification of cognitive concerns is critical but often hindered by subtle symptom presentation. This study developed and validated a fully automated, multi-agent AI workflow using LLaMA 3 8B to identify cognitive concerns in 3,338 clinical notes from Mass General Brigham. The agentic workflow, leveraging task-specific agents that dynamically collaborate to extract meaningful insights fro… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  4. arXiv:2412.18799  [pdf

    cs.CY

    Quantifying the Risk of Pastoral Conflict in 4 Central African Countries

    Authors: Lirika Solaa, Youdinghuan Chen, Samantha K. Murphy, V. S. Subrahmanian

    Abstract: Climate change is becoming a widely recognized risk factor of farmer-herder conflict in Africa. Using an 8 year dataset (Jan 2015 to Sep 2022) of detailed weather and terrain data across four African nations, we apply statistical and machine learning methods to analyze pastoral conflict. We test hypotheses linking these variables with pastoral conflict within each country using geospatial and stat… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

    Comments: v1. 78 pages, 13 figures, 23 tables

  5. arXiv:2412.00308  [pdf, other

    cs.LG cs.AI stat.ML

    BOTS: Batch Bayesian Optimization of Extended Thompson Sampling for Severely Episode-Limited RL Settings

    Authors: Karine Karine, Susan A. Murphy, Benjamin M. Marlin

    Abstract: In settings where the application of reinforcement learning (RL) requires running real-world trials, including the optimization of adaptive health interventions, the number of episodes available for learning can be severely limited due to cost or time constraints. In this setting, the bias-variance trade-off of contextual bandit methods can be significantly better than that of more complex full RL… ▽ More

    Submitted 29 November, 2024; originally announced December 2024.

    Comments: Accepted at NeurIPS 2024 Workshop on Bayesian Decision-making and Uncertainty

  6. arXiv:2410.24131  [pdf, ps, other

    cs.HC

    Transit drivers' reflections on the benefits and harms of eye tracking technology

    Authors: Shaina Murphy, Bryce Grame, Ethan Smith, Siva Srinivasan, Eakta Jain

    Abstract: Eye tracking technology offers great potential for improving road safety. It is already being built into vehicles, namely cars and trucks. When this technology is integrated into transit service vehicles, employees, i.e., bus drivers, will be subject to being eye tracked on their job. Although there is much research effort advancing algorithms for eye tracking in transportation, less is known abou… ▽ More

    Submitted 6 December, 2024; v1 submitted 31 October, 2024; originally announced October 2024.

  7. arXiv:2410.14659  [pdf, other

    cs.LG stat.ML

    Harnessing Causality in Reinforcement Learning With Bagged Decision Times

    Authors: Daiqi Gao, Hsin-Yu Lai, Predrag Klasnja, Susan A. Murphy

    Abstract: We consider reinforcement learning (RL) for a class of problems with bagged decision times. A bag contains a finite sequence of consecutive decision times. The transition dynamics are non-Markovian and non-stationary within a bag. All actions within a bag jointly impact a single reward, observed at the end of the bag. For example, in mobile health, multiple activity suggestions in a day collective… ▽ More

    Submitted 6 May, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

  8. Environment Scan of Generative AI Infrastructure for Clinical and Translational Science

    Authors: Betina Idnay, Zihan Xu, William G. Adams, Mohammad Adibuzzaman, Nicholas R. Anderson, Neil Bahroos, Douglas S. Bell, Cody Bumgardner, Thomas Campion, Mario Castro, James J. Cimino, I. Glenn Cohen, David Dorr, Peter L Elkin, Jungwei W. Fan, Todd Ferris, David J. Foran, David Hanauer, Mike Hogarth, Kun Huang, Jayashree Kalpathy-Cramer, Manoj Kandpal, Niranjan S. Karnik, Avnish Katoch, Albert M. Lai , et al. (32 additional authors not shown)

    Abstract: This study reports a comprehensive environmental scan of the generative AI (GenAI) infrastructure in the national network for clinical and translational science across 36 institutions supported by the Clinical and Translational Science Award (CTSA) Program led by the National Center for Advancing Translational Sciences (NCATS) of the National Institutes of Health (NIH) at the United States. With t… ▽ More

    Submitted 27 September, 2024; originally announced October 2024.

  9. arXiv:2410.03380  [pdf, ps, other

    cs.LG cs.AI q-bio.QM

    Identifying biological perturbation targets through causal differential networks

    Authors: Menghua Wu, Umesh Padia, Sean H. Murphy, Regina Barzilay, Tommi Jaakkola

    Abstract: Identifying variables responsible for changes to a biological system enables applications in drug target discovery and cell engineering. Given a pair of observational and interventional datasets, the goal is to isolate the subset of observed variables that were the targets of the intervention. Directly applying causal discovery algorithms is challenging: the data may contain thousands of variables… ▽ More

    Submitted 30 May, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Journal ref: Proceedings of the 42nd International Conference on Machine Learning, Vancouver, Canada. PMLR 267, 2025

  10. arXiv:2409.10526  [pdf, other

    cs.CY cs.AI

    Effective Monitoring of Online Decision-Making Algorithms in Digital Intervention Implementation

    Authors: Anna L. Trella, Susobhan Ghosh, Erin E. Bonar, Lara Coughlin, Finale Doshi-Velez, Yongyi Guo, Pei-Yao Hung, Inbal Nahum-Shani, Vivek Shetty, Maureen Walton, Iris Yan, Kelly W. Zhang, Susan A. Murphy

    Abstract: Online AI decision-making algorithms are increasingly used by digital interventions to dynamically personalize treatment to individuals. These algorithms determine, in real-time, the delivery of treatment based on accruing data. The objective of this paper is to provide guidelines for enabling effective monitoring of online decision-making algorithms with the goal of (1) safeguarding individuals a… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

  11. arXiv:2409.02069  [pdf, other

    cs.AI cs.HC

    A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial

    Authors: Anna L. Trella, Kelly W. Zhang, Hinal Jajal, Inbal Nahum-Shani, Vivek Shetty, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Dental disease is a prevalent chronic condition associated with substantial financial burden, personal suffering, and increased risk of systemic diseases. Despite widespread recommendations for twice-daily tooth brushing, adherence to recommended oral self-care behaviors remains sub-optimal due to factors such as forgetfulness and disengagement. To address this, we developed Oralytics, a mHealth i… ▽ More

    Submitted 18 December, 2024; v1 submitted 3 September, 2024; originally announced September 2024.

  12. arXiv:2408.15076  [pdf, other

    cs.LG cs.AI

    MiWaves Reinforcement Learning Algorithm

    Authors: Susobhan Ghosh, Yongyi Guo, Pei-Yao Hung, Lara Coughlin, Erin Bonar, Inbal Nahum-Shani, Maureen Walton, Susan Murphy

    Abstract: The escalating prevalence of cannabis use poses a significant public health challenge globally. In the U.S., cannabis use is more prevalent among emerging adults (EAs) (ages 18-25) than any other age group, with legalization in the multiple states contributing to a public perception that cannabis is less risky than in prior decades. To address this growing concern, we developed MiWaves, a reinforc… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.17739

  13. arXiv:2406.19662  [pdf, other

    cs.LG physics.comp-ph

    Finite basis Kolmogorov-Arnold networks: domain decomposition for data-driven and physics-informed problems

    Authors: Amanda A. Howard, Bruno Jacob, Sarah H. Murphy, Alexander Heinlein, Panos Stinis

    Abstract: Kolmogorov-Arnold networks (KANs) have attracted attention recently as an alternative to multilayer perceptrons (MLPs) for scientific machine learning. However, KANs can be expensive to train, even for relatively small networks. Inspired by finite basis physics-informed neural networks (FBPINNs), in this work, we develop a domain decomposition method for KANs that allows for several small KANs to… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  14. arXiv:2406.13127  [pdf, other

    cs.AI

    Oralytics Reinforcement Learning Algorithm

    Authors: Anna L. Trella, Kelly W. Zhang, Stephanie M. Carpenter, David Elashoff, Zara M. Greer, Inbal Nahum-Shani, Dennis Ruenger, Vivek Shetty, Susan A. Murphy

    Abstract: Dental disease is still one of the most common chronic diseases in the United States. While dental disease is preventable through healthy oral self-care behaviors (OSCB), this basic behavior is not consistently practiced. We have developed Oralytics, an online, reinforcement learning (RL) algorithm that optimizes the delivery of personalized intervention prompts to improve OSCB. In this paper, we… ▽ More

    Submitted 12 September, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  15. arXiv:2406.06306  [pdf, other

    eess.SP cs.IT math.ST

    Unified Fourier bases for signals on random graphs with group symmetries

    Authors: Mahya Ghandehari, Jeannette Janssen, Silo Murphy

    Abstract: We consider a recently proposed approach to graph signal processing (GSP) based on graphons. We show how the graphon-based approach to GSP applies to graphs sampled from a stochastic block model derived from a weighted Cayley graph. When SBM block sizes are equal, a nice Fourier basis can be derived from the representation theory of the underlying group. We explore how the SBM Fourier basis is aff… ▽ More

    Submitted 14 March, 2025; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 25 pages

    MSC Class: 94A12

  16. arXiv:2405.19660  [pdf, other

    cs.CL

    PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals

    Authors: Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Jiayin Zhi, Shaun M. Eack, Travis Labrum, Samuel M. Murphy, Nev Jones, Kate Hardy, Hong Shen, Fei Fang, Zhiyu Zoey Chen

    Abstract: Mental illness remains one of the most critical public health issues. Despite its importance, many mental health professionals highlight a disconnect between their training and actual real-world patient practice. To help bridge this gap, we propose PATIENT-Ψ, a novel patient simulation framework for cognitive behavior therapy (CBT) training. To build PATIENT-Ψ, we construct diverse patient cogniti… ▽ More

    Submitted 3 October, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: EMNLP 2024 Main, 9 pages, 5 figures

  17. arXiv:2403.10946  [pdf, other

    stat.ML cs.LG

    The Fallacy of Minimizing Cumulative Regret in the Sequential Task Setting

    Authors: Ziping Xu, Kelly W. Zhang, Susan A. Murphy

    Abstract: Online Reinforcement Learning (RL) is typically framed as the process of minimizing cumulative regret (CR) through interactions with an unknown environment. However, real-world RL applications usually involve a sequence of tasks, and the data collected in the first task is used to warm-start the second task. The performance of the warm-start policy is measured by simple regret (SR). While minimizi… ▽ More

    Submitted 24 October, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  18. arXiv:2403.05911  [pdf, other

    cs.HC cs.AI

    Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning

    Authors: Zana Buçinca, Siddharth Swaroop, Amanda E. Paluch, Susan A. Murphy, Krzysztof Z. Gajos

    Abstract: Imagine if AI decision-support tools not only complemented our ability to make accurate decisions, but also improved our skills, boosted collaboration, and elevated the joy we derive from our tasks. Despite the potential to optimize a broad spectrum of such human-centric objectives, the design of current AI tools remains focused on decision accuracy alone. We propose offline reinforcement learning… ▽ More

    Submitted 14 April, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  19. arXiv:2402.17739  [pdf, other

    cs.AI cs.LG

    reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use

    Authors: Susobhan Ghosh, Yongyi Guo, Pei-Yao Hung, Lara Coughlin, Erin Bonar, Inbal Nahum-Shani, Maureen Walton, Susan Murphy

    Abstract: The escalating prevalence of cannabis use, and associated cannabis-use disorder (CUD), poses a significant public health challenge globally. With a notably wide treatment gap, especially among emerging adults (EAs; ages 18-25), addressing cannabis use and CUD remains a pivotal objective within the 2030 United Nations Agenda for Sustainable Development Goals (SDG). In this work, we develop an onlin… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  20. arXiv:2402.17003  [pdf, other

    cs.LG cs.AI cs.CY

    Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Iris Yan, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Online reinforcement learning (RL) algorithms offer great potential for personalizing treatment for participants in clinical trials. However, deploying an online, autonomous algorithm in the high-stakes healthcare setting makes quality control and data quality especially difficult to achieve. This paper proposes algorithm fidelity as a critical requirement for deploying online RL algorithms in cli… ▽ More

    Submitted 12 August, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  21. arXiv:2402.03110  [pdf, other

    cs.LG cs.AI

    Non-Stationary Latent Auto-Regressive Bandits

    Authors: Anna L. Trella, Walter Dempsey, Asim H. Gazi, Ziping Xu, Finale Doshi-Velez, Susan A. Murphy

    Abstract: For the non-stationary multi-armed bandit (MAB) problem, many existing methods allow a general mechanism for the non-stationarity, but rely on a budget for the non-stationarity that is sub-linear to the total number of time steps $T$. In many real-world settings, however, the mechanism for the non-stationarity can be modeled, but there is no budget for the non-stationarity. We instead consider the… ▽ More

    Submitted 27 February, 2025; v1 submitted 5 February, 2024; originally announced February 2024.

  22. arXiv:2402.01995  [pdf, other

    cs.LG math.OC

    Online Uniform Sampling: Randomized Learning-Augmented Approximation Algorithms with Application to Digital Health

    Authors: Xueqing Liu, Kyra Gan, Esmaeil Keyvanshokooh, Susan Murphy

    Abstract: Motivated by applications in digital health, this work studies the novel problem of online uniform sampling (OUS), where the goal is to distribute a sampling budget uniformly across unknown decision times. In the OUS problem, the algorithm is given a budget $b$ and a time horizon $T$, and an adversary then chooses a value $Ï„^* \in [b,T]$, which is revealed to the algorithm online. At each decision… ▽ More

    Submitted 19 October, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  23. arXiv:2401.14923  [pdf, other

    cs.AI cs.LG

    Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks

    Authors: Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Many important behavior changes are frictionful; they require individuals to expend effort over a long period with little immediate gratification. Here, an artificial intelligence (AI) agent can provide personalized interventions to help individuals stick to their goals. In these settings, the AI agent must personalize rapidly (before the individual disengages) and interpretably, to help us unders… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: In AAMAS 2024

  24. arXiv:2311.06483  [pdf, other

    cs.LG math.NA

    Stacked networks improve physics-informed training: applications to neural networks and deep operator networks

    Authors: Amanda A Howard, Sarah H Murphy, Shady E Ahmed, Panos Stinis

    Abstract: Physics-informed neural networks and operator networks have shown promise for effectively solving equations modeling physical systems. However, these networks can be difficult or impossible to train accurately for some systems of equations. We present a novel multifidelity framework for stacking physics-informed neural networks and operator networks that facilitates training. We successively build… ▽ More

    Submitted 20 November, 2023; v1 submitted 11 November, 2023; originally announced November 2023.

  25. arXiv:2309.05671  [pdf

    cs.LG cs.AI cs.IR

    tSPM+; a high-performance algorithm for mining transitive sequential patterns from clinical data

    Authors: Jonas Hügel, Ulrich Sax, Shawn N. Murphy, Hossein Estiri

    Abstract: The increasing availability of large clinical datasets collected from patients can enable new avenues for computational characterization of complex diseases using different analytic algorithms. One of the promising new methods for extracting knowledge from large clinical datasets involves temporal pattern mining integrated with machine learning workflows. However, mining these temporal patterns is… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: Supplementary data: https://doi.org/10.5281/zenodo.8329519

  26. arXiv:2308.07843  [pdf, other

    cs.LG stat.AP stat.ML

    Dyadic Reinforcement Learning

    Authors: Shuangning Li, Lluis Salvat Niell, Sung Won Choi, Inbal Nahum-Shani, Guy Shani, Susan Murphy

    Abstract: Mobile health aims to enhance health outcomes by delivering interventions to individuals as they go about their daily life. The involvement of care partners and social support networks often proves crucial in helping individuals managing burdensome medical conditions. This presents opportunities in mobile health to design interventions that target the dyadic relationship -- the relationship betwee… ▽ More

    Submitted 11 August, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

  27. arXiv:2307.13916  [pdf, other

    stat.ML cs.LG

    Online learning in bandits with predicted context

    Authors: Yongyi Guo, Ziping Xu, Susan Murphy

    Abstract: We consider the contextual bandit problem where at each time, the agent only has access to a noisy version of the context and the error variance (or an estimator of this variance). This setting is motivated by a wide range of applications where the true context for decision-making is unobserved, and only a prediction of the context by a potentially complex machine learning algorithm is available.… ▽ More

    Submitted 17 March, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

  28. arXiv:2306.11208  [pdf, other

    cs.LG cs.AI stat.ML

    The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning

    Authors: Sarah Rathnam, Sonali Parbhoo, Weiwei Pan, Susan A. Murphy, Finale Doshi-Velez

    Abstract: Discount regularization, using a shorter planning horizon when calculating the optimal policy, is a popular choice to restrict planning to a less complex set of policies when estimating an MDP from sparse or noisy data (Jiang et al., 2015). It is commonly understood that discount regularization functions by de-emphasizing or ignoring delayed effects. In this paper, we reveal an alternate view of d… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  29. arXiv:2306.10983  [pdf, other

    stat.ML cs.LG

    Effect-Invariant Mechanisms for Policy Generalization

    Authors: Sorawit Saengkyongam, Niklas Pfister, Predrag Klasnja, Susan Murphy, Jonas Peters

    Abstract: Policy learning is an important component of many real-world learning systems. A major challenge in policy learning is how to adapt efficiently to unseen environments or tasks. Recently, it has been suggested to exploit invariant conditional distributions to learn models that generalize better to unseen environments. However, assuming invariance of entire conditional distributions (which we call f… ▽ More

    Submitted 27 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  30. arXiv:2305.18511  [pdf, other

    cs.LG math.OC

    Contextual Bandits with Budgeted Information Reveal

    Authors: Kyra Gan, Esmaeil Keyvanshokooh, Xueqing Liu, Susan Murphy

    Abstract: Contextual bandit algorithms are commonly used in digital health to recommend personalized treatments. However, to ensure the effectiveness of the treatments, patients are often requested to take actions that have no immediate benefit to them, which we refer to as pro-treatment actions. In practice, clinicians have a limited budget to encourage patients to take these actions and collect additional… ▽ More

    Submitted 13 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: International Conference on Artificial Intelligence and Statistics, 2024

  31. arXiv:2305.09913  [pdf, other

    cs.LG cs.AI

    Assessing the Impact of Context Inference Error and Partial Observability on RL Methods for Just-In-Time Adaptive Interventions

    Authors: Karine Karine, Predrag Klasnja, Susan A. Murphy, Benjamin M. Marlin

    Abstract: Just-in-Time Adaptive Interventions (JITAIs) are a class of personalized health interventions developed within the behavioral science community. JITAIs aim to provide the right type and amount of support by iteratively selecting a sequence of intervention options from a pre-defined set of components in response to each individual's time varying state. In this work, we explore the application of re… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted at UAI 2023

  32. arXiv:2304.05365  [pdf, other

    cs.LG stat.AP stat.ME stat.ML

    Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling

    Authors: Susobhan Ghosh, Raphael Kim, Prasidh Chhabria, Raaz Dwivedi, Predrag Klasnja, Peng Liao, Kelly Zhang, Susan Murphy

    Abstract: There is a growing interest in using reinforcement learning (RL) to personalize sequences of treatments in digital health to support users in adopting healthier behaviors. Such sequential decision-making problems involve decisions about when to treat and how to treat based on the user's context (e.g., prior activity level, location, etc.). Online RL is a promising data-driven approach for this pro… ▽ More

    Submitted 7 August, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: The first two authors contributed equally

  33. arXiv:2212.00863  [pdf, other

    cs.LG cs.AI

    Modeling Mobile Health Users as Reinforcement Learning Agents

    Authors: Eura Shin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Mobile health (mHealth) technologies empower patients to adopt/maintain healthy behaviors in their daily lives, by providing interventions (e.g. push notifications) tailored to the user's needs. In these settings, without intervention, human decision making may be impaired (e.g. valuing near term pleasure over own long term goals). In this work, we formalize this relationship with a framework in w… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  34. arXiv:2211.14297  [pdf, ps, other

    stat.ML cs.LG

    Doubly robust nearest neighbors in factor models

    Authors: Raaz Dwivedi, Katherine Tian, Sabina Tomkins, Predrag Klasnja, Susan Murphy, Devavrat Shah

    Abstract: We introduce and analyze an improved variant of nearest neighbors (NN) for estimation with missing data in latent factor models. We consider a matrix completion problem with missing data, where the $(i, t)$-th entry, when observed, is given by its mean $f(u_i, v_t)$ plus mean-zero noise for an unknown function $f$ and latent factors $u_i$ and $v_t$. Prior NN strategies, like unit-unit NN, for esti… ▽ More

    Submitted 29 January, 2024; v1 submitted 25 November, 2022; originally announced November 2022.

  35. arXiv:2210.10441  [pdf, other

    cs.RO

    Efficient delivery of Robotics Programming educational content using Cloud Robotics

    Authors: Sean Murphy, Leonardo Militano, Giovanni Toffetti, Remo Maurer

    Abstract: In this paper, we report on our use of cloud-robotics solutions to teach a Robotics Applications Programming course at Zurich University of Applied Sciences (ZHAW). The usage of Kubernetes based cloud computing environment combined with real robots -- turtlebots and Niryo arms -- allowed us to: 1) minimize the set up times required to provide a Robotic Operating System (ROS) simulation and develop… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 7th International Conference on Robotics and Automation Engineering, ICRAE, 2022. arXiv admin note: text overlap with arXiv:2210.03936

  36. arXiv:2210.03936  [pdf, other

    cs.RO cs.AI cs.CV cs.DC cs.NI

    Cloud Native Robotic Applications with GPU Sharing on Kubernetes

    Authors: Giovanni Toffetti, Leonardo Militano, Seán Murphy, Remo Maurer, Mark Straub

    Abstract: In this paper we discuss our experience in teaching the Robotic Applications Programming course at ZHAW combining the use of a Kubernetes (k8s) cluster and real, heterogeneous, robotic hardware. We discuss the main advantages of our solutions in terms of seamless simulation-to-real experience for students and the main shortcomings we encountered with networking and sharing GPUs to support deep lea… ▽ More

    Submitted 31 October, 2022; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: Submission accepted at the IROS'22 Cloud Robotics Workshop

  37. arXiv:2208.07406  [pdf, other

    cs.AI cs.LG

    Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Dental disease is one of the most common chronic diseases despite being largely preventable. However, professional advice on optimal oral hygiene practices is often forgotten or abandoned by patients. Therefore patients may benefit from timely and personalized encouragement to engage in oral self-care behaviors. In this paper, we develop an online reinforcement learning (RL) algorithm for use in o… ▽ More

    Submitted 14 September, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

  38. Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Online reinforcement learning (RL) algorithms are increasingly used to personalize digital interventions in the fields of mobile health and online education. Common challenges in designing and testing an RL algorithm in these settings include ensuring the RL algorithm can learn and run stably under real-time constraints, and accounting for the complexity of the environment, e.g., a lack of accurat… ▽ More

    Submitted 18 August, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  39. arXiv:2203.00097  [pdf

    stat.ME cs.AI cs.LG econ.EM math.OC

    Estimating causal effects with optimization-based methods: A review and empirical comparison

    Authors: Martin Cousineau, Vedat Verter, Susan A. Murphy, Joelle Pineau

    Abstract: In the absence of randomized controlled and natural experiments, it is necessary to balance the distributions of (observable) covariates of the treated and control groups in order to obtain an unbiased estimate of a causal effect of interest; otherwise, a different effect size may be estimated, and incorrect recommendations may be given. To achieve this balance, there exist a wide variety of metho… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: In Press, Corrected Proof

    Journal ref: European Journal of Operational Research, 2022, 14 pages

  40. arXiv:2202.07098  [pdf, ps, other

    cs.LG stat.ME

    Statistical Inference After Adaptive Sampling for Longitudinal Data

    Authors: Kelly W. Zhang, Lucas Janson, Susan A. Murphy

    Abstract: Online reinforcement learning and other adaptive sampling algorithms are increasingly used in digital intervention experiments to optimize treatment delivery for users over time. In this work, we focus on longitudinal user data collected by a large class of adaptive sampling algorithms that are designed to optimize treatment decisions online using accruing data from multiple users. Combining or "p… ▽ More

    Submitted 19 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Fixing typos

  41. arXiv:2202.06891  [pdf, ps, other

    stat.ML cs.LG

    Counterfactual inference in sequential experiments

    Authors: Raaz Dwivedi, Katherine Tian, Sabina Tomkins, Predrag Klasnja, Susan Murphy, Devavrat Shah

    Abstract: We consider after-study statistical inference for sequentially designed experiments wherein multiple units are assigned treatments for multiple time points using treatment policies that adapt over time. Our goal is to provide inference guarantees for the counterfactual mean at the smallest possible scale -- mean outcome under different treatments for each unit and each time -- with minimal assumpt… ▽ More

    Submitted 8 June, 2025; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted at the Annals of Statistics

  42. arXiv:2110.02260  [pdf

    cs.RO eess.SY

    An Overview of the Drone Open-Source Ecosystem

    Authors: John Glossner, Samantha Murphy, Daniel Iancu

    Abstract: Unmanned aerial systems capable of beyond visual line of sight operation can be organized into a top-down hierarchy of layers including flight supervision, command and control, simulation of systems, operating systems, and physical hardware. Flight supervision includes unmanned air traffic management, flight planning, authorization, and remote identification. Command and control ensure drones can… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  43. arXiv:2109.08134  [pdf, other

    cs.LG stat.ML

    Comparison and Unification of Three Regularization Methods in Batch Reinforcement Learning

    Authors: Sarah Rathnam, Susan A. Murphy, Finale Doshi-Velez

    Abstract: In batch reinforcement learning, there can be poorly explored state-action pairs resulting in poorly learned, inaccurate models and poorly performing associated policies. Various regularization methods can mitigate the problem of learning overly-complex models in Markov decision processes (MDPs), however they operate in technically and intuitively distinct ways and lack a common form in which to c… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: ICML Workshop on Reinforcement Learning Theory 2021

  44. arXiv:2107.09949  [pdf, other

    cs.LG stat.ML

    Online structural kernel selection for mobile health

    Authors: Eura Shin, Pedja Klasnja, Susan Murphy, Finale Doshi-Velez

    Abstract: Motivated by the need for efficient and personalized learning in mobile health, we investigate the problem of online kernel selection for Gaussian Process regression in the multi-task setting. We propose a novel generative process on the kernel composition for this purpose. Our method demonstrates that trajectories of kernel evolutions can be transferred between users to improve learning and that… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: Workshop paper in ICML IMLH 2021

  45. arXiv:2104.14074  [pdf, other

    cs.LG

    Statistical Inference with M-Estimators on Adaptively Collected Data

    Authors: Kelly W. Zhang, Lucas Janson, Susan A. Murphy

    Abstract: Bandit algorithms are increasingly used in real-world sequential decision-making problems. Associated with this is an increased desire to be able to use the resulting datasets to answer scientific questions like: Did one type of ad lead to more purchases? In which contexts is a mobile health intervention effective? However, classical statistical approaches fail to provide valid confidence interval… ▽ More

    Submitted 19 November, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Journal ref: Advances in Neural Information Processing Systems, 2021

  46. arXiv:2012.11646  [pdf, other

    cs.LG cs.CY stat.ML

    Fast Physical Activity Suggestions: Efficient Hyperparameter Learning in Mobile Health

    Authors: Marianne Menictas, Sabina Tomkins, Susan Murphy

    Abstract: Users can be supported to adopt healthy behaviors, such as regular physical activity, via relevant and timely suggestions on their mobile devices. Recently, reinforcement learning algorithms have been found to be effective for learning the optimal context under which to provide suggestions. However, these algorithms are not necessarily designed for the constraints posed by mobile health (mHealth)… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: Neurips 2020 workshop: Machine Learning in Mobile Health. arXiv admin note: substantial text overlap with arXiv:2003.12881

  47. arXiv:2008.03869  [pdf

    stat.ML cs.LG q-bio.QM

    Individualized Prediction of COVID-19 Adverse outcomes with MLHO

    Authors: Hossein Estiri, Zachary H. Strasser, Shawn N. Murphy

    Abstract: We developed MLHO (pronounced as melo), an end-to-end Machine Learning framework that leverages iterative feature and algorithm selection to predict Health Outcomes. MLHO implements iterative sequential representation mining, and feature and model selection, for predicting the patient-level risk of hospitalization, ICU admission, need for mechanical ventilation, and death. It bases this prediction… ▽ More

    Submitted 29 December, 2020; v1 submitted 9 August, 2020; originally announced August 2020.

  48. arXiv:2008.01571  [pdf, other

    cs.LG cs.CY stat.ML

    IntelligentPooling: Practical Thompson Sampling for mHealth

    Authors: Sabina Tomkins, Peng Liao, Predrag Klasnja, Susan Murphy

    Abstract: In mobile health (mHealth) smart devices deliver behavioral treatments repeatedly over time to a user with the goal of helping the user adopt and maintain healthy behaviors. Reinforcement learning appears ideal for learning how to optimally make these sequential treatment decisions. However, significant challenges must be overcome before reinforcement learning can be effectively deployed in a mobi… ▽ More

    Submitted 12 December, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: arXiv admin note: text overlap with arXiv:2002.09971

  49. arXiv:2005.05880  [pdf

    cs.HC stat.ME

    The Micro-Randomized Trial for Developing Digital Interventions: Experimental Design Considerations

    Authors: Ashley E. Walton, Linda M. Collins, Predrag Klasnja, Inbal Nahum-Shani, Mashfiqui Rabbi, Maureen A. Walton, Susan A. Murphy

    Abstract: Just-in-time adaptive interventions (JITAIs) are time-varying adaptive interventions that use frequent opportunities for the intervention to be adapted such as weekly, daily, or even many times a day. This high intensity of adaptation is facilitated by the ability of digital technology to continuously collect information about an individual's current context and deliver treatments adapted to this… ▽ More

    Submitted 23 April, 2020; originally announced May 2020.

    MSC Class: 62P15

  50. arXiv:2004.06230  [pdf, other

    cs.LG stat.ML

    Power Constrained Bandits

    Authors: Jiayu Yao, Emma Brunskill, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Contextual bandits often provide simple and effective personalization in decision making problems, making them popular tools to deliver personalized interventions in mobile health as well as other health applications. However, when bandits are deployed in the context of a scientific study -- e.g. a clinical trial to test if a mobile health intervention is effective -- the aim is not only to person… ▽ More

    Submitted 27 July, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: Accepted at MLHC 2021