Skip to main content

Showing 1–50 of 128 results for author: Doshi-velez, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13150  [pdf, ps, other

    cs.LG math.OC stat.ML

    Federated ADMM from Bayesian Duality

    Authors: Thomas Möllenhoff, Siddharth Swaroop, Finale Doshi-Velez, Mohammad Emtiyaz Khan

    Abstract: ADMM is a popular method for federated deep learning which originated in the 1970s and, even though many new variants of it have been proposed since then, its core algorithmic structure has remained unchanged. Here, we take a major departure from the old structure and present a fundamentally new way to derive and extend federated ADMM. We propose to use a structure called Bayesian Duality which ex… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Code is at https://github.com/team-approx-bayes/bayes-admm

  2. arXiv:2505.16833  [pdf, ps, other

    cs.LG

    Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning

    Authors: Alihan Hüyük, Finale Doshi-Velez

    Abstract: Long-term planning, as in reinforcement learning (RL), involves finding strategies: actions that collectively work toward a goal rather than individually optimizing their immediate outcomes. As part of a strategy, some actions are taken at the expense of short-term benefit to enable future actions with even greater returns. These actions are only advantageous if followed up by the actions they fac… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  3. arXiv:2501.17325  [pdf, other

    cs.LG cs.AI stat.ML

    Connecting Federated ADMM to Bayes

    Authors: Siddharth Swaroop, Mohammad Emtiyaz Khan, Finale Doshi-Velez

    Abstract: We provide new connections between two distinct federated learning approaches based on (i) ADMM and (ii) Variational Bayes (VB), and propose new variants by combining their complementary strengths. Specifically, we show that the dual variables in ADMM naturally emerge through the 'site' parameters used in VB with isotropic Gaussian covariances. Using this, we derive two versions of ADMM from VB th… ▽ More

    Submitted 28 February, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

  4. arXiv:2412.18059  [pdf, other

    cs.LG

    Diverse Concept Proposals for Concept Bottleneck Models

    Authors: Katrina Brown, Marton Havasi, Finale Doshi-Velez

    Abstract: Concept bottleneck models are interpretable predictive models that are often used in domains where model trust is a key priority, such as healthcare. They identify a small number of human-interpretable concepts in the data, which they then use to make predictions. Learning relevant concepts from data proves to be a challenging task. The most predictive concepts may not align with expert intuition,… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Comments: Accepted to the ICML 2022 Workshop on Human-Machine Collaboration and Teaming

  5. arXiv:2412.02730  [pdf

    cs.AI cs.CY cs.ET cs.LG

    Shaping AI's Impact on Billions of Lives

    Authors: Mariano-Florentino Cuéllar, Jeff Dean, Finale Doshi-Velez, John Hennessy, Andy Konwinski, Sanmi Koyejo, Pelonomi Moiloa, Emma Pierson, David Patterson

    Abstract: Artificial Intelligence (AI), like any transformative technology, has the potential to be a double-edged sword, leading either toward significant advancements or detrimental outcomes for society as a whole. As is often the case when it comes to widely-used technologies in market economies (e.g., cars and semiconductor chips), commercial interest tends to be the predominant guiding factor. The AI c… ▽ More

    Submitted 11 December, 2024; v1 submitted 3 December, 2024; originally announced December 2024.

  6. arXiv:2411.05237  [pdf

    cs.LG q-bio.QM stat.AP stat.CO stat.ML

    Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning

    Authors: Inko Bovenzi, Adi Carmel, Michael Hu, Rebecca M. Hurwitz, Fiona McBride, Leo Benac, José Roberto Tello Ayala, Finale Doshi-Velez

    Abstract: In aims to uncover insights into medical decision-making embedded within observational data from clinical settings, we present a novel application of Inverse Reinforcement Learning (IRL) that identifies suboptimal clinician actions based on the actions of their peers. This approach centers two stages of IRL with an intermediate step to prune trajectories displaying behavior that deviates significa… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 13 pages, 4 figures

  7. arXiv:2411.05174  [pdf, other

    cs.LG cs.AI stat.ML

    Inverse Transition Learning: Learning Dynamics from Demonstrations

    Authors: Leo Benac, Abhishek Sharma, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: We consider the problem of estimating the transition dynamics $T^*$ from near-optimal expert trajectories in the context of offline model-based reinforcement learning. We develop a novel constraint-based method, Inverse Transition Learning, that treats the limited coverage of the expert trajectories as a \emph{feature}: we use the fact that the expert is near-optimal to inform our estimate of… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  8. arXiv:2410.23880  [pdf, other

    cs.LG

    Directly Optimizing Explanations for Desired Properties

    Authors: Hiwot Belay Tadesse, Alihan Hüyük, Weiwei Pan, Finale Doshi-Velez

    Abstract: When explaining black-box machine learning models, it's often important for explanations to have certain desirable properties. Most existing methods `encourage' desirable properties in their construction of explanations. In this work, we demonstrate that these forms of encouragement do not consistently create explanations with the properties that are supposedly being targeted. Moreover, they do no… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

  9. arXiv:2410.09361  [pdf, other

    cs.LG

    Decision-Point Guided Safe Policy Improvement

    Authors: Abhishek Sharma, Leo Benac, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Within batch reinforcement learning, safe policy improvement (SPI) seeks to ensure that the learnt policy performs at least as well as the behavior policy that generated the dataset. The core challenge in SPI is seeking improvements while balancing risk when many state-action pairs may be infrequently visited. In this work, we introduce Decision Points RL (DPRL), an algorithm that restricts the se… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  10. arXiv:2410.04253  [pdf, other

    cs.HC cs.AI

    Contrastive Explanations That Anticipate Human Misconceptions Can Improve Human Decision-Making Skills

    Authors: Zana Buçinca, Siddharth Swaroop, Amanda E. Paluch, Finale Doshi-Velez, Krzysztof Z. Gajos

    Abstract: People's decision-making abilities often fail to improve or may even erode when they rely on AI for decision-support, even when the AI provides informative explanations. We argue this is partly because people intuitively seek contrastive explanations, which clarify the difference between the AI's decision and their own reasoning, while most AI systems offer "unilateral" explanations that justify t… ▽ More

    Submitted 18 March, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

  11. arXiv:2409.18051  [pdf, other

    cs.LG

    Inverse Reinforcement Learning with Multiple Planning Horizons

    Authors: Jiayu Yao, Weiwei Pan, Finale Doshi-Velez, Barbara E Engelhardt

    Abstract: In this work, we study an inverse reinforcement learning (IRL) problem where the experts are planning under a shared reward function but with different, unknown planning horizons. Without the knowledge of discount factors, the reward function has a larger feasible solution set, which makes it harder for existing IRL approaches to identify a reward function. To overcome this challenge, we develop a… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Accepted at RLC 2024

    Journal ref: Reinforcement Learning Journal 3 (2024) 1138-1167

  12. arXiv:2409.10526  [pdf, other

    cs.CY cs.AI

    Effective Monitoring of Online Decision-Making Algorithms in Digital Intervention Implementation

    Authors: Anna L. Trella, Susobhan Ghosh, Erin E. Bonar, Lara Coughlin, Finale Doshi-Velez, Yongyi Guo, Pei-Yao Hung, Inbal Nahum-Shani, Vivek Shetty, Maureen Walton, Iris Yan, Kelly W. Zhang, Susan A. Murphy

    Abstract: Online AI decision-making algorithms are increasingly used by digital interventions to dynamically personalize treatment to individuals. These algorithms determine, in real-time, the delivery of treatment based on accruing data. The objective of this paper is to provide guidelines for enabling effective monitoring of online decision-making algorithms with the goal of (1) safeguarding individuals a… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

  13. arXiv:2409.02069  [pdf, other

    cs.AI cs.HC

    A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial

    Authors: Anna L. Trella, Kelly W. Zhang, Hinal Jajal, Inbal Nahum-Shani, Vivek Shetty, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Dental disease is a prevalent chronic condition associated with substantial financial burden, personal suffering, and increased risk of systemic diseases. Despite widespread recommendations for twice-daily tooth brushing, adherence to recommended oral self-care behaviors remains sub-optimal due to factors such as forgetfulness and disengagement. To address this, we developed Oralytics, a mHealth i… ▽ More

    Submitted 18 December, 2024; v1 submitted 3 September, 2024; originally announced September 2024.

  14. arXiv:2407.14845  [pdf, other

    cs.LG cs.CL

    Understanding the Relationship between Prompts and Response Uncertainty in Large Language Models

    Authors: Ze Yu Zhang, Arun Verma, Finale Doshi-Velez, Bryan Kian Hsiang Low

    Abstract: Large language models (LLMs) are widely used in decision-making, but their reliability, especially in critical tasks like healthcare, is not well-established. Therefore, understanding how LLMs reason and make decisions is crucial for their safe deployment. This paper investigates how the uncertainty of responses generated by LLMs relates to the information provided in the input prompt. Leveraging… ▽ More

    Submitted 24 February, 2025; v1 submitted 20 July, 2024; originally announced July 2024.

    Comments: 22 pages, Preprint

  15. arXiv:2406.08636  [pdf, other

    cs.LG

    Towards Integrating Personal Knowledge into Test-Time Predictions

    Authors: Isaac Lage, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Machine learning (ML) models can make decisions based on large amounts of data, but they can be missing personal knowledge available to human users about whom predictions are made. For example, a model trained to predict psychiatric outcomes may know nothing about a patient's social support system, and social support may look different for different patients. In this work, we introduce the problem… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  16. arXiv:2406.00116  [pdf, other

    cs.HC cs.LG

    A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

    Authors: Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez

    Abstract: Explanations of an AI's function can assist human decision-makers, but the most useful explanation depends on the decision's context, referred to as the downstream task. User studies are necessary to determine the best explanations for each task. Unfortunately, testing every explanation and task combination is impractical, especially considering the many factors influencing human+AI collaboration… ▽ More

    Submitted 18 September, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

  17. arXiv:2404.14660  [pdf, ps, other

    cs.CY cs.AI

    AI Procurement Checklists: Revisiting Implementation in the Age of AI Governance

    Authors: Tom Zick, Mason Kortz, David Eaves, Finale Doshi-Velez

    Abstract: Public sector use of AI has been quietly on the rise for the past decade, but only recently have efforts to regulate it entered the cultural zeitgeist. While simple to articulate, promoting ethical and effective roll outs of AI systems in government is a notoriously elusive task. On the one hand there are hard-to-address pitfalls associated with AI-based tools, including concerns about bias toward… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  18. arXiv:2403.08941  [pdf, other

    stat.ML cs.LG

    Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders

    Authors: Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez

    Abstract: Inference for Variational Autoencoders (VAEs) consists of learning two models: (1) a generative model, which transforms a simple distribution over a latent space into the distribution over observed data, and (2) an inference model, which approximates the posterior of the latent codes given data. The two components are learned jointly via a lower bound to the generative model's log marginal likelih… ▽ More

    Submitted 12 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted at the Workshop at the 6th Symposium on Advances in Approximate Bayesian Inference (AABI) 2024

  19. arXiv:2402.17003  [pdf, other

    cs.LG cs.AI cs.CY

    Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Iris Yan, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Online reinforcement learning (RL) algorithms offer great potential for personalizing treatment for participants in clinical trials. However, deploying an online, autonomous algorithm in the high-stakes healthcare setting makes quality control and data quality especially difficult to achieve. This paper proposes algorithm fidelity as a critical requirement for deploying online RL algorithms in cli… ▽ More

    Submitted 12 August, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  20. arXiv:2402.12737  [pdf, other

    cs.LG

    Guarantee Regions for Local Explanations

    Authors: Marton Havasi, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Interpretability methods that utilise local surrogate models (e.g. LIME) are very good at describing the behaviour of the predictive model at a point of interest, but they are not guaranteed to extrapolate to the local region surrounding the point. However, overfitting to the local curvature of the predictive model and malicious tampering can significantly limit extrapolation. We propose an anchor… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  21. arXiv:2402.03110  [pdf, other

    cs.LG cs.AI

    Non-Stationary Latent Auto-Regressive Bandits

    Authors: Anna L. Trella, Walter Dempsey, Asim H. Gazi, Ziping Xu, Finale Doshi-Velez, Susan A. Murphy

    Abstract: For the non-stationary multi-armed bandit (MAB) problem, many existing methods allow a general mechanism for the non-stationarity, but rely on a budget for the non-stationarity that is sub-linear to the total number of time steps $T$. In many real-world settings, however, the mechanism for the non-stationarity can be modeled, but there is no budget for the non-stationarity. We instead consider the… ▽ More

    Submitted 27 February, 2025; v1 submitted 5 February, 2024; originally announced February 2024.

  22. arXiv:2401.16419  [pdf, other

    cs.LG stat.ML

    Semi-parametric Expert Bayesian Network Learning with Gaussian Processes and Horseshoe Priors

    Authors: Yidou Weng, Finale Doshi-Velez

    Abstract: This paper proposes a model learning Semi-parametric relationships in an Expert Bayesian Network (SEBN) with linear parameter and structure constraints. We use Gaussian Processes and a Horseshoe prior to introduce minimal nonlinear components. To prioritize modifying the expert graph over adding new edges, we optimize differential Horseshoe scales. In real-world datasets with unknown truth, we gen… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 8 pages, 4 figures, AAAI-2024 workshops

  23. arXiv:2401.14923  [pdf, other

    cs.AI cs.LG

    Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks

    Authors: Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Many important behavior changes are frictionful; they require individuals to expend effort over a long period with little immediate gratification. Here, an artificial intelligence (AI) agent can provide personalized interventions to help individuals stick to their goals. In these settings, the AI agent must personalize rapidly (before the individual disengages) and interpretably, to help us unders… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: In AAMAS 2024

  24. arXiv:2312.09983  [pdf, other

    cs.LG cs.AI stat.ML

    Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

    Authors: Lauren H. Cooke, Harvey Klyne, Edwin Zhang, Cassidy Laidlaw, Milind Tambe, Finale Doshi-Velez

    Abstract: Inverse reinforcement learning (IRL) is computationally challenging, with common approaches requiring the solution of multiple reinforcement learning (RL) sub-problems. This work motivates the use of potential-based reward shaping to reduce the computational burden of each RL sub-problem. This work serves as a proof-of-concept and we hope will inspire future developments towards computationally ef… ▽ More

    Submitted 18 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

  25. arXiv:2309.11443  [pdf, other

    cs.CV cs.LG

    Signature Activation: A Sparse Signal View for Holistic Saliency

    Authors: Jose Roberto Tello Ayala, Akl C. Fahed, Weiwei Pan, Eugene V. Pomerantsev, Patrick T. Ellinor, Anthony Philippakis, Finale Doshi-Velez

    Abstract: The adoption of machine learning in healthcare calls for model transparency and explainability. In this work, we introduce Signature Activation, a saliency method that generates holistic and class-agnostic explanations for Convolutional Neural Network (CNN) outputs. Our method exploits the fact that certain kinds of medical images, such as angiograms, have clear foreground and background objects.… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  26. arXiv:2309.00254  [pdf, other

    cs.LG cs.CL cs.CR

    Why do universal adversarial attacks work on large language models?: Geometry might be the answer

    Authors: Varshini Subhash, Anna Bialas, Weiwei Pan, Finale Doshi-Velez

    Abstract: Transformer based large language models with emergent capabilities are becoming increasingly ubiquitous in society. However, the task of understanding and interpreting their internal workings, in the context of adversarial attacks, remains largely unsolved. Gradient-based universal adversarial attacks have been shown to be highly effective on large language models and potentially dangerous due to… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 2nd AdvML Frontiers Workshop at 40th International Conference on Machine Learning, Honolulu, Hawaii, USA, 2023

  27. arXiv:2308.05075  [pdf, other

    cs.LG

    Bayesian Inverse Transition Learning for Offline Settings

    Authors: Leo Benac, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Offline Reinforcement learning is commonly used for sequential decision-making in domains such as healthcare and education, where the rewards are known and the transition dynamics $T$ must be estimated on the basis of batch data. A key challenge for all tasks is how to learn a reliable estimate of the transition dynamics $T$ that produce near-optimal policies that are safe enough so that they neve… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 8 pages, 1 plots, 2 tables

  28. arXiv:2308.01420  [pdf, other

    cs.CL cs.LG

    SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text

    Authors: Charumathi Badrinath, Weiwei Pan, Finale Doshi-Velez

    Abstract: A common way to explore text corpora is through low-dimensional projections of the documents, where one hopes that thematically similar documents will be clustered together in the projected space. However, popular algorithms for dimensionality reduction of text corpora, like Latent Dirichlet Allocation (LDA), often produce projections that do not capture human notions of document similarity. We pr… ▽ More

    Submitted 28 July, 2023; originally announced August 2023.

  29. arXiv:2307.08169  [pdf, other

    cs.LG cs.HC

    Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning

    Authors: L. L. Ankile, B. S. Ham, K. Mao, E. Shin, S. Swaroop, F. Doshi-Velez, W. Pan

    Abstract: When assisting human users in reinforcement learning (RL), we can represent users as RL agents and study key parameters, called \emph{user traits}, to inform intervention design. We study the relationship between user behaviors (policy classes) and user traits. Given an environment, we introduce an intuitive tool for studying the breakdown of "user types": broad sets of traits that result in the s… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  30. arXiv:2307.06541  [pdf, other

    cs.LG cs.AI

    On the Effective Horizon of Inverse Reinforcement Learning

    Authors: Yiqing Xu, Finale Doshi-Velez, David Hsu

    Abstract: Inverse reinforcement learning (IRL) algorithms often rely on (forward) reinforcement learning or planning, over a given time horizon, to compute an approximately optimal policy for a hypothesized reward function; they then match this policy with expert demonstrations. The time horizon plays a critical role in determining both the accuracy of reward estimates and the computational efficiency of IR… ▽ More

    Submitted 20 February, 2025; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: 8 pages, accepted to AAMAS 2025

  31. arXiv:2306.12609  [pdf, other

    cs.AI cs.CY

    Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities

    Authors: Xudong Shen, Hannah Brown, Jiashu Tao, Martin Strobel, Yao Tong, Akshay Narayan, Harold Soh, Finale Doshi-Velez

    Abstract: There is increasing attention being given to how to regulate AI systems. As governing bodies grapple with what values to encapsulate into regulation, we consider the technical half of the question: To what extent can AI experts vet an AI system for adherence to regulatory requirements? We investigate this question through the lens of two public sector procurement checklists, identifying what we ca… ▽ More

    Submitted 27 March, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: scheduled for publication in the Communications of the ACM, titled "Directions of Technical Innovation for Regulatable AI Systems"

  32. arXiv:2306.11208  [pdf, other

    cs.LG cs.AI stat.ML

    The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning

    Authors: Sarah Rathnam, Sonali Parbhoo, Weiwei Pan, Susan A. Murphy, Finale Doshi-Velez

    Abstract: Discount regularization, using a shorter planning horizon when calculating the optimal policy, is a popular choice to restrict planning to a less complex set of policies when estimating an MDP from sparse or noisy data (Jiang et al., 2015). It is commonly understood that discount regularization functions by de-emphasizing or ignoring delayed effects. In this paper, we reveal an alternate view of d… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  33. Accuracy-Time Tradeoffs in AI-Assisted Decision Making under Time Pressure

    Authors: Siddharth Swaroop, Zana Buçinca, Krzysztof Z. Gajos, Finale Doshi-Velez

    Abstract: In settings where users both need high accuracy and are time-pressured, such as doctors working in emergency rooms, we want to provide AI assistance that both increases decision accuracy and reduces decision-making time. Current literature focusses on how users interact with AI assistance when there is no time pressure, finding that different AI assistances have different benefits: some can reduce… ▽ More

    Submitted 11 February, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  34. arXiv:2305.01738  [pdf, other

    cs.LG cs.AI

    Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare

    Authors: Shengpu Tang, Maggie Makar, Michael W. Sjoding, Finale Doshi-Velez, Jenna Wiens

    Abstract: Many reinforcement learning (RL) applications have combinatorial action spaces, where each action is a composition of sub-actions. A standard RL approach ignores this inherent factorization structure, resulting in a potential failure to make meaningful inferences about rarely observed sub-action combinations; this is particularly problematic for offline settings, where data may be limited. In this… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 30 pages, 18 figures, 2 tables. NeurIPS 2022. Code available at https://github.com/MLD3/OfflineRL_FactoredActions

  35. arXiv:2304.03365  [pdf, other

    cs.LG cs.AI

    Decision-Focused Model-based Reinforcement Learning for Reward Transfer

    Authors: Abhishek Sharma, Sonali Parbhoo, Omer Gottesman, Finale Doshi-Velez

    Abstract: Model-based reinforcement learning (MBRL) provides a way to learn a transition model of the environment, which can then be used to plan personalized policies for different patient cohorts and to understand the dynamics involved in the decision-making process. However, standard MBRL algorithms are either sensitive to changes in the reward function or achieve suboptimal performance on the task when… ▽ More

    Submitted 20 November, 2024; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Machine Learning for Healthcare (MLHC) 2024

  36. arXiv:2212.00863  [pdf, other

    cs.LG cs.AI

    Modeling Mobile Health Users as Reinforcement Learning Agents

    Authors: Eura Shin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Mobile health (mHealth) technologies empower patients to adopt/maintain healthy behaviors in their daily lives, by providing interventions (e.g. push notifications) tailored to the user's needs. In these settings, without intervention, human decision making may be impaired (e.g. valuing near term pleasure over own long term goals). In this work, we formalize this relationship with a framework in w… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  37. arXiv:2211.09184  [pdf, other

    stat.ML cs.LG

    An Empirical Analysis of the Advantages of Finite- v.s. Infinite-Width Bayesian Neural Networks

    Authors: Jiayu Yao, Yaniv Yacoby, Beau Coker, Weiwei Pan, Finale Doshi-Velez

    Abstract: Comparing Bayesian neural networks (BNNs) with different widths is challenging because, as the width increases, multiple model properties change simultaneously, and, inference in the finite-width case is intractable. In this work, we empirically compare finite- and infinite-width BNNs, and provide quantitative and qualitative explanations for their performance difference. We find that when the mod… ▽ More

    Submitted 28 November, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

  38. arXiv:2211.07719  [pdf, other

    cs.LG cs.HC

    (When) Are Contrastive Explanations of Reinforcement Learning Helpful?

    Authors: Sanjana Narayanan, Isaac Lage, Finale Doshi-Velez

    Abstract: Global explanations of a reinforcement learning (RL) agent's expected behavior can make it safer to deploy. However, such explanations are often difficult to understand because of the complicated nature of many RL policies. Effective human explanations are often contrastive, referencing a known contrast (policy) to reduce redundancy. At the same time, these explanations also require the additional… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2022 workshop on Human in the Loop Learning

  39. arXiv:2211.05667  [pdf, ps, other

    cs.LG

    What Makes a Good Explanation?: A Harmonized View of Properties of Explanations

    Authors: Zixi Chen, Varshini Subhash, Marton Havasi, Weiwei Pan, Finale Doshi-Velez

    Abstract: Interpretability provides a means for humans to verify aspects of machine learning (ML) models and empower human+ML teaming in situations where the task cannot be fully automated. Different contexts require explanations with different properties. For example, the kind of explanation required to determine if an early cardiac arrest warning system is ready to be integrated into a care setting is ver… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Short version accepted at NeurIPS 2022 workshops on Progress and Challenges in Building Trustworthy Embodied AI and Trustworthy and Socially Responsible Machine Learning

  40. arXiv:2210.15767  [pdf

    cs.AI

    Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report

    Authors: Michael L. Littman, Ifeoma Ajunwa, Guy Berger, Craig Boutilier, Morgan Currie, Finale Doshi-Velez, Gillian Hadfield, Michael C. Horowitz, Charles Isbell, Hiroaki Kitano, Karen Levy, Terah Lyons, Melanie Mitchell, Julie Shah, Steven Sloman, Shannon Vallor, Toby Walsh

    Abstract: In September 2021, the "One Hundred Year Study on Artificial Intelligence" project (AI100) issued the second report of its planned long-term periodic assessment of artificial intelligence (AI) and its impact on society. It was written by a panel of 17 study authors, each of whom is deeply rooted in AI research, chaired by Michael Littman of Brown University. The report, entitled "Gathering Strengt… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 82 pages, https://ai100.stanford.edu/gathering-strength-gathering-storms-one-hundred-year-study-artificial-intelligence-ai100-2021-study

  41. Towards Robust Off-Policy Evaluation via Human Inputs

    Authors: Harvineet Singh, Shalmali Joshi, Finale Doshi-Velez, Himabindu Lakkaraju

    Abstract: Off-policy Evaluation (OPE) methods are crucial tools for evaluating policies in high-stakes domains such as healthcare, where direct deployment is often infeasible, unethical, or expensive. When deployment environments are expected to undergo changes (that is, dataset shifts), it is important for OPE methods to perform robust evaluation of the policies amidst such changes. Existing approaches con… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 10 pages, 5 figures, 1 table. Appeared at AIES '22: Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society. Expanded version of arXiv:2103.15933

  42. arXiv:2208.07406  [pdf, other

    cs.AI cs.LG

    Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Dental disease is one of the most common chronic diseases despite being largely preventable. However, professional advice on optimal oral hygiene practices is often forgotten or abandoned by patients. Therefore patients may benefit from timely and personalized encouragement to engage in oral self-care behaviors. In this paper, we develop an online reinforcement learning (RL) algorithm for use in o… ▽ More

    Submitted 14 September, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

  43. arXiv:2208.01705  [pdf, other

    cs.LG

    Success of Uncertainty-Aware Deep Models Depends on Data Manifold Geometry

    Authors: Mark Penrod, Harrison Termotto, Varshini Reddy, Jiayu Yao, Finale Doshi-Velez, Weiwei Pan

    Abstract: For responsible decision making in safety-critical settings, machine learning models must effectively detect and process edge-case data. Although existing works show that predictive uncertainty is useful for these tasks, it is not evident from literature which uncertainty-aware models are best suited for a given dataset. Thus, we compare six uncertainty-aware deep learning models on a set of edge-… ▽ More

    Submitted 5 August, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    ACM Class: I.2.6

    Journal ref: International Conference on Machine Learning. PMLR 162 (2022)

  44. arXiv:2208.00250  [pdf, other

    cs.LG cs.AI

    A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

    Authors: Kelly W. Zhang, Omer Gottesman, Finale Doshi-Velez

    Abstract: In the reinforcement learning literature, there are many algorithms developed for either Contextual Bandit (CB) or Markov Decision Processes (MDP) environments. However, when deploying reinforcement learning algorithms in the real world, even with domain expertise, it is often difficult to know whether it is appropriate to treat a sequential decision making problem as a CB or an MDP. In other word… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: Challenges of Real-World Reinforcement Learning 2020 (NeurIPS Workshop)

  45. arXiv:2207.06269  [pdf, other

    cs.LG

    Policy Optimization with Sparse Global Contrastive Explanations

    Authors: Jiayu Yao, Sonali Parbhoo, Weiwei Pan, Finale Doshi-Velez

    Abstract: We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes. Our goal is to make minimal changes while gaining as much benefit as possible. We define a minimal change as having a sparse, global contrastive explanation between the original and proposed policy. We improve the current policy with the constraint of keeping that gl… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at IMLH Workshop, ICML 2022

  46. arXiv:2206.10847  [pdf, other

    cs.AI cs.HC

    Connecting Algorithmic Research and Usage Contexts: A Perspective of Contextualized Evaluation for Explainable AI

    Authors: Q. Vera Liao, Yunfeng Zhang, Ronny Luss, Finale Doshi-Velez, Amit Dhurandhar

    Abstract: Recent years have seen a surge of interest in the field of explainable AI (XAI), with a plethora of algorithms proposed in the literature. However, a lack of consensus on how to evaluate XAI hinders the advancement of the field. We highlight that XAI is not a monolithic set of technologies -- researchers and practitioners have begun to leverage XAI algorithms to build XAI systems that serve differ… ▽ More

    Submitted 20 September, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Forthcoming for AAAI HCOMP 2022

  47. Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Online reinforcement learning (RL) algorithms are increasingly used to personalize digital interventions in the fields of mobile health and online education. Common challenges in designing and testing an RL algorithm in these settings include ensuring the RL algorithm can learn and run stably under real-time constraints, and accounting for the complexity of the environment, e.g., a lack of accurat… ▽ More

    Submitted 18 August, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  48. arXiv:2204.03208  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    A Joint Learning Approach for Semi-supervised Neural Topic Modeling

    Authors: Jeffrey Chiu, Rajat Mittal, Neehal Tumma, Abhishek Sharma, Finale Doshi-Velez

    Abstract: Topic models are some of the most popular ways to represent textual data in an interpret-able manner. Recently, advances in deep generative models, specifically auto-encoding variational Bayes (AEVB), have led to the introduction of unsupervised neural topic models, which leverage deep generative models as opposed to traditional statistics-based topic models. We extend upon these neural topic mode… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: To appear in the 6th ACL Workshop on Structured Prediction for NLP (SPNLP)

  49. arXiv:2202.11670  [pdf, other

    cs.LG stat.ML

    Wide Mean-Field Bayesian Neural Networks Ignore the Data

    Authors: Beau Coker, Wessel P. Bruinsma, David R. Burt, Weiwei Pan, Finale Doshi-Velez

    Abstract: Bayesian neural networks (BNNs) combine the expressive power of deep learning with the advantages of Bayesian formalism. In recent years, the analysis of wide, deep BNNs has provided theoretical insight into their priors and posteriors. However, we have no analogous insight into their posteriors under approximate inference. In this work, we show that mean-field variational inference entirely fails… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  50. arXiv:2201.08262  [pdf, other

    cs.LG stat.ML

    Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making

    Authors: Sonali Parbhoo, Shalmali Joshi, Finale Doshi-Velez

    Abstract: Assessing the effects of a policy based on observational data from a different policy is a common problem across several high-stake decision-making domains, and several off-policy evaluation (OPE) techniques have been proposed. However, these methods largely formulate OPE as a problem disassociated from the process used to generate the data (i.e. structural assumptions in the form of a causal grap… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.