Skip to main content

Showing 1–16 of 16 results for author: Tambe, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.14090  [pdf, other

    cs.AI econ.GN stat.ML

    Social Environment Design

    Authors: Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, Yiling Chen

    Abstract: Artificial Intelligence (AI) holds promise as a technology that can be used to improve government and economic policy-making. This paper proposes a new research agenda towards this end by introducing Social Environment Design, a general framework for the use of AI for automated policy-making that connects with the Reinforcement Learning, EconCS, and Computational Social Choice communities. The fra… ▽ More

    Submitted 17 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Position Paper. Website at https://sed.eddie.win

  2. arXiv:2402.11771  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Evaluating the Effectiveness of Index-Based Treatment Allocation

    Authors: Niclas Boehmer, Yash Nair, Sanket Shah, Lucas Janson, Aparna Taneja, Milind Tambe

    Abstract: When resources are scarce, an allocation policy is needed to decide who receives a resource. This problem occurs, for instance, when allocating scarce medical resources and is often solved using modern ML methods. This paper introduces methods to evaluate index-based allocation policies -- that allocate a fixed number of resources to those who need them the most -- by using data from a randomized… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  3. arXiv:2402.04933  [pdf, other

    cs.LG stat.AP

    Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits

    Authors: Biyonka Liang, Lily Xu, Aparna Taneja, Milind Tambe, Lucas Janson

    Abstract: Public health programs often provide interventions to encourage program adherence, and effectively allocating interventions is vital for producing the greatest overall health outcomes, especially in underserved communities where resources are limited. Such resource allocation problems are often modeled as restless multi-armed bandits (RMABs) with unknown underlying transition dynamics, hence requi… ▽ More

    Submitted 5 February, 2025; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 29 pages, 18 figures

  4. arXiv:2312.09983  [pdf, other

    cs.LG cs.AI stat.ML

    Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

    Authors: Lauren H. Cooke, Harvey Klyne, Edwin Zhang, Cassidy Laidlaw, Milind Tambe, Finale Doshi-Velez

    Abstract: Inverse reinforcement learning (IRL) is computationally challenging, with common approaches requiring the solution of multiple reinforcement learning (RL) sub-problems. This work motivates the use of potential-based reward shaping to reduce the computational burden of each RL sub-problem. This work serves as a proof-of-concept and we hope will inspire future developments towards computationally ef… ▽ More

    Submitted 18 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

  5. arXiv:2305.12640  [pdf, other

    cs.AI cs.LG stat.ML

    Limited Resource Allocation in a Non-Markovian World: The Case of Maternal and Child Healthcare

    Authors: Panayiotis Danassis, Shresth Verma, Jackson A. Killian, Aparna Taneja, Milind Tambe

    Abstract: The success of many healthcare programs depends on participants' adherence. We consider the problem of scheduling interventions in low resource settings (e.g., placing timely support calls from health workers) to increase adherence and/or engagement. Past works have successfully developed several classes of Restless Multi-armed Bandit (RMAB) based solutions for this problem. Nevertheless, all past… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)

  6. arXiv:2302.02570  [pdf, other

    cs.AI cs.LG stat.ME stat.ML

    Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation

    Authors: Aditya Mate, Bryan Wilder, Aparna Taneja, Milind Tambe

    Abstract: We consider the task of evaluating policies of algorithmic resource allocation through randomized controlled trials (RCTs). Such policies are tasked with optimizing the utilization of limited intervention resources, with the goal of maximizing the benefits derived. Evaluation of such allocation policies through RCTs proves difficult, notwithstanding the scale of the trial, because the individuals'… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  7. arXiv:2210.00025  [pdf, other

    cs.LG stat.ML

    Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits

    Authors: Siddhartha Banerjee, Sean R. Sinclair, Milind Tambe, Lily Xu, Christina Lee Yu

    Abstract: Most real-world deployments of bandit algorithms exist somewhere in between the offline and online set-up, where some historical data is available upfront and additional data is collected dynamically online. How best to incorporate historical data to "warm start" bandit algorithms is an open question: naively initializing reward estimates using all historical samples can suffer from spurious data… ▽ More

    Submitted 19 March, 2025; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: 55 pages (30 pages main paper), 9 figures

  8. arXiv:2107.03003  [pdf, other

    cs.LG cs.AI stat.ML

    Harnessing Heterogeneity: Learning from Decomposed Feedback in Bayesian Modeling

    Authors: Kai Wang, Bryan Wilder, Sze-chuan Suen, Bistra Dilkina, Milind Tambe

    Abstract: There is significant interest in learning and optimizing a complex system composed of multiple sub-components, where these components may be agents or autonomous sensors. Among the rich literature on this topic, agent-based and domain-specific simulations can capture complex dynamics and subgroup interaction, but optimizing over such simulations can be computationally and algorithmically challengi… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  9. arXiv:2009.06560  [pdf, other

    cs.LG stat.ML

    Dual-Mandate Patrols: Multi-Armed Bandits for Green Security

    Authors: Lily Xu, Elizabeth Bondi, Fei Fang, Andrew Perrault, Kai Wang, Milind Tambe

    Abstract: Conservation efforts in green security domains to protect wildlife and forests are constrained by the limited availability of defenders (i.e., patrollers), who must patrol vast areas to protect from attackers (e.g., poachers or illegal loggers). Defenders must choose how much time to spend in each region of the protected area, balancing exploration of infrequently visited regions and exploitation… ▽ More

    Submitted 26 April, 2024; v1 submitted 14 September, 2020; originally announced September 2020.

    Comments: Published at AAAI 2021. 9 pages (paper and references), 3 page appendix. 6 figures and 1 table

  10. arXiv:2009.05863  [pdf, other

    stat.ME cs.AI

    Tracking disease outbreaks from sparse data with Bayesian inference

    Authors: Bryan Wilder, Michael J. Mina, Milind Tambe

    Abstract: The COVID-19 pandemic provides new motivation for a classic problem in epidemiology: estimating the empirical rate of transmission during an outbreak (formally, the time-varying reproduction number) from case counts. While standard methods exist, they work best at coarse-grained national or state scales with abundant data, and struggle to accommodate the partial observability and sparse data commo… ▽ More

    Submitted 12 September, 2020; originally announced September 2020.

    Report number: Accepted at AAAI 2021

  11. arXiv:2007.04432  [pdf, other

    cs.LG cs.AI stat.ML

    Collapsing Bandits and Their Application to Public Health Interventions

    Authors: Aditya Mate, Jackson A. Killian, Haifeng Xu, Andrew Perrault, Milind Tambe

    Abstract: We propose and study Collpasing Bandits, a new restless multi-armed bandit (RMAB) setting in which each arm follows a binary-state Markovian process with a special structure: when an arm is played, the state is fully observed, thus "collapsing" any uncertainty, but when an arm is passive, no observation is made, thus allowing uncertainty to evolve. The goal is to keep as many arms in the "good" st… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

  12. arXiv:2006.10815  [pdf, other

    cs.LG stat.ML

    Automatically Learning Compact Quality-aware Surrogates for Optimization Problems

    Authors: Kai Wang, Bryan Wilder, Andrew Perrault, Milind Tambe

    Abstract: Solving optimization problems with unknown parameters often requires learning a predictive model to predict the values of the unknown parameters and then solving the problem using these values. Recent work has shown that including the optimization problem as a layer in the model training pipeline results in predictions of the unobserved parameters that lead to higher decision quality. Unfortunatel… ▽ More

    Submitted 22 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

  13. arXiv:1905.13732  [pdf, other

    cs.LG cs.SI stat.ML

    End to end learning and optimization on graphs

    Authors: Bryan Wilder, Eric Ewing, Bistra Dilkina, Milind Tambe

    Abstract: Real-world applications often combine learning and optimization problems on graphs. For instance, our objective may be to cluster the graph in order to detect meaningful communities (or solve other common graph optimization problems such as facility location, maxcut, and so on). However, graphs or related attributes are often only partially observed, introducing learning problems such as link pred… ▽ More

    Submitted 8 January, 2020; v1 submitted 31 May, 2019; originally announced May 2019.

    Comments: Accepted at NeurIPS 2019

  14. arXiv:1903.06669  [pdf, other

    stat.AP cs.AI cs.LG

    Stay Ahead of Poachers: Illegal Wildlife Poaching Prediction and Patrol Planning Under Uncertainty with Field Test Evaluations

    Authors: Lily Xu, Shahrzad Gholami, Sara Mc Carthy, Bistra Dilkina, Andrew Plumptre, Milind Tambe, Rohit Singh, Mustapha Nsubuga, Joshua Mabonga, Margaret Driciru, Fred Wanyama, Aggrey Rwetsiba, Tom Okello, Eric Enyel

    Abstract: Illegal wildlife poaching threatens ecosystems and drives endangered species toward extinction. However, efforts for wildlife protection are constrained by the limited resources of law enforcement agencies. To help combat poaching, the Protection Assistant for Wildlife Security (PAWS) is a machine learning pipeline that has been developed as a data-driven approach to identify areas at high risk of… ▽ More

    Submitted 5 November, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: 12 pages, 11 figures. Short paper published in ICDE 2020

  15. arXiv:1902.01506  [pdf, other

    cs.LG cs.CY stat.ML

    Learning to Prescribe Interventions for Tuberculosis Patients Using Digital Adherence Data

    Authors: Jackson A. Killian, Bryan Wilder, Amit Sharma, Daksha Shah, Vinod Choudhary, Bistra Dilkina, Milind Tambe

    Abstract: Digital Adherence Technologies (DATs) are an increasingly popular method for verifying patient adherence to many medications. We analyze data from one city served by 99DOTS, a phone-call-based DAT deployed for Tuberculosis (TB) treatment in India where nearly 3 million people are afflicted with the disease each year. The data contains nearly 17,000 patients and 2.1M dose records. We lay the ground… ▽ More

    Submitted 24 June, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

    Comments: 10 pages, 6 figures

    Journal ref: KDD 2019: The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

  16. arXiv:1809.05504  [pdf, other

    cs.LG cs.AI stat.ML

    Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization

    Authors: Bryan Wilder, Bistra Dilkina, Milind Tambe

    Abstract: Creating impact in real-world settings requires artificial intelligence techniques to span the full pipeline from data, to predictive models, to decisions. These components are typically approached separately: a machine learning model is first trained via a measure of predictive accuracy, and then its predictions are used as input into an optimization algorithm which produces a decision. However,… ▽ More

    Submitted 20 November, 2018; v1 submitted 14 September, 2018; originally announced September 2018.

    Comments: Full version of paper accepted at AAAI 2019