Skip to main content

Showing 1–27 of 27 results for author: Sunberg, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.14404  [pdf, other

    cs.AI cs.RO

    Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs

    Authors: Ofer Dagan, Tyler Becker, Zachary N. Sunberg

    Abstract: When human operators of cyber-physical systems encounter surprising behavior, they often consider multiple hypotheses that might explain it. In some cases, taking information-gathering actions such as additional measurements or control inputs given to the system can help resolve uncertainty and determine the most accurate hypothesis. The task of optimizing these actions can be formulated as a beli… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 8 pages, 4 figures, submitted to AAMAS 2025

  2. arXiv:2409.16392  [pdf, other

    cs.AI cs.LG cs.RO

    Rao-Blackwellized POMDP Planning

    Authors: Jiho Lee, Nisar R. Ahmed, Kyle H. Wray, Zachary N. Sunberg

    Abstract: Partially Observable Markov Decision Processes (POMDPs) provide a structured framework for decision-making under uncertainty, but their application requires efficient belief updates. Sequential Importance Resampling Particle Filters (SIRPF), also known as Bootstrap Particle Filters, are commonly used as belief updaters in large approximate POMDP solvers, but they face challenges such as particle d… ▽ More

    Submitted 3 March, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

  3. arXiv:2406.02871  [pdf, other

    cs.AI cs.LO cs.RO eess.SY

    Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability Objectives

    Authors: Qi Heng Ho, Martin S. Feather, Federico Rossi, Zachary N. Sunberg, Morteza Lahijanian

    Abstract: Partially Observable Markov Decision Processes (POMDPs) are powerful models for sequential decision making under transition and observation uncertainties. This paper studies the challenging yet important problem in POMDPs known as the (indefinite-horizon) Maximal Reachability Probability Problem (MRPP), where the goal is to maximize the probability of reaching some target states. This is also a co… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to the Conference on Uncertainty in Artificial Intelligence (UAI) 2024

  4. arXiv:2405.18703  [pdf, other

    cs.GT

    Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods

    Authors: Tyler Becker, Zachary Sunberg

    Abstract: Many real-world decision problems involve the interaction of multiple self-interested agents with limited sensing ability. The partially observable stochastic game (POSG) provides a mathematical framework for modeling these problems, however solving a POSG requires difficult reasoning over two critical factors: (1) information revealed by partial observations and (2) decisions other agents make. I… ▽ More

    Submitted 28 October, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2403.19760  [pdf, other

    cs.AI cs.HC

    Leveraging Counterfactual Paths for Contrastive Explanations of POMDP Policies

    Authors: Benjamin Kraske, Zakariya Laouar, Zachary Sunberg

    Abstract: As humans come to rely on autonomous systems more, ensuring the transparency of such systems is important to their continued adoption. Explainable Artificial Intelligence (XAI) aims to reduce confusion and foster trust in systems by providing explanations of agent behavior. Partially observable Markov decision processes (POMDPs) provide a flexible framework capable of reasoning over transition and… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 5 pages, 1 figure

  6. arXiv:2402.15997  [pdf, other

    cs.HC cs.GR cs.LG

    Cieran: Designing Sequential Colormaps via In-Situ Active Preference Learning

    Authors: Matt-Heun Hong, Zachary N. Sunberg, Danielle Albers Szafir

    Abstract: Quality colormaps can help communicate important data patterns. However, finding an aesthetically pleasing colormap that looks "just right" for a given scenario requires significant design and technical expertise. We introduce Cieran, a tool that allows any data analyst to rapidly find quality colormaps while designing charts within Jupyter Notebooks. Our system employs an active preference learni… ▽ More

    Submitted 29 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: CHI 2024. 12 pages/9 figures

  7. arXiv:2310.09688  [pdf, other

    cs.AI cs.RO

    Recursively-Constrained Partially Observable Markov Decision Processes

    Authors: Qi Heng Ho, Tyler Becker, Benjamin Kraske, Zakariya Laouar, Martin S. Feather, Federico Rossi, Morteza Lahijanian, Zachary N. Sunberg

    Abstract: Many sequential decision problems involve optimizing one objective function while imposing constraints on other objectives. Constrained Partially Observable Markov Decision Processes (C-POMDP) model this case with transition uncertainty and partial observability. In this work, we first show that C-POMDPs violate the optimal substructure property over successive decision steps and thus may exhibit… ▽ More

    Submitted 4 June, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted to the Conference on Uncertainty in Artificial Intelligence (UAI) 2024

  8. arXiv:2309.06395  [pdf, other

    cs.RO cs.HC

    Human-Centered Autonomy for UAS Target Search

    Authors: Hunter M. Ray, Zakariya Laouar, Zachary Sunberg, Nisar Ahmed

    Abstract: Current methods of deploying robots that operate in dynamic, uncertain environments, such as Uncrewed Aerial Systems in search \& rescue missions, require nearly continuous human supervision for vehicle guidance and operation. These methods do not consider high-level mission context resulting in cumbersome manual operation or inefficient exhaustive search patterns. We present a human-centered auto… ▽ More

    Submitted 6 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted to ICRA 2024. 9 pages, 5 figures

  9. arXiv:2305.00931  [pdf, other

    cs.AI cs.HC cs.LG

    Explanation through Reward Model Reconciliation using POMDP Tree Search

    Authors: Benjamin D. Kraske, Anshu Saksena, Anna L. Buczak, Zachary N. Sunberg

    Abstract: As artificial intelligence (AI) algorithms are increasingly used in mission-critical applications, promoting user-trust of these systems will be essential to their success. Ensuring users understand the models over which algorithms reason promotes user trust. This work seeks to reconcile differences between the reward model that an algorithm uses for online partially observable Markov decision (PO… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  10. arXiv:2304.06876  [pdf, other

    eess.SY cs.AI cs.RO

    Sampling-based Reactive Synthesis for Nondeterministic Hybrid Systems

    Authors: Qi Heng Ho, Zachary N. Sunberg, Morteza Lahijanian

    Abstract: This paper introduces a sampling-based strategy synthesis algorithm for nondeterministic hybrid systems with complex continuous dynamics under temporal and reachability constraints. We model the evolution of the hybrid system as a two-player game, where the nondeterminism is an adversarial player whose objective is to prevent achieving temporal and reachability goals. The aim is to synthesize a wi… ▽ More

    Submitted 23 December, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Published in IEEE Robotics and Automation Letters (RA-L)

    Journal ref: IEEE Robotics and Automation Letters, vol. 9, no. 2, pp. 931-938, 2024

  11. arXiv:2304.01052  [pdf, other

    cs.RO

    Investigation of risk-aware MDP and POMDP contingency management autonomy for UAS

    Authors: Prashin Sharma, Benjamin Kraske, Joseph Kim, Zakariya Laouar, Zachary Sunberg, Ella Atkins

    Abstract: Unmanned aircraft systems (UAS) are being increasingly adopted for various applications. The risk UAS poses to people and property must be kept to acceptable levels. This paper proposes risk-aware contingency management autonomy to prevent an accident in the event of component malfunction, specifically propulsion unit failure and/or battery degradation. The proposed autonomy is modeled as a Markov… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  12. arXiv:2210.10202  [pdf, other

    cs.RO cs.FL eess.SY

    Planning with SiMBA: Motion Planning under Uncertainty for Temporal Goals using Simplified Belief Guides

    Authors: Qi Heng Ho, Zachary N. Sunberg, Morteza Lahijanian

    Abstract: This paper presents a new multi-layered algorithm for motion planning under motion and sensing uncertainties for Linear Temporal Logic specifications. We propose a technique to guide a sampling-based search tree in the combined task and belief space using trajectories from a simplified model of the system, to make the problem computationally tractable. Our method eliminates the need to construct f… ▽ More

    Submitted 9 April, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: 8 pages, to appear in the IEEE International Conference on Robotics and Automation (ICRA), 2023

  13. arXiv:2210.05015  [pdf, other

    cs.AI cs.RO eess.SY stat.ML

    Optimality Guarantees for Particle Belief Approximation of POMDPs

    Authors: Michael H. Lim, Tyler J. Becker, Mykel J. Kochenderfer, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood w… ▽ More

    Submitted 19 October, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Journal ref: Journal of Artificial Intelligence Research, 77, 1591-1636 (2023)

  14. arXiv:2207.03662  [pdf, other

    eess.SY cs.FL cs.RO

    Automaton-Guided Control Synthesis for Signal Temporal Logic Specifications

    Authors: Qi Heng Ho, Roland B. Ilyes, Zachary N. Sunberg, Morteza Lahijanian

    Abstract: This paper presents an algorithmic framework for control synthesis of continuous dynamical systems subject to signal temporal logic (STL) specifications. We propose a novel algorithm to obtain a time-partitioned finite automaton from an STL specification, and introduce a multi-layered framework that utilizes this automaton to guide a sampling-based search tree both spatially and temporally. Our ap… ▽ More

    Submitted 4 October, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: 8 pages, 3 figures, to appear in CDC 2022

  15. arXiv:2206.10028  [pdf, other

    cs.RO cs.AI

    Intention-Aware Navigation in Crowds with Extended-Space POMDP Planning

    Authors: Himanshu Gupta, Bradley Hayes, Zachary Sunberg

    Abstract: This paper presents a hybrid online Partially Observable Markov Decision Process (POMDP) planning system that addresses the problem of autonomous navigation in the presence of multi-modal uncertainty introduced by other agents in the environment. As a particular example, we consider the problem of autonomous navigation in dense crowds of pedestrians and among obstacles. Popular approaches to this… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  16. Gaussian Belief Trees for Chance Constrained Asymptotically Optimal Motion Planning

    Authors: Qi Heng Ho, Zachary N. Sunberg, Morteza Lahijanian

    Abstract: In this paper, we address the problem of sampling-based motion planning under motion and measurement uncertainty with probabilistic guarantees. We generalize traditional sampling-based tree-based motion planning algorithms for deterministic systems and propose belief-$\mathcal{A}$, a framework that extends any kinodynamical tree-based planner to the belief space for linear (or linearizable) system… ▽ More

    Submitted 4 October, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 9 pages, 4 figures. In ICRA 2022

  17. arXiv:2112.09456  [pdf, other

    cs.AI cs.LG cs.RO eess.SY

    Compositional Learning-based Planning for Vision POMDPs

    Authors: Sampada Deglurkar, Michael H. Lim, Johnathan Tucker, Zachary N. Sunberg, Aleksandra Faust, Claire J. Tomlin

    Abstract: The Partially Observable Markov Decision Process (POMDP) is a powerful framework for capturing decision-making problems that involve state and transition uncertainty. However, most current POMDP planners cannot effectively handle high-dimensional image observations prevalent in real world applications, and often require lengthy online training that requires interaction with the environment. In thi… ▽ More

    Submitted 2 December, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  18. arXiv:2012.10140  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Voronoi Progressive Widening: Efficient Online Solvers for Continuous State, Action, and Observation POMDPs

    Authors: Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: This paper introduces Voronoi Progressive Widening (VPW), a generalization of Voronoi optimistic optimization (VOO) and action progressive widening to partially observable Markov decision processes (POMDPs). Tree search algorithms can use VPW to effectively handle continuous or hybrid action spaces by efficiently balancing local and global action searching. This paper proposes two VPW-based algori… ▽ More

    Submitted 1 April, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

  19. arXiv:2010.08063  [pdf, other

    cs.RO eess.SY

    APF-PF: Probabilistic Depth Perception for 3D Reactive Obstacle Avoidance

    Authors: Shakeeb Ahmad, Zachary N. Sunberg, J. Sean Humbert

    Abstract: This paper proposes a framework for 3D obstacle avoidance in the presence of partial observability of environment obstacles. The method focuses on the utility of the Artificial Potential Function (APF) controller in a practical setting where noisy and incomplete information about the proximity is inevitable. We propose a Particle Filter (PF) approach to estimate potential obstacle locations in an… ▽ More

    Submitted 17 March, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted to appear at 2021 American Control Conference (ACC), New Orleans, Louisiana (8 pages, 11 figures)

  20. arXiv:2010.03597  [pdf, other

    cs.AI

    Bayesian Optimized Monte Carlo Planning

    Authors: John Mern, Anil Yildiz, Zachary Sunberg, Tapan Mukerji, Mykel J. Kochenderfer

    Abstract: Online solvers for partially observable Markov decision processes have difficulty scaling to problems with large action spaces. Monte Carlo tree search with progressive widening attempts to improve scaling by sampling from the action space to construct a policy search tree. The performance of progressive widening search is dependent upon the action sampling policy, often requiring problem-specific… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 8 pages

    ACM Class: I.2.8

    Journal ref: AAAI-21 Technical Tracks Vol. 35, No. 13, 2021, 11880-11887

  21. arXiv:2005.14549  [pdf, other

    cs.AI cs.RO

    Improving Automated Driving through POMDP Planning with Human Internal States

    Authors: Zachary Sunberg, Mykel Kochenderfer

    Abstract: This work examines the hypothesis that partially observable Markov decision process (POMDP) planning with human driver internal states can significantly improve both safety and efficiency in autonomous freeway driving. We evaluate this hypothesis in a simulated scenario where an autonomous car must safely perform three lane changes in rapid succession. Approximate POMDP solutions are obtained thro… ▽ More

    Submitted 9 June, 2022; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: Preprint before submission to IEEE Transactions on Intelligent Transportation Systems. arXiv admin note: text overlap with arXiv:1702.00858

  22. arXiv:2002.10185  [pdf, other

    cs.MA

    iLQGames.jl: Rapidly Designing and Solving Differential Games in Julia

    Authors: Lasse Peters, Zachary N. Sunberg

    Abstract: In many problems that involve multiple decision making agents, optimal choices for each agent depend on the choices of others. Differential game theory provides a principled formalism for expressing these coupled interactions and recent work offers efficient approximations to solve these problems to non-cooperative equilibria. iLQGames.jl is a framework for designing and solving differential games… ▽ More

    Submitted 6 April, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

  23. arXiv:2002.04354  [pdf, other

    cs.RO eess.SY

    Inference-Based Strategy Alignment for General-Sum Differential Games

    Authors: Lasse Peters, David Fridovich-Keil, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: In many settings where multiple agents interact, the optimal choices for each agent depend heavily on the choices of the others. These coupled interactions are well-described by a general-sum differential game, in which players have differing objectives, the state evolves in continuous time, and optimal play may be characterized by one of many equilibrium concepts, e.g., a Nash equilibrium. Often,… ▽ More

    Submitted 6 May, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  24. arXiv:1910.04332  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Sparse tree search optimality guarantees in POMDPs with continuous observation spaces

    Authors: Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) with continuous state and observation spaces have powerful flexibility for representing real-world decision and control problems but are notoriously difficult to solve. Recent online sampling-based algorithms that use observation likelihood weighting have shown unprecedented effectiveness in domains with continuous observation spaces. However… ▽ More

    Submitted 5 June, 2023; v1 submitted 9 October, 2019; originally announced October 2019.

  25. arXiv:1709.06196  [pdf, other

    cs.AI cs.RO eess.SY

    Online algorithms for POMDPs with continuous state, action, and observation spaces

    Authors: Zachary Sunberg, Mykel Kochenderfer

    Abstract: Online solvers for partially observable Markov decision processes have been applied to problems with large discrete state spaces, but continuous state, action, and observation spaces remain a challenge. This paper begins by investigating double progressive widening (DPW) as a solution to this challenge. However, we prove that this modification alone is not sufficient because the belief representat… ▽ More

    Submitted 5 September, 2018; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: Added Multilane section

    Journal ref: Short version published in 2018 proceedings of the International Conference on Automated Planning and Scheduling (ICAPS)

  26. arXiv:1702.00858  [pdf, other

    cs.AI

    The Value of Inferring the Internal State of Traffic Participants for Autonomous Freeway Driving

    Authors: Zachary Sunberg, Christopher Ho, Mykel Kochenderfer

    Abstract: Safe interaction with human drivers is one of the primary challenges for autonomous vehicles. In order to plan driving maneuvers effectively, the vehicle's control system must infer and predict how humans will behave based on their latent internal state (e.g., intentions and aggressiveness). This research uses a simple model for human behavior with unknown parameters that make up the internal stat… ▽ More

    Submitted 2 February, 2017; originally announced February 2017.

  27. arXiv:1602.04762  [pdf, other

    cs.RO

    Optimized and Trusted Collision Avoidance for Unmanned Aerial Vehicles using Approximate Dynamic Programming (Technical Report)

    Authors: Zachary N. Sunberg, Mykel J. Kochenderfer, Marco Pavone

    Abstract: Safely integrating unmanned aerial vehicles into civil airspace is contingent upon development of a trustworthy collision avoidance system. This paper proposes an approach whereby a parameterized resolution logic that is considered trusted for a given range of its parameters is adaptively tuned online. Specifically, to address the potential conservatism of the resolution logic with static paramete… ▽ More

    Submitted 18 February, 2016; v1 submitted 15 February, 2016; originally announced February 2016.

    Comments: An abbreviated version was submitted to ICRA 2016