Skip to main content

Showing 1–40 of 40 results for author: Kochenderfer, M J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.09119  [pdf, ps, other

    cs.RO eess.SY

    Model Identification Adaptive Control with $ρ$-POMDP Planning

    Authors: Michelle Ho, Arec Jamgochian, Mykel J. Kochenderfer

    Abstract: Accurate system modeling is crucial for safe, effective control, as misidentification can lead to accumulated errors, especially under partial observability. We address this problem by formulating informative input design and model identification adaptive control (MIAC) as belief space planning problems, modeled as partially observable Markov decision processes with belief-dependent rewards ($ρ$-P… ▽ More

    Submitted 22 May, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

    Comments: Accepted to CoDIT 2025

  2. arXiv:2503.22660  [pdf, ps, other

    eess.SY

    Verifying Nonlinear Neural Feedback Systems using Polyhedral Enclosures

    Authors: Samuel I. Akinwande, Chelsea Sidrane, Mykel J. Kochenderfer, Clark Barrett

    Abstract: As dynamical systems equipped with neural network controllers (neural feedback systems) become increasingly prevalent, it is critical to develop methods to ensure their safe operation. Verifying safety requires extending control theoretic analysis methods to these systems. Although existing techniques can efficiently handle linear neural feedback systems, relatively few scalable methods address th… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  3. arXiv:2501.16625  [pdf, other

    eess.SY cs.LG

    An Iterative Bayesian Approach for System Identification based on Linear Gaussian Models

    Authors: Alexandros E. Tzikas, Mykel J. Kochenderfer

    Abstract: We tackle the problem of system identification, where we select inputs, observe the corresponding outputs from the true system, and optimize the parameters of our model to best fit the data. We propose a flexible and computationally tractable methodology that is compatible with any system and parametric family of models. Our approach only requires input-output data from the system and first-order… ▽ More

    Submitted 30 March, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

    Comments: Submitted to the IEEE CDC

    ACM Class: G.3; I.6

  4. arXiv:2412.06220  [pdf, other

    eess.SY cs.RO

    Discrete-Time Distribution Steering using Monte Carlo Tree Search

    Authors: Alexandros E. Tzikas, Liam A. Kruse, Mansur Arief, Mykel J. Kochenderfer, Stephen Boyd

    Abstract: Optimal control problems with state distribution constraints have attracted interest for their expressivity, but solutions rely on linear approximations. We approach the problem of driving the state of a dynamical system in distribution from a sequential decision-making perspective. We formulate the optimal control problem as an appropriate Markov decision process (MDP), where the actions correspo… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: Submitted to the IEEE Robotics and Automation Letters for possible publication

    ACM Class: I.2.9; G.3

  5. Optimal Control of Mechanical Ventilators with Learned Respiratory Dynamics

    Authors: Isaac Ronald Ward, Dylan M. Asmar, Mansur Arief, Jana Krystofova Mike, Mykel J. Kochenderfer

    Abstract: Deciding on appropriate mechanical ventilator management strategies significantly impacts the health outcomes for patients with respiratory diseases. Acute Respiratory Distress Syndrome (ARDS) is one such disease that requires careful ventilator operation to be effectively treated. In this work, we frame the management of ventilators for patients with ARDS as a sequential decision making problem u… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Comments: 2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 7 pages, 3 figures

  6. arXiv:2410.16282  [pdf, other

    cs.NI cs.AI eess.SY

    Optimal Ground Station Selection for Low-Earth Orbiting Satellites

    Authors: Duncan Eddy, Michelle Ho, Mykel J. Kochenderfer

    Abstract: This paper presents a solution to the problem of optimal ground station selection for low-Earth orbiting (LEO) space missions that enables mission operators to precisely design their ground segment performance and costs. Space mission operators are increasingly turning to Ground-Station-as-a-Service (GSaaS) providers to supply the terrestrial communications segment to reduce costs and increase net… ▽ More

    Submitted 1 March, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: 13 pages, 3 tables, 4 figures, presented at IEEE Aeroconf 2025

  7. arXiv:2409.13088  [pdf, other

    eess.SY

    Informative Input Design for Dynamic Mode Decomposition

    Authors: Joshua Ott, Mykel J. Kochenderfer, Stephen Boyd

    Abstract: Efficiently estimating system dynamics from data is essential for minimizing data collection costs and improving model performance. This work addresses the challenge of designing future control inputs to maximize information gain, thereby improving the efficiency of the system identification process. We propose an approach that integrates informative input design into the Dynamic Mode Decompositio… ▽ More

    Submitted 28 April, 2025; v1 submitted 19 September, 2024; originally announced September 2024.

    Comments: Accepted to L4DC 2025

  8. arXiv:2409.08097  [pdf, other

    eess.SY cs.LG

    Optimizing Falsification for Learning-Based Control Systems: A Multi-Fidelity Bayesian Approach

    Authors: Zahra Shahrooei, Mykel J. Kochenderfer, Ali Baheri

    Abstract: Testing controllers in safety-critical systems is vital for ensuring their safety and preventing failures. In this paper, we address the falsification problem within learning-based closed-loop control systems through simulation. This problem involves the identification of counterexamples that violate system safety requirements and can be formulated as an optimization task based on these requiremen… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 13 pages, 9 figures

  9. arXiv:2408.13847  [pdf, other

    eess.SY

    Watercraft as Overwater Ambulance Exchange Points to Enhance Aeromedical Evacuation

    Authors: Mahdi Al-Husseini, Kyle H. Wray, Mykel J. Kochenderfer

    Abstract: Ambulance exchange points are preidentified sites where patients are transferred between evacuation platforms while en route to enhanced medical care. We propose a new capability for maritime medical evacuation, which involves co-opting underway watercraft as overwater ambulance exchange points to transfer patients between medical evacuation aircraft. We partner with the United States Army's 25th… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

  10. arXiv:2406.14761  [pdf, other

    cs.RO cs.AI eess.SY

    Diffusion-Based Failure Sampling for Evaluating Safety-Critical Autonomous Systems

    Authors: Harrison Delecki, Marc R. Schlichting, Mansur Arief, Anthony Corso, Marcell Vazquez-Chanlatte, Mykel J. Kochenderfer

    Abstract: Validating safety-critical autonomous systems in high-dimensional domains such as robotics presents a significant challenge. Existing black-box approaches based on Markov chain Monte Carlo may require an enormous number of samples, while methods based on importance sampling often rely on simple parametric families that may struggle to represent the distribution over failures. We propose to sample… ▽ More

    Submitted 20 May, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Appears in IEEE International Conference on Engineering Reliable Autonomous Systems (ERAS) 2025

  11. arXiv:2401.10949  [pdf, ps, other

    cs.MA cs.LG eess.SY

    The Synergy Between Optimal Transport Theory and Multi-Agent Reinforcement Learning

    Authors: Ali Baheri, Mykel J. Kochenderfer

    Abstract: This paper explores the integration of optimal transport (OT) theory with multi-agent reinforcement learning (MARL). This integration uses OT to handle distributions and transportation problems to enhance the efficiency, coordination, and adaptability of MARL. There are five key areas where OT can impact MARL: (1) policy alignment, where OT's Wasserstein metric is used to align divergent agent str… ▽ More

    Submitted 24 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  12. arXiv:2309.12474  [pdf, other

    cs.RO cs.AI cs.CY cs.ET eess.SY

    SAVME: Efficient Safety Validation for Autonomous Systems Using Meta-Learning

    Authors: Marc R. Schlichting, Nina V. Boord, Anthony L. Corso, Mykel J. Kochenderfer

    Abstract: Discovering potential failures of an autonomous system is important prior to deployment. Falsification-based methods are often used to assess the safety of such systems, but the cost of running many accurate simulation can be high. The validation can be accelerated by identifying critical failure scenarios for the system under test and by reducing the simulation runtime. We propose a Bayesian appr… ▽ More

    Submitted 30 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted for ITSC 2023

  13. arXiv:2305.06111  [pdf, ps, other

    eess.SY

    Joint Falsification and Fidelity Settings Optimization for Validation of Safety-Critical Systems: A Theoretical Analysis

    Authors: Ali Baheri, Mykel J. Kochenderfer

    Abstract: Safety validation is a crucial component in the development and deployment of autonomous systems, such as self-driving vehicles and robotic systems. Ensuring safe operation necessitates extensive testing and verification of control policies, typically conducted in simulation environments. High-fidelity simulators accurately model real-world dynamics but entail high computational costs, limiting th… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: Submitted to the 20th International Conference on Quantitative Evaluation of Systems (QEST 2023)

  14. arXiv:2304.09352  [pdf, other

    cs.AI eess.SY physics.flu-dyn

    Optimizing Carbon Storage Operations for Long-Term Safety

    Authors: Yizheng Wang, Markus Zechner, Gege Wen, Anthony Louis Corso, John Michael Mern, Mykel J. Kochenderfer, Jef Karel Caers

    Abstract: To combat global warming and mitigate the risks associated with climate change, carbon capture and storage (CCS) has emerged as a crucial technology. However, safely sequestering CO2 in geological formations for long-term storage presents several challenges. In this study, we address these issues by modeling the decision-making process for carbon storage operations as a partially observable Markov… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  15. arXiv:2212.14118  [pdf, other

    eess.SY cs.LG

    Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization

    Authors: Zahra Shahrooei, Mykel J. Kochenderfer, Ali Baheri

    Abstract: Simulation-based falsification is a practical testing method to increase confidence that the system will meet safety requirements. Because full-fidelity simulations can be computationally demanding, we investigate the use of simulators with different levels of fidelity. As a first step, we express the overall safety specification in terms of environmental parameters and structure this safety speci… ▽ More

    Submitted 28 April, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 7 pages, 8 figures, Accepted for the 2023 European Control Conference (ECC)

  16. arXiv:2210.05015  [pdf, other

    cs.AI cs.RO eess.SY stat.ML

    Optimality Guarantees for Particle Belief Approximation of POMDPs

    Authors: Michael H. Lim, Tyler J. Becker, Mykel J. Kochenderfer, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood w… ▽ More

    Submitted 19 October, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Journal ref: Journal of Artificial Intelligence Research, 77, 1591-1636 (2023)

  17. arXiv:2209.14076  [pdf, other

    eess.SY cs.LG cs.RO

    Backward Reachability Analysis of Neural Feedback Loops: Techniques for Linear and Nonlinear Systems

    Authors: Nicholas Rober, Sydney M. Katz, Chelsea Sidrane, Esen Yel, Michael Everett, Mykel J. Kochenderfer, Jonathan P. How

    Abstract: As neural networks (NNs) become more prevalent in safety-critical applications such as control of vehicles, there is a growing need to certify that systems with NN components are safe. This paper presents a set of backward reachability approaches for safety certification of neural feedback loops (NFLs), i.e., closed-loop systems with NN control policies. While backward reachability strategies have… ▽ More

    Submitted 21 November, 2022; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 17 pages, 15 figures. Journal extension of arXiv:2204.08319

  18. arXiv:2204.14250  [pdf, other

    cs.RO eess.SY

    Collision Risk and Operational Impact of Speed Change Advisories as Aircraft Collision Avoidance Maneuvers

    Authors: Sydney M. Katz, Luis E. Alvarez, Michael Owen, Samuel Wu, Marc Brittain, Anshuman Das, Mykel J. Kochenderfer

    Abstract: Aircraft collision avoidance systems have long been a key factor in keeping our airspace safe. Over the past decade, the FAA has supported the development of a new family of collision avoidance systems called the Airborne Collision Avoidance System X (ACAS X), which model the collision avoidance problem as a Markov decision process (MDP). Variants of ACAS X have been created for both manned (ACAS… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: 10 pages, 6 figures, presented at the 2022 AIAA Aviation Forum

  19. arXiv:2203.16633  [pdf, other

    eess.SY cs.RO

    Model Predictive Optimized Path Integral Strategies

    Authors: Dylan M. Asmar, Ransalu Senanayake, Shawn Manuel, Mykel J. Kochenderfer

    Abstract: We generalize the derivation of model predictive path integral control (MPPI) to allow for a single joint distribution across controls in the control sequence. This reformation allows for the implementation of adaptive importance sampling (AIS) algorithms into the original importance sampling step while still maintaining the benefits of MPPI such as working with arbitrary system dynamics and cost… ▽ More

    Submitted 1 March, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Repository: https://github.com/sisl/MPOPIS. Accepted to ICRA 2023

    ACM Class: I.2.8; I.2.9

  20. arXiv:2112.03911  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Dyadic Sex Composition and Task Classification Using fNIRS Hyperscanning Data

    Authors: Liam A. Kruse, Allan L. Reiss, Mykel J. Kochenderfer, Stephanie Balters

    Abstract: Hyperscanning with functional near-infrared spectroscopy (fNIRS) is an emerging neuroimaging application that measures the nuanced neural signatures underlying social interactions. Researchers have assessed the effect of sex and task type (e.g., cooperation versus competition) on inter-brain coherence during human-to-human interactions. However, no work has yet used deep learning-based approaches… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 20th IEEE International Conference on Machine Learning and Applications

  21. arXiv:2108.01220  [pdf, ps, other

    cs.LG cs.LO eess.SY

    OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems

    Authors: Chelsea Sidrane, Amir Maleki, Ahmed Irfan, Mykel J. Kochenderfer

    Abstract: Deep learning methods can be used to produce control policies, but certifying their safety is challenging. The resulting networks are nonlinear and often very large. In response to this challenge, we present OVERT: a sound algorithm for safety verification of nonlinear discrete-time closed loop dynamical systems with neural network control policies. The novelty of OVERT lies in combining ideas fro… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 44 pages, under review

    MSC Class: 68Q60 (Primary) 68T07; 37N35 (Secondary) ACM Class: I.2.6; I.2.8; D.2.4

    Journal ref: Journal of Machine Learning Research 23 (2022) 1-45

  22. arXiv:2010.10618  [pdf, other

    cs.LG cs.AI eess.SY

    Runtime Safety Assurance Using Reinforcement Learning

    Authors: Christopher Lazarus, James G. Lopez, Mykel J. Kochenderfer

    Abstract: The airworthiness and safety of a non-pedigreed autopilot must be verified, but the cost to formally do so can be prohibitive. We can bypass formal verification of non-pedigreed components by incorporating Runtime Safety Assurance (RTSA) as mechanism to ensure safety. RTSA consists of a meta-controller that observes the inputs and outputs of a non-pedigreed component and verifies formally specifie… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Journal ref: 2020 IEEE/AIAA 39th Digital Avionics Systems Conference (DASC)

  23. arXiv:2008.08446  [pdf, other

    cs.AI eess.SY

    A Maximum Independent Set Method for Scheduling Earth Observing Satellite Constellations

    Authors: Duncan Eddy, Mykel J. Kochenderfer

    Abstract: Operating Earth observing satellites requires efficient planning methods that coordinate activities of multiple spacecraft. The satellite task planning problem entails selecting actions that best satisfy mission objectives for autonomous execution. Task scheduling is often performed by human operators assisted by heuristic or rule-based planning tools. This approach does not efficiently scale to m… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

  24. arXiv:2006.11615  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Scalable Identification of Partially Observed Systems with Certainty-Equivalent EM

    Authors: Kunal Menda, Jean de Becdelièvre, Jayesh K. Gupta, Ilan Kroo, Mykel J. Kochenderfer, Zachary Manchester

    Abstract: System identification is a key step for model-based control, estimator design, and output prediction. This work considers the offline identification of partially observed nonlinear systems. We empirically show that the certainty-equivalent approximation to expectation-maximization can be a reliable and scalable approach for high-dimensional deterministic systems, which are common in robotics. We f… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

    Comments: First three authors contributed equally. Accepted at ICML 2020. Website: https://sites.google.com/stanford.edu/ceem/

  25. arXiv:2006.08832  [pdf, other

    eess.SY cs.AI cs.CY

    A Taxonomy and Review of Algorithms for Modeling and Predicting Human Driver Behavior

    Authors: Kyle Brown, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: We present a review and taxonomy of 200 models from the literature on driver behavior modeling. We begin by introducing a mathematical framework for describing the dynamics of interactive multi-agent traffic. Based on the partially observable stochastic game, this framework provides a basis for discussing different driver modeling techniques. Our taxonomy is constructed around the core modeling ta… ▽ More

    Submitted 28 November, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  26. arXiv:2005.02979  [pdf, ps, other

    cs.LG cs.AI eess.SY stat.ML

    A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems

    Authors: Anthony Corso, Robert J. Moss, Mark Koren, Ritchie Lee, Mykel J. Kochenderfer

    Abstract: Autonomous cyber-physical systems (CPS) can improve safety and efficiency for safety-critical applications, but require rigorous testing before deployment. The complexity of these systems often precludes the use of formal verification and real-world testing can be too dangerous during development. Therefore, simulation-based techniques have been developed that treat the system under test as a blac… ▽ More

    Submitted 14 October, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Journal ref: Journal of Artificial Intelligence Research, vol. 72, p. 377-428, 2021

  27. arXiv:2004.10301  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Structured Mechanical Models for Robot Learning and Control

    Authors: Jayesh K. Gupta, Kunal Menda, Zachary Manchester, Mykel J. Kochenderfer

    Abstract: Model-based methods are the dominant paradigm for controlling robotic systems, though their efficacy depends heavily on the accuracy of the model used. Deep neural networks have been used to learn models of robot dynamics from data, but they suffer from data-inefficiency and the difficulty to incorporate prior knowledge. We introduce Structured Mechanical Models, a flexible model class for mechani… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: First two authors contributed equally. Accepted at L4DC2020. Source code and videos at https://sites.google.com/stanford.edu/smm/

  28. arXiv:2004.06801  [pdf, other

    cs.RO cs.LG eess.SY stat.ML

    Scalable Autonomous Vehicle Safety Validation through Dynamic Programming and Scene Decomposition

    Authors: Anthony Corso, Ritchie Lee, Mykel J. Kochenderfer

    Abstract: An open question in autonomous driving is how best to use simulation to validate the safety of autonomous vehicles. Existing techniques rely on simulated rollouts, which can be inefficient for finding rare failure events, while other techniques are designed to only discover a single failure. In this work, we present a new safety validation approach that attempts to estimate the distribution over f… ▽ More

    Submitted 26 June, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

  29. arXiv:2004.04293  [pdf, other

    cs.RO cs.LG eess.SY stat.ML

    The Adaptive Stress Testing Formulation

    Authors: Mark Koren, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Validation is a key challenge in the search for safe autonomy. Simulations are often either too simple to provide robust validation, or too complex to tractably compute. Therefore, approximate validation methods are needed to tractably find failures without unsafe simplifications. This paper presents the theory behind one such black-box approach: adaptive stress testing (AST). We also provide thre… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: Presented at the Workshop on Robust Autonomy at RSS 2019

  30. arXiv:2004.04292  [pdf, other

    cs.LG eess.SY stat.ML

    Adaptive Stress Testing without Domain Heuristics using Go-Explore

    Authors: Mark Koren, Mykel J. Kochenderfer

    Abstract: Recently, reinforcement learning (RL) has been used as a tool for finding failures in autonomous systems. During execution, the RL agents often rely on some domain-specific heuristic reward to guide them towards finding failures, but constructing such a heuristic may be difficult or infeasible. Without a heuristic, the agent may only receive rewards at the time of failure, or even rewards that gui… ▽ More

    Submitted 18 June, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: Accepted to ITSC 2020

  31. arXiv:2003.02381  [pdf, other

    cs.RO eess.SY

    Validation of Image-Based Neural Network Controllers through Adaptive Stress Testing

    Authors: Kyle D. Julian, Ritchie Lee, Mykel J. Kochenderfer

    Abstract: Neural networks have become state-of-the-art for computer vision problems because of their ability to efficiently model complex functions from large amounts of data. While neural networks can be shown to perform well empirically for a variety of tasks, their performance is difficult to guarantee. Neural network verification tools have been developed that can certify robustness with respect to a gi… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Comments: 7 pages, 6 figures

  32. Guaranteeing Safety for Neural Network-Based Aircraft Collision Avoidance Systems

    Authors: Kyle D. Julian, Mykel J. Kochenderfer

    Abstract: The decision logic for the ACAS X family of aircraft collision avoidance systems is represented as a large numeric table. Due to storage constraints of certified avionics hardware, neural networks have been suggested as a way to significantly compress the data while still preserving performance in terms of safety. However, neural networks are complex continuous functions with outputs that are diff… ▽ More

    Submitted 5 May, 2020; v1 submitted 15 December, 2019; originally announced December 2019.

    Comments: 10 pages, 11 figures, presented at the 2019 AIAA Digital Avionics Systems Conference (DASC)

    Journal ref: IEEE/AIAA 38th Digital Avionics Systems Conference (DASC). 2019

  33. arXiv:1908.01046  [pdf, other

    cs.RO cs.AI cs.LG eess.SY stat.ML

    Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle Validation

    Authors: Anthony Corso, Peter Du, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: Determining possible failure scenarios is a critical step in the evaluation of autonomous vehicle systems. Real-world vehicle testing is commonly employed for autonomous vehicle validation, but the costs and time requirements are high. Consequently, simulation-driven methods such as Adaptive Stress Testing (AST) have been proposed to aid in validation. AST formulates the problem of finding the mos… ▽ More

    Submitted 6 August, 2019; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: Appears in IEEE ITSC 2019

  34. arXiv:1903.03948  [pdf, other

    cs.AI eess.SY

    Rethinking System Health Management

    Authors: Edward Balaban, Stephen B. Johnson, Mykel J. Kochenderfer

    Abstract: Health management of complex dynamic systems has traditionally evolved separately from automated control, planning, and scheduling (generally referred to in the paper as decision making). A goal of Integrated System Health Management has been to enable coordination between system health management and decision making, although successful practical implementations have remained limited. This paper… ▽ More

    Submitted 10 March, 2019; originally announced March 2019.

    Comments: Published in the proceedings of the 2018 AAAI Fall Symposium on Integrating Planning, Diagnosis, and Causal Reasoning

  35. arXiv:1903.00762  [pdf, other

    eess.SY cs.LO

    Verifying Aircraft Collision Avoidance Neural Networks Through Linear Approximations of Safe Regions

    Authors: Kyle D. Julian, Shivam Sharma, Jean-Baptiste Jeannin, Mykel J. Kochenderfer

    Abstract: The next generation of aircraft collision avoidance systems frame the problem as a Markov decision process and use dynamic programming to optimize the alerting logic. The resulting system uses a large lookup table to determine advisories given to pilots, but these tables can grow very large. To enable the system to operate on limited hardware, prior work investigated compressing the table using a… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

  36. arXiv:1903.00520  [pdf, other

    eess.SY

    A Reachability Method for Verifying Dynamical Systems with Deep Neural Network Controllers

    Authors: Kyle D. Julian, Mykel J. Kochenderfer

    Abstract: Deep neural networks can be trained to be efficient and effective controllers for dynamical systems; however, the mechanics of deep neural networks are complex and difficult to guarantee. This work presents a general approach for providing guarantees for deep neural network controllers over multiple time steps using a combination of reachability methods and open source neural network verification… ▽ More

    Submitted 3 June, 2019; v1 submitted 1 March, 2019; originally announced March 2019.

  37. arXiv:1902.08705  [pdf, ps, other

    cs.RO cs.AI cs.LG eess.SY

    A General Framework for Structured Learning of Mechanical Systems

    Authors: Jayesh K. Gupta, Kunal Menda, Zachary Manchester, Mykel J. Kochenderfer

    Abstract: Learning accurate dynamics models is necessary for optimal, compliant control of robotic systems. Current approaches to white-box modeling using analytic parameterizations, or black-box modeling using neural networks, can suffer from high bias or high variance. We address the need for a flexible, gray-box model of mechanical systems that can seamlessly incorporate prior knowledge where it is avail… ▽ More

    Submitted 1 March, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: 10 pages, 7 figures. First two authors contributed equally. Submitted to IROS/RA-L. Code at https://github.com/sisl/mechamodlearn/

  38. arXiv:1809.10012  [pdf, other

    cs.RO cs.LG eess.SY

    Using Neural Networks to Generate Information Maps for Mobile Sensors

    Authors: Louis Dressel, Mykel J. Kochenderfer

    Abstract: Target localization is a critical task for mobile sensors and has many applications. However, generating informative trajectories for these sensors is a challenging research problem. A common method uses information maps that estimate the value of taking measurements from any point in the sensor state space. These information maps are used to generate trajectories; for example, a trajectory might… ▽ More

    Submitted 26 September, 2018; originally announced September 2018.

    Comments: Accepted to the 2018 IEEE Conference on Decision and Control (CDC)

  39. arXiv:1808.06652  [pdf, other

    eess.SY cs.RO

    On the Optimality of Ergodic Trajectories for Information Gathering Tasks

    Authors: Louis Dressel, Mykel J. Kochenderfer

    Abstract: Recently, ergodic control has been suggested as a means to guide mobile sensors for information gathering tasks. In ergodic control, a mobile sensor follows a trajectory that is ergodic with respect to some information density distribution. A trajectory is ergodic if time spent in a state space region is proportional to the information density of the region. Although ergodic control has shown prom… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Comments: Presented at 2018 American Control Conference (ACC)

  40. arXiv:1808.00888  [pdf, other

    eess.SY

    Estimation and Control Using Sampling-Based Bayesian Reinforcement Learning

    Authors: Patrick Slade, Zachary N. Sunberg, Mykel J. Kochenderfer

    Abstract: Real-world autonomous systems operate under uncertainty about both their pose and dynamics. Autonomous control systems must simultaneously perform estimation and control tasks to maintain robustness to changing dynamics or modeling errors. However, information gathering actions often conflict with optimal actions for reaching control objectives, requiring a trade-off between exploration and exploi… ▽ More

    Submitted 31 July, 2018; originally announced August 2018.

    Comments: 10 pages, 6 figures. arXiv admin note: text overlap with arXiv:1707.09055