Skip to main content

Showing 1–33 of 33 results for author: Kochenderfer, M J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.18441  [pdf, other

    cs.LG cs.MS stat.AP

    DB-KSVD: Scalable Alternating Optimization for Disentangling High-Dimensional Embedding Spaces

    Authors: Romeo Valentin, Sydney M. Katz, Vincent Vanhoucke, Mykel J. Kochenderfer

    Abstract: Dictionary learning has recently emerged as a promising approach for mechanistic interpretability of large transformer models. Disentangling high-dimensional transformer embeddings, however, requires algorithms that scale to high-dimensional data with large sample sizes. Recent work has explored sparse autoencoders (SAEs) for this problem. However, SAEs use a simple linear encoder to solve the spa… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 9 pages + 4 pages appendix

  2. arXiv:2505.13335  [pdf, other

    stat.ML cs.RO

    Scalable Importance Sampling in High Dimensions with Low-Rank Mixture Proposals

    Authors: Liam A. Kruse, Marc R. Schlichting, Mykel J. Kochenderfer

    Abstract: Importance sampling is a Monte Carlo technique for efficiently estimating the likelihood of rare events by biasing the sampling distribution towards the rare event of interest. By drawing weighted samples from a learned proposal distribution, importance sampling allows for more sample-efficient estimation of rare events or tails of distributions. A common choice of proposal density is a Gaussian m… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted at CoDIT 2025

  3. arXiv:2412.02154  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Failure Probability Estimation for Black-Box Autonomous Systems using State-Dependent Importance Sampling Proposals

    Authors: Harrison Delecki, Sydney M. Katz, Mykel J. Kochenderfer

    Abstract: Estimating the probability of failure is a critical step in developing safety-critical autonomous systems. Direct estimation methods such as Monte Carlo sampling are often impractical due to the rarity of failures in these systems. Existing importance sampling approaches do not scale to sequential decision-making systems with large state spaces and long horizons. We propose an adaptive importance… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: Submitted to L4DC 2025

  4. arXiv:2401.15838  [pdf, other

    stat.ML cs.LG cs.MA math.OC stat.CO

    Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

    Authors: Alexandros E. Tzikas, Licio Romao, Mert Pilanci, Alessandro Abate, Mykel J. Kochenderfer

    Abstract: Many machine learning applications require operating on a spatially distributed dataset. Despite technological advances, privacy considerations and communication constraints may prevent gathering the entire dataset in a central unit. In this paper, we propose a distributed sampling scheme based on the alternating direction method of multipliers, which is commonly used in the optimization literatur… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  5. arXiv:2305.09930  [pdf, other

    cs.RO cs.LG stat.ML

    Model-based Validation as Probabilistic Inference

    Authors: Harrison Delecki, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Estimating the distribution over failures is a key step in validating autonomous systems. Existing approaches focus on finding failures for a small range of initial conditions or make restrictive assumptions about the properties of the system under test. We frame estimating the distribution over failure trajectories for sequential systems as Bayesian inference. Our model-based approach represents… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to 5th Annual Learning for Dynamics & Control Conference (L4DC) 2023

  6. arXiv:2305.02449  [pdf, other

    cs.LG stat.AP

    Bayesian Safety Validation for Failure Probability Estimation of Black-Box Systems

    Authors: Robert J. Moss, Mykel J. Kochenderfer, Maxime Gariel, Arthur Dubois

    Abstract: Estimating the probability of failure is an important step in the certification of safety-critical systems. Efficient estimation methods are often needed due to the challenges posed by high-dimensional input spaces, risky test scenarios, and computationally expensive simulators. This work frames the problem of black-box safety validation as a Bayesian optimization problem and introduces a method t… ▽ More

    Submitted 29 June, 2024; v1 submitted 3 May, 2023; originally announced May 2023.

    Journal ref: AIAA Journal of Aerospace Information Systems (JAIS) 21.7 (2024): 533-546

  7. arXiv:2210.05015  [pdf, other

    cs.AI cs.RO eess.SY stat.ML

    Optimality Guarantees for Particle Belief Approximation of POMDPs

    Authors: Michael H. Lim, Tyler J. Becker, Mykel J. Kochenderfer, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood w… ▽ More

    Submitted 19 October, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Journal ref: Journal of Artificial Intelligence Research, 77, 1591-1636 (2023)

  8. arXiv:2110.14182  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

    Authors: Phil Chen, Masha Itkina, Ransalu Senanayake, Mykel J. Kochenderfer

    Abstract: Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training since the log-likelihood is undefined for spar… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted to NeurIPS 2021. Code is available at https://github.com/sisl/EvSoftmax

  9. arXiv:2107.12940  [pdf, other

    cs.LG stat.ML

    Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

    Authors: Mark Koren, Ahmed Nassar, Mykel J. Kochenderfer

    Abstract: Validating the safety of autonomous systems generally requires the use of high-fidelity simulators that adequately capture the variability of real-world scenarios. However, it is generally not feasible to exhaustively search the space of simulation scenarios for failures. Adaptive stress testing (AST) is a method that uses reinforcement learning to find the most likely failure of a system. AST wit… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: Accepted to IROS 2021

  10. arXiv:2008.07737  [pdf, ps, other

    cs.LG stat.ML

    Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration

    Authors: Andrea Zanette, Alessandro Lazaric, Mykel J. Kochenderfer, Emma Brunskill

    Abstract: There has been growing progress on theoretical analyses for provably efficient learning in MDPs with linear function approximation, but much of the existing work has made strong assumptions to enable exploration by conventional exploration frameworks. Typically these assumptions are stronger than what is needed to find good solutions in the batch setting. In this work, we show how under a more sta… ▽ More

    Submitted 21 October, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: Minor update; appears in NeurIPS

  11. arXiv:2007.00161  [pdf, other

    cs.RO cs.AI stat.AP

    Directional Primitives for Uncertainty-Aware Motion Estimation in Urban Environments

    Authors: Ransalu Senanayake, Maneekwan Toyungyernsub, Mingyu Wang, Mykel J. Kochenderfer, Mac Schwager

    Abstract: We can use driving data collected over a long period of time to extract rich information about how vehicles behave in different areas of the roads. In this paper, we introduce the concept of directional primitives, which is a representation of prior information of road networks. Specifically, we represent the uncertainty of directions using a mixture of von Mises distributions and associated speed… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

    Comments: The 23rd IEEE International Conference on Intelligent Transportation Systems. September, 2020

    MSC Class: 62P30 ACM Class: G.3

  12. arXiv:2006.11615  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Scalable Identification of Partially Observed Systems with Certainty-Equivalent EM

    Authors: Kunal Menda, Jean de Becdelièvre, Jayesh K. Gupta, Ilan Kroo, Mykel J. Kochenderfer, Zachary Manchester

    Abstract: System identification is a key step for model-based control, estimator design, and output prediction. This work considers the offline identification of partially observed nonlinear systems. We empirically show that the certainty-equivalent approximation to expectation-maximization can be a reliable and scalable approach for high-dimensional deterministic systems, which are common in robotics. We f… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

    Comments: First three authors contributed equally. Accepted at ICML 2020. Website: https://sites.google.com/stanford.edu/ceem/

  13. arXiv:2006.10096  [pdf, other

    cs.LG stat.ML

    Towards Recurrent Autoregressive Flow Models

    Authors: John Mern, Peter Morales, Mykel J. Kochenderfer

    Abstract: Stochastic processes generated by non-stationary distributions are difficult to represent with conventional models such as Gaussian processes. This work presents Recurrent Autoregressive Flows as a method toward general stochastic process modeling with normalizing flows. The proposed method defines a conditional distribution for each variable in a sequential process by conditioning the parameters… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    ACM Class: I.2.6; G.3

  14. arXiv:2005.02979  [pdf, ps, other

    cs.LG cs.AI eess.SY stat.ML

    A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems

    Authors: Anthony Corso, Robert J. Moss, Mark Koren, Ritchie Lee, Mykel J. Kochenderfer

    Abstract: Autonomous cyber-physical systems (CPS) can improve safety and efficiency for safety-critical applications, but require rigorous testing before deployment. The complexity of these systems often precludes the use of formal verification and real-world testing can be too dangerous during development. Therefore, simulation-based techniques have been developed that treat the system under test as a blac… ▽ More

    Submitted 14 October, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Journal ref: Journal of Artificial Intelligence Research, vol. 72, p. 377-428, 2021

  15. arXiv:2004.06805  [pdf, other

    cs.RO cs.LG stat.ML

    Interpretable Safety Validation for Autonomous Vehicles

    Authors: Anthony Corso, Mykel J. Kochenderfer

    Abstract: An open problem for autonomous driving is how to validate the safety of an autonomous vehicle in simulation. Automated testing procedures can find failures of an autonomous system but these failures may be difficult to interpret due to their high dimensionality and may be so unlikely as to not be important. This work describes an approach for finding interpretable failures of an autonomous system.… ▽ More

    Submitted 26 June, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

  16. arXiv:2004.06801  [pdf, other

    cs.RO cs.LG eess.SY stat.ML

    Scalable Autonomous Vehicle Safety Validation through Dynamic Programming and Scene Decomposition

    Authors: Anthony Corso, Ritchie Lee, Mykel J. Kochenderfer

    Abstract: An open question in autonomous driving is how best to use simulation to validate the safety of autonomous vehicles. Existing techniques rely on simulated rollouts, which can be inefficient for finding rare failure events, while other techniques are designed to only discover a single failure. In this work, we present a new safety validation approach that attempts to estimate the distribution over f… ▽ More

    Submitted 26 June, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

  17. arXiv:2004.04293  [pdf, other

    cs.RO cs.LG eess.SY stat.ML

    The Adaptive Stress Testing Formulation

    Authors: Mark Koren, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Validation is a key challenge in the search for safe autonomy. Simulations are often either too simple to provide robust validation, or too complex to tractably compute. Therefore, approximate validation methods are needed to tractably find failures without unsafe simplifications. This paper presents the theory behind one such black-box approach: adaptive stress testing (AST). We also provide thre… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: Presented at the Workshop on Robust Autonomy at RSS 2019

  18. arXiv:2004.04292  [pdf, other

    cs.LG eess.SY stat.ML

    Adaptive Stress Testing without Domain Heuristics using Go-Explore

    Authors: Mark Koren, Mykel J. Kochenderfer

    Abstract: Recently, reinforcement learning (RL) has been used as a tool for finding failures in autonomous systems. During execution, the RL agents often rely on some domain-specific heuristic reward to guide them towards finding failures, but constructing such a heuristic may be difficult or infeasible. Without a heuristic, the agent may only receive rewards at the time of failure, or even rewards that gui… ▽ More

    Submitted 18 June, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: Accepted to ITSC 2020

  19. arXiv:2003.09022  [pdf, other

    cs.LG cs.AI stat.ML

    Exchangeable Input Representations for Reinforcement Learning

    Authors: John Mern, Dorsa Sadigh, Mykel J. Kochenderfer

    Abstract: Poor sample efficiency is a major limitation of deep reinforcement learning in many domains. This work presents an attention-based method to project neural network inputs into an efficient representation space that is invariant under changes to input ordering. We show that our proposed representation results in an input space that is a factor of $m!$ smaller for inputs of $m$ objects. We also show… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: 6 pages, 7 figures

    ACM Class: I.2.6

  20. arXiv:1912.10648  [pdf, other

    cs.LG cs.AI stat.ML

    Monte-Carlo Tree Search for Policy Optimization

    Authors: Xiaobai Ma, Katherine Driggs-Campbell, Zongzhang Zhang, Mykel J. Kochenderfer

    Abstract: Gradient-based methods are often used for policy optimization in deep reinforcement learning, despite being vulnerable to local optima and saddle points. Although gradient-free methods (e.g., genetic algorithms or evolution strategies) help mitigate these issues, poor initialization and local optima are still concerns in highly nonconvex spaces. This paper presents a method for policy optimization… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: IJCAI 2019

    Journal ref: In Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 3116-3122. AAAI Press, 2019

  21. arXiv:1912.06752  [pdf, other

    physics.comp-ph cs.LG stat.ML

    Parameter-Conditioned Sequential Generative Modeling of Fluid Flows

    Authors: Jeremy Morton, Freddie D. Witherden, Mykel J. Kochenderfer

    Abstract: The computational cost associated with simulating fluid flows can make it infeasible to run many simulations across multiple flow conditions. Building upon concepts from generative modeling, we introduce a new method for learning neural network models capable of performing efficient parameterized simulations of fluid flows. Evaluated on their ability to simulate both two-dimensional and three-dime… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

    Comments: 29 pages, 21 figures

  22. arXiv:1908.01046  [pdf, other

    cs.RO cs.AI cs.LG eess.SY stat.ML

    Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle Validation

    Authors: Anthony Corso, Peter Du, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: Determining possible failure scenarios is a critical step in the evaluation of autonomous vehicle systems. Real-world vehicle testing is commonly employed for autonomous vehicle validation, but the costs and time requirements are high. Consequently, simulation-driven methods such as Adaptive Stress Testing (AST) have been proposed to aid in validation. AST formulates the problem of finding the mos… ▽ More

    Submitted 6 August, 2019; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: Appears in IEEE ITSC 2019

  23. arXiv:1908.01022  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning

    Authors: Ross E. Allen, Jayesh K. Gupta, Jaime Pena, Yutai Zhou, Javona White Bear, Mykel J. Kochenderfer

    Abstract: This paper proposes a definition of system health in the context of multiple agents optimizing a joint reward function. We use this definition as a credit assignment term in a policy gradient algorithm to distinguish the contributions of individual agents to the global reward. The health-informed credit assignment is then extended to a multi-agent variant of the proximal policy optimization algori… ▽ More

    Submitted 4 January, 2021; v1 submitted 2 August, 2019; originally announced August 2019.

  24. arXiv:1903.06758  [pdf

    cs.LG stat.ML

    Algorithms for Verifying Deep Neural Networks

    Authors: Changliu Liu, Tomer Arnon, Christopher Lazarus, Christopher Strong, Clark Barrett, Mykel J. Kochenderfer

    Abstract: Deep neural networks are widely used for nonlinear function approximation with applications ranging from computer vision to control. Although these networks involve the composition of simple arithmetic operations, it can be very challenging to verify whether a particular network satisfies certain input-output properties. This article surveys methods that have emerged recently for soundly verifying… ▽ More

    Submitted 15 October, 2020; v1 submitted 15 March, 2019; originally announced March 2019.

  25. arXiv:1903.03642  [pdf, other

    cs.LG cs.RO stat.ML

    Improved Robustness and Safety for Autonomous Vehicle Control with Adversarial Reinforcement Learning

    Authors: Xiaobai Ma, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: To improve efficiency and reduce failures in autonomous vehicles, research has focused on developing robust and safe learning methods that take into account disturbances in the environment. Existing literature in robust reinforcement learning poses the learning problem as a two player game between the autonomous system and disturbances. This paper examines two different algorithms to solve the gam… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

    Comments: intelligent vehicles symposium 2018

    MSC Class: 60-06

    Journal ref: Intelligent Vehicles Symposium (IV), 2018 IEEE

  26. arXiv:1902.09742  [pdf, ps, other

    cs.LG stat.ML

    Deep Variational Koopman Models: Inferring Koopman Observations for Uncertainty-Aware Dynamics Modeling and Control

    Authors: Jeremy Morton, Freddie D Witherden, Mykel J Kochenderfer

    Abstract: Koopman theory asserts that a nonlinear dynamical system can be mapped to a linear system, where the Koopman operator advances observations of the state forward in time. However, the observable functions that map states to observations are generally unknown. We introduce the Deep Variational Koopman (DVK) model, a method for inferring distributions over observations that can be propagated linearly… ▽ More

    Submitted 1 June, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: Accepted to the 2019 International Joint Conference on Artificial Intelligence (IJCAI). 8 pages, 3 figures

  27. arXiv:1902.01909  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Adaptive Stress Testing for Autonomous Vehicles

    Authors: Mark Koren, Saud Alsaif, Ritchie Lee, Mykel J. Kochenderfer

    Abstract: This paper presents a method for testing the decision making systems of autonomous vehicles. Our approach involves perturbing stochastic elements in the vehicle's environment until the vehicle is involved in a collision. Instead of applying direct Monte Carlo sampling to find collision scenarios, we formulate the problem as a Markov decision process and use reinforcement learning algorithms to fin… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

  28. arXiv:1811.09977  [pdf, other

    stat.ML cs.LG

    Robust Super-Level Set Estimation using Gaussian Processes

    Authors: Andrea Zanette, Junzi Zhang, Mykel J. Kochenderfer

    Abstract: This paper focuses on the problem of determining as large a region as possible where a function exceeds a given threshold with high probability. We assume that we only have access to a noise-corrupted version of the function and that function evaluations are costly. To select the next query point, we propose maximizing the expected volume of the domain identified as above the threshold as predicte… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

    Comments: Accepted to ECML 2018

  29. arXiv:1810.04240  [pdf, other

    cs.LG stat.ML

    Deep Neural Network Compression for Aircraft Collision Avoidance Systems

    Authors: Kyle D. Julian, Mykel J. Kochenderfer, Michael P. Owen

    Abstract: One approach to designing decision making logic for an aircraft collision avoidance system frames the problem as a Markov decision process and optimizes the system using dynamic programming. The resulting collision avoidance strategy can be represented as a numeric table. This methodology has been used in the development of the Airborne Collision Avoidance System X (ACAS X) family of collision avo… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

  30. arXiv:1805.08913  [pdf, other

    stat.ML cs.AI cs.LG

    Amortized Inference Regularization

    Authors: Rui Shu, Hung H. Bui, Shengjia Zhao, Mykel J. Kochenderfer, Stefano Ermon

    Abstract: The variational autoencoder (VAE) is a popular model for density estimation and representation learning. Canonically, the variational principle suggests to prefer an expressive inference model so that the variational approximation is accurate. However, it is often overlooked that an overly-expressive inference model can be detrimental to the test set performance of both the amortized posterior app… ▽ More

    Submitted 9 January, 2019; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2018

  31. arXiv:1710.05090  [pdf, other

    cs.LG stat.ML

    Burn-In Demonstrations for Multi-Modal Imitation Learning

    Authors: Alex Kuefler, Mykel J. Kochenderfer

    Abstract: Recent work on imitation learning has generated policies that reproduce expert behavior from multi-modal data. However, past approaches have focused only on recreating a small number of distinct, expert maneuvers, or have relied on supervised learning techniques that produce unstable policies. This work extends InfoGAIL, an algorithm for multi-modal imitation learning, to reproduce behavior over a… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

    Comments: 1st Conference on Robotic Learning, Non-archival Track

  32. arXiv:1709.02802  [pdf, ps, other

    cs.LG cs.CR cs.LO stat.ML

    Towards Proving the Adversarial Robustness of Deep Neural Networks

    Authors: Guy Katz, Clark Barrett, David L. Dill, Kyle Julian, Mykel J. Kochenderfer

    Abstract: Autonomous vehicles are highly complex systems, required to function reliably in a wide variety of situations. Manually crafting software controllers for these vehicles is difficult, but there has been some success in using deep neural networks generated using machine-learning. However, deep neural networks are opaque to human engineers, rendering their correctness very difficult to prove manually… ▽ More

    Submitted 8 September, 2017; originally announced September 2017.

    Comments: In Proceedings FVAV 2017, arXiv:1709.02126

    ACM Class: D.2.4; I.2.2

    Journal ref: EPTCS 257, 2017, pp. 19-26

  33. arXiv:1704.05566  [pdf, other

    cs.LG cs.AI stat.ML

    Simultaneous Policy Learning and Latent State Inference for Imitating Driver Behavior

    Authors: Jeremy Morton, Mykel J. Kochenderfer

    Abstract: In this work, we propose a method for learning driver models that account for variables that cannot be observed directly. When trained on a synthetic dataset, our models are able to learn encodings for vehicle trajectories that distinguish between four distinct classes of driver behavior. Such encodings are learned without any knowledge of the number of driver classes or any objective that directl… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

    Comments: 7 pages, 6 figures, 2 tables