Skip to main content

Showing 1–6 of 6 results for author: Vincent, J A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.04823  [pdf, other

    cs.RO math.OC stat.AP

    Learning Robot Safety from Sparse Human Feedback using Conformal Prediction

    Authors: Aaron O. Feldman, Joseph A. Vincent, Maximilian Adang, Jun En Low, Mac Schwager

    Abstract: Ensuring robot safety can be challenging; user-defined constraints can miss edge cases, policies can become unsafe even when trained from safe data, and safety can be subjective. Thus, we learn about robot safety by showing policy trajectories to a human who flags unsafe behavior. From this binary feedback, we use the statistical method of conformal prediction to identify a region of states, poten… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  2. arXiv:2405.05439  [pdf, other

    cs.RO cs.AI cs.LG stat.AP

    How Generalizable Is My Behavior Cloning Policy? A Statistical Approach to Trustworthy Performance Evaluation

    Authors: Joseph A. Vincent, Haruki Nishimura, Masha Itkina, Paarth Shah, Mac Schwager, Thomas Kollar

    Abstract: With the rise of stochastic generative models in robot policy learning, end-to-end visuomotor policies are increasingly successful at solving complex tasks by learning from human demonstrations. Nevertheless, since real-world evaluation costs afford users only a small number of policy rollouts, it remains a challenge to accurately gauge the performance of such policies. This is exacerbated by dist… ▽ More

    Submitted 18 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  3. arXiv:2309.10874  [pdf, other

    cs.RO eess.SY

    Guarantees on Robot System Performance Using Stochastic Simulation Rollouts

    Authors: Joseph A. Vincent, Aaron O. Feldman, Mac Schwager

    Abstract: We provide finite-sample performance guarantees for control policies executed on stochastic robotic systems. Given an open- or closed-loop policy and a finite set of trajectory rollouts under the policy, we bound the expected value, value-at-risk, and conditional-value-at-risk of the trajectory cost, and the probability of failure in a sparse cost setting. The bounds hold, with user-specified prob… ▽ More

    Submitted 13 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE-TRO

  4. arXiv:2210.08339  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Reachable Polyhedral Marching (RPM): An Exact Analysis Tool for Deep-Learned Control Systems

    Authors: Joseph A. Vincent, Mac Schwager

    Abstract: Neural networks are increasingly used in robotics as policies, state transition models, state estimation models, or all of the above. With these components being learned from data, it is important to be able to analyze what behaviors were learned and how this affects closed-loop performance. In this paper we take steps toward this goal by developing methods for computing control invariant sets and… ▽ More

    Submitted 29 March, 2025; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: Submitted to IEEE Transactions on Neural Networks and Learning Systems. arXiv admin note: text overlap with arXiv:2011.11609

  5. DiNNO: Distributed Neural Network Optimization for Multi-Robot Collaborative Learning

    Authors: Javier Yu, Joseph A. Vincent, Mac Schwager

    Abstract: We present a distributed algorithm that enables a group of robots to collaboratively optimize the parameters of a deep neural network model while communicating over a mesh network. Each robot only has access to its own data and maintains its own version of the neural network, but eventually learns a model that is as good as if it had been trained on all the data centrally. No robot sends raw data… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Submitted to IEEE Robotics and Automation Letters (with conference ICRA)

  6. arXiv:2011.11609  [pdf, other

    cs.RO cs.AI

    Reachable Polyhedral Marching (RPM): A Safety Verification Algorithm for Robotic Systems with Deep Neural Network Components

    Authors: Joseph A. Vincent, Mac Schwager

    Abstract: We present a method for computing exact reachable sets for deep neural networks with rectified linear unit (ReLU) activation. Our method is well-suited for use in rigorous safety analysis of robotic perception and control systems with deep neural network components. Our algorithm can compute both forward and backward reachable sets for a ReLU network iterated over multiple time steps, as would be… ▽ More

    Submitted 1 April, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: accepted to International Conference on Robotics and Automation (ICRA) 2021