Skip to main content

Showing 1–7 of 7 results for author: Baker, C L

.
  1. arXiv:2405.11669  [pdf, other

    cs.LG cs.AI

    Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning

    Authors: Sean Vaskov, Wilko Schwarting, Chris L. Baker

    Abstract: Reinforcement Learning (RL) for control has become increasingly popular due to its ability to learn rich feedback policies that take into account uncertainty and complex representations of the environment. When considering safety constraints, constrained optimization approaches, where agents are penalized for constraint violations, are commonly used. In such methods, if agents are initialized in,… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  2. arXiv:2106.09127  [pdf, other

    cs.RO

    Planning on a (Risk) Budget: Safe Non-Conservative Planning in Probabilistic Dynamic Environments

    Authors: Hung-Jui Huang, Kai-Chi Huang, Michal Čáp, Yibiao Zhao, Ying Nian Wu, Chris L. Baker

    Abstract: Planning in environments with other agents whose future actions are uncertain often requires compromise between safety and performance. Here our goal is to design efficient planning algorithms with guaranteed bounds on the probability of safety violation, which nonetheless achieve non-conservative performance. To quantify a system's risk, we define a natural criterion called interval risk bounds (… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 9 pages, 5 figures, International Conference on Robotics and Automation 2021

  3. arXiv:2105.06979  [pdf, other

    astro-ph.HE gr-qc nucl-ex nucl-th

    The Radius of PSR J0740+6620 from NICER and XMM-Newton Data

    Authors: M. C. Miller, F. K. Lamb, A. J. Dittmann, S. Bogdanov, Z. Arzoumanian, K. C. Gendreau, S. Guillot, W. C. G. Ho, J. M. Lattimer, M. Loewenstein, S. M. Morsink, P. S. Ray, M. T. Wolff, C. L. Baker, T. Cazeau, S. Manthripragada, C. B. Markwardt, T. Okajima, S. Pollard, I. Cognard, H. T. Cromartie, E. Fonseca, L. Guillemot, M. Kerr, A. Parthasarathy , et al. (3 additional authors not shown)

    Abstract: PSR J0740$+$6620 has a gravitational mass of $2.08\pm 0.07~M_\odot$, which is the highest reliably determined mass of any neutron star. As a result, a measurement of its radius will provide unique insight into the properties of neutron star core matter at high densities. Here we report a radius measurement based on fits of rotating hot spot patterns to Neutron Star Interior Composition Explorer (N… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: 49 pages, 16 figures, submitted to The Astrophysical Journal Letters

  4. PODDP: Partially Observable Differential Dynamic Programming for Latent Belief Space Planning

    Authors: Dicong Qiu, Yibiao Zhao, Chris L. Baker

    Abstract: Autonomous agents are limited in their ability to observe the world state. Partially observable Markov decision processes (POMDPs) formally model the problem of planning under world state uncertainty, but POMDPs with continuous actions and nonlinear dynamics suitable for robotics applications are challenging to solve. In this paper, we present an efficient differential dynamic programming (DDP) al… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

    Comments: 16 pages, 6 figures, preprint

    Journal ref: Robotics: Science and Systems, 2020. 69.1-69.10

  5. arXiv:1912.05702  [pdf, other

    astro-ph.HE astro-ph.SR nucl-th

    A NICER View of PSR J0030+0451: Millisecond Pulsar Parameter Estimation

    Authors: Thomas E. Riley, Anna L. Watts, Slavko Bogdanov, Paul S. Ray, Renee M. Ludlam, Sebastien Guillot, Zaven Arzoumanian, Charles L. Baker, Anna V. Bilous, Deepto Chakrabarty, Keith C. Gendreau, Alice K. Harding, Wynn C. G. Ho, James M. Lattimer, Sharon M. Morsink, Tod E. Strohmayer

    Abstract: We report on Bayesian parameter estimation of the mass and equatorial radius of the millisecond pulsar PSR J0030$+$0451, conditional on pulse-profile modeling of Neutron Star Interior Composition Explorer (NICER) X-ray spectral-timing event data. We perform relativistic ray-tracing of thermal emission from hot regions of the pulsar's surface. We assume two distinct hot regions based on two clear p… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Appears in ApJ Letters Focus Issue on NICER Constraints on the Dense Matter Equation of State; 76 pages, 24 figures, 7 tables, 8 figure sets (available in the online journal or from the authors)

    Journal ref: ApJL, 887, L21 (2019)

  6. arXiv:1602.03924  [pdf, other

    cs.AI cs.GT cs.MA

    Modeling Human Ad Hoc Coordination

    Authors: Peter M. Krafft, Chris L. Baker, Alex Pentland, Joshua B. Tenenbaum

    Abstract: Whether in groups of humans or groups of computer agents, collaboration is most effective between individuals who have the ability to coordinate on a joint strategy for collective action. However, in general a rational actor will only intend to coordinate if that actor believes the other group members have the same intention. This circular dependence makes rational coordination difficult in uncert… ▽ More

    Submitted 11 February, 2016; originally announced February 2016.

    Comments: AAAI 2016

    ACM Class: I.2.0; I.2.11; J.4

  7. arXiv:1512.00964  [pdf, other

    cs.AI

    Modeling Human Understanding of Complex Intentional Action with a Bayesian Nonparametric Subgoal Model

    Authors: Ryo Nakahashi, Chris L. Baker, Joshua B. Tenenbaum

    Abstract: Most human behaviors consist of multiple parts, steps, or subtasks. These structures guide our action planning and execution, but when we observe others, the latent structure of their actions is typically unobservable, and must be inferred in order to learn new skills by demonstration, or to assist others in completing their tasks. For example, an assistant who has learned the subgoal structure of… ▽ More

    Submitted 3 December, 2015; originally announced December 2015.

    Comments: Accepted at AAAI 16

    Journal ref: Proceedings of 30th conference on artificial intelligence (AAAI 2016) pp. 3754--3760