Skip to main content

Showing 101–122 of 122 results for author: Boots, B

.
  1. arXiv:1710.05387  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Manifold Regularization for Kernelized LSTD

    Authors: Xinyan Yan, Krzysztof Choromanski, Byron Boots, Vikas Sindhwani

    Abstract: Policy evaluation or value function or Q-function approximation is a key procedure in reinforcement learning (RL). It is a necessary component of policy iteration and can be used for variance reduction in policy gradient methods. Therefore its quality has a significant impact on most RL algorithms. Motivated by manifold regularized learning, we propose a novel kernelized policy evaluation method t… ▽ More

    Submitted 15 October, 2017; originally announced October 2017.

    Comments: 6 pages, CoRL 2017 non-archival track

  2. arXiv:1709.08520  [pdf, other

    stat.ML cs.LG

    Predictive-State Decoders: Encoding the Future into Recurrent Networks

    Authors: Arun Venkatraman, Nicholas Rhinehart, Wen Sun, Lerrel Pinto, Martial Hebert, Byron Boots, Kris M. Kitani, J. Andrew Bagnell

    Abstract: Recurrent neural networks (RNNs) are a vital modeling technique that rely on internal states learned indirectly by optimization of a supervised, unsupervised, or reinforcement training loss. RNNs are used to model dynamic processes that are characterized by underlying latent states whose form is often unknown, precluding its analytic representation inside an RNN. In the Predictive-State Representa… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: NIPS 2017

  3. arXiv:1709.07174  [pdf, other

    cs.RO

    Agile Autonomous Driving using End-to-End Deep Imitation Learning

    Authors: Yunpeng Pan, Ching-An Cheng, Kamil Saigol, Keuntaek Lee, Xinyan Yan, Evangelos Theodorou, Byron Boots

    Abstract: We present an end-to-end imitation learning system for agile, off-road autonomous driving using only low-cost sensors. By imitating a model predictive controller equipped with advanced sensors, we train a deep neural network control policy to map raw, high-dimensional observations to continuous steering and throttle commands. Compared with recent approaches to similar tasks, our method requires ne… ▽ More

    Submitted 9 August, 2019; v1 submitted 21 September, 2017; originally announced September 2017.

    Comments: 13 pages, Robotics: Science and Systems (RSS) 2018

  4. arXiv:1709.03410  [pdf, other

    cs.CV

    One-Shot Learning for Semantic Segmentation

    Authors: Amirreza Shaban, Shray Bansal, Zhen Liu, Irfan Essa, Byron Boots

    Abstract: Low-shot learning methods for image classification support learning from sparse data. We extend these techniques to support dense semantic image segmentation. Specifically, we train a network that, given a small set of annotated images, produces parameters for a Fully Convolutional Network (FCN). We use this FCN to perform dense pixel-level prediction on a test image for the new semantic class. Ou… ▽ More

    Submitted 11 September, 2017; originally announced September 2017.

    Comments: To appear in the proceedings of the British Machine Vision Conference (BMVC) 2017. The code is available at https://github.com/lzzcd001/OSLSM

  5. Continuous-Time Gaussian Process Motion Planning via Probabilistic Inference

    Authors: Mustafa Mukadam, Jing Dong, Xinyan Yan, Frank Dellaert, Byron Boots

    Abstract: We introduce a novel formulation of motion planning, for continuous-time trajectories, as probabilistic inference. We first show how smooth continuous-time trajectories can be represented by a small number of states using sparse Gaussian process (GP) models. We next develop an efficient gradient-based optimization algorithm that exploits this sparsity and GP interpolation. We call this algorithm t… ▽ More

    Submitted 22 November, 2018; v1 submitted 23 July, 2017; originally announced July 2017.

    Comments: The International Journal of Robotics Research (IJRR), 2018, Volume 37, Issue 11

  6. arXiv:1705.09353  [pdf, other

    stat.ML

    Predictive State Recurrent Neural Networks

    Authors: Carlton Downey, Ahmed Hefny, Boyue Li, Byron Boots, Geoffrey Gordon

    Abstract: We present a new model, Predictive State Recurrent Neural Networks (PSRNNs), for filtering and prediction in dynamical systems. PSRNNs draw on insights from both Recurrent Neural Networks (RNNs) and Predictive State Representations (PSRs), and inherit advantages from both types of models. Like many successful RNN architectures, PSRNNs use (potentially deeply composed) bilinear transfer functions t… ▽ More

    Submitted 17 June, 2017; v1 submitted 25 May, 2017; originally announced May 2017.

  7. arXiv:1705.06020  [pdf, other

    cs.RO

    Sparse Gaussian Processes for Continuous-Time Trajectory Estimation on Matrix Lie Groups

    Authors: Jing Dong, Byron Boots, Frank Dellaert

    Abstract: Continuous-time trajectory representations are a powerful tool that can be used to address several issues in many practical simultaneous localization and mapping (SLAM) scenarios, like continuously collected measurements distorted by robot motion, or during with asynchronous sensor measurements. Sparse Gaussian processes (GP) allow for a probabilistic non-parametric trajectory representation that… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Comments: 6 pages

  8. arXiv:1703.01030  [pdf, other

    cs.LG

    Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction

    Authors: Wen Sun, Arun Venkatraman, Geoffrey J. Gordon, Byron Boots, J. Andrew Bagnell

    Abstract: Researchers have demonstrated state-of-the-art performance in sequential decision making problems (e.g., robotics control, sequential prediction) with deep neural network models. One often has access to near-optimal oracles that achieve good performance on the task during training. We demonstrate that AggreVaTeD --- a policy gradient extension of the Imitation Learning (IL) approach of (Ross & Bag… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.

    Comments: 17 pages

  9. arXiv:1702.07335  [pdf, other

    cs.RO eess.SY

    Approximately Optimal Continuous-Time Motion Planning and Control via Probabilistic Inference

    Authors: Mustafa Mukadam, Ching-An Cheng, Xinyan Yan, Byron Boots

    Abstract: The problem of optimal motion planing and control is fundamental in robotics. However, this problem is intractable for continuous-time stochastic systems in general and the solution is difficult to approximate if non-instantaneous nonlinear performance indices are present. In this work, we provide an efficient algorithm, PIPC (Probabilistic Inference for Planning and Control), that yields approxim… ▽ More

    Submitted 27 February, 2017; v1 submitted 23 February, 2017; originally announced February 2017.

    Comments: minor fixes and typos

  10. arXiv:1610.02482  [pdf, other

    cs.RO cs.CV

    4D Crop Monitoring: Spatio-Temporal Reconstruction for Agriculture

    Authors: Jing Dong, John Gary Burnham, Byron Boots, Glen C. Rains, Frank Dellaert

    Abstract: Autonomous crop monitoring at high spatial and temporal resolution is a critical problem in precision agriculture. While Structure from Motion and Multi-View Stereo algorithms can finely reconstruct the 3D structure of a field with low-cost image sensors, these algorithms fail to capture the dynamic nature of continuously growing crops. In this paper we propose a 4D reconstruction approach to crop… ▽ More

    Submitted 8 October, 2016; originally announced October 2016.

    Comments: Submitted to IEEE International Conference on Robotics and Automation (ICRA) 2017

  11. arXiv:1608.06235  [pdf, other

    cs.RO cs.LG

    Adaptive Probabilistic Trajectory Optimization via Efficient Approximate Inference

    Authors: Yunpeng Pan, Xinyan Yan, Evangelos Theodorou, Byron Boots

    Abstract: Robotic systems must be able to quickly and robustly make decisions when operating in uncertain and dynamic environments. While Reinforcement Learning (RL) can be used to compute optimal policies with little prior knowledge about the environment, it suffers from slow convergence. An alternative approach is Model Predictive Control (MPC), which optimizes policies quickly, but also requires accurate… ▽ More

    Submitted 11 September, 2016; v1 submitted 22 August, 2016; originally announced August 2016.

  12. arXiv:1607.04579  [pdf, other

    cs.LG math.OC stat.ML

    Learning from Conditional Distributions via Dual Embeddings

    Authors: Bo Dai, Niao He, Yunpeng Pan, Byron Boots, Le Song

    Abstract: Many machine learning tasks, such as learning with invariance and policy evaluation in reinforcement learning, can be characterized as problems of learning from conditional distributions. In such problems, each sample $x$ itself is associated with a conditional distribution $p(z|x)$ represented by samples $\{z_i\}_{i=1}^M$, and the goal is to learn a function $f$ that links these conditional distr… ▽ More

    Submitted 31 December, 2016; v1 submitted 15 July, 2016; originally announced July 2016.

    Comments: 24 pages, 11 figures

  13. arXiv:1601.03648  [pdf, other

    cs.RO

    Functional Gradient Motion Planning in Reproducing Kernel Hilbert Spaces

    Authors: Zita Marinho, Anca Dragan, Arun Byravan, Byron Boots, Siddhartha Srinivasa, Geoffrey Gordon

    Abstract: We introduce a functional gradient descent trajectory optimization algorithm for robot motion planning in Reproducing Kernel Hilbert Spaces (RKHSs). Functional gradient algorithms are a popular choice for motion planning in complex many-degree-of-freedom robots, since they (in theory) work by directly optimizing within a space of continuous trajectories to avoid obstacles while maintaining geometr… ▽ More

    Submitted 14 January, 2016; originally announced January 2016.

  14. arXiv:1512.08836  [pdf, other

    cs.LG

    Learning to Filter with Predictive State Inference Machines

    Authors: Wen Sun, Arun Venkatraman, Byron Boots, J. Andrew Bagnell

    Abstract: Latent state space models are a fundamental and widely used tool for modeling dynamical systems. However, they are difficult to learn from data and learned models often lack performance guarantees on inference tasks such as filtering and prediction. In this work, we present the PREDICTIVE STATE INFERENCE MACHINE (PSIM), a data-driven method that considers the inference procedure on a dynamical sys… ▽ More

    Submitted 30 May, 2016; v1 submitted 29 December, 2015; originally announced December 2015.

    Comments: ICML 2016

  15. arXiv:1504.02696  [pdf, other

    cs.RO

    Incremental Sparse GP Regression for Continuous-time Trajectory Estimation & Mapping

    Authors: Xinyan Yan, Vadim Indelman, Byron Boots

    Abstract: Recent work on simultaneous trajectory estimation and mapping (STEAM) for mobile robots has found success by representing the trajectory as a Gaussian process. Gaussian processes can represent a continuous-time trajectory, elegantly handle asynchronous and sparse measurements, and allow the robot to query the trajectory to recover its estimated position at any time of interest. A major drawback of… ▽ More

    Submitted 10 April, 2015; originally announced April 2015.

    Comments: 10 pages, 10 figures

  16. arXiv:1309.6819  [pdf

    cs.LG stat.ML

    Hilbert Space Embeddings of Predictive State Representations

    Authors: Byron Boots, Geoffrey Gordon, Arthur Gretton

    Abstract: Predictive State Representations (PSRs) are an expressive class of models for controlled stochastic processes. PSRs represent state as a set of predictions of future observable events. Because PSRs are defined entirely in terms of observable data, statistically consistent estimates of PSR parameters can be learned efficiently by manipulating moments of observed training data. Most learning algorit… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-92-101

  17. arXiv:1207.2491  [pdf, other

    cs.LG cs.RO stat.ML

    A Spectral Learning Approach to Range-Only SLAM

    Authors: Byron Boots, Geoffrey J. Gordon

    Abstract: We present a novel spectral learning algorithm for simultaneous localization and mapping (SLAM) from range data with known correspondences. This algorithm is an instance of a general spectral system identification framework, from which it inherits several desirable properties, including statistical consistency and no local optima. Compared with popular batch optimization or multiple-hypothesis tra… ▽ More

    Submitted 10 July, 2012; originally announced July 2012.

  18. arXiv:1206.4648  [pdf

    cs.LG

    Two-Manifold Problems with Applications to Nonlinear System Identification

    Authors: Byron Boots, Geoff Gordon

    Abstract: Recently, there has been much interest in spectral approaches to learning manifolds---so-called kernel eigenmap methods. These methods have had some successes, but their applicability is limited because they are not robust to noise. To address this limitation, we look at two-manifold problems, in which we simultaneously reconstruct two related manifolds, each representing a different view of the s… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012. arXiv admin note: text overlap with arXiv:1112.6399

  19. arXiv:1112.6399  [pdf, other

    cs.LG

    Two-Manifold Problems

    Authors: Byron Boots, Geoffrey J. Gordon

    Abstract: Recently, there has been much interest in spectral approaches to learning manifolds---so-called kernel eigenmap methods. These methods have had some successes, but their applicability is limited because they are not robust to noise. To address this limitation, we look at two-manifold problems, in which we simultaneously reconstruct two related manifolds, each representing a different view of the s… ▽ More

    Submitted 29 December, 2011; originally announced December 2011.

  20. arXiv:1011.0041  [pdf, other

    cs.LG cs.AI

    Predictive State Temporal Difference Learning

    Authors: Byron Boots, Geoffrey J. Gordon

    Abstract: We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identification. In practical applications, reinforcement learning (RL) is complicated by the fact that state is either high-dimensional or partially observable. Therefore, RL methods are designed to work with features of state rather than state itself, and the su… ▽ More

    Submitted 17 January, 2011; v1 submitted 29 October, 2010; originally announced November 2010.

  21. arXiv:0912.2385  [pdf, other

    cs.LG cs.AI

    Closing the Learning-Planning Loop with Predictive State Representations

    Authors: Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

    Abstract: A central problem in artificial intelligence is that of planning to maximize future reward under uncertainty in a partially observable environment. In this paper we propose and demonstrate a novel algorithm which accurately learns a model of such an environment directly from sequences of action-observation pairs. We then close the loop from observations to actions by planning in the learned mode… ▽ More

    Submitted 11 December, 2009; originally announced December 2009.

  22. arXiv:0910.0902  [pdf, other

    cs.LG cs.AI

    Reduced-Rank Hidden Markov Models

    Authors: Sajid M. Siddiqi, Byron Boots, Geoffrey J. Gordon

    Abstract: We introduce the Reduced-Rank Hidden Markov Model (RR-HMM), a generalization of HMMs that can model smooth state evolution as in Linear Dynamical Systems (LDSs) as well as non-log-concave predictive distributions as in continuous-observation HMMs. RR-HMMs assume an m-dimensional latent state and n discrete observations, with a transition matrix of rank k <= m. This implies the dynamics evolve in… ▽ More

    Submitted 22 December, 2009; v1 submitted 6 October, 2009; originally announced October 2009.

    Comments: Updated robot experiment figure, added details on KDE, fixed a couple of errors