Skip to main content

Showing 1–10 of 10 results for author: Romeres, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2002.10621  [pdf, other

    cs.LG cs.RO eess.SP eess.SY stat.ML

    Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements

    Authors: Alberto Dalla Libera, Diego Romeres, Devesh K. Jha, Bill Yerazunis, Daniel Nikovski

    Abstract: In this paper, we propose a derivative-free model learning framework for Reinforcement Learning (RL) algorithms based on Gaussian Process Regression (GPR). In many mechanical systems, only positions can be measured by the sensing instruments. Then, instead of representing the system state as suggested by the physics with a collection of positions, velocities, and accelerations, we define the state… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted at RA-L

  2. arXiv:2001.08092  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Local Policy Optimization for Trajectory-Centric Reinforcement Learning

    Authors: Patrik Kolaric, Devesh K. Jha, Arvind U. Raghunathan, Frank L. Lewis, Mouhacine Benosman, Diego Romeres, Daniel Nikovski

    Abstract: The goal of this paper is to present a method for simultaneous trajectory and local stabilizing policy optimization to generate local policies for trajectory-centric model-based reinforcement learning (MBRL). This is motivated by the fact that global policy optimization for non-linear systems could be a very challenging problem both algorithmically and numerically. However, a lot of robotic manipu… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Journal ref: ICRA 2020

  3. arXiv:1912.11912  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Quasi-Newton Trust Region Policy Optimization

    Authors: Devesh Jha, Arvind Raghunathan, Diego Romeres

    Abstract: We propose a trust region method for policy optimization that employs Quasi-Newton approximation for the Hessian, called Quasi-Newton Trust Region Policy Optimization QNTRPO. Gradient descent is the de facto algorithm for reinforcement learning tasks with continuous controls. The algorithm has achieved state-of-the-art performance when used in reinforcement learning across a wide range of tasks. H… ▽ More

    Submitted 26 December, 2019; originally announced December 2019.

    Comments: 3rd Conference on Robot Learning (CoRL 2019)

  4. arXiv:1809.05074  [pdf, other

    cs.LG stat.ML

    Derivative-free online learning of inverse dynamics models

    Authors: Diego Romeres, Mattia Zorzi, Raffaello Camoriano, Silvio Traversaro, Alessandro Chiuso

    Abstract: This paper discusses online algorithms for inverse dynamics modelling in robotics. Several model classes including rigid body dynamics (RBD) models, data-driven models and semiparametric models (which are a combination of the previous two classes) are placed in a common framework. While model classes used in the literature typically exploit joint velocities and accelerations, which need to be appr… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: 14 pages, 11 figures

  5. arXiv:1809.04993  [pdf, other

    cs.RO cs.LG stat.ML

    Semiparametrical Gaussian Processes Learning of Forward Dynamical Models for Navigating in a Circular Maze

    Authors: Diego Romeres, Devesh Jha, Alberto Dalla Libera, William Yerazunis, Daniel Nikovski

    Abstract: This paper presents a problem of model learning for the purpose of learning how to navigate a ball to a goal state in a circular maze environment with two degrees of freedom. The motion of the ball in the maze environment is influenced by several non-linear effects such as dry friction and contacts, which are difficult to model physically. We propose a semiparametric model to estimate the motion d… ▽ More

    Submitted 18 September, 2018; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: 7 pages including the references, 5 figures. Changed title, improved the structure of the article and the images

  6. arXiv:1809.04720  [pdf, other

    cs.LG stat.ML

    Sim-to-Real Transfer Learning using Robustified Controllers in Robotic Tasks involving Complex Dynamics

    Authors: Jeroen van Baar, Alan Sullivan, Radu Cordorel, Devesh Jha, Diego Romeres, Daniel Nikovski

    Abstract: Learning robot tasks or controllers using deep reinforcement learning has been proven effective in simulations. Learning in simulation has several advantages. For example, one can fully control the simulated environment, including halting motions while performing computations. Another advantage when robots are involved, is that the amount of time a robot is occupied learning a task---rather than b… ▽ More

    Submitted 17 September, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: 7 pages

  7. arXiv:1603.05412  [pdf, other

    math.OC cs.LG stat.ML

    Online semi-parametric learning for inverse dynamics modeling

    Authors: Diego Romeres, Mattia Zorzi, Raffaello Camoriano, Alessandro Chiuso

    Abstract: This paper presents a semi-parametric algorithm for online learning of a robot inverse dynamics model. It combines the strength of the parametric and non-parametric modeling. The former exploits the rigid body dynamics equa- tion, while the latter exploits a suitable kernel function. We provide an extensive comparison with other methods from the literature using real data from the iCub humanoid ro… ▽ More

    Submitted 9 October, 2016; v1 submitted 17 March, 2016; originally announced March 2016.

  8. arXiv:1601.04251  [pdf, other

    eess.SY cs.LG stat.AP stat.ML

    On-line Bayesian System Identification

    Authors: Diego Romeres, Giulia Prando, Gianluigi Pillonetto, Alessandro Chiuso

    Abstract: We consider an on-line system identification setting, in which new data become available at given time steps. In order to meet real-time estimation requirements, we propose a tailored Bayesian system identification procedure, in which the hyper-parameters are still updated through Marginal Likelihood maximization, but after only one iteration of a suitable iterative optimization algorithm. Both gr… ▽ More

    Submitted 17 January, 2016; originally announced January 2016.

  9. arXiv:1507.00543  [pdf, other

    stat.ML

    Classical vs. Bayesian methods for linear system identification: point estimators and confidence sets

    Authors: D. Romeres, G. Prando, G. Pillonetto, A. Chiuso

    Abstract: This paper compares classical parametric methods with recently developed Bayesian methods for system identification. A Full Bayes solution is considered together with one of the standard approximations based on the Empirical Bayes paradigm. Results regarding point estimators for the impulse response as well as for confidence regions are reported.

    Submitted 2 July, 2015; originally announced July 2015.

    Comments: number of pages = 8, number of figures = 4

  10. arXiv:1507.00507  [pdf, other

    stat.ML

    Identification of stable models via nonparametric prediction error methods

    Authors: Diego Romeres, Gianluigi Pillonetto, Alessandro Chiuso

    Abstract: A new Bayesian approach to linear system identification has been proposed in a series of recent papers. The main idea is to frame linear system identification as predictor estimation in an infinite dimensional space, with the aid of regularization/Bayesian techniques. This approach guarantees the identification of stable predictors based on the prediction error minimization. Unluckily, the stabili… ▽ More

    Submitted 2 July, 2015; originally announced July 2015.

    Comments: number of pages = 6, number of figures = 3