Skip to main content

Showing 1–27 of 27 results for author: Nikovski, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.05093  [pdf, other

    cs.RO eess.SY

    Forward Dynamics Estimation from Data-Driven Inverse Dynamics Learning

    Authors: Alberto Dalla Libera, Giulio Giacomuzzo, Ruggero Carli, Daniel Nikovski, Diego Romeres

    Abstract: In this paper, we propose to estimate the forward dynamics equations of mechanical systems by learning a model of the inverse dynamics and estimating individual dynamics components from it. We revisit the classical formulation of rigid body dynamics in order to extrapolate the physical dynamical components, such as inertial and gravitational components, from an inverse dynamics model. After estima… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  2. arXiv:2303.03282  [pdf, other

    cs.RO eess.SY

    Learning Object Manipulation With Under-Actuated Impulse Generator Arrays

    Authors: Chuizheng Kong, William Yerazunis, Daniel Nikovski

    Abstract: For more than half a century, vibratory bowl feeders have been the standard in automated assembly for singulation, orientation, and manipulation of small parts. Unfortunately, these feeders are expensive, noisy, and highly specialized on a single part design bases. We consider an alternative device and learning control method for singulation, orientation, and manipulation by means of seven fixed-p… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at the 2023 American Control Conference

  3. arXiv:2301.13183  [pdf, other

    cs.RO cs.LG

    Learning Control from Raw Position Measurements

    Authors: Fabio Amadio, Alberto Dalla Libera, Daniel Nikovski, Ruggero Carli, Diego Romeres

    Abstract: We propose a Model-Based Reinforcement Learning (MBRL) algorithm named VF-MC-PILCO, specifically designed for application to mechanical systems where velocities cannot be directly measured. This circumstance, if not adequately considered, can compromise the success of MBRL approaches. To cope with this problem, we define a velocity-free state formulation which consists of the collection of past po… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted at the 2023 American Control Conference (ACC)

  4. arXiv:2212.01434  [pdf, other

    cs.RO cs.AI

    Generalizable Human-Robot Collaborative Assembly Using Imitation Learning and Force Control

    Authors: Devesh K. Jha, Siddarth Jain, Diego Romeres, William Yerazunis, Daniel Nikovski

    Abstract: Robots have been steadily increasing their presence in our daily lives, where they can work along with humans to provide assistance in various tasks on industry floors, in offices, and in homes. Automated assembly is one of the key applications of robots, and the next generation assembly systems could become much more efficient by creating collaborative human-robot systems. However, although colla… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  5. arXiv:2209.14461  [pdf, other

    cs.RO cs.AI

    Constrained Dynamic Movement Primitives for Safe Learning of Motor Skills

    Authors: Seiji Shaw, Devesh K. Jha, Arvind Raghunathan, Radu Corcodel, Diego Romeres, George Konidaris, Daniel Nikovski

    Abstract: Dynamic movement primitives are widely used for learning skills which can be demonstrated to a robot by a skilled human or controller. While their generalization capabilities and simple formulation make them very appealing to use, they possess no strong guarantees to satisfy operational safety constraints for a task. In this paper, we present constrained dynamic movement primitives (CDMP) which ca… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  6. arXiv:2208.08948  [pdf, other

    physics.soc-ph cs.LG eess.SY

    Transformer Networks for Predictive Group Elevator Control

    Authors: Jing Zhang, Athanasios Tsiligkaridis, Hiroshi Taguchi, Arvind Raghunathan, Daniel Nikovski

    Abstract: We propose a Predictive Group Elevator Scheduler by using predictive information of passengers arrivals from a Transformer based destination predictor and a linear regression model that predicts remaining time to destinations. Through extensive empirical evaluation, we find that the savings of Average Waiting Time (AWT) could be as high as above 50% for light arrival streams and around 15% for med… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Journal ref: Presented at European Control Conference 2022

  7. arXiv:2204.10447  [pdf, other

    cs.RO

    Design of Adaptive Compliance Controllers for Safe Robotic Assembly

    Authors: Devesh K. Jha, Diego Romeres, Siddarth Jain, William Yerazunis, Daniel Nikovski

    Abstract: Insertion operations are a critical element of most robotic assembly operation, and peg-in-hole (PiH) insertion is one of the most widely studied tasks in the industrial and academic manipulation communities. PiH insertion is in fact an entire class of problems, where the complexity of the problem can depend on the type of misalignment and contact formation during an insertion attempt. In this pap… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: 8 pages, 10 figures

  8. arXiv:2111.10488  [pdf, other

    cs.RO cs.AI

    Imitation and Supervised Learning of Compliance for Robotic Assembly

    Authors: Devesh K. Jha, Diego Romeres, William Yerazunis, Daniel Nikovski

    Abstract: We present the design of a learning-based compliance controller for assembly operations for industrial robots. We propose a solution within the general setting of learning from demonstration (LfD), where a nominal trajectory is provided through demonstration by an expert teacher. This can be used to learn a suitable representation of the skill that can be generalized to novel positions of one of t… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 8 pages, 7 figures

  9. Control of Mechanical Systems via Feedback Linearization Based on Black-Box Gaussian Process Models

    Authors: Alberto Dalla Libera, Fabio Amadio, Daniel Nikovski, Ruggero Carli, Diego Romeres

    Abstract: In this paper, we consider the use of black-box Gaussian process (GP) models for trajectory tracking control based on feedback linearization, in the context of mechanical systems. We considered two strategies. The first computes the control input directly by using the GP model, whereas the second computes the input after estimating the individual components of the dynamics. We tested the two strat… ▽ More

    Submitted 2 May, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

  10. arXiv:2104.01167  [pdf, other

    cs.RO

    Tactile-RL for Insertion: Generalization to Objects of Unknown Geometry

    Authors: Siyuan Dong, Devesh K. Jha, Diego Romeres, Sangwoon Kim, Daniel Nikovski, Alberto Rodriguez

    Abstract: Object insertion is a classic contact-rich manipulation task. The task remains challenging, especially when considering general objects of unknown geometry, which significantly limits the ability to understand the contact configuration between the object and the environment. We study the problem of aligning the object and environment with a tactile-based feedback insertion policy. The insertion pr… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  11. Model-Based Policy Search Using Monte Carlo Gradient Estimation with Real Systems Application

    Authors: Fabio Amadio, Alberto Dalla Libera, Riccardo Antonello, Daniel Nikovski, Ruggero Carli, Diego Romeres

    Abstract: In this paper, we present a Model-Based Reinforcement Learning (MBRL) algorithm named \emph{Monte Carlo Probabilistic Inference for Learning COntrol} (MC-PILCO). The algorithm relies on Gaussian Processes (GPs) to model the system dynamics and on a Monte Carlo approach to estimate the policy gradient. This defines a framework in which we ablate the choice of the following components: (i) the selec… ▽ More

    Submitted 6 September, 2022; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: Accepted in IEEE Transactions on Robotics. MC-PILCO code is publicly available at https://www.merl.com/research/license/MC-PILCO

  12. arXiv:2101.08740  [pdf, other

    cs.RO cs.LG eess.SY

    Model-based Policy Search for Partially Measurable Systems

    Authors: Fabio Amadio, Alberto Dalla Libera, Ruggero Carli, Daniel Nikovski, Diego Romeres

    Abstract: In this paper, we propose a Model-Based Reinforcement Learning (MBRL) algorithm for Partially Measurable Systems (PMS), i.e., systems where the state can not be directly measured, but must be estimated through proper state observers. The proposed algorithm, named Monte Carlo Probabilistic Inference for Learning COntrol for Partially Measurable Systems (MC-PILCO4PMS), relies on Gaussian Processes (… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: Accepted to 3rd Robot Learning Workshop: Grounding Machine Learning Development in the Real World (NeurIPS 2020)

  13. arXiv:2011.07193  [pdf, other

    cs.LG cs.AI cs.RO

    Data-Efficient Learning for Complex and Real-Time Physical Problem Solving using Augmented Simulation

    Authors: Kei Ota, Devesh K. Jha, Diego Romeres, Jeroen van Baar, Kevin A. Smith, Takayuki Semitsu, Tomoaki Oiki, Alan Sullivan, Daniel Nikovski, Joshua B. Tenenbaum

    Abstract: Humans quickly solve tasks in novel systems with complex dynamics, without requiring much interaction. While deep reinforcement learning algorithms have achieved tremendous success in many complex tasks, these algorithms need a large number of samples to learn meaningful policies. In this paper, we present a task for navigating a marble to the center of a circular maze. While this system is very i… ▽ More

    Submitted 15 February, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: Under submission

  14. arXiv:2011.00155  [pdf, other

    cs.RO cs.AI cs.LG

    Deep Reactive Planning in Dynamic Environments

    Authors: Kei Ota, Devesh K. Jha, Tadashi Onishi, Asako Kanezaki, Yusuke Yoshiyasu, Yoko Sasaki, Toshisada Mariyama, Daniel Nikovski

    Abstract: The main novelty of the proposed approach is that it allows a robot to learn an end-to-end policy which can adapt to changes in the environment during execution. While goal conditioning of policies has been studied in the RL literature, such approaches are not easily extended to cases where the robot's goal can change during execution. This is something that humans are naturally able to do. Howeve… ▽ More

    Submitted 5 November, 2020; v1 submitted 30 October, 2020; originally announced November 2020.

    Comments: 15 pages, 5 figures. Accepted at CoRL 2020

  15. arXiv:2007.11646  [pdf, other

    cs.RO cs.LG

    Understanding Multi-Modal Perception Using Behavioral Cloning for Peg-In-a-Hole Insertion Tasks

    Authors: Yifang Liu, Diego Romeres, Devesh K. Jha, Daniel Nikovski

    Abstract: One of the main challenges in peg-in-a-hole (PiH) insertion tasks is in handling the uncertainty in the location of the target hole. In order to address it, high-dimensional sensor inputs from sensor modalities such as vision, force/torque sensing, and proprioception can be combined to learn control policies that are robust to this uncertainty in the target pose. Whereas deep learning has shown su… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: Published at a RSS20 workshop

  16. arXiv:2003.01629  [pdf, other

    cs.LG cs.RO stat.ML

    Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?

    Authors: Kei Ota, Tomoaki Oiki, Devesh K. Jha, Toshisada Mariyama, Daniel Nikovski

    Abstract: Deep reinforcement learning (RL) algorithms have recently achieved remarkable successes in various sequential decision making tasks, leveraging advances in methods for training large deep networks. However, these methods usually require large amounts of training data, which is often a big problem for real-world applications. One natural question to ask is whether learning good representations for… ▽ More

    Submitted 26 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 11 pages, 10 figures. Accepted to ICML 2020

  17. arXiv:2002.10621  [pdf, other

    cs.LG cs.RO eess.SP eess.SY stat.ML

    Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements

    Authors: Alberto Dalla Libera, Diego Romeres, Devesh K. Jha, Bill Yerazunis, Daniel Nikovski

    Abstract: In this paper, we propose a derivative-free model learning framework for Reinforcement Learning (RL) algorithms based on Gaussian Process Regression (GPR). In many mechanical systems, only positions can be measured by the sensing instruments. Then, instead of representing the system state as suggested by the physics with a collection of positions, velocities, and accelerations, we define the state… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted at RA-L

  18. arXiv:2001.10098  [pdf, other

    cs.LG eess.SP stat.ML

    Multi-label Prediction in Time Series Data using Deep Neural Networks

    Authors: Wenyu Zhang, Devesh K. Jha, Emil Laftchiev, Daniel Nikovski

    Abstract: This paper addresses a multi-label predictive fault classification problem for multidimensional time-series data. While fault (event) detection problems have been thoroughly studied in literature, most of the state-of-the-art techniques can't reliably predict faults (events) over a desired future horizon. In the most general setting of these types of problems, one or more samples of data across mu… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: Accepted by IJPHM. Presented at PHM19

  19. arXiv:2001.08092  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Local Policy Optimization for Trajectory-Centric Reinforcement Learning

    Authors: Patrik Kolaric, Devesh K. Jha, Arvind U. Raghunathan, Frank L. Lewis, Mouhacine Benosman, Diego Romeres, Daniel Nikovski

    Abstract: The goal of this paper is to present a method for simultaneous trajectory and local stabilizing policy optimization to generate local policies for trajectory-centric model-based reinforcement learning (MBRL). This is motivated by the fact that global policy optimization for non-linear systems could be a very challenging problem both algorithmically and numerically. However, a lot of robotic manipu… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Journal ref: ICRA 2020

  20. arXiv:1910.10628  [pdf, other

    cs.RO cs.LG

    Learning Deep Parameterized Skills from Demonstration for Re-targetable Visuomotor Control

    Authors: Jonathan Chang, Nishanth Kumar, Sean Hastings, Aaron Gokaslan, Diego Romeres, Devesh Jha, Daniel Nikovski, George Konidaris, Stefanie Tellex

    Abstract: Robots need to learn skills that can not only generalize across similar problems but also be directed to a specific goal. Previous methods either train a new skill for every different goal or do not infer the specific target in the presence of multiple goals from visual data. We introduce an end-to-end method that represents targetable visuomotor skills as a goal-parameterized neural network polic… ▽ More

    Submitted 28 February, 2021; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: Preprint

  21. arXiv:1903.05751  [pdf, other

    stat.ML cs.LG cs.RO

    Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning

    Authors: Kei Ota, Devesh K. Jha, Tomoaki Oiki, Mamoru Miura, Takashi Nammoto, Daniel Nikovski, Toshisada Mariyama

    Abstract: In this paper, we propose a reinforcement learning-based algorithm for trajectory optimization for constrained dynamical systems. This problem is motivated by the fact that for most robotic systems, the dynamics may not always be known. Generating smooth, dynamically feasible trajectories could be difficult for such systems. Using sampling-based algorithms for motion planning may result in traject… ▽ More

    Submitted 3 March, 2020; v1 submitted 13 March, 2019; originally announced March 2019.

    Comments: 8 pages, 6 figures, Accepted to IROS 2019

  22. Learning Dynamical Demand Response Model in Real-Time Pricing Program

    Authors: Hanchen Xu, Hongbo Sun, Daniel Nikovski, Kitamura Shoichi, Kazuyuki Mori

    Abstract: Price responsiveness is a major feature of end use customers (EUCs) that participate in demand response (DR) programs, and has been conventionally modeled with static demand functions, which take the electricity price as the input and the aggregate energy consumption as the output. This, however, neglects the inherent temporal correlation of the EUC behaviors, and may result in large errors when p… ▽ More

    Submitted 22 December, 2018; originally announced December 2018.

    Comments: Accepted to IEEE ISGT NA 2019

  23. arXiv:1809.04993  [pdf, other

    cs.RO cs.LG stat.ML

    Semiparametrical Gaussian Processes Learning of Forward Dynamical Models for Navigating in a Circular Maze

    Authors: Diego Romeres, Devesh Jha, Alberto Dalla Libera, William Yerazunis, Daniel Nikovski

    Abstract: This paper presents a problem of model learning for the purpose of learning how to navigate a ball to a goal state in a circular maze environment with two degrees of freedom. The motion of the ball in the maze environment is influenced by several non-linear effects such as dry friction and contacts, which are difficult to model physically. We propose a semiparametric model to estimate the motion d… ▽ More

    Submitted 18 September, 2018; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: 7 pages including the references, 5 figures. Changed title, improved the structure of the article and the images

  24. arXiv:1809.04720  [pdf, other

    cs.LG stat.ML

    Sim-to-Real Transfer Learning using Robustified Controllers in Robotic Tasks involving Complex Dynamics

    Authors: Jeroen van Baar, Alan Sullivan, Radu Cordorel, Devesh Jha, Diego Romeres, Daniel Nikovski

    Abstract: Learning robot tasks or controllers using deep reinforcement learning has been proven effective in simulations. Learning in simulation has several advantages. For example, one can fully control the simulated environment, including halting motions while performing computations. Another advantage when robots are involved, is that the amount of time a robot is occupied learning a task---rather than b… ▽ More

    Submitted 17 September, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: 7 pages

  25. arXiv:1806.06931  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control

    Authors: Yangchen Pan, Amir-massoud Farahmand, Martha White, Saleh Nabi, Piyush Grover, Daniel Nikovski

    Abstract: Recent work has shown that reinforcement learning (RL) is a promising approach to control dynamical systems described by partial differential equations (PDE). This paper shows how to use RL to tackle more general PDE control problems that have continuous high-dimensional action spaces with spatial relationship among action dimensions. In particular, we propose the concept of action descriptors, wh… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: ICML2018

  26. arXiv:1707.00617  [pdf, other

    cs.AI math.CO

    Submodular Function Maximization for Group Elevator Scheduling

    Authors: Srikumar Ramalingam, Arvind U. Raghunathan, Daniel Nikovski

    Abstract: We propose a novel approach for group elevator scheduling by formulating it as the maximization of submodular function under a matroid constraint. In particular, we propose to model the total waiting time of passengers using a quadratic Boolean function. The unary and pairwise terms in the function denote the waiting time for single and pairwise allocation of passengers to elevators, respectively.… ▽ More

    Submitted 27 June, 2017; originally announced July 2017.

    Comments: 10 pages; 2017 International Conference on Automated Planning and Scheduling (ICAPS)

    MSC Class: 05B35 ACM Class: F.2.2; I.2.8

  27. arXiv:1212.2499  [pdf

    cs.AI eess.SY

    Marginalizing Out Future Passengers in Group Elevator Control

    Authors: Daniel N. Nikovski, Matthew Brand

    Abstract: Group elevator scheduling is an NP-hard sequential decision-making problem with unbounded state spaces and substantial uncertainty. Decision-theoretic reasoning plays a surprisingly limited role in fielded systems. A new opportunity for probabilistic methods has opened with the recent discovery of a tractable solution for the expected waiting times of all passengers in the buil… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-443-450