Skip to main content

Showing 1–8 of 8 results for author: Turcato, N

.
  1. arXiv:2504.06721  [pdf, other

    cs.RO cs.AI cs.LG

    Learning global control of underactuated systems with Model-Based Reinforcement Learning

    Authors: Niccolò Turcato, Marco Calì, Alberto Dalla Libera, Giulio Giacomuzzo, Ruggero Carli, Diego Romeres

    Abstract: This short paper describes our proposed solution for the third edition of the "AI Olympics with RealAIGym" competition, held at ICRA 2025. We employed Monte-Carlo Probabilistic Inference for Learning Control (MC-PILCO), an MBRL algorithm recognized for its exceptional data efficiency across various low-dimensional robotic tasks, including cart-pole, ball \& plate, and Furuta pendulum systems. MC-P… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2409.05811

  2. arXiv:2503.15290  [pdf, other

    cs.RO

    Reinforcement Learning for Robust Athletic Intelligence: Lessons from the 2nd 'AI Olympics with RealAIGym' Competition

    Authors: Felix Wiebe, Niccolò Turcato, Alberto Dalla Libera, Jean Seong Bjorn Choe, Bumkyu Choi, Tim Lukas Faust, Habib Maraqten, Erfan Aghadavoodi, Marco Cali, Alberto Sinigaglia, Giulio Giacomuzzo, Diego Romeres, Jong-kook Kim, Gian Antonio Susto, Shubham Vyas, Dennis Mronga, Boris Belousov, Jan Peters, Frank Kirchner, Shivesh Kumar

    Abstract: In the field of robotics many different approaches ranging from classical planning over optimal control to reinforcement learning (RL) are developed and borrowed from other fields to achieve reliable control in diverse tasks. In order to get a clear understanding of their individual strengths and weaknesses and their applicability in real world robotic scenarios is it important to benchmark and co… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 8 pages, 7 figures

  3. arXiv:2503.04280  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models

    Authors: Niccolò Turcato, Matteo Iovino, Aris Synodinos, Alberto Dalla Libera, Ruggero Carli, Pietro Falco

    Abstract: Recent advancements in Large Language Models (LLMs) and Visual Language Models (VLMs) have significantly impacted robotics, enabling high-level semantic motion planning applications. Reinforcement Learning (RL), a complementary paradigm, enables agents to autonomously optimize complex behaviors through interaction and reward signals. However, designing effective reward functions for RL remains cha… ▽ More

    Submitted 10 June, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

  4. arXiv:2502.05595  [pdf, other

    cs.RO

    Data efficient Robotic Object Throwing with Model-Based Reinforcement Learning

    Authors: Niccolò Turcato, Giulio Giacomuzzo, Matteo Terreran, Davide Allegro, Ruggero Carli, Alberto Dalla Libera

    Abstract: Pick-and-place (PnP) operations, featuring object grasping and trajectory planning, are fundamental in industrial robotics applications. Despite many advancements in the field, PnP is limited by workspace constraints, reducing flexibility. Pick-and-throw (PnT) is a promising alternative where the robot throws objects to target locations, leveraging extrinsic resources like gravity to improve effic… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: Preprint under review

  5. arXiv:2412.06390  [pdf, other

    cs.LG cs.AI

    Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios

    Authors: Alberto Sinigaglia, Niccolò Turcato, Ruggero Carli, Gian Antonio Susto

    Abstract: Deep Reinforcement Learning is gaining increasing attention thanks to its capability to learn complex policies in high-dimensional settings. Recent advancements utilize a dual-network architecture to learn optimal policies through the Q-learning algorithm. However, this approach has notable drawbacks, such as an overestimation bias that can disrupt the learning process and degrade the performance… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  6. arXiv:2409.05811  [pdf, other

    cs.RO

    Learning control of underactuated double pendulum with Model-Based Reinforcement Learning

    Authors: Niccolò Turcato, Alberto Dalla Libera, Giulio Giacomuzzo, Ruggero Carli, Diego Romeres

    Abstract: This report describes our proposed solution for the second AI Olympics competition held at IROS 2024. Our solution is based on a recent Model-Based Reinforcement Learning algorithm named MC-PILCO. Besides briefly reviewing the algorithm, we discuss the most critical aspects of the MC-PILCO implementation in the tasks at hand.

    Submitted 9 September, 2024; originally announced September 2024.

  7. arXiv:2409.01104  [pdf, other

    cs.RO cs.AI cs.LG cs.NE

    AI Olympics challenge with Evolutionary Soft Actor Critic

    Authors: Marco Calì, Alberto Sinigaglia, Niccolò Turcato, Ruggero Carli, Gian Antonio Susto

    Abstract: In the following report, we describe the solution we propose for the AI Olympics competition held at IROS 2024. Our solution is based on a Model-free Deep Reinforcement Learning approach combined with an evolutionary strategy. We will briefly describe the algorithms that have been used and then provide details of the approach

    Submitted 28 October, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    Comments: Added Sec 9 after testing on real robot

  8. arXiv:2402.09078  [pdf, other

    cs.LG cs.AI

    Exploiting Estimation Bias in Clipped Double Q-Learning for Continous Control Reinforcement Learning Tasks

    Authors: Niccolò Turcato, Alberto Sinigaglia, Alberto Dalla Libera, Ruggero Carli, Gian Antonio Susto

    Abstract: Continuous control Deep Reinforcement Learning (RL) approaches are known to suffer from estimation biases, leading to suboptimal policies. This paper introduces innovative methods in RL, focusing on addressing and exploiting estimation biases in Actor-Critic methods for continuous control tasks, using Deep Double Q-Learning. We design a Bias Exploiting (BE) mechanism to dynamically select the most… ▽ More

    Submitted 11 October, 2024; v1 submitted 14 February, 2024; originally announced February 2024.