Skip to main content

Showing 1–5 of 5 results for author: Martins, M F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11775  [pdf, ps, other

    cs.RO

    ExoStart: Efficient learning for dexterous manipulation with sensorized exoskeleton demonstrations

    Authors: Zilin Si, Jose Enrique Chen, M. Emre Karagozler, Antonia Bronars, Jonathan Hutchinson, Thomas Lampe, Nimrod Gileadi, Taylor Howell, Stefano Saliceti, Lukasz Barczyk, Ilan Olivarez Correa, Tom Erez, Mohit Shridhar, Murilo Fernandes Martins, Konstantinos Bousmalis, Nicolas Heess, Francesco Nori, Maria Bauza Villalonga

    Abstract: Recent advancements in teleoperation systems have enabled high-quality data collection for robotic manipulators, showing impressive results in learning manipulation at scale. This progress suggests that extending these capabilities to robotic hands could unlock an even broader range of manipulation skills, especially if we could achieve the same level of dexterity that human hands exhibit. However… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  2. arXiv:2409.06613  [pdf, other

    cs.RO cs.LG

    DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots

    Authors: Maria Bauza, Jose Enrique Chen, Valentin Dalibard, Nimrod Gileadi, Roland Hafner, Murilo F. Martins, Joss Moore, Rugile Pevceviciute, Antoine Laurens, Dushyant Rao, Martina Zambelli, Martin Riedmiller, Jon Scholz, Konstantinos Bousmalis, Francesco Nori, Nicolas Heess

    Abstract: We present DemoStart, a novel auto-curriculum reinforcement learning method capable of learning complex manipulation behaviors on an arm equipped with a three-fingered robotic hand, from only a sparse reward and a handful of demonstrations in simulation. Learning from simulation drastically reduces the development cycle of behavior generation, and domain randomization techniques are leveraged to a… ▽ More

    Submitted 12 September, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: 15 pages total with 7 pages of appendix. 9 Figures, 4 in the main text and 5 in the appendix

  3. arXiv:2005.07513  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    A Distributional View on Multi-Objective Policy Optimization

    Authors: Abbas Abdolmaleki, Sandy H. Huang, Leonard Hasenclever, Michael Neunert, H. Francis Song, Martina Zambelli, Murilo F. Martins, Nicolas Heess, Raia Hadsell, Martin Riedmiller

    Abstract: Many real-world problems require trading off multiple competing objectives. However, these objectives are often in different units and/or scales, which can make it challenging for practitioners to express numerical preferences over objectives in their native units. In this paper we propose a novel algorithm for multi-objective reinforcement learning that enables setting desired preferences for obj… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  4. arXiv:1903.08542  [pdf, other

    cs.RO

    Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning

    Authors: Sandy H. Huang, Martina Zambelli, Jackie Kay, Murilo F. Martins, Yuval Tassa, Patrick M. Pilarski, Raia Hadsell

    Abstract: Robots must know how to be gentle when they need to interact with fragile objects, or when the robot itself is prone to wear and tear. We propose an approach that enables deep reinforcement learning to train policies that are gentle, both during exploration and task execution. In a reward-based learning environment, a natural approach involves augmenting the (task) reward with a penalty for non-ge… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

  5. arXiv:1902.04706  [pdf, other

    cs.LG cs.RO stat.ML

    Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup

    Authors: Devin Schwab, Tobias Springenberg, Murilo F. Martins, Thomas Lampe, Michael Neunert, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin Riedmiller

    Abstract: We present a method for fast training of vision based control policies on real robots. The key idea behind our method is to perform multi-task Reinforcement Learning with auxiliary tasks that differ not only in the reward to be optimized but also in the state-space in which they operate. In particular, we allow auxiliary task policies to utilize task features that are available only at training-ti… ▽ More

    Submitted 18 February, 2019; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: Videos can be found at https://sites.google.com/view/rss-2019-sawyer-bic/