Skip to main content

Showing 1–1 of 1 results for author: Frans, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.18076  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

    Authors: Max Wilcoxson, Qiyang Li, Kevin Frans, Sergey Levine

    Abstract: Unsupervised pretraining has been transformative in many supervised domains. However, applying such ideas to reinforcement learning (RL) presents a unique challenge in that fine-tuning does not involve mimicking task-specific data, but rather exploring and locating the solution through iterative self-improvement. In this work, we study how unlabeled offline trajectory data can be leveraged to lear… ▽ More

    Submitted 23 February, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: 27 pages, 19 figures