Skip to main content

Showing 1–3 of 3 results for author: Kundalia, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15659  [pdf, ps, other

    cs.RO cs.LG

    FLARE: Robot Learning with Implicit World Modeling

    Authors: Ruijie Zheng, Jing Wang, Scott Reed, Johan Bjorck, Yu Fang, Fengyuan Hu, Joel Jang, Kaushil Kundalia, Zongyu Lin, Loic Magne, Avnish Narayan, You Liang Tan, Guanzhi Wang, Qi Wang, Jiannan Xiang, Yinzhen Xu, Seonghyeon Ye, Jan Kautz, Furong Huang, Yuke Zhu, Linxi Fan

    Abstract: We introduce $\textbf{F}$uture $\textbf{LA}$tent $\textbf{RE}$presentation Alignment ($\textbf{FLARE}$), a novel framework that integrates predictive latent world modeling into robot policy learning. By aligning features from a diffusion transformer with latent embeddings of future observations, $\textbf{FLARE}$ enables a diffusion transformer policy to anticipate latent representations of future… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Project Webpage / Blogpost: https://research.nvidia.com/labs/gear/flare

  2. arXiv:2505.12705  [pdf, ps, other

    cs.RO cs.AI cs.LG

    DreamGen: Unlocking Generalization in Robot Learning through Video World Models

    Authors: Joel Jang, Seonghyeon Ye, Zongyu Lin, Jiannan Xiang, Johan Bjorck, Yu Fang, Fengyuan Hu, Spencer Huang, Kaushil Kundalia, Yen-Chen Lin, Loic Magne, Ajay Mandlekar, Avnish Narayan, You Liang Tan, Guanzhi Wang, Jing Wang, Qi Wang, Yinzhen Xu, Xiaohui Zeng, Kaiyuan Zheng, Ruijie Zheng, Ming-Yu Liu, Luke Zettlemoyer, Dieter Fox, Jan Kautz , et al. (3 additional authors not shown)

    Abstract: We introduce DreamGen, a simple yet highly effective 4-stage pipeline for training robot policies that generalize across behaviors and environments through neural trajectories - synthetic robot data generated from video world models. DreamGen leverages state-of-the-art image-to-video generative models, adapting them to the target robot embodiment to produce photorealistic synthetic videos of famil… ▽ More

    Submitted 17 June, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: See website for videos: https://research.nvidia.com/labs/gear/dreamgen

  3. arXiv:2503.14734  [pdf, other

    cs.RO cs.AI cs.LG

    GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

    Authors: NVIDIA, :, Johan Bjorck, Fernando CastaƱeda, Nikita Cherniadev, Xingye Da, Runyu Ding, Linxi "Jim" Fan, Yu Fang, Dieter Fox, Fengyuan Hu, Spencer Huang, Joel Jang, Zhenyu Jiang, Jan Kautz, Kaushil Kundalia, Lawrence Lao, Zhiqi Li, Zongyu Lin, Kevin Lin, Guilin Liu, Edith Llontop, Loic Magne, Ajay Mandlekar, Avnish Narayan , et al. (18 additional authors not shown)

    Abstract: General-purpose robots need a versatile body and an intelligent mind. Recent advancements in humanoid robots have shown great promise as a hardware platform for building generalist autonomy in the human world. A robot foundation model, trained on massive and diverse data sources, is essential for enabling the robots to reason about novel situations, robustly handle real-world variability, and rapi… ▽ More

    Submitted 26 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

    Comments: Authors are listed alphabetically. Project leads are Linxi "Jim" Fan and Yuke Zhu. For more information, see https://developer.nvidia.com/isaac/gr00t