GAIA-1: A Generative World Model for Autonomous Driving

Hu, Anthony; Russell, Lloyd; Yeo, Hudson; Murez, Zak; Fedoseev, George; Kendall, Alex; Shotton, Jamie; Corrado, Gianluca

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.17080 (cs)

[Submitted on 29 Sep 2023]

Title:GAIA-1: A Generative World Model for Autonomous Driving

Authors:Anthony Hu, Lloyd Russell, Hudson Yeo, Zak Murez, George Fedoseev, Alex Kendall, Jamie Shotton, Gianluca Corrado

View PDF

Abstract:Autonomous driving promises transformative improvements to transportation, but building systems capable of safely navigating the unstructured complexity of real-world scenarios remains challenging. A critical problem lies in effectively predicting the various potential outcomes that may emerge in response to the vehicle's actions as the world evolves.
To address this challenge, we introduce GAIA-1 ('Generative AI for Autonomy'), a generative world model that leverages video, text, and action inputs to generate realistic driving scenarios while offering fine-grained control over ego-vehicle behavior and scene features. Our approach casts world modeling as an unsupervised sequence modeling problem by mapping the inputs to discrete tokens, and predicting the next token in the sequence. Emerging properties from our model include learning high-level structures and scene dynamics, contextual awareness, generalization, and understanding of geometry. The power of GAIA-1's learned representation that captures expectations of future events, combined with its ability to generate realistic samples, provides new possibilities for innovation in the field of autonomy, enabling enhanced and accelerated training of autonomous driving technology.

Comments:	Technical Report
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2309.17080 [cs.CV]
	(or arXiv:2309.17080v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.17080

Submission history

From: Anthony Hu [view email]
[v1] Fri, 29 Sep 2023 09:20:37 UTC (15,940 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GAIA-1: A Generative World Model for Autonomous Driving

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GAIA-1: A Generative World Model for Autonomous Driving

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators