Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

Wei, Yuxi; Wang, Zi; Lu, Yifan; Xu, Chenxin; Liu, Changxing; Zhao, Hao; Chen, Siheng; Wang, Yanfeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.05746 (cs)

[Submitted on 8 Feb 2024 (v1), last revised 26 Jun 2024 (this version, v3)]

Title:Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

Authors:Yuxi Wei, Zi Wang, Yifan Lu, Chenxin Xu, Changxing Liu, Hao Zhao, Siheng Chen, Yanfeng Wang

View PDF HTML (experimental)

Abstract:Scene simulation in autonomous driving has gained significant attention because of its huge potential for generating customized data. However, existing editable scene simulation approaches face limitations in terms of user interaction efficiency, multi-camera photo-realistic rendering and external digital assets integration. To address these challenges, this paper introduces ChatSim, the first system that enables editable photo-realistic 3D driving scene simulations via natural language commands with external digital assets. To enable editing with high command flexibility,~ChatSim leverages a large language model (LLM) agent collaboration framework. To generate photo-realistic outcomes, ChatSim employs a novel multi-camera neural radiance field method. Furthermore, to unleash the potential of extensive high-quality digital assets, ChatSim employs a novel multi-camera lighting estimation method to achieve scene-consistent assets' rendering. Our experiments on Waymo Open Dataset demonstrate that ChatSim can handle complex language commands and generate corresponding photo-realistic scene videos.

Comments:	CVPR 2024(Highlight)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2402.05746 [cs.CV]
	(or arXiv:2402.05746v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.05746

Submission history

From: Yuxi Wei [view email]
[v1] Thu, 8 Feb 2024 15:26:28 UTC (39,318 KB)
[v2] Mon, 11 Mar 2024 13:45:48 UTC (39,308 KB)
[v3] Wed, 26 Jun 2024 10:44:58 UTC (39,308 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators