Learning Road Scene-level Representations via Semantic Region Prediction

Xiao, Zihao; Yuille, Alan; Chen, Yi-Ting

Computer Science > Computer Vision and Pattern Recognition

arXiv:2301.00714 (cs)

[Submitted on 2 Jan 2023 (v1), last revised 27 Feb 2023 (this version, v2)]

Title:Learning Road Scene-level Representations via Semantic Region Prediction

Authors:Zihao Xiao, Alan Yuille, Yi-Ting Chen

View PDF

Abstract:In this work, we tackle two vital tasks in automated driving systems, i.e., driver intent prediction and risk object identification from egocentric images. Mainly, we investigate the question: what would be good road scene-level representations for these two tasks? We contend that a scene-level representation must capture higher-level semantic and geometric representations of traffic scenes around ego-vehicle while performing actions to their destinations. To this end, we introduce the representation of semantic regions, which are areas where ego-vehicles visit while taking an afforded action (e.g., left-turn at 4-way intersections). We propose to learn scene-level representations via a novel semantic region prediction task and an automatic semantic region labeling algorithm. Extensive evaluations are conducted on the HDD and nuScenes datasets, and the learned representations lead to state-of-the-art performance for driver intention prediction and risk object identification.

Comments:	18 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2301.00714 [cs.CV]
	(or arXiv:2301.00714v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2301.00714

Submission history

From: Zihao Xiao [view email]
[v1] Mon, 2 Jan 2023 15:13:30 UTC (18,549 KB)
[v2] Mon, 27 Feb 2023 22:19:55 UTC (18,549 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Road Scene-level Representations via Semantic Region Prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Road Scene-level Representations via Semantic Region Prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators