Toward Unsupervised Test Scenario Extraction for Automated Driving Systems from Urban Naturalistic Road Traffic Data

Weber, Nico; Thiem, Christoph; Konigorski, Ulrich

doi:10.4271/12-06-03-0017

Computer Science > Software Engineering

arXiv:2202.06608 (cs)

[Submitted on 14 Feb 2022 (v1), last revised 21 Apr 2023 (this version, v2)]

Title:Toward Unsupervised Test Scenario Extraction for Automated Driving Systems from Urban Naturalistic Road Traffic Data

Authors:Nico Weber, Christoph Thiem, Ulrich Konigorski

View PDF

Abstract:Scenario-based testing is a promising approach to solve the challenge of proving the safe behavior of vehicles equipped with automated driving systems. Since an infinite number of concrete scenarios can theoretically occur in real-world road traffic, the extraction of scenarios relevant in terms of the safety-related behavior of these systems is a key aspect for their successful verification and validation. Therefore, a method for extracting multimodal urban traffic scenarios from naturalistic road traffic data in an unsupervised manner, minimizing the amount of (potentially biased) prior expert knowledge, is proposed. Rather than an (elaborate) rule-based assignment by extracting concrete scenarios into predefined functional scenarios, the presented method deploys an unsupervised machine learning pipeline. The approach allows exploring the unknown nature of the data and their interpretation as test scenarios that experts could not have anticipated. The method is evaluated for naturalistic road traffic data at urban intersections from the inD and the Silicon Valley Intersections datasets. For this purpose, it is analyzed with which clustering approach (K-Means, hierarchical clustering, and DBSCAN) the scenario extraction method performs best (referring to an elaborate rule-based implementation). Subsequently, using hierarchical clustering the results show both a jump in overall accuracy of around 20% when moving from 4 to 5 clusters and a saturation effect starting at 41 clusters with an overall accuracy of 84%. These observations can be a valuable contribution in the context of the trade-off between the number of functional scenarios (i.e., clustering accuracy) and testing effort. Possible reasons for the observed accuracy variations of different clusters, each with a fixed total number of given clusters, are discussed.

Comments:	16 pages, 9 figures
Subjects:	Software Engineering (cs.SE); Machine Learning (cs.LG)
ACM classes:	D.2; I.2
Report number:	SAE 12-06-03-0017
Cite as:	arXiv:2202.06608 [cs.SE]
	(or arXiv:2202.06608v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2202.06608
Journal reference:	SAE Intl. J CAV 6(3):2023
Related DOI:	https://doi.org/10.4271/12-06-03-0017

Submission history

From: Nico Weber [view email]
[v1] Mon, 14 Feb 2022 10:55:14 UTC (5,552 KB)
[v2] Fri, 21 Apr 2023 14:15:57 UTC (3,015 KB)

Computer Science > Software Engineering

Title:Toward Unsupervised Test Scenario Extraction for Automated Driving Systems from Urban Naturalistic Road Traffic Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Toward Unsupervised Test Scenario Extraction for Automated Driving Systems from Urban Naturalistic Road Traffic Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators