On the Black-box Explainability of Object Detection Models for Safe and Trustworthy Industrial Applications

Andres, Alain; Martinez-Seras, Aitor; Laña, Ibai; Del Ser, Javier

doi:10.1016/j.rineng.2024.103498

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.00818 (cs)

[Submitted on 28 Oct 2024 (v1), last revised 28 Nov 2024 (this version, v2)]

Title:On the Black-box Explainability of Object Detection Models for Safe and Trustworthy Industrial Applications

Authors:Alain Andres, Aitor Martinez-Seras, Ibai Laña, Javier Del Ser

View PDF HTML (experimental)

Abstract:In the realm of human-machine interaction, artificial intelligence has become a powerful tool for accelerating data modeling tasks. Object detection methods have achieved outstanding results and are widely used in critical domains like autonomous driving and video surveillance. However, their adoption in high-risk applications, where errors may cause severe consequences, remains limited. Explainable Artificial Intelligence methods aim to address this issue, but many existing techniques are model-specific and designed for classification tasks, making them less effective for object detection and difficult for non-specialists to interpret. In this work we focus on model-agnostic explainability methods for object detection models and propose D-MFPP, an extension of the Morphological Fragmental Perturbation Pyramid (MFPP) technique based on segmentation-based masks to generate explanations. Additionally, we introduce D-Deletion, a novel metric combining faithfulness and localization, adapted specifically to meet the unique demands of object detectors. We evaluate these methods on real-world industrial and robotic datasets, examining the influence of parameters such as the number of masks, model size, and image resolution on the quality of explanations. Our experiments use single-stage object detection models applied to two safety-critical robotic environments: i) a shared human-robot workspace where safety is of paramount importance, and ii) an assembly area of battery kits, where safety is critical due to the potential for damage among high-risk components. Our findings evince that D-Deletion effectively gauges the performance of explanations when multiple elements of the same class appear in a scene, while D-MFPP provides a promising alternative to D-RISE when fewer masks are used.

Comments:	14 pages, 10 figures, 6 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2411.00818 [cs.CV]
	(or arXiv:2411.00818v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.00818
Journal reference:	Volume 24, Year 2024, Page number 103498
Related DOI:	https://doi.org/10.1016/j.rineng.2024.103498

Submission history

From: Alain Andres [view email]
[v1] Mon, 28 Oct 2024 13:28:05 UTC (2,949 KB)
[v2] Thu, 28 Nov 2024 08:09:26 UTC (2,949 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:On the Black-box Explainability of Object Detection Models for Safe and Trustworthy Industrial Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:On the Black-box Explainability of Object Detection Models for Safe and Trustworthy Industrial Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators