Concept-based Analysis of Neural Networks via Vision-Language Models

Mangal, Ravi; Narodytska, Nina; Gopinath, Divya; Hu, Boyue Caroline; Roy, Anirban; Jha, Susmit; Pasareanu, Corina

Computer Science > Machine Learning

arXiv:2403.19837 (cs)

[Submitted on 28 Mar 2024 (v1), last revised 10 Apr 2024 (this version, v3)]

Title:Concept-based Analysis of Neural Networks via Vision-Language Models

Authors:Ravi Mangal, Nina Narodytska, Divya Gopinath, Boyue Caroline Hu, Anirban Roy, Susmit Jha, Corina Pasareanu

View PDF HTML (experimental)

Abstract:The analysis of vision-based deep neural networks (DNNs) is highly desirable but it is very challenging due to the difficulty of expressing formal specifications for vision tasks and the lack of efficient verification procedures. In this paper, we propose to leverage emerging multimodal, vision-language, foundation models (VLMs) as a lens through which we can reason about vision models. VLMs have been trained on a large body of images accompanied by their textual description, and are thus implicitly aware of high-level, human-understandable concepts describing the images. We describe a logical specification language $\texttt{Con}_{\texttt{spec}}$ designed to facilitate writing specifications in terms of these concepts. To define and formally check $\texttt{Con}_{\texttt{spec}}$ specifications, we build a map between the internal representations of a given vision model and a VLM, leading to an efficient verification procedure of natural-language properties for vision models. We demonstrate our techniques on a ResNet-based classifier trained on the RIVAL-10 dataset using CLIP as the multimodal model.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Logic in Computer Science (cs.LO)
Cite as:	arXiv:2403.19837 [cs.LG]
	(or arXiv:2403.19837v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.19837

Submission history

From: Ravi Mangal [view email]
[v1] Thu, 28 Mar 2024 21:15:38 UTC (1,929 KB)
[v2] Mon, 1 Apr 2024 22:34:37 UTC (1,929 KB)
[v3] Wed, 10 Apr 2024 23:47:34 UTC (1,929 KB)

Computer Science > Machine Learning

Title:Concept-based Analysis of Neural Networks via Vision-Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Concept-based Analysis of Neural Networks via Vision-Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators