Perceptual Score: What Data Modalities Does Your Model Perceive?

Gat, Itai; Schwartz, Idan; Schwing, Alexander

Computer Science > Machine Learning

arXiv:2110.14375 (cs)

[Submitted on 27 Oct 2021]

Title:Perceptual Score: What Data Modalities Does Your Model Perceive?

Authors:Itai Gat, Idan Schwartz, Alexander Schwing

View PDF

Abstract:Machine learning advances in the last decade have relied significantly on large-scale datasets that continue to grow in size. Increasingly, those datasets also contain different data modalities. However, large multi-modal datasets are hard to annotate, and annotations may contain biases that we are often unaware of. Deep-net-based classifiers, in turn, are prone to exploit those biases and to find shortcuts. To study and quantify this concern, we introduce the perceptual score, a metric that assesses the degree to which a model relies on the different subsets of the input features, i.e., modalities. Using the perceptual score, we find a surprisingly consistent trend across four popular datasets: recent, more accurate state-of-the-art multi-modal models for visual question-answering or visual dialog tend to perceive the visual data less than their predecessors. This trend is concerning as answers are hence increasingly inferred from textual cues only. Using the perceptual score also helps to analyze model biases by decomposing the score into data subset contributions. We hope to spur a discussion on the perceptiveness of multi-modal models and also hope to encourage the community working on multi-modal classifiers to start quantifying perceptiveness via the proposed perceptual score.

Comments:	Accepted to NeurIPS 2021
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2110.14375 [cs.LG]
	(or arXiv:2110.14375v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2110.14375

Submission history

From: Itai Gat [view email]
[v1] Wed, 27 Oct 2021 12:19:56 UTC (3,320 KB)

Computer Science > Machine Learning

Title:Perceptual Score: What Data Modalities Does Your Model Perceive?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Perceptual Score: What Data Modalities Does Your Model Perceive?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators