Distinguishing mirror from glass: A 'big data' approach to material perception

Tamura, Hideki; Prokott, Konrad E.; Fleming, Roland W.

doi:10.1167/jov.22.4.4

Computer Science > Computer Vision and Pattern Recognition

arXiv:1903.01671 (cs)

[Submitted on 5 Mar 2019]

Title:Distinguishing mirror from glass: A 'big data' approach to material perception

Authors:Hideki Tamura, Konrad E. Prokott, Roland W. Fleming

View PDF

Abstract:Visually identifying materials is crucial for many tasks, yet material perception remains poorly understood. Distinguishing mirror from glass is particularly challenging as both materials derive their appearance from their surroundings, yet we rarely experience difficulties telling them apart. Here we took a 'big data' approach to uncovering the underlying visual cues and processes, leveraging recent advances in neural network models of vision. We trained thousands of convolutional neural networks on >750,000 simulated mirror and glass objects, and compared their performance with human judgments, as well as alternative classifiers based on 'hand-engineered' image features. For randomly chosen images, all classifiers and humans performed with high accuracy, and therefore correlated highly with one another. To tease the models apart, we then painstakingly assembled a diagnostic image set for which humans make highly systematic errors, allowing us to decouple accuracy from human-like performance. A large-scale, systematic search through feedforward neural architectures revealed that relatively shallow networks predicted human judgments better than any other models. However, surprisingly, no network correlated better than 0.6 with humans (below inter-human correlations). Thus, although the model sets new standards for simulating human vision in a challenging material perception task, the results cast doubt on recent claims that such architectures are generally good models of human vision.

Comments:	40 pages, 5 figures, 7 supplement figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1903.01671 [cs.CV]
	(or arXiv:1903.01671v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1903.01671
Journal reference:	Journal of Vision (2022) 22(4):4
Related DOI:	https://doi.org/10.1167/jov.22.4.4

Submission history

From: Hideki Tamura [view email]
[v1] Tue, 5 Mar 2019 05:05:05 UTC (3,205 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Distinguishing mirror from glass: A 'big data' approach to material perception

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Distinguishing mirror from glass: A 'big data' approach to material perception

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators