Are "Hierarchical" Visual Representations Hierarchical?

Shen, Ethan; Farhadi, Ali; Kusupati, Aditya

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.05784 (cs)

[Submitted on 9 Nov 2023 (v1), last revised 23 Nov 2023 (this version, v2)]

Title:Are "Hierarchical" Visual Representations Hierarchical?

Authors:Ethan Shen, Ali Farhadi, Aditya Kusupati

View PDF

Abstract:Learned visual representations often capture large amounts of semantic information for accurate downstream applications. Human understanding of the world is fundamentally grounded in hierarchy. To mimic this and further improve representation capabilities, the community has explored "hierarchical" visual representations that aim at modeling the underlying hierarchy of the visual world. In this work, we set out to investigate if hierarchical visual representations truly capture the human perceived hierarchy better than standard learned representations. To this end, we create HierNet, a suite of 12 datasets spanning 3 kinds of hierarchy from the BREEDs subset of ImageNet. After extensive evaluation of Hyperbolic and Matryoshka Representations across training setups, we conclude that they do not capture hierarchy any better than the standard representations but can assist in other aspects like search efficiency and interpretability. Our benchmark and the datasets are open-sourced at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2311.05784 [cs.CV]
	(or arXiv:2311.05784v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.05784

Submission history

From: Ethan Shen [view email]
[v1] Thu, 9 Nov 2023 23:25:29 UTC (781 KB)
[v2] Thu, 23 Nov 2023 20:45:53 UTC (781 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Are "Hierarchical" Visual Representations Hierarchical?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Are "Hierarchical" Visual Representations Hierarchical?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators