A Data-Centric Approach: Dimensions of Visual Complexity and How to find Them

Sarıtaş, Karahan; Shen, Tingke; Nath, Surabhi S; Dayan, Peter

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.15890v1 (cs)

[Submitted on 27 Jan 2025 (this version), latest version 20 Mar 2025 (v3)]

Title:A Data-Centric Approach: Dimensions of Visual Complexity and How to find Them

Authors:Karahan Sarıtaş, Tingke Shen, Surabhi S Nath, Peter Dayan

View PDF HTML (experimental)

Abstract:Understanding how humans perceive visual complexity is a key area of study in visual cognition. Previous approaches to modeling visual complexity have often resulted in intricate, difficult-to-interpret solutions that employ numerous features or sophisticated deep learning architectures. While these complex models achieve high performance on specific datasets, they often sacrifice interpretability, making it challenging to understand the factors driving human perception of complexity. A recent model based on image segmentations showed promise in addressing this challenge; however, it presented limitations in capturing structural and semantic aspects of visual complexity. In this paper, we propose viable and effective features to overcome these shortcomings. Specifically, we develop multiscale features for the structural aspect of complexity, including the Multiscale Sobel Gradient (MSG), which captures spatial intensity variations across scales, and Multiscale Unique Colors (MUC), which quantifies image colorfulness by indexing quantized RGB values. We also introduce a new dataset SVG based on Visual Genome to explore the semantic aspect of visual complexity, obtaining surprise scores based on the element of surprise in images, which we demonstrate significantly contributes to perceived complexity. Overall, we suggest that the nature of the data is fundamental to understanding and modeling visual complexity, highlighting the importance of both structural and semantic dimensions in providing a comprehensive, interpretable assessment. The code for our analysis, experimental setup, and dataset will be made publicly available upon acceptance.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2501.15890 [cs.CV]
	(or arXiv:2501.15890v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.15890

Submission history

From: Karahan Sarıtaş [view email]
[v1] Mon, 27 Jan 2025 09:32:56 UTC (3,146 KB)
[v2] Wed, 5 Feb 2025 19:36:23 UTC (4,895 KB)
[v3] Thu, 20 Mar 2025 12:06:51 UTC (6,216 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Data-Centric Approach: Dimensions of Visual Complexity and How to find Them

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Data-Centric Approach: Dimensions of Visual Complexity and How to find Them

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators