Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation

Nag, Sayak; Ghosh, Udita; Ta, Calvin-Khang; Bose, Sarosij; Li, Jiachen; Chowdhury, Amit K Roy

Computer Science > Computer Vision and Pattern Recognition

arXiv:2503.13947 (cs)

[Submitted on 18 Mar 2025 (v1), last revised 11 Apr 2025 (this version, v2)]

Title:Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation

Authors:Sayak Nag, Udita Ghosh, Calvin-Khang Ta, Sarosij Bose, Jiachen Li, Amit K Roy Chowdhury

View PDF HTML (experimental)

Abstract:Scene Graph Generation (SGG) aims to represent visual scenes by identifying objects and their pairwise relationships, providing a structured understanding of image content. However, inherent challenges like long-tailed class distributions and prediction variability necessitate uncertainty quantification in SGG for its practical viability. In this paper, we introduce a novel Conformal Prediction (CP) based framework, adaptive to any existing SGG method, for quantifying their predictive uncertainty by constructing well-calibrated prediction sets over their generated scene graphs. These scene graph prediction sets are designed to achieve statistically rigorous coverage guarantees. Additionally, to ensure these prediction sets contain the most practically interpretable scene graphs, we design an effective MLLM-based post-processing strategy for selecting the most visually and semantically plausible scene graphs within these prediction sets. We show that our proposed approach can produce diverse possible scene graphs from an image, assess the reliability of SGG methods, and improve overall SGG performance.

Comments:	Accepted at CVPR 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2503.13947 [cs.CV]
	(or arXiv:2503.13947v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2503.13947

Submission history

From: Sayak Nag [view email]
[v1] Tue, 18 Mar 2025 06:27:57 UTC (7,887 KB)
[v2] Fri, 11 Apr 2025 03:03:26 UTC (8,820 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators