Combining Compositional Models and Deep Networks For Robust Object Classification under Occlusion

Kortylewski, Adam; Liu, Qing; Wang, Huiyu; Zhang, Zhishuai; Yuille, Alan

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.11826 (cs)

[Submitted on 28 May 2019 (v1), last revised 29 Jan 2020 (this version, v4)]

Title:Combining Compositional Models and Deep Networks For Robust Object Classification under Occlusion

Authors:Adam Kortylewski, Qing Liu, Huiyu Wang, Zhishuai Zhang, Alan Yuille

View PDF

Abstract:Deep convolutional neural networks (DCNNs) are powerful models that yield impressive results at object classification. However, recent work has shown that they do not generalize well to partially occluded objects and to mask attacks. In contrast to DCNNs, compositional models are robust to partial occlusion, however, they are not as discriminative as deep models. In this work, we combine DCNNs and compositional object models to retain the best of both approaches: a discriminative model that is robust to partial occlusion and mask attacks. Our model is learned in two steps. First, a standard DCNN is trained for image classification. Subsequently, we cluster the DCNN features into dictionaries. We show that the dictionary components resemble object part detectors and learn the spatial distribution of parts for each object class. We propose mixtures of compositional models to account for large changes in the spatial activation patterns (e.g. due to changes in the 3D pose of an object). At runtime, an image is first classified by the DCNN in a feedforward manner. The prediction uncertainty is used to detect partially occluded objects, which in turn are classified by the compositional model. Our experimental results demonstrate that combining compositional models and DCNNs resolves a fundamental problem of current deep learning approaches to computer vision: The combined model recognizes occluded objects, even when it has not been exposed to occluded objects during training, while at the same time maintaining high discriminative performance for non-occluded objects.

Comments:	WACV 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1905.11826 [cs.CV]
	(or arXiv:1905.11826v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.11826

Submission history

From: Adam Kortylewski [view email]
[v1] Tue, 28 May 2019 14:03:46 UTC (8,465 KB)
[v2] Wed, 29 May 2019 00:45:58 UTC (8,465 KB)
[v3] Wed, 13 Nov 2019 16:07:23 UTC (8,488 KB)
[v4] Wed, 29 Jan 2020 14:42:08 UTC (8,488 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Combining Compositional Models and Deep Networks For Robust Object Classification under Occlusion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Combining Compositional Models and Deep Networks For Robust Object Classification under Occlusion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators