PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Zhang, Hanwang; Kyaw, Zawlin; Yu, Jinyang; Chang, Shih-Fu

Computer Science > Computer Vision and Pattern Recognition

arXiv:1708.01956 (cs)

[Submitted on 7 Aug 2017]

Title:PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Authors:Hanwang Zhang, Zawlin Kyaw, Jinyang Yu, Shih-Fu Chang

View PDF

Abstract:We aim to tackle a novel vision task called Weakly Supervised Visual Relation Detection (WSVRD) to detect "subject-predicate-object" relations in an image with object relation groundtruths available only at the image level. This is motivated by the fact that it is extremely expensive to label the combinatorial relations between objects at the instance level. Compared to the extensively studied problem, Weakly Supervised Object Detection (WSOD), WSVRD is more challenging as it needs to examine a large set of regions pairs, which is computationally prohibitive and more likely stuck in a local optimal solution such as those involving wrong spatial context. To this end, we present a Parallel, Pairwise Region-based, Fully Convolutional Network (PPR-FCN) for WSVRD. It uses a parallel FCN architecture that simultaneously performs pair selection and classification of single regions and region pairs for object and relation detection, while sharing almost all computation shared over the entire image. In particular, we propose a novel position-role-sensitive score map with pairwise RoI pooling to efficiently capture the crucial context associated with a pair of objects. We demonstrate the superiority of PPR-FCN over all baselines in solving the WSVRD challenge by using results of extensive experiments over two visual relation benchmarks.

Comments:	To appear in International Conference on Computer Vision (ICCV) 2017, Venice, Italy
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1708.01956 [cs.CV]
	(or arXiv:1708.01956v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1708.01956

Submission history

From: Hanwang Zhang [view email]
[v1] Mon, 7 Aug 2017 01:07:20 UTC (1,957 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators