Beyond Holistic Object Recognition: Enriching Image Understanding with Part States

Lu, Cewu; Su, Hao; Lu, Yongyi; Yi, Li; Tang, Chikeung; Guibas, Leonidas

Computer Science > Computer Vision and Pattern Recognition

arXiv:1612.07310 (cs)

[Submitted on 15 Dec 2016]

Title:Beyond Holistic Object Recognition: Enriching Image Understanding with Part States

Authors:Cewu Lu, Hao Su, Yongyi Lu, Li Yi, Chikeung Tang, Leonidas Guibas

View PDF

Abstract:Important high-level vision tasks such as human-object interaction, image captioning and robotic manipulation require rich semantic descriptions of objects at part level. Based upon previous work on part localization, in this paper, we address the problem of inferring rich semantics imparted by an object part in still images. We propose to tokenize the semantic space as a discrete set of part states. Our modeling of part state is spatially localized, therefore, we formulate the part state inference problem as a pixel-wise annotation problem. An iterative part-state inference neural network is specifically designed for this task, which is efficient in time and accurate in performance. Extensive experiments demonstrate that the proposed method can effectively predict the semantic states of parts and simultaneously correct localization errors, thus benefiting a few visual understanding applications. The other contribution of this paper is our part state dataset which contains rich part-level semantic annotations.

Comments:	9 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
ACM classes:	F.2.2
Report number:	23452523
Cite as:	arXiv:1612.07310 [cs.CV]
	(or arXiv:1612.07310v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1612.07310

Submission history

From: Cewu Lu [view email]
[v1] Thu, 15 Dec 2016 13:46:58 UTC (9,215 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2016-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Cewu Lu
Hao Su
Yongyi Lu
Li Yi
Chi-Keung Tang

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Beyond Holistic Object Recognition: Enriching Image Understanding with Part States

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Beyond Holistic Object Recognition: Enriching Image Understanding with Part States

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators