Mid-level Representation for Visual Recognition

Nabi, Moin

Computer Science > Computer Vision and Pattern Recognition

arXiv:1512.07314 (cs)

[Submitted on 23 Dec 2015]

Title:Mid-level Representation for Visual Recognition

Authors:Moin Nabi

View PDF

Abstract:Visual Recognition is one of the fundamental challenges in AI, where the goal is to understand the semantics of visual data. Employing mid-level representation, in particular, shifted the paradigm in visual recognition. The mid-level image/video representation involves discovering and training a set of mid-level visual patterns (e.g., parts and attributes) and represent a given image/video utilizing them. The mid-level patterns can be extracted from images and videos using the motion and appearance information of visual phenomenas. This thesis targets employing mid-level representations for different high-level visual recognition tasks, namely (i)image understanding and (ii)video understanding.
In the case of image understanding, we focus on object detection/recognition task. We investigate on discovering and learning a set of mid-level patches to be used for representing the images of an object category. We specifically employ the discriminative patches in a subcategory-aware webly-supervised fashion. We, additionally, study the outcomes provided by employing the subcategory-based models for undoing dataset bias.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1512.07314 [cs.CV]
	(or arXiv:1512.07314v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1512.07314

Submission history

From: Moin Nabi [view email]
[v1] Wed, 23 Dec 2015 00:45:41 UTC (87,731 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2015-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Moin Nabi

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Mid-level Representation for Visual Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Mid-level Representation for Visual Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators