Multi-Stage Reinforcement Learning For Object Detection

Koenig, Jonas; Malberg, Simon; Martens, Martin; Niehaus, Sebastian; Krohn-Grimberghe, Artus; Ramaswamy, Arunselvan

Computer Science > Computer Vision and Pattern Recognition

arXiv:1810.10325 (cs)

[Submitted on 15 Oct 2018 (v1), last revised 26 Oct 2018 (this version, v2)]

Title:Multi-Stage Reinforcement Learning For Object Detection

Authors:Jonas Koenig, Simon Malberg, Martin Martens, Sebastian Niehaus, Artus Krohn-Grimberghe, Arunselvan Ramaswamy

View PDF

Abstract:We present a reinforcement learning approach for detecting objects within an image. Our approach performs a step-wise deformation of a bounding box with the goal of tightly framing the object. It uses a hierarchical tree-like representation of predefined region candidates, which the agent can zoom in on. This reduces the number of region candidates that must be evaluated so that the agent can afford to compute new feature maps before each step to enhance detection quality. We compare an approach that is based purely on zoom actions with one that is extended by a second refinement stage to fine-tune the bounding box after each zoom step. We also improve the fitting ability by allowing for different aspect ratios of the bounding box. Finally, we propose different reward functions to lead to a better guidance of the agent while following its search trajectories. Experiments indicate that each of these extensions leads to more correct detections. The best performing approach comprises a zoom stage and a refinement stage, uses aspect-ratio modifying actions and is trained using a combination of three different reward metrics.

Comments:	Accepted for the Computer Vision Conference (CVC) 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.10325 [cs.CV]
	(or arXiv:1810.10325v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1810.10325

Submission history

From: Sebastian Niehaus [view email]
[v1] Mon, 15 Oct 2018 21:41:57 UTC (718 KB)
[v2] Fri, 26 Oct 2018 11:11:02 UTC (718 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Stage Reinforcement Learning For Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Stage Reinforcement Learning For Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators