Generating Positive Bounding Boxes for Balanced Training of Object Detectors

Oksuz, Kemal; Cam, Baris Can; Akbas, Emre; Kalkan, Sinan

Computer Science > Computer Vision and Pattern Recognition

arXiv:1909.09777 (cs)

[Submitted on 21 Sep 2019 (v1), last revised 19 Jun 2020 (this version, v3)]

Title:Generating Positive Bounding Boxes for Balanced Training of Object Detectors

Authors:Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan

View PDF

Abstract:Two-stage deep object detectors generate a set of regions-of-interest (RoI) in the first stage, then, in the second stage, identify objects among the proposed RoIs that sufficiently overlap with a ground truth (GT) box. The second stage is known to suffer from a bias towards RoIs that have low intersection-over-union (IoU) with the associated GT boxes. To address this issue, we first propose a sampling method to generate bounding boxes (BB) that overlap with a given reference box more than a given IoU threshold. Then, we use this BB generation method to develop a positive RoI (pRoI) generator that produces RoIs following any desired spatial or IoU distribution, for the second-stage. We show that our pRoI generator is able to simulate other sampling methods for positive examples such as hard example mining and prime sampling. Using our generator as an analysis tool, we show that (i) IoU imbalance has an adverse effect on performance, (ii) hard positive example mining improves the performance only for certain input IoU distributions, and (iii) the imbalance among the foreground classes has an adverse effect on performance and that it can be alleviated at the batch level. Finally, we train Faster R-CNN using our pRoI generator and, compared to conventional training, obtain better or on-par performance for low IoUs and significant improvements when trained for higher IoUs for Pascal VOC and MS COCO datasets. The code is available at: this https URL.

Comments:	To appear in WACV 20
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1909.09777 [cs.CV]
	(or arXiv:1909.09777v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1909.09777

Submission history

From: Kemal Oksuz [view email]
[v1] Sat, 21 Sep 2019 05:27:15 UTC (1,999 KB)
[v2] Thu, 12 Dec 2019 14:51:01 UTC (1,773 KB)
[v3] Fri, 19 Jun 2020 07:50:03 UTC (1,773 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Generating Positive Bounding Boxes for Balanced Training of Object Detectors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Generating Positive Bounding Boxes for Balanced Training of Object Detectors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators