Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection

Bartolo, Matthias; Seychell, Dylan; Bajada, Josef

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.06803 (cs)

[Submitted on 13 Aug 2024]

Title:Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection

Authors:Matthias Bartolo, Dylan Seychell, Josef Bajada

View PDF HTML (experimental)

Abstract:With the ever-growing variety of object detection approaches, this study explores a series of experiments that combine reinforcement learning (RL)-based visual attention methods with saliency ranking techniques to investigate transparent and sustainable solutions. By integrating saliency ranking for initial bounding box prediction and subsequently applying RL techniques to refine these predictions through a finite set of actions over multiple time steps, this study aims to enhance RL object detection accuracy. Presented as a series of experiments, this research investigates the use of various image feature extraction methods and explores diverse Deep Q-Network (DQN) architectural variations for deep reinforcement learning-based localisation agent training. Additionally, we focus on optimising the detection pipeline at every step by prioritising lightweight and faster models, while also incorporating the capability to classify detected objects, a feature absent in previous RL approaches. We show that by evaluating the performance of these trained agents using the Pascal VOC 2007 dataset, faster and more optimised models were developed. Notably, the best mean Average Precision (mAP) achieved in this study was 51.4, surpassing benchmarks set by RL-based single object detectors in the literature.

Comments:	Resultant work from Dissertation, Department of AI, University of Malta. Code available at: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2408.06803 [cs.CV]
	(or arXiv:2408.06803v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.06803

Submission history

From: Matthias Bartolo [view email]
[v1] Tue, 13 Aug 2024 10:46:42 UTC (41,771 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators