Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning

Supancic III, James Steven; Ramanan, Deva

Computer Science > Computer Vision and Pattern Recognition

arXiv:1707.04991v1 (cs)

[Submitted on 17 Jul 2017]

Title:Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning

Authors:James Steven Supancic III, Deva Ramanan

View PDF

Abstract:We formulate tracking as an online decision-making process, where a tracking agent must follow an object despite ambiguous image frames and a limited computational budget. Crucially, the agent must decide where to look in the upcoming frames, when to reinitialize because it believes the target has been lost, and when to update its appearance model for the tracked object. Such decisions are typically made heuristically. Instead, we propose to learn an optimal decision-making policy by formulating tracking as a partially observable decision-making process (POMDP). We learn policies with deep reinforcement learning algorithms that need supervision (a reward signal) only when the track has gone awry. We demonstrate that sparse rewards allow us to quickly train on massive datasets, several orders of magnitude more than past work. Interestingly, by treating the data source of Internet videos as unlimited streams, we both learn and evaluate our trackers in a single, unified computational stream.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1707.04991 [cs.CV]
	(or arXiv:1707.04991v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1707.04991

Submission history

From: James Supancic III [view email]
[v1] Mon, 17 Jul 2017 03:38:35 UTC (8,325 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2017-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

James Steven Supancic III
Deva Ramanan

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators