Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation

Yang, Zhenheng; Gao, Jiyang; Nevatia, Ram

Computer Science > Computer Vision and Pattern Recognition

arXiv:1708.00042 (cs)

[Submitted on 31 Jul 2017]

Title:Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation

Authors:Zhenheng Yang, Jiyang Gao, Ram Nevatia

View PDF

Abstract:In this work, we address the problem of spatio-temporal action detection in temporally untrimmed videos. It is an important and challenging task as finding accurate human actions in both temporal and spatial space is important for analyzing large-scale video data. To tackle this problem, we propose a cascade proposal and location anticipation (CPLA) model for frame-level action detection. There are several salient points of our model: (1) a cascade region proposal network (casRPN) is adopted for action proposal generation and shows better localization accuracy compared with single region proposal network (RPN); (2) action spatio-temporal consistencies are exploited via a location anticipation network (LAN) and thus frame-level action detection is not conducted independently. Frame-level detections are then linked by solving an linking score maximization problem, and temporally trimmed into spatio-temporal action tubes. We demonstrate the effectiveness of our model on the challenging UCF101 and LIRIS-HARL datasets, both achieving state-of-the-art performance.

Comments:	Accepted at BMVC 2017 (oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1708.00042 [cs.CV]
	(or arXiv:1708.00042v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1708.00042

Submission history

From: Zhenheng Yang [view email]
[v1] Mon, 31 Jul 2017 19:03:19 UTC (635 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2017-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhenheng Yang
Jiyang Gao
Ram Nevatia

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators