Semantic Segmentation of Video Sequences with Convolutional LSTMs

Pfeuffer, Andreas; Schulz, Karina; Dietmayer, Klaus

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.01058 (cs)

[Submitted on 3 May 2019]

Title:Semantic Segmentation of Video Sequences with Convolutional LSTMs

Authors:Andreas Pfeuffer, Karina Schulz, Klaus Dietmayer

View PDF

Abstract:Most of the semantic segmentation approaches have been developed for single image segmentation, and hence, video sequences are currently segmented by processing each frame of the video sequence separately. The disadvantage of this is that temporal image information is not considered, which improves the performance of the segmentation approach. One possibility to include temporal information is to use recurrent neural networks. However, there are only a few approaches using recurrent networks for video segmentation so far. These approaches extend the encoder-decoder network architecture of well-known segmentation approaches and place convolutional LSTM layers between encoder and decoder. However, in this paper it is shown that this position is not optimal, and that other positions in the network exhibit better performance. Nowadays, state-of-the-art segmentation approaches rarely use the classical encoder-decoder structure, but use multi-branch architectures. These architectures are more complex, and hence, it is more difficult to place the recurrent units at a proper position. In this work, the multi-branch architectures are extended by convolutional LSTM layers at different positions and evaluated on two different datasets in order to find the best one. It turned out that the proposed approach outperforms the pure CNN-based approach for up to 1.6 percent.

Comments:	This work has been submitted to the IEEE for possible publication
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1905.01058 [cs.CV]
	(or arXiv:1905.01058v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.01058
Journal reference:	IEEE Intelligent Vehicles Symposium 2019 (IV'19)

Submission history

From: Andreas Pfeuffer [view email]
[v1] Fri, 3 May 2019 07:52:32 UTC (2,246 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Semantic Segmentation of Video Sequences with Convolutional LSTMs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Semantic Segmentation of Video Sequences with Convolutional LSTMs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators