MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning

Chennupati, Sumanth; Sistu, Ganesh; Yogamani, Senthil; Rawashdeh, Samir A

Computer Science > Computer Vision and Pattern Recognition

arXiv:1904.08492 (cs)

[Submitted on 15 Apr 2019 (v1), last revised 22 Apr 2019 (this version, v2)]

Title:MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning

Authors:Sumanth Chennupati, Ganesh Sistu, Senthil Yogamani, Samir A Rawashdeh

View PDF

Abstract:Multi-task learning is commonly used in autonomous driving for solving various visual perception tasks. It offers significant benefits in terms of both performance and computational complexity. Current work on multi-task learning networks focus on processing a single input image and there is no known implementation of multi-task learning handling a sequence of images. In this work, we propose a multi-stream multi-task network to take advantage of using feature representations from preceding frames in a video sequence for joint learning of segmentation, depth, and motion. The weights of the current and previous encoder are shared so that features computed in the previous frame can be leveraged without additional computation. In addition, we propose to use the geometric mean of task losses as a better alternative to the weighted average of task losses. The proposed loss function facilitates better handling of the difference in convergence rates of different tasks. Experimental results on KITTI, Cityscapes and SYNTHIA datasets demonstrate that the proposed strategies outperform various existing multi-task learning solutions.

Comments:	Accepted for CVPR 2019 Workshop on Autonomous Driving (WAD). Demo Video can be accessed at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1904.08492 [cs.CV]
	(or arXiv:1904.08492v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1904.08492

Submission history

From: Sumanth Chennupati [view email]
[v1] Mon, 15 Apr 2019 19:25:59 UTC (4,298 KB)
[v2] Mon, 22 Apr 2019 13:21:27 UTC (4,299 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators