Monocular Depth Estimation by Learning from Heterogeneous Datasets

Gurram, Akhil; Urfalioglu, Onay; Halfaoui, Ibrahim; Bouzaraa, Fahd; Lopez, Antonio M.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1803.08018 (cs)

[Submitted on 21 Mar 2018 (v1), last revised 12 Sep 2018 (this version, v2)]

Title:Monocular Depth Estimation by Learning from Heterogeneous Datasets

Authors:Akhil Gurram, Onay Urfalioglu, Ibrahim Halfaoui, Fahd Bouzaraa, Antonio M. Lopez

View PDF

Abstract:Depth estimation provides essential information to perform autonomous driving and driver assistance. Especially, Monocular Depth Estimation is interesting from a practical point of view, since using a single camera is cheaper than many other options and avoids the need for continuous calibration strategies as required by stereo-vision approaches. State-of-the-art methods for Monocular Depth Estimation are based on Convolutional Neural Networks (CNNs). A promising line of work consists of introducing additional semantic information about the traffic scene when training CNNs for depth estimation. In practice, this means that the depth data used for CNN training is complemented with images having pixel-wise semantic labels, which usually are difficult to annotate (e.g. crowded urban images). Moreover, so far it is common practice to assume that the same raw training data is associated with both types of ground truth, i.e., depth and semantic labels. The main contribution of this paper is to show that this hard constraint can be circumvented, i.e., that we can train CNNs for depth estimation by leveraging the depth and semantic information coming from heterogeneous datasets. In order to illustrate the benefits of our approach, we combine KITTI depth and Cityscapes semantic segmentation datasets, outperforming state-of-the-art results on Monocular Depth Estimation.

Comments:	Accepted in IEEE-Intelligent Vehicles Symposium, IV'2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1803.08018 [cs.CV]
	(or arXiv:1803.08018v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1803.08018

Submission history

From: Akhil Gurram [view email]
[v1] Wed, 21 Mar 2018 17:18:25 UTC (8,956 KB)
[v2] Wed, 12 Sep 2018 17:40:58 UTC (8,956 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Monocular Depth Estimation by Learning from Heterogeneous Datasets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Monocular Depth Estimation by Learning from Heterogeneous Datasets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators