Weakly supervised monocular depth estimation method based on stereo matching labels

Zhimin Zhang; Jianzhong Qiao; Shukuan Lin; Han Liu

doi:10.1117/1.JEI.29.5.053013

13 October 2020 Weakly supervised monocular depth estimation method based on stereo matching labels

Zhimin Zhang, Jianzhong Qiao, Shukuan Lin, Han Liu

Author Affiliations +

Journal of Electronic Imaging, Vol. 29, Issue 5, 053013 (October 2020). https://doi.org/10.1117/1.JEI.29.5.053013

Abstract

Current self-supervised monocular methods only learn effectively by imposing consistency constraints without relying on any geometric constraints or ground truth depth constraints, which makes the accuracy of the estimation result suboptimal. Compared with the monocular algorithm, the stereo matching network usually follows the geometric process of the traditional stereo algorithm, which makes the estimation result more accurate. Inspired by these findings, we proposed a weakly supervised monocular learning approach that makes use of the disparity maps generated by the self-supervised stereo matching model as the “ground truth” labels to train a self-supervised monocular depth estimation model. To obtain more accurate ground truth labels, we improve the layer of geometry and context in self-supervised deep stereo regression by replacing the 3D convolutional layer with a guided aggregation layer. The design can also reduce computational costs and memory consumption. Then, we build our weakly supervised monocular model by improving the U-Net model and designing a loss function composed of a weakly supervised cost and a self-supervised cost. The estimation results obtained using our model outperform those of the existing self-supervised depth estimation methods under the same training conditions on the challenging KITTI dataset, and the results can easily be generalized to the Cityscapes dataset.

Citation Download Citation

Zhimin Zhang, Jianzhong Qiao, Shukuan Lin, and Han Liu "Weakly supervised monocular depth estimation method based on stereo matching labels," Journal of Electronic Imaging 29(5), 053013 (13 October 2020). https://doi.org/10.1117/1.JEI.29.5.053013

Received: 26 March 2020; Accepted: 28 September 2020; Published: 13 October 2020

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available