Enforcing geometric constraints of virtual normal for depth prediction

Date

2019

Authors

Yin, W.
Liu, Y.
Shen, C.
Yan, Y.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Conference paper

Citation

Proceedings / IEEE International Conference on Computer Vision. IEEE International Conference on Computer Vision, 2019, vol.2019-October, pp.5683-5692

Statement of Responsibility

Wei Yin, Yifan Liu, Chunhua Shen, Youliang Yan

Conference Name

IEEE/CVF International Conference on Computer Vision (ICCV) (27 Oct 2019 - 2 Nov 2019 : Seoul, South Korea)

Abstract

Monocular depth prediction plays a crucial role in understanding 3D scene geometry. Although recent methods have achieved impressive progress in evaluation metrics such as the pixel-wise relative error, most methods neglect the geometric constraints in the 3D space. In this work, we show the importance of the high-order 3D geometric constraints for depth prediction. By designing a loss term that enforces one simple type of geometric constraints, namely, virtual normal directions determined by randomly sampled three points in the reconstructed 3D space, we can considerably improve the depth prediction accuracy. Furthermore, we can not only predict accurate depth but also achieve high-quality other 3D information from the depth without retraining new parameters, Significantly, the byproduct of this predicted depth being sufficiently accurate is that we are now able to recover good 3D structures of the scene such as the point cloud and surface normal directly from the depth, eliminating the necessity of training new sub-models as was previously done. Experiments on two challenging benchmarks: NYU Depth-V2 and KITTI demonstrate the effectiveness of our method and state-of-the-art performance.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

©2019 IEEE

License

Grant ID

Call number

Persistent link to this record