Residential College | false |
Status | 已發表Published |
BRNet: Exploring Comprehensive Features for Monocular Depth Estimation | |
Han, Wencheng1; Yin, Junbo2; Jin, Xiaogang3; Dai, Xiangdong4; Shen, Jianbing1 | |
2022-10-23 | |
Conference Name | 17th European Conference on Computer Vision (ECCV) |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Volume | 13698 |
Pages | 586-602 |
Conference Date | OCT 23-27, 2022 |
Conference Place | Tel Aviv, ISRAEL |
Abstract | Self-supervised monocular depth estimation has achieved encouraging performance recently. A consensus is that high-resolution inputs often yield better results. However, we find that the performance gap between high and low resolutions in this task mainly lies in the inappropriate feature representation of the widely used U-Net backbone rather than the information difference. In this paper, we address the comprehensive feature representation problem for self-supervised depth estimation by paying attention to both local and global feature representation. Specifically, we first provide an in-depth analysis of the influence of different input resolutions and find out that the receptive fields play a more crucial role than the information disparity between inputs. To this end, we propose a bilateral depth encoder that can fully exploit detailed and global information. It benefits from more broad receptive fields and thus achieves substantial improvements. Furthermore, we propose a residual decoder to facilitate depth regression as well as save computations by focusing on the information difference between different layers. We named our new depth estimation model Bilateral Residual Depth Network (BRNet). Experimental results show that BRNet achieves new state-of-the-art performance on the KITTI benchmark with three types of self-supervision. Codes are available at: https://github.com/wencheng256/BRNet. |
DOI | 10.1007/978-3-031-19839-7_34 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science ; Imaging Science & Photographic Technology |
WOS Subject | Computer Science, Artificial Intelligence ; Imaging Science & Photographic Technology |
WOS ID | WOS:000903760400034 |
Scopus ID | 2-s2.0-85142678479 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Shen, Jianbing |
Affiliation | 1.SKL-IOTSC, Computer and Information Science, University of Macau, Zhuhai, China 2.School of Computer Science, Beijing Institute of Technology, Beijing, China 3.State Key Lab of CAD &CG, Zhejiang University, Hangzhou, 310058, China 4.Guangdong OPPO Mobile Telecommunications Corp., Ltd., Dongguan, China |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Han, Wencheng,Yin, Junbo,Jin, Xiaogang,et al. BRNet: Exploring Comprehensive Features for Monocular Depth Estimation[C], 2022, 586-602. |
APA | Han, Wencheng., Yin, Junbo., Jin, Xiaogang., Dai, Xiangdong., & Shen, Jianbing (2022). BRNet: Exploring Comprehensive Features for Monocular Depth Estimation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13698, 586-602. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment