Residential College | false |
Status | 已發表Published |
Weakly Supervised Monocular 3D Object Detection Using Multi-View Projection and Direction Consistency | |
Tao, Runzhou1,3; Han, Wencheng2; Qiu, Zhongying3; Xu, Cheng Zhong2; Shen, Jianbing2 | |
2023 | |
Conference Name | IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
Source Publication | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
Volume | 2023-June |
Pages | 17482-17492 |
Conference Date | JUN 17-24, 2023 |
Conference Place | Vancouver, CANADA |
Abstract | Monocular 3D object detection has become a mainstream approach in automatic driving for its easy application. A prominent advantage is that it does not need Li-DAR point clouds during the inference. However, most current methods still rely on 3D point cloud data for labeling the ground truths used in the training phase. This inconsistency between the training and inference makes it hard to utilize the large-scale feedback data and increases the data collection expenses. To bridge this gap, we propose a new weakly supervised monocular 3D objection detection method, which can train the model with only 2D labels marked on images. To be specific, we explore three types of consistency in this task, i.e. the projection, multi-view and direction consistency, and design a weakly-supervised architecture based on these consistencies. Moreover, we propose a new 2D direction labeling method in this task to guide the model for accurate rotation direction prediction. Experiments show that our weakly-supervised method achieves comparable performance with some fully supervised methods. When used as a pre-training method, our model can significantly outperform the corresponding fully-supervised baseline with only 1/3 3D labels. |
Keyword | Autonomous Driving |
DOI | 10.1109/CVPR52729.2023.01677 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:001062531301076 |
Scopus ID | 2-s2.0-85173971108 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) Faculty of Science and Technology |
Corresponding Author | Shen, Jianbing |
Affiliation | 1.Beijing Institute of Technology, China 2.SKL-IOTSC, Cis, University of Macau, Macao 3.QCraft, |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Tao, Runzhou,Han, Wencheng,Qiu, Zhongying,et al. Weakly Supervised Monocular 3D Object Detection Using Multi-View Projection and Direction Consistency[C], 2023, 17482-17492. |
APA | Tao, Runzhou., Han, Wencheng., Qiu, Zhongying., Xu, Cheng Zhong., & Shen, Jianbing (2023). Weakly Supervised Monocular 3D Object Detection Using Multi-View Projection and Direction Consistency. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023-June, 17482-17492. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment