Residential College | false |
Status | 已發表Published |
Semi-supervised 3D Object Detection with Proficient Teachers | |
Yin, Junbo1; Fang, Jin2,3,4; Zhou, Dingfu2,3; Zhang, Liangjun2,3; Xu, Cheng Zhong4; Shen, Jianbing4; Wang, Wenguan5 | |
2022-10-23 | |
Conference Name | 17th European Conference on Computer Vision (ECCV) |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Volume | 13698 |
Pages | 727-743 |
Conference Date | OCT 23-27, 2022 |
Conference Place | Tel Aviv |
Country | ISRAEL |
Publisher | SPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY |
Abstract | Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples, however, 3D annotation in the point cloud is extremely tedious, expensive and time-consuming. To reduce the dependence on large supervision, semi-supervised learning (SSL) based approaches have been proposed. The Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance. In this work, we propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs. First, to improve the recall of pseudo labels, a Spatial-temporal Ensemble (STE) module is proposed to generate sufficient seed boxes. Second, to improve the precision of recalled boxes, a Clustering-based Box Voting (CBV) module is designed to get aggregated votes from the clustered seed boxes. This also eliminates the necessity of sophisticated thresholds to select pseudo labels. Furthermore, to reduce the negative influence of wrongly pseudo-labeled samples during the training, a soft supervision signal is proposed by considering Box-wise Contrastive Learning (BCL). The effectiveness of our model is verified on both ONCE and Waymo datasets. For example, on ONCE, our approach significantly improves the baseline by 9.51 mAP. Moreover, with half annotations, our model outperforms the oracle model with full annotations on Waymo. |
Keyword | 3d Object Detection Point Cloud Semi-supervised Learning |
DOI | 10.1007/978-3-031-19839-7_42 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science ; Imaging Science & Photographic Technology |
WOS Subject | Computer Science, Artificial Intelligence ; Imaging Science & Photographic Technology |
WOS ID | WOS:000903760400042 |
Scopus ID | 2-s2.0-85142715900 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Corresponding Author | Wang, Wenguan |
Affiliation | 1.School of Computer Science, Beijing Institute of Technology, Beijing, China 2.Baidu Research, Beijing, China 3.National Engineering Laboratory of Deep Learning Technology and Application, Beijing, China 4.SKL-IOTSC, CIS, University of Macau, Zhuhai, China 5.ReLER, AAII, University of Technology Sydney, Ultimo, Australia |
Recommended Citation GB/T 7714 | Yin, Junbo,Fang, Jin,Zhou, Dingfu,et al. Semi-supervised 3D Object Detection with Proficient Teachers[C]:SPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY, 2022, 727-743. |
APA | Yin, Junbo., Fang, Jin., Zhou, Dingfu., Zhang, Liangjun., Xu, Cheng Zhong., Shen, Jianbing., & Wang, Wenguan (2022). Semi-supervised 3D Object Detection with Proficient Teachers. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13698, 727-743. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment