Residential College | false |
Status | 已發表Published |
ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection | |
Yin, Junbo1; Zhou, Dingfu2,3; Zhang, Liangjun2,3; Fang, Jin2,3,4; Xu, Cheng Zhong4; Shen, Jianbing4; Wang, Wenguan5 | |
2022-10-23 | |
Conference Name | 17th European Conference on Computer Vision (ECCV) |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Volume | 13699 |
Pages | 17-33 |
Conference Date | OCT 23-27, 2022 |
Conference Place | Tel Aviv, ISRAEL |
Publisher | SPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY |
Abstract | Existing approaches for unsupervised point cloud pre-training are constrained to either scene-level or point/voxel-level instance discrimination. Scene-level methods tend to lose local details that are crucial for recognizing the road objects, while point/voxel-level methods inherently suffer from limited receptive field that is incapable of perceiving large objects or context environments. Considering region-level representations are more suitable for 3D object detection, we devise a new unsupervised point cloud pre-training framework, called ProposalContrast, that learns robust 3D representations by contrasting region proposals. Specifically, with an exhaustive set of region proposals sampled from each point cloud, geometric point relations within each proposal are modeled for creating expressive proposal representations. To better accommodate 3D detection properties, ProposalContrast optimizes with both inter-cluster and inter-proposal separation, i.e., sharpening the discriminativeness of proposal representations across semantic classes and object instances. The generalizability and transferability of ProposalContrast are verified on various 3D detectors (i.e., PV-RCNN, CenterPoint, PointPillars and PointRCNN) and datasets (i.e., KITTI, Waymo and ONCE). |
Keyword | 3d Object Detection Unsupervised Point Cloud Pre-training |
DOI | 10.1007/978-3-031-19842-7_2 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science ; Imaging Science & Photographic Technology |
WOS Subject | Computer Science, Artificial Intelligence ; Imaging Science & Photographic Technology |
WOS ID | WOS:000904430800002 |
Scopus ID | 2-s2.0-85142672848 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) Faculty of Science and Technology |
Corresponding Author | Shen, Jianbing; Wang, Wenguan |
Affiliation | 1.School of Computer Science, Beijing Institute of Technology, Beijing, China 2.Baidu Research, Beijing, China 3.National Engineering Laboratory of Deep Learning Technology and Application, Beijing, China 4.SKL-IOTSC, CIS, University of Macau, Zhuhai, China 5.ReLER, AAII, University of Technology Sydney, Ultimo, Australia |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Yin, Junbo,Zhou, Dingfu,Zhang, Liangjun,et al. ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection[C]:SPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY, 2022, 17-33. |
APA | Yin, Junbo., Zhou, Dingfu., Zhang, Liangjun., Fang, Jin., Xu, Cheng Zhong., Shen, Jianbing., & Wang, Wenguan (2022). ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13699, 17-33. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment