Residential College | false |
Status | 已發表Published |
S3Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection | |
Zhu, Lei1; Wang, Xiaoqiang2; Li, Ping3; Yang, Xin4; Zhang, Qing5; Wang, Weiming6; Schonlieb, Carola Bibiane7; Chen, C. L.Philip8,9,10 | |
2023 | |
Source Publication | IEEE Transactions on Multimedia
![]() |
ISSN | 1520-9210 |
Volume | 25Pages:676 - 689 |
Abstract | RGB-D salient object detection aims to detect visually distinctive objects or regions from a pair of the RGB image and the depth image. State-of-the-art RGB-D saliency detectors are mainly based on convolutional neural networks but almost suffer from an intrinsic limitation relying on the labeled data, thus degrading detection accuracy in complex cases. In this work, we present a self-supervised self-ensembling network (S$^3$Net) for semi-supervised RGB-D salient object detection by leveraging the unlabeled data and exploring a self-supervised learning mechanism. To be specific, we first build a self-guided convolutional neural network (SG-CNN) as a baseline model by developing a series of three-layer cross-model feature fusion (TCF) modules to leverage complementary information among depth and RGB modalities and formulating an auxiliary task that predicts a self-supervised image rotation angle. After that, to further explore the knowledge from unlabeled data, we assign SG-CNN to a student network and a teacher network, and encourage the saliency predictions and self-supervised rotation predictions from these two networks to be consistent on the unlabeled data. Experimental results on seven widely-used benchmark datasets demonstrate that our network quantitatively and qualitatively outperforms the state-of-the-art methods. |
Keyword | Rgb-d Salient Object Detection Self-supervised Learning Semi-supervised Learning And Cross-model And Cross-level Feature Aggregation |
DOI | 10.1109/TMM.2021.3129730 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Telecommunications |
WOS Subject | Computer Science, Information systemsComputer Science, Software Engineeringtelecommunications |
WOS ID | WOS:000961977900001 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC445 HOES LANE, PISCATAWAY, NJ 08855-4141 |
Scopus ID | 2-s2.0-85120066638 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology |
Affiliation | 1.University of Cambridge, Hong Kong University of Science and Technology, Department of Applied Mathematics and Theoretical Physics (DAMTP), Cambridge, CB3 0WA, United Kingdom 2.Zhejiang University, College of Computer Science and Technology, Shatin, 310058, China 3.Hong Kong Polytechnic University, Department of Computing, Kowloon, 00852, Hong Kong 4.Dalian University of Technology, Department of Computer Science, Dalian, 116024, China 5.Sun Yat-Sen University, School of Data and Computer Science, Guangzhou, 510006, China 6.Hong Kong Metropolitan University, School of Science and Technology, Ho Man Tin, 00852, Hong Kong 7.University of Cambridge, Department of Applied Mathematics and Theoretical Physics (DAMTP), Cambridge, CB30WA, United Kingdom 8.School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510006, China 9.Dalian Maritime University, Navigation College, Dalian, 116026, China 10.University of Macau, Faculty of Science, and Technology, 999078, Macao |
Recommended Citation GB/T 7714 | Zhu, Lei,Wang, Xiaoqiang,Li, Ping,et al. S3Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection[J]. IEEE Transactions on Multimedia, 2023, 25, 676 - 689. |
APA | Zhu, Lei., Wang, Xiaoqiang., Li, Ping., Yang, Xin., Zhang, Qing., Wang, Weiming., Schonlieb, Carola Bibiane., & Chen, C. L.Philip (2023). S3Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection. IEEE Transactions on Multimedia, 25, 676 - 689. |
MLA | Zhu, Lei,et al."S3Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection".IEEE Transactions on Multimedia 25(2023):676 - 689. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment