Residential College | false |
Status | 已發表Published |
Siamese Network for RGB-D Salient Object Detection and beyond | |
Fu, Keren1,2; Fan, Deng Ping3; Ji, Ge Peng4; Zhao, Qijun1,6; Shen, Jianbing5; Zhu, Ce6 | |
2022-09-01 | |
Source Publication | IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE |
ISSN | 0162-8828 |
Volume | 44Issue:9Pages:5541-5559 |
Abstract | Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as independent information and design separate networks for feature extraction from each. Such schemes can easily be constrained by a limited amount of training data or over-reliance on an elaborately designed training process. Inspired by the observation that RGB and depth modalities actually present certain commonality in distinguishing salient objects, a novel joint learning and densely cooperative fusion (JL-DCF) architecture is designed to learn from both RGB and depth inputs through a shared network backbone, known as the Siamese architecture. In this paper, we propose two effective components: joint learning (JL), and densely cooperative fusion (DCF). The JL module provides robust saliency feature learning by exploiting cross-modal commonality via a Siamese network, while the DCF module is introduced for complementary feature discovery. Comprehensive experiments using five popular metrics show that the designed framework yields a robust RGB-D saliency detector with good generalization. As a result, JL-DCF significantly advances the state-of-the-art models by an average of \sim 2.0\%∼2.0% (max F-measure) across seven challenging datasets. In addition, we show that JL-DCF is readily applicable to other related multi-modal detection tasks, including RGB-T (thermal infrared) SOD and video SOD, achieving comparable or even better performance against state-of-the-art methods. We also link JL-DCF to the RGB-D semantic segmentation field, showing its capability of outperforming several semantic segmentation models on the task of RGB-D SOD. These facts further confirm that the proposed framework could offer a potential solution for various applications and provide more insight into the cross-modal complementarity task. |
Keyword | Rgb-d Semantic Segmentation Rgb-d Sod Saliency Detection Salient Object Detection Siamese Network |
DOI | 10.1109/TPAMI.2021.3073689 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS ID | WOS:000836666600074 |
Publisher | IEEE COMPUTER SOC10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1314 |
Scopus ID | 2-s2.0-85104572858 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Fu, Keren; Fan, Deng Ping; Ji, Ge Peng; Zhao, Qijun; Shen, Jianbing; Zhu, Ce |
Affiliation | 1.Sichuan University, College of Computer Science, Chengdu, Sichuan, 610065, China 2.Sichuan University, National Key Laboratory of Fundamental Science on Synthetic Vision, Chengdu, Sichuan, 610017, China 3.Nankai University, College of Computer Science, Tianjin, 300350, China 4.Wuhan University, School of Computer Science, Wuhan, Hubei, 430072, China 5.University of Macau, State Key Laboratory of Internet of Things for Smart City, Department of Computer and Information Science, Macao 6.University of Electronic Science and Technology of China, School of Information and Communication Engineering, Chengdu, Sichuan, 611731, China |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Fu, Keren,Fan, Deng Ping,Ji, Ge Peng,et al. Siamese Network for RGB-D Salient Object Detection and beyond[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44(9), 5541-5559. |
APA | Fu, Keren., Fan, Deng Ping., Ji, Ge Peng., Zhao, Qijun., Shen, Jianbing., & Zhu, Ce (2022). Siamese Network for RGB-D Salient Object Detection and beyond. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 44(9), 5541-5559. |
MLA | Fu, Keren,et al."Siamese Network for RGB-D Salient Object Detection and beyond".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 44.9(2022):5541-5559. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment