Residential College | false |
Status | 已發表Published |
Zero-Shot Video Object Segmentation with Co-Attention Siamese Networks | |
Lu, Xiankai1; Wang, Wenguan2; Shen, Jianbing3; Crandall, David4; Luo, Jiebo5 | |
2020-11-24 | |
Source Publication | IEEE Transactions on Pattern Analysis and Machine Intelligence |
ISSN | 0162-8828 |
Volume | 44Issue:4Pages:2228-2242 |
Abstract | We introduce a novel network, called CO-attention siamese network (COSNet), to address the zero-shot video object segmentation task in a holistic fashion. We exploit the inherent correlation among video frames and incorporate a global co-attention mechanism to further improve the state-of-the-art deep learning based solutions that primarily focus on learning discriminative foreground representations over appearance and motion in short-term temporal segments. The co-attention layers in COSNet provide efficient and competent stages for capturing global correlations and scene context by jointly computing and appending co-attention responses into a joint feature space. COSNet is a unified and end-to-end trainable framework where different co-attention variants can be derived for capturing diverse properties of the learned joint feature space. We train COSNet with pairs (or groups) of video frames, and this naturally augments training data and allows increased learning capacity. During the segmentation stage, the co-attention model encodes useful information by processing multiple reference frames together, which is leveraged to infer the frequently reappearing and salient foreground objects better. Our extensive experiments over three large benchmarks demonstrate that COSNet outperforms the current alternatives by a large margin. Our implementations are available at https://github.com/carrierlxk/COSNet. |
Keyword | Differentiable Co-attention Mechanism Siamese Network Zero-shot Video Object Segmentation |
DOI | 10.1109/TPAMI.2020.3040258 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS ID | WOS:000764815300041 |
Publisher | IEEE Computer Society |
Scopus ID | 2-s2.0-85097143029 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Corresponding Author | Wang, Wenguan |
Affiliation | 1.School of Software, Shangdong University, Jinan, 250100, China 2.ETH Zurich, Zurich, 8092, Switzerland 3.Department of Computer and Information Science, State Key Laboratory of Internet of Things for Smart City, University of Macau, Macao 4.School of Informatics Computing and Engineering, Indiana University, Bloomington, 47405, United States 5.Department of Computer Scienece, University of Rochester, Rochester, 14627, United States |
Recommended Citation GB/T 7714 | Lu, Xiankai,Wang, Wenguan,Shen, Jianbing,et al. Zero-Shot Video Object Segmentation with Co-Attention Siamese Networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 44(4), 2228-2242. |
APA | Lu, Xiankai., Wang, Wenguan., Shen, Jianbing., Crandall, David., & Luo, Jiebo (2020). Zero-Shot Video Object Segmentation with Co-Attention Siamese Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(4), 2228-2242. |
MLA | Lu, Xiankai,et al."Zero-Shot Video Object Segmentation with Co-Attention Siamese Networks".IEEE Transactions on Pattern Analysis and Machine Intelligence 44.4(2020):2228-2242. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment