Residential Collegefalse
Status即將出版Forthcoming
Space-time Reinforcement Network for Video Object Segmentation
Chen, Yadang1; Zhu, Wentao1; Yang, Zhi Xin2; Wu, Enhua3
2024
Conference Name2024 IEEE International Conference on Multimedia and Expo, ICME 2024
Source PublicationProceedings - IEEE International Conference on Multimedia and Expo
Pages203042
Conference Date15 July 2024through 19 July 2024
Conference PlaceNiagra Falls
PublisherIEEE Computer Society
Abstract

Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames. Despite these methods having superior performance, they suffer from two issues: 1) Challenging data can destroy the space-time coherence between adjacent video frames. 2) Pixel-level matching will lead to undesired mismatching caused by the noises or distractors. To address the aforementioned issues, we first propose to generate an auxiliary frame between adjacent frames, serving as an implicit short-temporal reference for the query one. Next, we learn a prototype for each video object and prototype-level matching can be implemented between the query and memory. The experiment demonstrated that our network outperforms the state-of-the-art method on the DAVIS 2017, achieving a ℐ&ℱ score of 86.4%, and attains a competitive result 85.0% on YouTube VOS 2018. In addition, our network exhibits a high inference speed of 32+ FPS.

KeywordAuxiliary Frame Memory-based Methods Prototype Learning Video Object Segmentation
DOI10.1109/ICME57554.2024.10687407
URLView the original
Language英語English
Scopus ID2-s2.0-85206566288
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionTHE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
DEPARTMENT OF ELECTROMECHANICAL ENGINEERING
Affiliation1.Nanjing University of Information Science and Technology, School of Computer Science, Nanjing, China
2.University of Macau, State Key Laboratory of Internet of Things for Smart City, Macao
3.Chinese Academy of Sciences, Key Laboratory of System Software, State Key Laboratory of Computer Science Institute of Software, Beijing, China
Recommended Citation
GB/T 7714
Chen, Yadang,Zhu, Wentao,Yang, Zhi Xin,et al. Space-time Reinforcement Network for Video Object Segmentation[C]:IEEE Computer Society, 2024, 203042.
APA Chen, Yadang., Zhu, Wentao., Yang, Zhi Xin., & Wu, Enhua (2024). Space-time Reinforcement Network for Video Object Segmentation. Proceedings - IEEE International Conference on Multimedia and Expo, 203042.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Chen, Yadang]'s Articles
[Zhu, Wentao]'s Articles
[Yang, Zhi Xin]'s Articles
Baidu academic
Similar articles in Baidu academic
[Chen, Yadang]'s Articles
[Zhu, Wentao]'s Articles
[Yang, Zhi Xin]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Chen, Yadang]'s Articles
[Zhu, Wentao]'s Articles
[Yang, Zhi Xin]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.