UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
SCNet: Scale-aware coupling-structure network for efficient video object detection
Wang,Fengchao1; Xu,Zhewei1; Gan,Yanfen2; Vong,Chi Man3; Liu,Qiong1
2020-09
Source PublicationNEUROCOMPUTING
ISSN0925-2312
Volume404Pages:283-293
Abstract

In recent years, outstanding image-based object detectors were extended for video object detection. Such extension requires to address two challenging problems in videos, namely, scale variation and deformation anomaly. Most image-based works deal with scale variation by resizing the input image for multi-scale training and testing. Such way incurs very high computational cost, which becomes even higher in videos. For deformation anomaly, low-quality RoI feature can be caused by motion blur, video defocus and rare poses, resulting to poor detection accuracy in videos. In this paper, an end-to-end scale-aware coupling-structure network (SCNet) focusing on the two issues is proposed for video object detection with high accuracy and affordable computation. In SCNet, a lightweight scale-aware module is structured to flexibly model the object scale variation, which mainly consists of a set of dilated convolutional layers with parameter constraint. Furthermore, a coupling-structure RoI (region of interest) module is designed to extract robust RoI feature with position-sensitive and context-sensitive information for accuracy improvement. Besides, the feature aggregation strategy is simplified for efficiency. Experiments are conducted on the ImageNet VID dataset. SCNet achieves the state-of-the-art detection performance, exactly 79.5% mAP, with 5.9 points improvement compared to the strong single-frame baseline.

KeywordVideo Object Detection Scale Variation Deformation Anomaly Scale-aware Module Coupling-structure Roi Module
DOI10.1016/j.neucom.2020.03.110
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:000542634100010
PublisherELSEVIERRADARWEG 29, 1043 NX AMSTERDAM, NETHERLANDS
The Source to ArticlePB_Publication
Scopus ID2-s2.0-85084942233
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorLiu,Qiong
Affiliation1.South China University of Technology, Guangzhou, 510006, China
2.South China Business College Guangdong University of Foreign Studies, Guangzhou, 510545, China
3.University of Macau, Macau, 999078, China
Recommended Citation
GB/T 7714
Wang,Fengchao,Xu,Zhewei,Gan,Yanfen,et al. SCNet: Scale-aware coupling-structure network for efficient video object detection[J]. NEUROCOMPUTING, 2020, 404, 283-293.
APA Wang,Fengchao., Xu,Zhewei., Gan,Yanfen., Vong,Chi Man., & Liu,Qiong (2020). SCNet: Scale-aware coupling-structure network for efficient video object detection. NEUROCOMPUTING, 404, 283-293.
MLA Wang,Fengchao,et al."SCNet: Scale-aware coupling-structure network for efficient video object detection".NEUROCOMPUTING 404(2020):283-293.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wang,Fengchao]'s Articles
[Xu,Zhewei]'s Articles
[Gan,Yanfen]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wang,Fengchao]'s Articles
[Xu,Zhewei]'s Articles
[Gan,Yanfen]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wang,Fengchao]'s Articles
[Xu,Zhewei]'s Articles
[Gan,Yanfen]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.