Video action recognition with Key-detail Motion Capturing based on motion spectrum analysis and multiscale feature fusion

doi:10.1007/s00371-021-02355-4

UM > Faculty of Science and Technology

Residential College	false
Status	已發表Published
	Video action recognition with Key-detail Motion Capturing based on motion spectrum analysis and multiscale feature fusion
	Ganghan Zhang 1; Guoheng Huang 1; Haiyuan Chen 1; Chi-Man Pun2 ; Zhiwen Yu 3; Wing-Kuen Ling 4
	2022-01-10
Source Publication	VISUAL COMPUTER
ISSN	0178-2789
Volume	39 Pages:539-556
Abstract	At present, existing research works on action recognition are still not ideal, when most of the video content is redundant such as video clips without any object motion, and human actions in the video are complex. The reasons are as follows: (1) Most of them lack attention to key-motion information of the video, thus irrelevant information will be input into the model. (2) And there is a lack of interaction between video spatial and temporal information, which may cause the loss of detailed motion information in the video. In this paper, we propose a Key-detail Motion Capturing Network (K-MCN) to solve these problems, which contains two modules. The first one is the Video Key-motion Spectrum Analyzer (VKSA) module. In this module, the video optical flow can be subjected to frequency spectrum analysis, filtering and clustering to extract the key-motion frames. The second one is the Multiscale Motion Spatiotemporal Interaction module, which allows multi-scale modeling and fusion of spatial and temporal features extracted from key-motion frames, enabling the network to realize the interaction and supplement of multiscale spatiotemporal information. Finally, we conducted extensive experiments on the UCF101, HMDB51 and Something-SomethingV1 datasets, and the results showed that our method achieves better performance compared with other state-of-the-art methods.
Keyword	Action Recognition Key Frame Extraction Multiscale Feature Fusion Spatiotemporal Feature Pyramid
DOI	10.1007/s00371-021-02355-4
URL	View the original
Indexed By	SCIE
Language	英語English
WOS Research Area	Computer Science
WOS Subject	Computer Science, Software Engineering
WOS ID	WOS:000741634600006
Publisher	SPRINGERONE, NEW YORK PLAZA, SUITE 4600 , NEW YORK, NY 10004, UNITED STATES
Scopus ID	2-s2.0-85122962234
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding Author	Guoheng Huang
Affiliation	1.School of Computers, Guangdong University of Technology, Guangzhou 510006, China 2.Department of Computer and Information Science, University of Macau, Macau SAR 999078, China 3.School of Computer Science and Engineering, South China University of Technology, Guangzhou 510006, China 4.School of Information Engineering, Guangdong University of Technology, Guangzhou 510006
Recommended Citation GB/T 7714	Ganghan Zhang,Guoheng Huang,Haiyuan Chen,et al. Video action recognition with Key-detail Motion Capturing based on motion spectrum analysis and multiscale feature fusion[J]. VISUAL COMPUTER, 2022, 39, 539-556.
APA	Ganghan Zhang., Guoheng Huang., Haiyuan Chen., Chi-Man Pun., Zhiwen Yu., & Wing-Kuen Ling (2022). Video action recognition with Key-detail Motion Capturing based on motion spectrum analysis and multiscale feature fusion. VISUAL COMPUTER, 39, 539-556.
MLA	Ganghan Zhang,et al."Video action recognition with Key-detail Motion Capturing based on motion spectrum analysis and multiscale feature fusion".VISUAL COMPUTER 39(2022):539-556.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh