3D Human Pose Lifting with Grid Convolution

doi:10.48550/arXiv.2302.08760

UM > Faculty of Science and Technology > DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE

Residential College	false
Status	已發表Published
	3D Human Pose Lifting with Grid Convolution
	Kang, Yangyuxuan 1,2; Liu, Yuyang 3; Yao, Anbang 4; Wang, Shandong 4; Wu, Enhua1,2,5
	2023-02-17
Conference Name	37th AAAI Conference on Artificial Intelligence
Source Publication	Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023
Volume	37
Pages	1105-1113
Conference Date	2023/02/07-2023/02/14
Conference Place	Washington
Publisher	AAAI Press
Abstract	Existing lifting networks for regressing 3D human poses from 2D single-view poses are typically constructed with linear layers based on graph-structured representation learning. In sharp contrast to them, this paper presents Grid Convolution (GridConv), mimicking the wisdom of regular convolution operations in image space. GridConv is based on a novel Semantic Grid Transformation (SGT) which leverages a binary assignment matrix to map the irregular graph-structured human pose onto a regular weave-like grid pose representation joint by joint, enabling layer-wise feature learning with GridConv operations. We provide two ways to implement SGT, including handcrafted and learnable designs. Surprisingly, both designs turn out to achieve promising results and the learnable one is better, demonstrating the great potential of this new lifting representation learning formulation. To improve the ability of GridConv to encode contextual cues, we introduce an attention module over the convolutional kernel, making grid convolution operations input-dependent, spatial-aware and grid-specific. We show that our fully convolutional grid lifting network outperforms state-of-the-art methods with noticeable margins under (1) conventional evaluation on Human3.6M and (2) cross-evaluation on MPI-INF-3DHP. Code is available at https://github.com/OSVAI/GridConv.
DOI	10.48550/arXiv.2302.08760
URL	View the original
Language	英語English
Scopus ID	2-s2.0-85167688105
Fulltext Access	View Full-Text via DOI View Full-Text via Scopus
Citation statistics
Document Type	Conference paper
Collection	DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding Author	Yao, Anbang; Wu, Enhua
Affiliation	1.State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, China 2.University of Chinese Academy of Sciences, China 3.Tsinghua University, China 4.Intel Labs China, China 5.Faculty of Science and Technology, University of Macau, Macao
Corresponding Author Affilication	Faculty of Science and Technology
Recommended Citation GB/T 7714	Kang, Yangyuxuan,Liu, Yuyang,Yao, Anbang,et al. 3D Human Pose Lifting with Grid Convolution[C]:AAAI Press, 2023, 1105-1113.
APA	Kang, Yangyuxuan., Liu, Yuyang., Yao, Anbang., Wang, Shandong., & Wu, Enhua (2023). 3D Human Pose Lifting with Grid Convolution. Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, 37, 1105-1113.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh