UM
Residential Collegefalse
Status已發表Published
Clip Fusion with Bi-level Optimization for Human Mesh Reconstruction from Monocular Videos
Wu, Peng1; Lu, Xiankai1; Shen, Jianbing2; Yin, Yilong1
2023-10-27
Conference Name31st ACM International Conference on Multimedia, MM 2023
Source PublicationMM 2023 - Proceedings of the 31st ACM International Conference on Multimedia
Pages105-115
Conference Date2023/10/29-2023/11/03
Conference PlaceOttawa
Abstract

Human mesh reconstruction (HMR) from monocular video is the key step to many mixed reality and robotic applications. Although existing methods show promising results by capturing frames' temporal information, these methods predict human mesh with the design of implicit temporal learning modules in a sequence to frame manner. To mine more temporal information from the video, we present a bi-level clip inference network for HMR, which leverages both local motion and global context explicitly for dense 3D reconstruction. Specifically, we propose a novel bi-level temporal fusion strategy that takes both neighboring and long-range relations into consideration. In addition, different from traditional frame-wise operation, we investigate an alternative perspective by treating video-based HMR as clip-wise inference. We evaluate the proposed method on multiple datasets (3DPW, Human3.6M, and MPI-INF-3DHP) quantitatively and qualitatively, demonstrating a significant improvement over existing methods (in terms of PA-MPJPE, ACC-Error etc). Furthermore, we extend the proposed method on more challenging Multiple Shots HMR task to demonstrate its generalizability. Some visual demos can be seen https://github.com/bicf0/bicf-demo.

KeywordBi-level Optimization Clip-wise Inference Human Mesh Reconstruction
DOI10.1145/3581783.3611978
URLView the original
Language英語English
Scopus ID2-s2.0-85179547893
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionUniversity of Macau
THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Corresponding AuthorLu, Xiankai
Affiliation1.Shandong University, Jinan, China
2.University of Macao, Macao
Recommended Citation
GB/T 7714
Wu, Peng,Lu, Xiankai,Shen, Jianbing,et al. Clip Fusion with Bi-level Optimization for Human Mesh Reconstruction from Monocular Videos[C], 2023, 105-115.
APA Wu, Peng., Lu, Xiankai., Shen, Jianbing., & Yin, Yilong (2023). Clip Fusion with Bi-level Optimization for Human Mesh Reconstruction from Monocular Videos. MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia, 105-115.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wu, Peng]'s Articles
[Lu, Xiankai]'s Articles
[Shen, Jianbing]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wu, Peng]'s Articles
[Lu, Xiankai]'s Articles
[Shen, Jianbing]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wu, Peng]'s Articles
[Lu, Xiankai]'s Articles
[Shen, Jianbing]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.