UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Orthogonalization-Guided Feature Fusion Network for Multimodal 2D+3D Facial Expression Recognition
Lin, Shisong1,2; Bai, Mengchao1,2; Liu, Feng1,2; Shen, Linlin1,2,3; Zhou, Yicong3
2020-06-11
Source PublicationIEEE Transactions on Multimedia
ISSN1520-9210
Volume23Pages:1581-1591
Abstract

As 2D and 3D data present different views of the same face, the features extracted from them can be both complementary and redundant. In this paper, we present a novel and efficient orthogonalization-guided feature fusion network, namely OGF^2Net, to fuse the features extracted from 2D and 3D faces for facial expression recognition. While 2D texture maps are fed into a 2D feature extraction pipeline (FE2DNet), the attribute maps generated from 3D data are concatenated as input of the 3D feature extraction pipeline (FE3DNet). The two networks are separately trained at the first stage and frozen in the second stage for late feature fusion, which can well address the unavailability of a large number of 3D+2D face pairs. To reduce the redundancies among features extracted from 2D and 3D streams, we design an orthogonal loss-guided feature fusion network to orthogonalize the features before fusing them. Experimental results show that the proposed method significantly outperforms the state-of-the-art algorithms on both the BU-3DFE and Bosphorus databases. While accuracies as high as 89.05% (P1 protocol) and 89.07% (P2 protocol) are achieved on the BU-3DFE database, an accuracy of 89.28% is achieved on the Bosphorus database. The complexity analysis also suggests that our approach achieves a higher processing speed while simultaneously requiring lower memory costs.

KeywordFeature Fusion Multimodal Facial Expression Recognition
DOI10.1109/TMM.2020.3001497
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science ; Telecommunications
WOS SubjectComputer Science, Information systems;Computer Science, Software Engineering;telecommunications
WOS IDWOS:000658333200003
PublisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC445 HOES LANE, PISCATAWAY, NJ 08855-4141
Scopus ID2-s2.0-85104947893
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
Corresponding AuthorShen, Linlin
Affiliation1.Computer Vision Institute, School of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518060, China
2.Shenzhen Institute of Artificial Intelligence and Robotics for Society, Shenzhen, 518172, China
3.Faculty of Science and Technology, University of Macau, 999078, Macao
Corresponding Author AffilicationFaculty of Science and Technology
Recommended Citation
GB/T 7714
Lin, Shisong,Bai, Mengchao,Liu, Feng,et al. Orthogonalization-Guided Feature Fusion Network for Multimodal 2D+3D Facial Expression Recognition[J]. IEEE Transactions on Multimedia, 2020, 23, 1581-1591.
APA Lin, Shisong., Bai, Mengchao., Liu, Feng., Shen, Linlin., & Zhou, Yicong (2020). Orthogonalization-Guided Feature Fusion Network for Multimodal 2D+3D Facial Expression Recognition. IEEE Transactions on Multimedia, 23, 1581-1591.
MLA Lin, Shisong,et al."Orthogonalization-Guided Feature Fusion Network for Multimodal 2D+3D Facial Expression Recognition".IEEE Transactions on Multimedia 23(2020):1581-1591.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Lin, Shisong]'s Articles
[Bai, Mengchao]'s Articles
[Liu, Feng]'s Articles
Baidu academic
Similar articles in Baidu academic
[Lin, Shisong]'s Articles
[Bai, Mengchao]'s Articles
[Liu, Feng]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Lin, Shisong]'s Articles
[Bai, Mengchao]'s Articles
[Liu, Feng]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.