UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs
Yin, Qilin1,2,3; Lu, Wei1,2,3; Cao, Xiaochun4; Luo, Xiangyang5; Zhou, Yicong6; Huang, Jiwu7
2024
Source PublicationInternational Journal of Computer Vision
ISSN0920-5691
Abstract

Nowadays, the abuse of deepfakes is a well-known issue since deepfakes can lead to severe security and privacy problems. And this situation is getting worse, as attackers are no longer limited to unimodal deepfakes, but use multimodal deepfakes, i.e., both audio forgery and video forgery, to better achieve malicious purposes. The existing unimodal or ensemble deepfake detectors are demanded with fine-grained classification capabilities for the growing technique on multimodal deepfakes. To address this gap, we propose a graph attention network based on heterogeneous graph for fine-grained multimodal deepfake classification, i.e., not only distinguishing the authenticity of samples, but also identifying the forged types, e.g., video or audio or both. To this end, we propose a positional coding-based heterogeneous graph construction method that converts an audio-visual sample into a multimodal heterogeneous graph according to relevant hyperparameters. Moreover, a cross-modal graph interaction module is devised to utilize audio-visual synchronization patterns for capturing inter-modal complementary information. The de-homogenization graph pooling operation is elaborately designed to keep differences in graph node features for enhancing the representation of graph-level features. Through the heterogeneous graph attention network, we can efficiently model intra- and inter-modal relationships of multimodal data both at spatial and temporal scales. Extensive experimental results on two audio-visual datasets FakeAVCeleb and LAV-DF demonstrate that our proposed model obtains significant performance gains as compared to other state-of-the-art competitors. The code is available at https://github.com/yinql1995/Fine-grained-Multimodal-DeepFake-Classification/.

KeywordAudio-visual Model Graph Neural Networks Heterogeneous Graphs Multimodal Deepfake Classification
DOI10.1007/s11263-024-02128-1
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:001242312700002
PublisherSPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS
Scopus ID2-s2.0-85195317870
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorLu, Wei
Affiliation1.School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, China
2.Ministry of Education Key Laboratory of Information Technology, Sun Yat-sen University, Guangzhou, 510006, China
3.Guangdong Province Key Laboratory of Information Security Technology, Sun Yat-sen University, Guangzhou, 510006, China
4.School of Cyber Science and Technology, Sun Yat-sen University, Shenzhen, 518107, China
5.State Key Laboratory of Mathematical Engineering and Advanced Computing, Henan, 450001, China
6.Department of Computer and Information Science, University of Macau, Macao
7.Guangdong Laboratory of Machine Perception and Intelligent Computing, Faculty of Engineering, Shenzhen MSU-BIT University, Shenzhen, 518116, China
Recommended Citation
GB/T 7714
Yin, Qilin,Lu, Wei,Cao, Xiaochun,et al. Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs[J]. International Journal of Computer Vision, 2024.
APA Yin, Qilin., Lu, Wei., Cao, Xiaochun., Luo, Xiangyang., Zhou, Yicong., & Huang, Jiwu (2024). Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs. International Journal of Computer Vision.
MLA Yin, Qilin,et al."Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs".International Journal of Computer Vision (2024).
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yin, Qilin]'s Articles
[Lu, Wei]'s Articles
[Cao, Xiaochun]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yin, Qilin]'s Articles
[Lu, Wei]'s Articles
[Cao, Xiaochun]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yin, Qilin]'s Articles
[Lu, Wei]'s Articles
[Cao, Xiaochun]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.