Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs

doi:10.1007/s11263-024-02128-1

UM > Faculty of Science and Technology

Residential College	false
Status	已發表Published
	Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs
	Yin, Qilin 1,2,3; Lu, Wei 1,2,3; Cao, Xiaochun 4; Luo, Xiangyang 5; Zhou, Yicong6 ; Huang, Jiwu 7
	2024
Source Publication	International Journal of Computer Vision
ISSN	0920-5691
Abstract	Nowadays, the abuse of deepfakes is a well-known issue since deepfakes can lead to severe security and privacy problems. And this situation is getting worse, as attackers are no longer limited to unimodal deepfakes, but use multimodal deepfakes, i.e., both audio forgery and video forgery, to better achieve malicious purposes. The existing unimodal or ensemble deepfake detectors are demanded with fine-grained classification capabilities for the growing technique on multimodal deepfakes. To address this gap, we propose a graph attention network based on heterogeneous graph for fine-grained multimodal deepfake classification, i.e., not only distinguishing the authenticity of samples, but also identifying the forged types, e.g., video or audio or both. To this end, we propose a positional coding-based heterogeneous graph construction method that converts an audio-visual sample into a multimodal heterogeneous graph according to relevant hyperparameters. Moreover, a cross-modal graph interaction module is devised to utilize audio-visual synchronization patterns for capturing inter-modal complementary information. The de-homogenization graph pooling operation is elaborately designed to keep differences in graph node features for enhancing the representation of graph-level features. Through the heterogeneous graph attention network, we can efficiently model intra- and inter-modal relationships of multimodal data both at spatial and temporal scales. Extensive experimental results on two audio-visual datasets FakeAVCeleb and LAV-DF demonstrate that our proposed model obtains significant performance gains as compared to other state-of-the-art competitors. The code is available at https://github.com/yinql1995/Fine-grained-Multimodal-DeepFake-Classification/.
Keyword	Audio-visual Model Graph Neural Networks Heterogeneous Graphs Multimodal Deepfake Classification
DOI	10.1007/s11263-024-02128-1
URL	View the original
Indexed By	SCIE
Language	英語English
WOS Research Area	Computer Science
WOS Subject	Computer Science, Artificial Intelligence
WOS ID	WOS:001242312700002
Publisher	SPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS
Scopus ID	2-s2.0-85195317870
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding Author	Lu, Wei
Affiliation	1.School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, China 2.Ministry of Education Key Laboratory of Information Technology, Sun Yat-sen University, Guangzhou, 510006, China 3.Guangdong Province Key Laboratory of Information Security Technology, Sun Yat-sen University, Guangzhou, 510006, China 4.School of Cyber Science and Technology, Sun Yat-sen University, Shenzhen, 518107, China 5.State Key Laboratory of Mathematical Engineering and Advanced Computing, Henan, 450001, China 6.Department of Computer and Information Science, University of Macau, Macao 7.Guangdong Laboratory of Machine Perception and Intelligent Computing, Faculty of Engineering, Shenzhen MSU-BIT University, Shenzhen, 518116, China
Recommended Citation GB/T 7714	Yin, Qilin,Lu, Wei,Cao, Xiaochun,et al. Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs[J]. International Journal of Computer Vision, 2024.
APA	Yin, Qilin., Lu, Wei., Cao, Xiaochun., Luo, Xiangyang., Zhou, Yicong., & Huang, Jiwu (2024). Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs. International Journal of Computer Vision.
MLA	Yin, Qilin,et al."Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs".International Journal of Computer Vision (2024).

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh