Residential College | false |
Status | 已發表Published |
Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs | |
Yin, Qilin1,2,3; Lu, Wei1,2,3; Cao, Xiaochun4; Luo, Xiangyang5; Zhou, Yicong6; Huang, Jiwu7 | |
2024 | |
Source Publication | International Journal of Computer Vision |
ISSN | 0920-5691 |
Abstract | Nowadays, the abuse of deepfakes is a well-known issue since deepfakes can lead to severe security and privacy problems. And this situation is getting worse, as attackers are no longer limited to unimodal deepfakes, but use multimodal deepfakes, i.e., both audio forgery and video forgery, to better achieve malicious purposes. The existing unimodal or ensemble deepfake detectors are demanded with fine-grained classification capabilities for the growing technique on multimodal deepfakes. To address this gap, we propose a graph attention network based on heterogeneous graph for fine-grained multimodal deepfake classification, i.e., not only distinguishing the authenticity of samples, but also identifying the forged types, e.g., video or audio or both. To this end, we propose a positional coding-based heterogeneous graph construction method that converts an audio-visual sample into a multimodal heterogeneous graph according to relevant hyperparameters. Moreover, a cross-modal graph interaction module is devised to utilize audio-visual synchronization patterns for capturing inter-modal complementary information. The de-homogenization graph pooling operation is elaborately designed to keep differences in graph node features for enhancing the representation of graph-level features. Through the heterogeneous graph attention network, we can efficiently model intra- and inter-modal relationships of multimodal data both at spatial and temporal scales. Extensive experimental results on two audio-visual datasets FakeAVCeleb and LAV-DF demonstrate that our proposed model obtains significant performance gains as compared to other state-of-the-art competitors. The code is available at https://github.com/yinql1995/Fine-grained-Multimodal-DeepFake-Classification/. |
Keyword | Audio-visual Model Graph Neural Networks Heterogeneous Graphs Multimodal Deepfake Classification |
DOI | 10.1007/s11263-024-02128-1 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:001242312700002 |
Publisher | SPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS |
Scopus ID | 2-s2.0-85195317870 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Lu, Wei |
Affiliation | 1.School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, China 2.Ministry of Education Key Laboratory of Information Technology, Sun Yat-sen University, Guangzhou, 510006, China 3.Guangdong Province Key Laboratory of Information Security Technology, Sun Yat-sen University, Guangzhou, 510006, China 4.School of Cyber Science and Technology, Sun Yat-sen University, Shenzhen, 518107, China 5.State Key Laboratory of Mathematical Engineering and Advanced Computing, Henan, 450001, China 6.Department of Computer and Information Science, University of Macau, Macao 7.Guangdong Laboratory of Machine Perception and Intelligent Computing, Faculty of Engineering, Shenzhen MSU-BIT University, Shenzhen, 518116, China |
Recommended Citation GB/T 7714 | Yin, Qilin,Lu, Wei,Cao, Xiaochun,et al. Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs[J]. International Journal of Computer Vision, 2024. |
APA | Yin, Qilin., Lu, Wei., Cao, Xiaochun., Luo, Xiangyang., Zhou, Yicong., & Huang, Jiwu (2024). Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs. International Journal of Computer Vision. |
MLA | Yin, Qilin,et al."Fine-Grained Multimodal DeepFake Classification via Heterogeneous Graphs".International Journal of Computer Vision (2024). |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment