Residential College | false |
Status | 已發表Published |
Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model | |
Zhu, Guoqing1; Pan, Honghu2; Wang, Qiang1; Tian, Chao1; Yang, Chao1; He, Zhenyu1 | |
2024 | |
Conference Name | 32nd ACM International Conference on Multimedia, MM 2024 |
Source Publication | MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia
![]() |
Pages | 10544-10553 |
Conference Date | 28 October 2024 - 1 November 2024 |
Conference Place | Melbourne |
Country | Australia |
Publication Place | New York, NY, USA |
Publisher | Association for Computing Machinery, Inc |
Abstract | In challenging low-light and adverse weather conditions, thermal vision algorithms, especially object detection, have exhibited remarkable potential, contrasting with the frequent struggles encountered by visible vision algorithms. Nevertheless, the efficacy of thermal vision algorithms driven by deep learning models remains constrained by the paucity of available training data samples. To this end, this paper introduces a novel approach termed the edge-guided conditional diffusion model (ECDM). This framework aims to produce meticulously aligned pseudo thermal images at the pixel level, leveraging edge information extracted from visible images. By utilizing edges as contextual cues from the visible domain, the diffusion model achieves meticulous control over the delineation of objects within the generated images. To alleviate the impacts of those visible-specific edge information that should not appear in the thermal domain, a two-stage modality adversarial training (TMAT) strategy is proposed to filter them out from the generated images by differentiating the visible and thermal modality. Extensive experiments on LLVIP demonstrate ECDM's superiority over existing state-of-the-art approaches in terms of image generation quality. The pseudo thermal images generated by ECDM also help to boost the performance of various thermal object detectors by up to 7.1 mAP. Code is available at https://github.com/lengmo1996/ECDM. |
Keyword | Diffusion Model Thermal Image Generation Thermal Object Detection |
DOI | 10.1145/3664647.3680922 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85209802845 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Affiliation | 1.Harbin Institute of Technology, Shenzhen, Shenzhen, China 2.University of Macau, Macao |
Recommended Citation GB/T 7714 | Zhu, Guoqing,Pan, Honghu,Wang, Qiang,et al. Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model[C], New York, NY, USA:Association for Computing Machinery, Inc, 2024, 10544-10553. |
APA | Zhu, Guoqing., Pan, Honghu., Wang, Qiang., Tian, Chao., Yang, Chao., & He, Zhenyu (2024). Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model. MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia, 10544-10553. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment