Residential College | false |
Status | 即將出版Forthcoming |
MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation | |
Chen, Xuhang1,2,3; Luo, Shenghong2; Pun, Chi Man2![]() | |
2025 | |
Conference Name | 7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024 |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
![]() |
Volume | 15044 LNCS |
Pages | 61-75 |
Conference Date | 18 October 2024 to 20 October 2024 |
Conference Place | Urumqi; China |
Publisher | Springer Science and Business Media Deutschland GmbH |
Abstract | The ability to translate medical images across different modalities is crucial for synthesizing missing data and aiding in clinical diagnosis. However, existing learning-based techniques have limitations when it comes to capturing cross-modal and global features. These techniques are often tailored to specific pairs of modalities, limiting their practical utility, especially considering the variability of missing modalities in different cases. In this study, we introduce MedPrompt, a multi-task framework designed to efficiently translate diverse modalities. Our framework incorporates the Self-adaptive Prompt Block, which dynamically guides the translation network to handle different modalities effectively. To encode the cross-modal prompt efficiently, we introduce the Prompt Extraction Block and the Prompt Fusion Block. Additionally, we leverage the Transformer model to enhance the extraction of global features across various modalities. Through extensive experimentation involving five datasets and four pairs of modalities, we demonstrate that our proposed model achieves state-of-the-art visual quality and exhibits excellent generalization capability. The results highlight the effectiveness and versatility of MedPrompt in addressing the challenges associated with cross-modal medical image translation. |
Keyword | Medical Image Translation Vision Transformer Visual Prompting |
DOI | 10.1007/978-981-97-8496-7_5 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85209364062 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | 1.Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China 2.University of Macau, Macao 3.Huizhou University, Huizhou, China |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Chen, Xuhang,Luo, Shenghong,Pun, Chi Man,et al. MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation[C]:Springer Science and Business Media Deutschland GmbH, 2025, 61-75. |
APA | Chen, Xuhang., Luo, Shenghong., Pun, Chi Man., & Wang, Shuqiang (2025). MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 15044 LNCS, 61-75. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment