MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation

doi:10.1007/978-981-97-8496-7_5

UM > Faculty of Science and Technology > DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE

Residential College	false
Status	即將出版Forthcoming
	MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation
	Chen, Xuhang 1,2,3; Luo, Shenghong 2; Pun, Chi Man2 ; Wang, Shuqiang 1
	2025
Conference Name	7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024
Source Publication	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	15044 LNCS
Pages	61-75
Conference Date	18 October 2024 to 20 October 2024
Conference Place	Urumqi; China
Publisher	Springer Science and Business Media Deutschland GmbH
Abstract	The ability to translate medical images across different modalities is crucial for synthesizing missing data and aiding in clinical diagnosis. However, existing learning-based techniques have limitations when it comes to capturing cross-modal and global features. These techniques are often tailored to specific pairs of modalities, limiting their practical utility, especially considering the variability of missing modalities in different cases. In this study, we introduce MedPrompt, a multi-task framework designed to efficiently translate diverse modalities. Our framework incorporates the Self-adaptive Prompt Block, which dynamically guides the translation network to handle different modalities effectively. To encode the cross-modal prompt efficiently, we introduce the Prompt Extraction Block and the Prompt Fusion Block. Additionally, we leverage the Transformer model to enhance the extraction of global features across various modalities. Through extensive experimentation involving five datasets and four pairs of modalities, we demonstrate that our proposed model achieves state-of-the-art visual quality and exhibits excellent generalization capability. The results highlight the effectiveness and versatility of MedPrompt in addressing the challenges associated with cross-modal medical image translation.
Keyword	Medical Image Translation Vision Transformer Visual Prompting
DOI	10.1007/978-981-97-8496-7_5
URL	View the original
Language	英語English
Scopus ID	2-s2.0-85209364062
Fulltext Access	View Full-Text via DOI View Full-Text via Scopus
Citation statistics
Document Type	Conference paper
Collection	DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Affiliation	1.Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China 2.University of Macau, Macao 3.Huizhou University, Huizhou, China
First Author Affilication	University of Macau
Recommended Citation GB/T 7714	Chen, Xuhang,Luo, Shenghong,Pun, Chi Man,et al. MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation[C]:Springer Science and Business Media Deutschland GmbH, 2025, 61-75.
APA	Chen, Xuhang., Luo, Shenghong., Pun, Chi Man., & Wang, Shuqiang (2025). MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 15044 LNCS, 61-75.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh