Residential Collegefalse
Status即將出版Forthcoming
MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation
Chen, Xuhang1,2,3; Luo, Shenghong2; Pun, Chi Man2; Wang, Shuqiang1
2025
Conference Name7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024
Source PublicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15044 LNCS
Pages61-75
Conference Date18 October 2024 to 20 October 2024
Conference PlaceUrumqi; China
PublisherSpringer Science and Business Media Deutschland GmbH
Abstract

The ability to translate medical images across different modalities is crucial for synthesizing missing data and aiding in clinical diagnosis. However, existing learning-based techniques have limitations when it comes to capturing cross-modal and global features. These techniques are often tailored to specific pairs of modalities, limiting their practical utility, especially considering the variability of missing modalities in different cases. In this study, we introduce MedPrompt, a multi-task framework designed to efficiently translate diverse modalities. Our framework incorporates the Self-adaptive Prompt Block, which dynamically guides the translation network to handle different modalities effectively. To encode the cross-modal prompt efficiently, we introduce the Prompt Extraction Block and the Prompt Fusion Block. Additionally, we leverage the Transformer model to enhance the extraction of global features across various modalities. Through extensive experimentation involving five datasets and four pairs of modalities, we demonstrate that our proposed model achieves state-of-the-art visual quality and exhibits excellent generalization capability. The results highlight the effectiveness and versatility of MedPrompt in addressing the challenges associated with cross-modal medical image translation.

KeywordMedical Image Translation Vision Transformer Visual Prompting
DOI10.1007/978-981-97-8496-7_5
URLView the original
Language英語English
Scopus ID2-s2.0-85209364062
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Affiliation1.Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
2.University of Macau, Macao
3.Huizhou University, Huizhou, China
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Chen, Xuhang,Luo, Shenghong,Pun, Chi Man,et al. MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation[C]:Springer Science and Business Media Deutschland GmbH, 2025, 61-75.
APA Chen, Xuhang., Luo, Shenghong., Pun, Chi Man., & Wang, Shuqiang (2025). MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 15044 LNCS, 61-75.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Chen, Xuhang]'s Articles
[Luo, Shenghong]'s Articles
[Pun, Chi Man]'s Articles
Baidu academic
Similar articles in Baidu academic
[Chen, Xuhang]'s Articles
[Luo, Shenghong]'s Articles
[Pun, Chi Man]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Chen, Xuhang]'s Articles
[Luo, Shenghong]'s Articles
[Pun, Chi Man]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.