Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation

doi:10.1109/CCDC62350.2024.10587821

UM > INSTITUTE OF MICROELECTRONICS

Residential College	false
Status	已發表Published
	Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation
	Yu, Huina 1; Gao, Lingyan 1; Yu, Huitao 1; Zhang, Anguo 2
	2024-07
Conference Name	2024 36th Chinese Control and Decision Conference (CCDC)
Source Publication	Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
Pages	1737-1741
Conference Date	25-27 May 2024
Conference Place	Xi'an, China
Country	China
Publisher	Institute of Electrical and Electronics Engineers Inc.
Abstract	In recent years, deep learning models, such as the UNet architecture, have shown great potential in medical image segmentation tasks. However, these models often require large amounts of annotated data to achieve high performance. In this paper, we propose a novel approach that combines the UNet architecture with the Vision Transformer with Multi-Head Attention model (MAViT-UNet) for medical image segmentation. Our method leverages the ability of ViT to capture long-range dependencies in images, while still benefitting from the UNet's skip connections for local feature extraction. Through extensive experiments on various medical datasets, we demonstrate that our proposed Vision Transformer based UNet achieves state-of-the-art performance in medical image segmentation tasks, even with limited annotated data. The results suggest the potential of transformer-based models in the field of medical image analysis.
Keyword	Deep Learning Medical Image Segmentation Self-attention Unet Vision Transformer
DOI	10.1109/CCDC62350.2024.10587821
URL	View the original
Language	英語English
Scopus ID	2-s2.0-85200345641
Fulltext Access	View Full-Text via DOI View Full-Text via Scopus
Citation statistics
Document Type	Conference paper
Collection	INSTITUTE OF MICROELECTRONICS
Affiliation	1.First Affiliated Hospital Of Xiamen University, Xiamen, China 2.University Of Macau, Institute Of Microelectronics, Macao
Recommended Citation GB/T 7714	Yu, Huina,Gao, Lingyan,Yu, Huitao,et al. Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation[C]:Institute of Electrical and Electronics Engineers Inc., 2024, 1737-1741.
APA	Yu, Huina., Gao, Lingyan., Yu, Huitao., & Zhang, Anguo (2024). Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation. Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024, 1737-1741.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh