UM  > INSTITUTE OF MICROELECTRONICS
Residential Collegefalse
Status已發表Published
Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation
Yu, Huina1; Gao, Lingyan1; Yu, Huitao1; Zhang, Anguo2
2024-07
Conference Name2024 36th Chinese Control and Decision Conference (CCDC)
Source PublicationProceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
Pages1737-1741
Conference Date25-27 May 2024
Conference PlaceXi'an, China
CountryChina
PublisherInstitute of Electrical and Electronics Engineers Inc.
Abstract

In recent years, deep learning models, such as the UNet architecture, have shown great potential in medical image segmentation tasks. However, these models often require large amounts of annotated data to achieve high performance. In this paper, we propose a novel approach that combines the UNet architecture with the Vision Transformer with Multi-Head Attention model (MAViT-UNet) for medical image segmentation. Our method leverages the ability of ViT to capture long-range dependencies in images, while still benefitting from the UNet's skip connections for local feature extraction. Through extensive experiments on various medical datasets, we demonstrate that our proposed Vision Transformer based UNet achieves state-of-the-art performance in medical image segmentation tasks, even with limited annotated data. The results suggest the potential of transformer-based models in the field of medical image analysis.

KeywordDeep Learning Medical Image Segmentation Self-attention Unet Vision Transformer
DOI10.1109/CCDC62350.2024.10587821
URLView the original
Language英語English
Scopus ID2-s2.0-85200345641
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionINSTITUTE OF MICROELECTRONICS
Affiliation1.First Affiliated Hospital Of Xiamen University, Xiamen, China
2.University Of Macau, Institute Of Microelectronics, Macao
Recommended Citation
GB/T 7714
Yu, Huina,Gao, Lingyan,Yu, Huitao,et al. Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation[C]:Institute of Electrical and Electronics Engineers Inc., 2024, 1737-1741.
APA Yu, Huina., Gao, Lingyan., Yu, Huitao., & Zhang, Anguo (2024). Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation. Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024, 1737-1741.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yu, Huina]'s Articles
[Gao, Lingyan]'s Articles
[Yu, Huitao]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yu, Huina]'s Articles
[Gao, Lingyan]'s Articles
[Yu, Huitao]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yu, Huina]'s Articles
[Gao, Lingyan]'s Articles
[Yu, Huitao]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.