Residential College | false |
Status | 已發表Published |
Simultaneous Segmentation and Classification of Esophageal Lesions Using Attention Gating Pyramid Vision Transformer | |
Ge, Peixuan1,2; Yan, Tao1,3; Wong, Pak Kin1; Li, Zheng4; Chan, In Neng1; Yu, Hon Ho5; Chan, Chon In5; Yao, Liang6; Hu, Ying2; Gao, Shan4 | |
2024-11 | |
Source Publication | IEEE Transactions on Emerging Topics in Computational Intelligence |
ISSN | 2471-285X |
Abstract | Automatic and accurate segmentation and classification of esophageal lesions are two essential tasks to assist endoscopists in Upper Gastrointestinal Endoscopy. However, there is no intelligent system that can diagnose more lesion types, handle multiple tasks simultaneously, and be more accurate in clinical work. Therefore, we present an innovative Multi-Task deep learning architecture named Attention Gating Pyramid Vision Transformer (AGPVT), which provides a solution for the accurate classification and precise segmentation of lesion types and regions simultaneously. The proposed AGPVT combines the benefits of cutting-edge deep learning model designs with Multi-Task Learning (MTL) in order to advance the field. Furthermore, a patch-wise multi-head attention gating method alongside a hybrid design MTL decoder, is employed as the core driving architecture of the AGPVT. Comprehensive experiments are conducted on a multicenter dataset which contains esophageal cancer, Barrett's esophagus, esophageal protruded lesions, esophagitis, and normal esophagus. Experimental results show that the proposed AGPVT achieves a classification accuracy of 96.84%, an IoU score of 85.61%, and a Dice score of 90.75%, outperforming existing methods and demonstrating its effectiveness in this domain. |
Keyword | Medical Image Classification Medical Image Segmentation Multi-task Learning Esophageal Lesion Transformer |
DOI | 10.1109/TETCI.2024.3485704 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:001351505600001 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141 |
Scopus ID | 2-s2.0-85208756915 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology DEPARTMENT OF ELECTROMECHANICAL ENGINEERING |
Corresponding Author | Wong, Pak Kin; Hu, Ying; Gao, Shan |
Affiliation | 1.University of Macau, Department of Electromechanical Engineering, Taipa, 999078, Macao 2.Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, Shenzhen, Guangdong, 518055, China 3.Hubei University of Arts and Science, School of Mechanical Engineering, Xiangyang, Hubei, 441053, China 4.Affiliated Hospital of Hubei University of Arts and Science, Xiangyang Central Hospital, Xiangyang, Hubei, 441021, China 5.Kiang Wu Hospital, Macao, 999078, Macao 6.The Chinese University of Hong Kong, Department of Computer Science and Engineering, Hong Kong, 999077, Hong Kong |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Ge, Peixuan,Yan, Tao,Wong, Pak Kin,et al. Simultaneous Segmentation and Classification of Esophageal Lesions Using Attention Gating Pyramid Vision Transformer[J]. IEEE Transactions on Emerging Topics in Computational Intelligence, 2024. |
APA | Ge, Peixuan., Yan, Tao., Wong, Pak Kin., Li, Zheng., Chan, In Neng., Yu, Hon Ho., Chan, Chon In., Yao, Liang., Hu, Ying., & Gao, Shan (2024). Simultaneous Segmentation and Classification of Esophageal Lesions Using Attention Gating Pyramid Vision Transformer. IEEE Transactions on Emerging Topics in Computational Intelligence. |
MLA | Ge, Peixuan,et al."Simultaneous Segmentation and Classification of Esophageal Lesions Using Attention Gating Pyramid Vision Transformer".IEEE Transactions on Emerging Topics in Computational Intelligence (2024). |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment