Residential College | false |
Status | 已發表Published |
Image Recognition Based on Enhanced-Conformer | |
Gong, Runlin1; Qi, Ke1; Zhou, Yicong2; Chen, Wenbin1; Zhang, Jingdong3 | |
2023-01-02 | |
Conference Name | 2022 IEEE 5th International Conference on Automation, Electronics and Electrical Engineering (AUTEEE) |
Source Publication | 2022 IEEE 5th International Conference on Automation, Electronics and Electrical Engineering, AUTEEE 2022 |
Pages | 114-120 |
Conference Date | 2022/11/18-2022/11/20 |
Conference Place | Shenyang, China |
Abstract | Convolutional Neural Networks (CNNs) has always dominated visual recognition tasks, and it is difficult to link distant information in images due to the size limitation of each convolution filter. Vision Transformer (ViT) can capture features at a distance in an image, but lacks the details of local features. Conformer combines the advantages of both using Convolutional Neural Networks (CNNs) and Attention mechanisms in parallel, but it does not take into account the relationship between different samples. Therefore, we propose a new attention calculation method, Extra-Attention, which can effectively learn intra-sample and inter-sample relationships. In order to combine the advantages of CNNs and Attention, in our work, we proposed a new network Enhanced-Conformer (ENC) based on Conformer, in which the attention mechanism adopts a more efficient computational module Inside And Outside Transformer (IAOT), which contains three parallel Attention: Extra-Attention, Self-Attention, External-Attention. Enhanced-Conformer (ENC) can fuse local features, global features and external features at the same time. We conduct experiments on the commonly used image recognition datasets, the recognition accuracies reach 93.29%, 68.70% and 57.63% on Tiny- ImageNet, CIFAR-10 and CIFAR-100, respectively. |
Keyword | Cnns Extra-attention Image Recognition Transformer |
DOI | 10.1109/AUTEEE56487.2022.9994542 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85146719884 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Corresponding Author | Qi, Ke |
Affiliation | 1.School of Computer Science and Cyber Engineering, Guangzhou University, Guangzhou, China 2.Department of Computer and Information Science University of Macau Macau, China 3.Aberdeen Institute of Data Science and Artificial Intelligence, South China Normal University, Guangzhou, China |
Recommended Citation GB/T 7714 | Gong, Runlin,Qi, Ke,Zhou, Yicong,et al. Image Recognition Based on Enhanced-Conformer[C], 2023, 114-120. |
APA | Gong, Runlin., Qi, Ke., Zhou, Yicong., Chen, Wenbin., & Zhang, Jingdong (2023). Image Recognition Based on Enhanced-Conformer. 2022 IEEE 5th International Conference on Automation, Electronics and Electrical Engineering, AUTEEE 2022, 114-120. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment