Residential College | false |
Status | 已發表Published |
Image Recognition based on Multi-scale Feature Fusion Transformer | |
Zhu, Zhefeng1; Qi, Ke1![]() ![]() | |
2022 | |
Conference Name | 2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA) |
Source Publication | 2022 IEEE International Conference on Artificial Intelligence and Computer Applications, ICAICA 2022
![]() |
Pages | 7-13 |
Conference Date | 2022/06/24-2022/06/26 |
Conference Place | Dalian, China |
Abstract | Aiming at the problem that image recognition based on transformers has low image recognition rate due to ignoring local information of image blocks, an image recognition framework based on multi-scale Feature Fusion Transformer (FFT) is proposed, where the FFT block is designed to fuse feature information of different scales, and the residual attention module is introduced to emphasize feature channels and feature regions of interest. The FFT framework not only avoids the problem of vision transformer internal structure and local information loss of image feature blocks but also captures richer detailed features, which effectively improves the image recognition rate. A large number of experiments are performed on common image recognition datasets Tiny-ImageNet, CIFAR-10 and CIFAR-100, and the recognition accuracy can reach 57.81%, 82.04% and 56.98%, respectively, which are significantly higher than the mainstream image recognition algorithms. |
Keyword | Fft Block Image Recognition Multi-scale Feature Fusion Vision Transformer |
DOI | 10.1109/ICAICA54878.2022.9844458 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85136705759 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Corresponding Author | Qi, Ke |
Affiliation | 1.Guangzhou University, School of Computer Science and Cyber Engineering, Guangzhou, China 2.University of Macau, Department of Computer and Information Science, Macao |
Recommended Citation GB/T 7714 | Zhu, Zhefeng,Qi, Ke,Chen, Wenbin,et al. Image Recognition based on Multi-scale Feature Fusion Transformer[C], 2022, 7-13. |
APA | Zhu, Zhefeng., Qi, Ke., Chen, Wenbin., Zhou, Yicong., Li, Peiyue., & Liu, Zhenxian (2022). Image Recognition based on Multi-scale Feature Fusion Transformer. 2022 IEEE International Conference on Artificial Intelligence and Computer Applications, ICAICA 2022, 7-13. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment