Residential Collegefalse
Status已發表Published
Image Recognition based on Multi-scale Feature Fusion Transformer
Zhu, Zhefeng1; Qi, Ke1; Chen, Wenbin1; Zhou, Yicong2; Li, Peiyue1; Liu, Zhenxian1
2022
Conference Name2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)
Source Publication2022 IEEE International Conference on Artificial Intelligence and Computer Applications, ICAICA 2022
Pages7-13
Conference Date2022/06/24-2022/06/26
Conference PlaceDalian, China
Abstract

Aiming at the problem that image recognition based on transformers has low image recognition rate due to ignoring local information of image blocks, an image recognition framework based on multi-scale Feature Fusion Transformer (FFT) is proposed, where the FFT block is designed to fuse feature information of different scales, and the residual attention module is introduced to emphasize feature channels and feature regions of interest. The FFT framework not only avoids the problem of vision transformer internal structure and local information loss of image feature blocks but also captures richer detailed features, which effectively improves the image recognition rate. A large number of experiments are performed on common image recognition datasets Tiny-ImageNet, CIFAR-10 and CIFAR-100, and the recognition accuracy can reach 57.81%, 82.04% and 56.98%, respectively, which are significantly higher than the mainstream image recognition algorithms.

KeywordFft Block Image Recognition Multi-scale Feature Fusion Vision Transformer
DOI10.1109/ICAICA54878.2022.9844458
URLView the original
Language英語English
Scopus ID2-s2.0-85136705759
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Faculty of Science and Technology
Corresponding AuthorQi, Ke
Affiliation1.Guangzhou University, School of Computer Science and Cyber Engineering, Guangzhou, China
2.University of Macau, Department of Computer and Information Science, Macao
Recommended Citation
GB/T 7714
Zhu, Zhefeng,Qi, Ke,Chen, Wenbin,et al. Image Recognition based on Multi-scale Feature Fusion Transformer[C], 2022, 7-13.
APA Zhu, Zhefeng., Qi, Ke., Chen, Wenbin., Zhou, Yicong., Li, Peiyue., & Liu, Zhenxian (2022). Image Recognition based on Multi-scale Feature Fusion Transformer. 2022 IEEE International Conference on Artificial Intelligence and Computer Applications, ICAICA 2022, 7-13.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Zhu, Zhefeng]'s Articles
[Qi, Ke]'s Articles
[Chen, Wenbin]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhu, Zhefeng]'s Articles
[Qi, Ke]'s Articles
[Chen, Wenbin]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhu, Zhefeng]'s Articles
[Qi, Ke]'s Articles
[Chen, Wenbin]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.