Residential College | false |
Status | 已發表Published |
Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation | |
Pan, Haolin1; Guo, Yong2; Yu, Mianjie3; Chen, Jian1 | |
2024-07 | |
Source Publication | IEEE Transactions on Image Processing |
ISSN | 1057-7149 |
Volume | 33Pages:4215-4230 |
Abstract | Real-world data often follows a long-tailed distribution, where a few head classes occupy most of the data and a large number of tail classes only contain very limited samples. In practice, deep models often show poor generalization performance on tail classes due to the imbalanced distribution. To tackle this, data augmentation has become an effective way by synthesizing new samples for tail classes. Among them, one popular way is to use CutMix that explicitly mixups the images of tail classes and the others, while constructing the labels according to the ratio of areas cropped from two images. However, the area-based labels entirely ignore the inherent semantic information of the augmented samples, often leading to misleading training signals. To address this issue, we propose a Contrastive CutMix (ConCutMix) that constructs augmented samples with semantically consistent labels to boost the performance of long-tailed recognition. Specifically, we compute the similarities between samples in the semantic space learned by contrastive learning, and use them to rectify the area-based labels. Experiments show that our ConCutMix significantly improves the accuracy on tail classes as well as the overall performance. For example, based on ResNeXt-50, we improve the overall accuracy on ImageNet-LT by 3.0% thanks to the significant improvement of 3.3% on tail classes. We highlight that the improvement also generalizes well to other benchmarks and models. Our code and pretrained models are available at https://github.com/PanHaulin/ConCutMix. |
Keyword | Contrastive Learning Data Augmentation Long-tailed Recognition |
DOI | 10.1109/TIP.2024.3425148 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS ID | WOS:001276392200003 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855-4141 |
Scopus ID | 2-s2.0-85199115708 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology |
Corresponding Author | Chen, Jian |
Affiliation | 1.South China University of Technology (SCUT), School of Software Engineering, Guangzhou, 510006, China 2.Max Planck Institute for Informatics (MPI-INF), Saarbrücken, 66123, Germany 3.University of Macau (UM), Faculty of Science and Technology, Macao |
Recommended Citation GB/T 7714 | Pan, Haolin,Guo, Yong,Yu, Mianjie,et al. Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation[J]. IEEE Transactions on Image Processing, 2024, 33, 4215-4230. |
APA | Pan, Haolin., Guo, Yong., Yu, Mianjie., & Chen, Jian (2024). Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation. IEEE Transactions on Image Processing, 33, 4215-4230. |
MLA | Pan, Haolin,et al."Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation".IEEE Transactions on Image Processing 33(2024):4215-4230. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment