UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Subspace-based minority oversampling for imbalance classification
Li, Tianjun1,2; Wang, Yingxu3; Liu, Licheng5; Chen, Long4; Chen, C. L.Philip1,2
2023-04
Source PublicationInformation Sciences
ISSN0020-0255
Volume621Pages:371-388
Abstract

In pattern classification, the class imbalance problem always occurs when the number of observations in some classes is significantly different from that of other categories, which leads to the learning bias in the classifiers. One possible solution to this problem is to re-balance the training set by over-sampling the minority class. However, over-samplings always push the classification boundaries to the majority part, thus the recall increases while the precision decreases. To avoid this situation and better handle the class imbalance problem, this paper proposes a new over-sampling method, namely Subspace-based Minority Over-Sampling (abbr. SMO). This approach considers that each category of samples is formed by common and unique characteristics, and such characteristics can be extracted by subspace. To obtain the balanced data, the common part is over-sampled for more accurately depicting the minority, and the unique part can be expanded by some generative methods. The balanced data are obtained by restoring the generated products of the subspace to the original space. The experimental results demonstrate that the SMO has the ability to model complex data distributions and outperforms both classical and newly designed over-sampling algorithms. Also, SMO can be used to generate simple images, and the generation results of MNIST can be clearly identified by both human vision and machine vision.

KeywordClass Imbalance Low-rank Representation Matrix Completion Minority Over-sampling
DOI10.1016/j.ins.2022.11.108
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Information Systems
WOS IDWOS:000901785200001
PublisherELSEVIER SCIENCE INC, STE 800, 230 PARK AVE, NEW YORK, NY 10169
Scopus ID2-s2.0-85143642995
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorChen, Long
Affiliation1.School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510006, China
2.Brain and Affective Cognitive Research Center, Pazhou Lab, Guangzhou, 510335, China
3.Shandong Provincial Key Laboratory of Network-Based Intelligent Computing, University of Jinan, Jinan, 250022, China
4.Department of Computer and Information Science, Faculty of Science and Technology, University of Macau, Macau, Taipa, China
5.Department of Electrical and Information Engineering, Hunan University, Hunan, 410082, China
Corresponding Author AffilicationFaculty of Science and Technology
Recommended Citation
GB/T 7714
Li, Tianjun,Wang, Yingxu,Liu, Licheng,et al. Subspace-based minority oversampling for imbalance classification[J]. Information Sciences, 2023, 621, 371-388.
APA Li, Tianjun., Wang, Yingxu., Liu, Licheng., Chen, Long., & Chen, C. L.Philip (2023). Subspace-based minority oversampling for imbalance classification. Information Sciences, 621, 371-388.
MLA Li, Tianjun,et al."Subspace-based minority oversampling for imbalance classification".Information Sciences 621(2023):371-388.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Li, Tianjun]'s Articles
[Wang, Yingxu]'s Articles
[Liu, Licheng]'s Articles
Baidu academic
Similar articles in Baidu academic
[Li, Tianjun]'s Articles
[Wang, Yingxu]'s Articles
[Liu, Licheng]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Li, Tianjun]'s Articles
[Wang, Yingxu]'s Articles
[Liu, Licheng]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.