Residential Collegefalse
Status已發表Published
Similarity majority under-sampling technique for easing imbalanced classification problem
Jinyan Li1; Simon Fong1; Shimin Hu1; Raymond K. Wong2; Sabah Mohammed3
2018-04-11
Conference Name2017 AusDM: Australasian Conference on Data Mining
Source PublicationData Mining
Volume845
Pages3-23
Conference DateAugust 19-20, 2017
Conference PlaceMelbourne, VIC, Australia
Abstract

Imbalanced classification problem is an enthusiastic topic in the fields of data mining, machine learning and pattern recognition. The imbalanced distributions of different class samples result in the classifier being over-fitted by learning too many majority class samples and under-fitted in recognizing minority class samples. Prior methods attempt to ease imbalanced problem through sampling techniques, in order to re-assign and rebalance the distributions of imbalanced dataset. In this paper, we proposed a novel notion to under-sample the majority class size for adjusting the original imbalanced class distributions. This method is called Similarity Majority Under-sampling Technique (SMUTE). By calculating the similarity of each majority class sample and observing its surrounding minority class samples, SMUTE effectively separates the majority and minority class samples to increase the recognition power for each class. The experimental results show that SMUTE could outperform the current under-sampling methods when the same under-sampling rate is used.

KeywordImbalanced Classification Under-sampling Similarity Measure Smute
DOI10.1007/978-981-13-0292-3_1
URLView the original
Language英語English
WOS IDWOS:000922750100001
Scopus ID2-s2.0-85045832453
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorSimon Fong
Affiliation1.Department of Computer and Information Science, University of Macau, Taipa, Macau SAR, China
2.School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, Australia
3.Department of Computer Science, Lakehead University, Thunder Bay, Canada
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Jinyan Li,Simon Fong,Shimin Hu,et al. Similarity majority under-sampling technique for easing imbalanced classification problem[C], 2018, 3-23.
APA Jinyan Li., Simon Fong., Shimin Hu., Raymond K. Wong., & Sabah Mohammed (2018). Similarity majority under-sampling technique for easing imbalanced classification problem. Data Mining, 845, 3-23.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Jinyan Li]'s Articles
[Simon Fong]'s Articles
[Shimin Hu]'s Articles
Baidu academic
Similar articles in Baidu academic
[Jinyan Li]'s Articles
[Simon Fong]'s Articles
[Shimin Hu]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Jinyan Li]'s Articles
[Simon Fong]'s Articles
[Shimin Hu]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.