Residential College | false |
Status | 已發表Published |
Similarity majority under-sampling technique for easing imbalanced classification problem | |
Jinyan Li1; Simon Fong1; Shimin Hu1; Raymond K. Wong2; Sabah Mohammed3 | |
2018-04-11 | |
Conference Name | 2017 AusDM: Australasian Conference on Data Mining |
Source Publication | Data Mining |
Volume | 845 |
Pages | 3-23 |
Conference Date | August 19-20, 2017 |
Conference Place | Melbourne, VIC, Australia |
Abstract | Imbalanced classification problem is an enthusiastic topic in the fields of data mining, machine learning and pattern recognition. The imbalanced distributions of different class samples result in the classifier being over-fitted by learning too many majority class samples and under-fitted in recognizing minority class samples. Prior methods attempt to ease imbalanced problem through sampling techniques, in order to re-assign and rebalance the distributions of imbalanced dataset. In this paper, we proposed a novel notion to under-sample the majority class size for adjusting the original imbalanced class distributions. This method is called Similarity Majority Under-sampling Technique (SMUTE). By calculating the similarity of each majority class sample and observing its surrounding minority class samples, SMUTE effectively separates the majority and minority class samples to increase the recognition power for each class. The experimental results show that SMUTE could outperform the current under-sampling methods when the same under-sampling rate is used. |
Keyword | Imbalanced Classification Under-sampling Similarity Measure Smute |
DOI | 10.1007/978-981-13-0292-3_1 |
URL | View the original |
Language | 英語English |
WOS ID | WOS:000922750100001 |
Scopus ID | 2-s2.0-85045832453 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Simon Fong |
Affiliation | 1.Department of Computer and Information Science, University of Macau, Taipa, Macau SAR, China 2.School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, Australia 3.Department of Computer Science, Lakehead University, Thunder Bay, Canada |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Jinyan Li,Simon Fong,Shimin Hu,et al. Similarity majority under-sampling technique for easing imbalanced classification problem[C], 2018, 3-23. |
APA | Jinyan Li., Simon Fong., Shimin Hu., Raymond K. Wong., & Sabah Mohammed (2018). Similarity majority under-sampling technique for easing imbalanced classification problem. Data Mining, 845, 3-23. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment