UM
Residential Collegefalse
Status已發表Published
Hybrid Sampling with Bagging for Class Imbalance Learning
Yang Lu1; Yiu-ming Cheung1; Yuan Yan Tang2
2016
Conference Name20th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD)
Source PublicationLecture Notes in Computer Science (ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING)
Volume9651
Pages14-26
Conference DateAPR 19-22, 2016
Conference PlaceUniv Auckland, Auckland, NEW ZEALAND
CountryNEW ZEALAND
PublisherSPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY
Abstract

For class imbalance problem, the integration of sampling and ensemble methods has shown great success among various methods. Nevertheless, as the representatives of sampling methods, undersampling and oversampling cannot outperform each other. That is, undersampling fits some data sets while oversampling fits some other. Besides, the sampling rate also significantly influences the performance of a classifier, while existing methods usually adopt full sampling rate to produce balanced training set. In this paper, we propose a new algorithm that utilizes a new hybrid scheme of undersampling and oversampling with sampling rate selection to preprocess the data in each ensemble iteration. Bagging is adopted as the ensemble framework because the sampling rate selection can benefit from the Out-Of-Bag estimate in bagging. The proposed method features both of undersampling and oversampling, and the specifically selected sampling rate for each data set. The experiments are conducted on 26 data sets from the UCI data repository, in which the proposed method in comparison with the existing counterparts is evaluated by three evaluation metrics. Experiments show that, combined with bagging, the proposed hybrid sampling method significantly outperforms the other state-of-the-art bagging-based methods for class imbalance problem. Meanwhile, the superiority of sampling rate selection is also demonstrated.

KeywordClass Imbalance Learning Ensemble Method Hybrid Sampling Sampling Method
DOI10.1007/978-3-319-31753-3_2
URLView the original
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods
WOS IDWOS:000389019500002
Scopus ID2-s2.0-84963994580
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionUniversity of Macau
Affiliation1.Department of Computer ScienceHong Kong Baptist UniversityHong KongChina
2.Department of Computer and Information Science, Faculty of Science and TechnologyUniversity of MacauMacauChina
Recommended Citation
GB/T 7714
Yang Lu,Yiu-ming Cheung,Yuan Yan Tang. Hybrid Sampling with Bagging for Class Imbalance Learning[C]:SPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY, 2016, 14-26.
APA Yang Lu., Yiu-ming Cheung., & Yuan Yan Tang (2016). Hybrid Sampling with Bagging for Class Imbalance Learning. Lecture Notes in Computer Science (ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING), 9651, 14-26.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yang Lu]'s Articles
[Yiu-ming Cheung]'s Articles
[Yuan Yan Tang]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yang Lu]'s Articles
[Yiu-ming Cheung]'s Articles
[Yuan Yan Tang]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yang Lu]'s Articles
[Yiu-ming Cheung]'s Articles
[Yuan Yan Tang]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.