Residential College | false |
Status | 已發表Published |
Hybrid Sampling with Bagging for Class Imbalance Learning | |
Yang Lu1; Yiu-ming Cheung1; Yuan Yan Tang2 | |
2016 | |
Conference Name | 20th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) |
Source Publication | Lecture Notes in Computer Science (ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING)
![]() |
Volume | 9651 |
Pages | 14-26 |
Conference Date | APR 19-22, 2016 |
Conference Place | Univ Auckland, Auckland, NEW ZEALAND |
Country | NEW ZEALAND |
Publisher | SPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY |
Abstract | For class imbalance problem, the integration of sampling and ensemble methods has shown great success among various methods. Nevertheless, as the representatives of sampling methods, undersampling and oversampling cannot outperform each other. That is, undersampling fits some data sets while oversampling fits some other. Besides, the sampling rate also significantly influences the performance of a classifier, while existing methods usually adopt full sampling rate to produce balanced training set. In this paper, we propose a new algorithm that utilizes a new hybrid scheme of undersampling and oversampling with sampling rate selection to preprocess the data in each ensemble iteration. Bagging is adopted as the ensemble framework because the sampling rate selection can benefit from the Out-Of-Bag estimate in bagging. The proposed method features both of undersampling and oversampling, and the specifically selected sampling rate for each data set. The experiments are conducted on 26 data sets from the UCI data repository, in which the proposed method in comparison with the existing counterparts is evaluated by three evaluation metrics. Experiments show that, combined with bagging, the proposed hybrid sampling method significantly outperforms the other state-of-the-art bagging-based methods for class imbalance problem. Meanwhile, the superiority of sampling rate selection is also demonstrated. |
Keyword | Class Imbalance Learning Ensemble Method Hybrid Sampling Sampling Method |
DOI | 10.1007/978-3-319-31753-3_2 |
URL | View the original |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods |
WOS ID | WOS:000389019500002 |
Scopus ID | 2-s2.0-84963994580 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | University of Macau |
Affiliation | 1.Department of Computer ScienceHong Kong Baptist UniversityHong KongChina 2.Department of Computer and Information Science, Faculty of Science and TechnologyUniversity of MacauMacauChina |
Recommended Citation GB/T 7714 | Yang Lu,Yiu-ming Cheung,Yuan Yan Tang. Hybrid Sampling with Bagging for Class Imbalance Learning[C]:SPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY, 2016, 14-26. |
APA | Yang Lu., Yiu-ming Cheung., & Yuan Yan Tang (2016). Hybrid Sampling with Bagging for Class Imbalance Learning. Lecture Notes in Computer Science (ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING), 9651, 14-26. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment