Residential College | false |
Status | 已發表Published |
A Lightweight Data Pre-Processing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning | |
Fong, S.; Biuk-Aghai, R. P.; Si, Y. W.; Yap, B. W. | |
2015-02-01 | |
Source Publication | Mathematical Problems in Engineering |
ISSN | 1024-123X |
Pages | 1-11 |
Abstract | A prime objective in constructing data streaming mining models is to achieve good accuracy, fast learning, robustness to noise and a compact tree size. Although many techniques have been proposed in the past, efforts to improve the accuracy of classification models have been somewhat disparate. These techniques include, but are not limited to, feature selection, dimensionality reduction and the removal of noise from training data. One limitation common to all of these techniques is the assumption that the full training dataset must be applied. Although this has been effective for traditional batch training, it may not be practical for incremental classifier learning, also known as data stream mining, where only a single pass of the data stream is seen at a time. Because data streams can amount to infinity, and the so-called Big Data phenomenon, the data pre-processing time must be kept to a minimum to fulfill the need for high speed. This paper introduces a new data pre-processing strategy suitable for the progressive purging of noisy data from the training dataset without the need to process the whole dataset at one time. This strategy is shown via a computer simulation to provide the significant benefit of allowing for the dynamic removal of bad records from the incremental classifier learning process. |
Keyword | Data Pre-processing Conflict Analysis Incremental Learning Pairwise Classification |
URL | View the original |
Language | 英語English |
The Source to Article | PB_Publication |
Document Type | Journal article |
Collection | University of Macau |
Corresponding Author | Fong, S. |
Recommended Citation GB/T 7714 | Fong, S.,Biuk-Aghai, R. P.,Si, Y. W.,et al. A Lightweight Data Pre-Processing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning[J]. Mathematical Problems in Engineering, 2015, 1-11. |
APA | Fong, S.., Biuk-Aghai, R. P.., Si, Y. W.., & Yap, B. W. (2015). A Lightweight Data Pre-Processing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning. Mathematical Problems in Engineering, 1-11. |
MLA | Fong, S.,et al."A Lightweight Data Pre-Processing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning".Mathematical Problems in Engineering (2015):1-11. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment