Residential College | false |
Status | 已發表Published |
Optimized Very Fast Decision Tree with Balanced Classification Accuracy and Compact Tree Size | |
Hang Yang; Simon Fong | |
2011-12-01 | |
Conference Name | The 3rd International Conference on Data Mining and Intelligent Information Technology Applications |
Source Publication | Proceedings - 3rd International Conference on Data Mining and Intelligent Information Technology Applications, ICMIA 2011 |
Pages | 57-64 |
Conference Date | 24-26 Oct. 2011 |
Conference Place | Macao, China |
Publisher | IEEE |
Abstract | Very Fast Decision Tree (VFDT) in data stream mining has been widely studied for more than a decade. VFDT in essence can mine over a portion of an unbounded data stream at a time, and the structure of the decision tree gets updated whenever new data feed in; hence it can predict better upon the input of fresh data. Inherent from traditional decision trees that use information gains for tree induction, VFDT may suffer the same over-fitting problem where noise in the training data leads to excessive tree branches therefore decline in prediction accuracy. This problem is aggravated in stream mining because limited run-time memory (for storing the whole decision tree) and reasonable accuracy are often the criteria for implementing VFDT. Post-pruning that was a popular technique used in traditional decision tree to keep the tree size in check, however may not be applicable for VFDT in situ. In this paper a new model that extends from VFDT called Optimized VFDT is proposed, for controlling the tree size while sustaining good prediction accuracy. This is enabled by using an adaptive threshold tie and incremental pruning in tree induction. Experimental results show that an optimal ratio of tree size and accuracy can be achieved by OVFDT. |
Keyword | Incremental Optimization Ovfdt Stream Mining Tree Pruning Very Fast Decision Tree |
URL | View the original |
Language | 英語English |
Fulltext Access | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | Faculty of Science and Technology, University of Macau Av. Padre Tomás Pereira Taipa, Macau, China |
First Author Affilication | Faculty of Science and Technology |
Recommended Citation GB/T 7714 | Hang Yang,Simon Fong. Optimized Very Fast Decision Tree with Balanced Classification Accuracy and Compact Tree Size[C]:IEEE, 2011, 57-64. |
APA | Hang Yang., & Simon Fong (2011). Optimized Very Fast Decision Tree with Balanced Classification Accuracy and Compact Tree Size. Proceedings - 3rd International Conference on Data Mining and Intelligent Information Technology Applications, ICMIA 2011, 57-64. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment