Residential Collegefalse
Status已發表Published
Optimized Very Fast Decision Tree with Balanced Classification Accuracy and Compact Tree Size
Hang Yang; Simon Fong
2011-12-01
Conference NameThe 3rd International Conference on Data Mining and Intelligent Information Technology Applications
Source PublicationProceedings - 3rd International Conference on Data Mining and Intelligent Information Technology Applications, ICMIA 2011
Pages57-64
Conference Date24-26 Oct. 2011
Conference PlaceMacao, China
PublisherIEEE
Abstract

Very Fast Decision Tree (VFDT) in data stream mining has been widely studied for more than a decade. VFDT in essence can mine over a portion of an unbounded data stream at a time, and the structure of the decision tree gets updated whenever new data feed in; hence it can predict better upon the input of fresh data. Inherent from traditional decision trees that use information gains for tree induction, VFDT may suffer the same over-fitting problem where noise in the training data leads to excessive tree branches therefore decline in prediction accuracy. This problem is aggravated in stream mining because limited run-time memory (for storing the whole decision tree) and reasonable accuracy are often the criteria for implementing VFDT. Post-pruning that was a popular technique used in traditional decision tree to keep the tree size in check, however may not be applicable for VFDT in situ. In this paper a new model that extends from VFDT called Optimized VFDT is proposed, for controlling the tree size while sustaining good prediction accuracy. This is enabled by using an adaptive threshold tie and incremental pruning in tree induction. Experimental results show that an optimal ratio of tree size and accuracy can be achieved by OVFDT. 

KeywordIncremental Optimization Ovfdt Stream Mining Tree Pruning Very Fast Decision Tree
URLView the original
Language英語English
Fulltext Access
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
AffiliationFaculty of Science and Technology, University of Macau Av. Padre Tomás Pereira Taipa, Macau, China
First Author AffilicationFaculty of Science and Technology
Recommended Citation
GB/T 7714
Hang Yang,Simon Fong. Optimized Very Fast Decision Tree with Balanced Classification Accuracy and Compact Tree Size[C]:IEEE, 2011, 57-64.
APA Hang Yang., & Simon Fong (2011). Optimized Very Fast Decision Tree with Balanced Classification Accuracy and Compact Tree Size. Proceedings - 3rd International Conference on Data Mining and Intelligent Information Technology Applications, ICMIA 2011, 57-64.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Hang Yang]'s Articles
[Simon Fong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.