Residential Collegefalse
Status已發表Published
An experimental comparison of decision trees in traditional data mining and data stream mining
Yang Hang; Simon Fong
2011-02-14
Conference Name2010 6th International Conference on Advanced Information Management and Service (IMS)
Source PublicationProc. - 6th Intl. Conference on Advanced Information Management and Service, IMS2010, with ICMIA2010 - 2nd International Conference on Data Mining and Intelligent Information Technology Applications
Pages442-447
Conference Date30 Nov.-2 Dec. 2010
Conference PlaceSeoul, South Korea
Abstract

Data Stream mining (DSM) is claimed to be the successor of traditional data mining where it is capable of mining continuous incoming data streams in real-time with an acceptable performance. Nowadays many computer applications evolved to online and on-demand basis, fresh data are feeding in at high speeds. Not only a decision response needs to be made rapidly, the trained decision tree models would have to be updated recurrently as frequent as the latest data arrive. By the nature of traditional data mining, training datasets are assumed structured and static, and the decision tree models are either refreshed in batches or never. That is, the full dataset will be completely scanned (sometimes in multiple repetitions), induction of rules by Greedy algorithm that proceeds in manner of divide-and-conquer in the case of constructing a C4.5 decision tree. DSM on the other hand progressively builds and renews the decision tree model at a time when a new pass of data come by. In this paper, we evaluated the performance of a popular decision tree in DSM, which is known as Hoeffding Tree vis-à-vis that of C4.5. A good mix of types of datasets was used in the experiments for investigating the apparent differences between the decision trees. An open-source DSM simulator was programmed in JAVA for the experiments.

KeywordData Stream Mining Java Simulator Hoeffding Tree Algorithm Decision Tree Noise Data
URLView the original
Language英語English
Fulltext Access
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
AffiliationFaculty of Science and Technology, University of Macau, China
First Author AffilicationFaculty of Science and Technology
Recommended Citation
GB/T 7714
Yang Hang,Simon Fong. An experimental comparison of decision trees in traditional data mining and data stream mining[C], 2011, 442-447.
APA Yang Hang., & Simon Fong (2011). An experimental comparison of decision trees in traditional data mining and data stream mining. Proc. - 6th Intl. Conference on Advanced Information Management and Service, IMS2010, with ICMIA2010 - 2nd International Conference on Data Mining and Intelligent Information Technology Applications, 442-447.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yang Hang]'s Articles
[Simon Fong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yang Hang]'s Articles
[Simon Fong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yang Hang]'s Articles
[Simon Fong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.