Residential College | false |
Status | 已發表Published |
An experimental comparison of decision trees in traditional data mining and data stream mining | |
Yang Hang; Simon Fong | |
2011-02-14 | |
Conference Name | 2010 6th International Conference on Advanced Information Management and Service (IMS) |
Source Publication | Proc. - 6th Intl. Conference on Advanced Information Management and Service, IMS2010, with ICMIA2010 - 2nd International Conference on Data Mining and Intelligent Information Technology Applications |
Pages | 442-447 |
Conference Date | 30 Nov.-2 Dec. 2010 |
Conference Place | Seoul, South Korea |
Abstract | Data Stream mining (DSM) is claimed to be the successor of traditional data mining where it is capable of mining continuous incoming data streams in real-time with an acceptable performance. Nowadays many computer applications evolved to online and on-demand basis, fresh data are feeding in at high speeds. Not only a decision response needs to be made rapidly, the trained decision tree models would have to be updated recurrently as frequent as the latest data arrive. By the nature of traditional data mining, training datasets are assumed structured and static, and the decision tree models are either refreshed in batches or never. That is, the full dataset will be completely scanned (sometimes in multiple repetitions), induction of rules by Greedy algorithm that proceeds in manner of divide-and-conquer in the case of constructing a C4.5 decision tree. DSM on the other hand progressively builds and renews the decision tree model at a time when a new pass of data come by. In this paper, we evaluated the performance of a popular decision tree in DSM, which is known as Hoeffding Tree vis-à-vis that of C4.5. A good mix of types of datasets was used in the experiments for investigating the apparent differences between the decision trees. An open-source DSM simulator was programmed in JAVA for the experiments. |
Keyword | Data Stream Mining Java Simulator Hoeffding Tree Algorithm Decision Tree Noise Data |
URL | View the original |
Language | 英語English |
Fulltext Access | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | Faculty of Science and Technology, University of Macau, China |
First Author Affilication | Faculty of Science and Technology |
Recommended Citation GB/T 7714 | Yang Hang,Simon Fong. An experimental comparison of decision trees in traditional data mining and data stream mining[C], 2011, 442-447. |
APA | Yang Hang., & Simon Fong (2011). An experimental comparison of decision trees in traditional data mining and data stream mining. Proc. - 6th Intl. Conference on Advanced Information Management and Service, IMS2010, with ICMIA2010 - 2nd International Conference on Data Mining and Intelligent Information Technology Applications, 442-447. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment