Residential College | false |
Status | 已發表Published |
Kennard-Stone Balance Algorithm for Time-series Big Data Stream Mining | |
Tengyue Li1; Simon Fong1; Yaoyang Wu1; Antonio J. Tallón-Ballesteros2 | |
2020-11 | |
Conference Name | 2020 International Conference on Data Mining Workshops (ICDMW) |
Source Publication | IEEE International Conference on Data Mining Workshops, ICDMW |
Volume | 2020-November |
Pages | 851-858 |
Conference Date | 17-20 November 2020 |
Conference Place | Sorrento, Italy |
Country | Italy |
Publisher | IEEE |
Abstract | Nowadays time series are generated relatively more easily and in larger quantity than ever, by the advances of IoT and sensor applications. Training a prediction model effectively using such big data streams poses certain challenges in machine learning. Data sampling has been an important technique in handling over-sized data in pre-processing which converts the huge data streams into a manageable and representative subset before loading them into a model induction process. In this paper a novel data conversion method, namely Kennard-Stone Balance (KSB) Algorithm is proposed. In the past decades, KS has been used by researchers for partitioning a bounded dataset into appropriate portions of training and testing data in cross-validation. In this new proposal, we extend KS into balancing the sub-sampled data in consideration of the class distribution by round-robin. It is also the first time KS is applied on time-series for the purpose of extracting a meaningful representation of big data streams, for improving the performance of a machine learning model. Preliminary simulation results show the advantages of KBS. Analysis, discussion and future works are reported in this short paper. It is anticipated that KBS brings a new alternative of data sampling to data stream mining with lots of potentials. |
DOI | 10.1109/ICDMW51313.2020.00122 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods |
WOS ID | WOS:000657112800115 |
Scopus ID | 2-s2.0-85101342912 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | 1.Department of Computer and Information Science University of Macau Macau, China 2.2 Department of Electronic, Computer Systems and Automation Engineering University of Huelva Huelva, Spain |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Tengyue Li,Simon Fong,Yaoyang Wu,et al. Kennard-Stone Balance Algorithm for Time-series Big Data Stream Mining[C]:IEEE, 2020, 851-858. |
APA | Tengyue Li., Simon Fong., Yaoyang Wu., & Antonio J. Tallón-Ballesteros (2020). Kennard-Stone Balance Algorithm for Time-series Big Data Stream Mining. IEEE International Conference on Data Mining Workshops, ICDMW, 2020-November, 851-858. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment