Residential Collegefalse
Status已發表Published
Fast Cluster-learning with Prior Probability from Big Dataset
Tengyue Li1; Simon Fong1; Joao Alexandre Lobo Marques2; Raymond K. Wong3
2019-05-02
Conference Name2018 5th International Conference on Soft Computing & Machine Intelligence (ISCMI)
Source Publication5th International Conference on Soft Computing and Machine Intelligence, ISCMI 2018
Pages60-66
Conference Date21-22 Nov. 2018
Conference PlaceNairobi, Kenya
PublisherIEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA
Abstract

Association Rule Mining by Aprior method has been one of the popular data mining techniques for decades, where knowledge in the form of item-association rules is harvested from a dataset. The quality of item-association rules nevertheless depends on the concentration of frequent items from the input dataset. When the dataset becomes large, the items are scattered far apart. It is known from previous literature that clustering helps produce some data groups which are concentrated with frequent items. Among all the data clusters generated by a clustering algorithm, there must be one or more clusters which contain suitable and frequent items. In turn, the association rules that are mined from such clusters would be assured of better qualities in terms of high confidence than those mined from the whole dataset. However, it is not known in advance which cluster is the suitable one until all the clusters are tried by association rule mining. It is time consuming if they were to be tested by brute-force. In this paper, a statistical property called prior probability is investigated with respect to selecting the best out of many clusters by a clustering algorithm as a pre-processing step before association rule mining. Experiment results indicate that there is correlation between prior probability of the best cluster and the relatively high quality of association rules generated from that cluster. The results are significant as it is possible to know which cluster should be best used for association rule mining instead of testing them all out exhaustively.

KeywordAssociation Rule Mining Clustering Preprocessing Prior Probability
DOI10.1109/ISCMI.2018.8703219
URLView the original
Indexed ByCPCI-S
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence ; Computer Science, Theory & Methods
WOS IDWOS:000470762100011
Scopus ID2-s2.0-85065698950
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorTengyue Li
Affiliation1.Department of Computer and Information Science,University of Macau,Macao
2.School of Business,University of Saint Joseph,Macao
3.School of Computer Science and Engineering,University of New South Wales,Sydney,Australia
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Tengyue Li,Simon Fong,Joao Alexandre Lobo Marques,et al. Fast Cluster-learning with Prior Probability from Big Dataset[C]:IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2019, 60-66.
APA Tengyue Li., Simon Fong., Joao Alexandre Lobo Marques., & Raymond K. Wong (2019). Fast Cluster-learning with Prior Probability from Big Dataset. 5th International Conference on Soft Computing and Machine Intelligence, ISCMI 2018, 60-66.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Tengyue Li]'s Articles
[Simon Fong]'s Articles
[]'s Articles
Baidu academic
Similar articles in Baidu academic
[Tengyue Li]'s Articles
[Simon Fong]'s Articles
[Joao Alexandre ...]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Tengyue Li]'s Articles
[Simon Fong]'s Articles
[Joao Alexandre ...]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.