Residential College | false |
Status | 已發表Published |
Novel structures for counting frequent items in time decayed streams | |
Wu, Shanshan1; Lin, Huaizhong1; Hou, Leong U.2; Gao, Yunjun1; Lu, Dongming1 | |
2017-09 | |
Source Publication | WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS |
ISSN | 1386-145X |
Volume | 20Issue:5Pages:1111-1133 |
Abstract | Identifying frequently occurring items is a fundamental building block in many data stream applications. A great deal of work for efficiently identifying frequent items has been studied on the landmark and sliding window models. In this work, we revisit this problem on a new streaming model based on the time decay, where the importance of every arrival item is decreased over the time. To address the importance changes over time, we propose an innovative heap structure, named Quasi-heap, which maintains the item order using a lazy update mechanism. Two approximation algorithm, Space Saving with Quasi-heap (SSQ) and Filtered Space Saving with Quasi-heap (FSSQ), are proposed to find the frequently occurring items based on the Quasi-heap structure. To achieve better accuracy of frequency estimation for all the items in the stream, we introduce a new count-min-min (CMM) sketch structure, which can estimate the count of an item with almost error free. Extensive experiments conducted on both real-world and synthetic data demonstrate the superiority of proposed methods in terms of both efficiency (i.e., response time) and effectiveness (i.e., accuracy). |
Keyword | Frequent Item Data Stream Time Decay Model Data Structure |
DOI | 10.1007/s11280-017-0433-5 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Information Systems ; Computer Science, Software Engineering |
WOS ID | WOS:000402177100011 |
Publisher | SPRINGER |
The Source to Article | WOS |
Scopus ID | 2-s2.0-85010817750 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Affiliation | 1.Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China 2.Univ Macau, Fac Sci & Technol, Macau, Peoples R China |
Recommended Citation GB/T 7714 | Wu, Shanshan,Lin, Huaizhong,Hou, Leong U.,et al. Novel structures for counting frequent items in time decayed streams[J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2017, 20(5), 1111-1133. |
APA | Wu, Shanshan., Lin, Huaizhong., Hou, Leong U.., Gao, Yunjun., & Lu, Dongming (2017). Novel structures for counting frequent items in time decayed streams. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 20(5), 1111-1133. |
MLA | Wu, Shanshan,et al."Novel structures for counting frequent items in time decayed streams".WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS 20.5(2017):1111-1133. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment