Residential College | false |
Status | 已發表Published |
Multidimensional similarity join using MapReduce | |
Li Y.; Wang J.; U L.H. | |
2016 | |
Conference Name | 17th International Conference on Web-Age Information Management (WAIM) |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Volume | 9659 |
Pages | 457-468 |
Conference Date | JUN 03-05, 2016 |
Conference Place | Nanchang, PEOPLES R CHINA |
Abstract | Similarity join is arguably one of the most important operators in multidimensional data analysis tasks. However, processing a similarity join is costly especially for large volume and high dimensional data. In this work, we attempt to process the similarity join on MapReduce such that the join computation can be scaled horizontally. In order to make the workload balancing among all MapReduce nodes, we systemically select the most profitable feature based on a novel data selectivity approach. Given the selected feature, we develop the partitioning scheme for MapReduce processing based on two different optimization goals. Our proposed techniques are extensively evaluated on real datasets. |
DOI | 10.1007/978-3-319-39958-4_36 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Information Systems ; Computer Science, Software Engineering ; Computer Science, Theory & Methods |
WOS ID | WOS:000389411500036 |
Scopus ID | 2-s2.0-84976649853 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Affiliation | Universidade de Macau |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Li Y.,Wang J.,U L.H.. Multidimensional similarity join using MapReduce[C], 2016, 457-468. |
APA | Li Y.., Wang J.., & U L.H. (2016). Multidimensional similarity join using MapReduce. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9659, 457-468. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment