Residential College | false |
Status | 已發表Published |
FastDEC: Clustering by Fast Dominance Estimation | |
Geping Yang1; Hongzhang Lv1; Yiyang Yang1; Zhiguo Gong2; Xiang Chen3; Zhifeng Hao4 | |
2023-03-17 | |
Conference Name | 22nd Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2022 |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Volume | 13713 LNAI |
Pages | 138-156 |
Conference Date | 19 September 2022through 23 September 2022 |
Conference Place | Grenoble |
Country | France |
Author of Source | Massih-Reza Amini ; Stéphane Canu ; Asja Fischer ; Tias Guns ; Petra Kralj Novak ; Grigorios Tsoumakas |
Publisher | Springer Science and Business Media Deutschland GmbH |
Abstract | k-Nearest Neighbors (k-NN) graph is essential for the various graph mining tasks. In this work, we study the density-based clustering on the k-NN graph and propose FastDEC, a clustering framework by fast dominance estimation. The nearest density higher (NDH) relation and dominance-component (DC), more specifically their integration with the k-NN graph, are formally defined and theoretically analyzed. FastDEC includes two extensions to satisfy different clustering scenarios: FastDEC for partitioning data into clusters with arbitrary shapes, and FastDEC for K-Way partition. Firstly, a set of DCs is detected as the results of FastDEC by segmenting the given k-NN graph. Then, the K-Way partition is generated by selecting the top-K DCs in terms of the inter-dominance (ID) as the seeds, and assigning the remaining DCs to their nearest dominators. FastDEC can be viewed as a much faster, more robust, and k-NN based variant of the classical density-based clustering algorithm: Density Peak Clustering (DPC). DPC estimates the significance of data points from the density and geometric distance factors, while FastDEC innovatively uses the global rank of the dominator as an additional factor in the significance estimation. FastDEC naturally holds several critical characteristics: (1) excellent clustering performance; (2) easy to interpret and implement; (3) efficiency and robustness. Experiments on both the artificial and real datasets demonstrate that FastDEC outperforms the state-of-the-art density methods including DPC. |
Keyword | Clustering Density Estimation k Nearest Neighbors |
DOI | 10.1007/978-3-031-26387-3_9 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
Scopus ID | 2-s2.0-85151048158 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Yiyang Yang; Zhiguo Gong |
Affiliation | 1.Faculty of Computer,Guangdong University of Technology,Guangzhou,China 2.State Key Laboratory of Internet of Things for Smart City and Department of Computer and Information Science,University of Macau,Macao 3.School of Electronics and Information Technology,Sun Yat-Sen University,Guangzhou,China 4.College of Engineering,Shantou University,shantou,China |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Geping Yang,Hongzhang Lv,Yiyang Yang,et al. FastDEC: Clustering by Fast Dominance Estimation[C]. Massih-Reza Amini, Stéphane Canu, Asja Fischer, Tias Guns, Petra Kralj Novak, Grigorios Tsoumakas:Springer Science and Business Media Deutschland GmbH, 2023, 138-156. |
APA | Geping Yang., Hongzhang Lv., Yiyang Yang., Zhiguo Gong., Xiang Chen., & Zhifeng Hao (2023). FastDEC: Clustering by Fast Dominance Estimation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13713 LNAI, 138-156. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment