Residential College | false |
Status | 已發表Published |
Efficient Detection of Soft Concatenation Mapping | |
Liu, Hao; Xiao, Jiang; Tan, Haoyu; Luo, Qiong; Zhao, Jintao; Ni, Lionel M. | |
2018-11 | |
Source Publication | IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING |
ISSN | 1041-4347 |
Volume | 30Issue:11Pages:2106-2119 |
Abstract | In modern big data warehouse systems, we observe a common phenomenon that a column of data values can be derived from one or several other columns by transforming and concatenating these columns. We call this relationship between columns a Soft Concatenation Mapping (SCM). SCMs imply significant redundancy in the schema or data, and therefore can be exploited for data integration or data compression. In this paper, we formalize the problem of SCM detection and prove it is NP-hard. We then propose efficient approximate algorithms to detect all SCMs or an optimal set of SCMs in a table. Our experiments on both real-world and synthetic datasets show promising results. |
Keyword | Big Data Management Data Profiling Soft Concatenation Mapping |
DOI | 10.1109/TKDE.2018.2812822 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Engineering, Electrical & Electronic |
WOS ID | WOS:000446795900007 |
Publisher | IEEE COMPUTER SOC |
The Source to Article | WOS |
Scopus ID | 2-s2.0-85043382079 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | University of Macau |
Recommended Citation GB/T 7714 | Liu, Hao,Xiao, Jiang,Tan, Haoyu,et al. Efficient Detection of Soft Concatenation Mapping[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30(11), 2106-2119. |
APA | Liu, Hao., Xiao, Jiang., Tan, Haoyu., Luo, Qiong., Zhao, Jintao., & Ni, Lionel M. (2018). Efficient Detection of Soft Concatenation Mapping. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 30(11), 2106-2119. |
MLA | Liu, Hao,et al."Efficient Detection of Soft Concatenation Mapping".IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 30.11(2018):2106-2119. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment