Residential College | false |
Status | 已發表Published |
Semi-supervised Drifted Stream Learning with Short Lookback | |
Weijieying Ren1; Pengyang Wang2; Xiaolin Li3; Charles E. Hughes1; Yanjie Fu1 | |
2022-05 | |
Conference Name | The 28th ACM SIGKDD international conference on knowledge discovery & data mining |
Source Publication | Proceedings - The 28th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD |
Pages | 1504–1513 |
Conference Date | 2022-08-14 |
Conference Place | Washington D.C. |
Country | USA |
Publisher | Association for Computing Machinery |
Abstract | In many scenarios, 1) data streams are generated in real time; 2) labeled data are expensive and only limited labels are available in the beginning; 3) real-world data is not always i.i.d. and data drift over time gradually; 4) the storage of historical streams is limited and model updating can only be achieved based on a very short lookback window. This learning setting limits the applicability and availability of many Machine Learning (ML) algorithms. We generalize the learning task under such setting as a semi-supervised drifted stream learning with short lookback problem (SDSL). SDSL imposes two under-addressed challenges on existing methods in semi-supervised learning, continuous learning, and domain adaptation: 1) robust pseudo-labeling under gradual shifts and 2) anti-forgetting adaptation with short lookback. To tackle these challenges, we propose a principled and generic generation-replay framework to solve SDSL. The framework is able to accomplish: 1) robust pseudo-labeling in the generation step; 2) anti-forgetting adaption in the replay step. To achieve robust pseudo-labeling, we develop a novel pseudo-label classification model to leverage supervised knowledge of previously labeled data, unsupervised knowledge of new data, and, structure knowledge of invariant label semantics. To achieve adaptive antiforgetting model replay, we propose to view the anti-forgetting adaptation task as a flat region search problem. We propose a novel minimax game-based replay objective function to solve the flat region search problem and develop an effective optimization solver. Finally, we present extensive experiments to demonstrate our framework can effectively address the task of anti-forgetting learning in drifted streams with short lookback |
Keyword | Continual Learning Distribution Shift Semi-supervised Learning |
DOI | 10.1145/3534678.3539297 |
Language | 英語English |
Scopus ID | 2-s2.0-85137150776 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Corresponding Author | Yanjie Fu |
Affiliation | 1.University of Central Florida 2.University of Macau 3.Nanjing University |
Recommended Citation GB/T 7714 | Weijieying Ren,Pengyang Wang,Xiaolin Li,et al. Semi-supervised Drifted Stream Learning with Short Lookback[C]:Association for Computing Machinery, 2022, 1504–1513. |
APA | Weijieying Ren., Pengyang Wang., Xiaolin Li., Charles E. Hughes., & Yanjie Fu (2022). Semi-supervised Drifted Stream Learning with Short Lookback. Proceedings - The 28th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, 1504–1513. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment