Residential College | false |
Status | 已發表Published |
Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach | |
Wenyan Chen2; Zizhao Mo1; Huanle Xu1; Kejiang Ye2; Chengzhong Xu1 | |
2023-06 | |
Conference Name | The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) |
Source Publication | The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) |
Volume | 30 |
Pages | 1–15 |
Conference Date | 2023-11 |
Conference Place | DENVER, CO |
Country | USA |
Abstract | A common strategy for improving efficiency in training deep learning entails multiplexing tasks on a single GPU. To mitigate the interference caused by multiplexing, existing approaches primarily employ kernel-level solutions to regulate GPU kernel execution, or harness hardware-level techniques to explicitly restrict GPU streaming multiprocessors and memory. Nevertheless, none of them perform satisfactorily in optimizing the completion time of tasks. In this paper, we present IADeep, a middleware solution designed to significantly improve multiplexing efficiency. The core concept is the co-optimization of task assignments within a cluster and interference mitigation on each device. IADeep coordinates the configuration of all co-located tasks in a less fine-grained fashion, effectively reducing interference and enhancing task training performance. Across the entire cluster, IADeep intelligently selects applications suitable for multiplexing to further amplify the advantages of optimizing task configurations. Evaluations on a 20 RTX 3090-GPU cluster demonstrate that IADeep can significantly outperform state-of-the-art multiplexing solutions. |
Keyword | Cloud Computing Interference-aware Multiplexing Deep Learning |
DOI | 10.1145/3581784.3607060 |
URL | View the original |
Scopus ID | 2-s2.0-85179551264 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Corresponding Author | Huanle Xu |
Affiliation | 1.University of Macau 2.Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, University of Macau, Macao |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Wenyan Chen,Zizhao Mo,Huanle Xu,et al. Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach[C], 2023, 1–15. |
APA | Wenyan Chen., Zizhao Mo., Huanle Xu., Kejiang Ye., & Chengzhong Xu (2023). Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach. The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 30, 1–15. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment