Residential Collegefalse
Status已發表Published
Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach
Wenyan Chen2; Zizhao Mo1; Huanle Xu1; Kejiang Ye2; Chengzhong Xu1
2023-06
Conference NameThe International Conference for High Performance Computing, Networking, Storage, and Analysis (SC)
Source PublicationThe International Conference for High Performance Computing, Networking, Storage, and Analysis (SC)
Volume30
Pages1–15
Conference Date2023-11
Conference PlaceDENVER, CO
CountryUSA
Abstract

A common strategy for improving efficiency in training deep learning entails multiplexing tasks on a single GPU. To mitigate the interference caused by multiplexing, existing approaches primarily employ kernel-level solutions to regulate GPU kernel execution, or harness hardware-level techniques to explicitly restrict GPU streaming multiprocessors and memory. Nevertheless, none of them perform satisfactorily in optimizing the completion time of tasks. In this paper, we present IADeep, a middleware solution designed to significantly improve multiplexing efficiency. The core concept is the co-optimization of task assignments within a cluster and interference mitigation on each device. IADeep coordinates the configuration of all co-located tasks in a less fine-grained fashion, effectively reducing interference and enhancing task training performance. Across the entire cluster, IADeep intelligently selects applications suitable for multiplexing to further amplify the advantages of optimizing task configurations. Evaluations on a 20 RTX 3090-GPU cluster demonstrate that IADeep can significantly outperform state-of-the-art multiplexing solutions.

KeywordCloud Computing Interference-aware Multiplexing Deep Learning
DOI10.1145/3581784.3607060
URLView the original
Scopus ID2-s2.0-85179551264
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Faculty of Science and Technology
Corresponding AuthorHuanle Xu
Affiliation1.University of Macau
2.Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, University of Macau, Macao
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Wenyan Chen,Zizhao Mo,Huanle Xu,et al. Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach[C], 2023, 1–15.
APA Wenyan Chen., Zizhao Mo., Huanle Xu., Kejiang Ye., & Chengzhong Xu (2023). Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach. The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 30, 1–15.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wenyan Chen]'s Articles
[Zizhao Mo]'s Articles
[Huanle Xu]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wenyan Chen]'s Articles
[Zizhao Mo]'s Articles
[Huanle Xu]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wenyan Chen]'s Articles
[Zizhao Mo]'s Articles
[Huanle Xu]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.