Residential College | false |
Status | 已發表Published |
Holmes: SMT Interference Diagnosis and CPU Scheduling for Job Co-location | |
Pi, Aidi1; Zhou, Xiaobo1; Xu, Chengzhong2 | |
2022-06-27 | |
Conference Name | 31st International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2022 |
Source Publication | HPDC 2022 - Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing |
Pages | 110-121 |
Conference Date | 27 June 2022through 30 June 2022 |
Conference Place | Virtual, Online |
Publisher | Association for Computing Machinery, Inc |
Abstract | Co-location of latency-critical services with best-effort batch jobs is commonly adopted in production systems to increase resource utilization. Although memory and CPU isolation have been extensively studied, we find Simultaneous Multi-Threading (SMT) technology imposes non-trivial interference on memory access which jeopardizes efficient co-location and performance assurance of latency-critical services. However, there is not an existing metric to quantitatively measure and lacks a deterministic approach to tackle SMT interference on memory access. We present Holmes, a user-space approach to SMT interference diagnosis and adaptive CPU scheduling for efficient job co-location in multi-tenant systems. Holmes tackles two challenges: accurately measuring SMT interference on memory access, and efficiently adjusting CPU allocation to achieve low latency and high resource utilization at the same time. It leverages CPU hardware performance events to diagnose SMT interference on memory access and form a metric. It deploys an interference-aware scheduler to adaptively allocate CPU cores to latency-critical services and batch jobs. Experiments with four real-world key-value stores show that compared to a representative CPU isolation approach, Holmes reduces the average (99th percentile) query latency by up to 49.0% (52.3%) for four real-world latency-critical services. It also significantly improves convergence speed, resource utilization, and system throughput. |
Keyword | Job Co-location Latency-critical Service Smt Interference |
DOI | 10.1145/3502181.3531464 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85134157039 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Corresponding Author | Zhou, Xiaobo |
Affiliation | 1.University of Colorado Colorado Springs, Colorado Springs, United States 2.University of Macau, Macau, Macao |
Recommended Citation GB/T 7714 | Pi, Aidi,Zhou, Xiaobo,Xu, Chengzhong. Holmes: SMT Interference Diagnosis and CPU Scheduling for Job Co-location[C]:Association for Computing Machinery, Inc, 2022, 110-121. |
APA | Pi, Aidi., Zhou, Xiaobo., & Xu, Chengzhong (2022). Holmes: SMT Interference Diagnosis and CPU Scheduling for Job Co-location. HPDC 2022 - Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, 110-121. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment