UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Holmes: SMT Interference Diagnosis and CPU Scheduling for Job Co-location
Pi, Aidi1; Zhou, Xiaobo1; Xu, Chengzhong2
2022-06-27
Conference Name31st International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2022
Source PublicationHPDC 2022 - Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing
Pages110-121
Conference Date27 June 2022through 30 June 2022
Conference PlaceVirtual, Online
PublisherAssociation for Computing Machinery, Inc
Abstract

Co-location of latency-critical services with best-effort batch jobs is commonly adopted in production systems to increase resource utilization. Although memory and CPU isolation have been extensively studied, we find Simultaneous Multi-Threading (SMT) technology imposes non-trivial interference on memory access which jeopardizes efficient co-location and performance assurance of latency-critical services. However, there is not an existing metric to quantitatively measure and lacks a deterministic approach to tackle SMT interference on memory access. We present Holmes, a user-space approach to SMT interference diagnosis and adaptive CPU scheduling for efficient job co-location in multi-tenant systems. Holmes tackles two challenges: accurately measuring SMT interference on memory access, and efficiently adjusting CPU allocation to achieve low latency and high resource utilization at the same time. It leverages CPU hardware performance events to diagnose SMT interference on memory access and form a metric. It deploys an interference-aware scheduler to adaptively allocate CPU cores to latency-critical services and batch jobs. Experiments with four real-world key-value stores show that compared to a representative CPU isolation approach, Holmes reduces the average (99th percentile) query latency by up to 49.0% (52.3%) for four real-world latency-critical services. It also significantly improves convergence speed, resource utilization, and system throughput.

KeywordJob Co-location Latency-critical Service Smt Interference
DOI10.1145/3502181.3531464
URLView the original
Language英語English
Scopus ID2-s2.0-85134157039
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Corresponding AuthorZhou, Xiaobo
Affiliation1.University of Colorado Colorado Springs, Colorado Springs, United States
2.University of Macau, Macau, Macao
Recommended Citation
GB/T 7714
Pi, Aidi,Zhou, Xiaobo,Xu, Chengzhong. Holmes: SMT Interference Diagnosis and CPU Scheduling for Job Co-location[C]:Association for Computing Machinery, Inc, 2022, 110-121.
APA Pi, Aidi., Zhou, Xiaobo., & Xu, Chengzhong (2022). Holmes: SMT Interference Diagnosis and CPU Scheduling for Job Co-location. HPDC 2022 - Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, 110-121.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Pi, Aidi]'s Articles
[Zhou, Xiaobo]'s Articles
[Xu, Chengzhong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Pi, Aidi]'s Articles
[Zhou, Xiaobo]'s Articles
[Xu, Chengzhong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Pi, Aidi]'s Articles
[Zhou, Xiaobo]'s Articles
[Xu, Chengzhong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.