Residential College | false |
Status | 已發表Published |
Cloud Configuration Optimization for Recurring Batch-Processing Applications | |
Liu, Yang1,2; Xu, Huanle3; Lau, Wing Cheong2 | |
2023-02-17 | |
Source Publication | IEEE Transactions on Parallel and Distributed Systems |
ISSN | 1045-9219 |
Volume | 34Issue:5Pages:1495-1507 |
Abstract | Recognizing the diversity of Big Data analytic jobs, cloud providers offer a wide range of VM instance types or even clusters to cater for different use cases. The choice of cloud configurations can have a significant impact on the response time and running cost of batch-processing applications, which may need to be re-run regularly with cloud-scale resources. However, identifying the best cloud configuration with a low search cost is quite challenging due to i) the large and high-dimensional configuration space, ii) the time-varying cloud service cost (e.g., AWS Spot instances), and iii) job response time variation even given the same configuration. To tackle these challenges, we design and implement Accordia, a system that enables Adaptive Cloud Configuration Optimization for Recurring Data-Intensive Applications. By leveraging recent algorithmic advances in Gaussian Process UCB techniques, Accordia can unearth the cost-optimal configuration with a deadline constraint (i.e., maximum tolerated running time) under the time-varying cloud service cost. More importantly, Accordia manages to achieve a theoretical performance guarantee, sub-linearly increasing dynamic regret of the job completion cost. Using extensive trace-driven simulations and empirical measurements of our Kubernetes-based implementation, we demonstrate that Accordia can identify a near-cost-optimal configuration (i.e., within 10% of the optimum) after fewer than 20 runs from over 7000 candidate choices, which translates to a 2X-speedup and up to 17.9% cost-savings, when comparing to the state-of-the-art approach, CherryPick. |
Keyword | Big Data Analytics Cloud Configuration Gaussian-process Ucb Kubernetes |
DOI | 10.1109/TPDS.2023.3246086 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Theory & Methods ; Engineering, Electrical & Electronic |
WOS ID | WOS:000954366000006 |
Publisher | IEEE COMPUTER SOC, 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1314 |
Scopus ID | 2-s2.0-85149388446 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Liu, Yang |
Affiliation | 1.Shanghai University, Shanghai, 200444, China 2.Chinese University of Hong Kong, Hong Kong, Hong Kong 3.University of Macau, Taipa, 999078, Macao |
Recommended Citation GB/T 7714 | Liu, Yang,Xu, Huanle,Lau, Wing Cheong. Cloud Configuration Optimization for Recurring Batch-Processing Applications[J]. IEEE Transactions on Parallel and Distributed Systems, 2023, 34(5), 1495-1507. |
APA | Liu, Yang., Xu, Huanle., & Lau, Wing Cheong (2023). Cloud Configuration Optimization for Recurring Batch-Processing Applications. IEEE Transactions on Parallel and Distributed Systems, 34(5), 1495-1507. |
MLA | Liu, Yang,et al."Cloud Configuration Optimization for Recurring Batch-Processing Applications".IEEE Transactions on Parallel and Distributed Systems 34.5(2023):1495-1507. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment