Automatic cloud instance provisioning with quality and efficiency
Loading...
Date
2021
Publisher
DOI
ISSN
ISBN
Abstract
A distinctive feature of cloud computing is that it enables customers to dynamically summon server instances. Service providers facing uncertain demand patterns may exploit this feature by setting automatic provisioning rules for right-sizing the capacity contracted from the cloud. This situation can be modeled by a queueing system where the numbers of both jobs and servers evolve in time, the latter subject to delays in creation and deletion. We study in this context different feedback rules with the objective of efficiently matching capacity and load, while simultaneously providing a high quality of service. These rules are analyzed by means of fluid and diffusion limits for Markov chains. In particular we develop suitable extensions of the classical literature on this topic, required to accommodate non-homogeneous intensity scalings and non-differentiable drift fields. With these tools, our final proposal is shown to exhibit properties akin to the Halfin-Whitt regime, achieved automatically without knowledge of the system load. We further investigate by simulation its behavior under time-varying load, demonstrating the capabilities of our design to provide quality and efficiency in highly dynamic scenarios.
En
Thesis note
Thesis degree name
31 p.
Notes
En: Performance Evaluation, 149-150.
Incluye bibliografía.
Incluye bibliografía.
Subject
CLOUD COMPUTING, RIGHT-SIZING, FLUID AND DIFFUSION LIMITS
Type
Preprint