Automatic cloud instance provisioning with quality and efficiency

Abstract

A distinctive feature of cloud computing is that it enables customers to dynamically summon server instances. Service providers facing uncertain demand patterns may exploit this feature by setting automatic provisioning rules for right-sizing the capacity contracted from the cloud. This situation can be modeled by a queueing system where the numbers of both jobs and servers evolve in time, the latter subject to delays in creation and deletion. We study in this context different feedback rules with the objective of efficiently matching capacity and load, while simultaneously providing a high quality of service. These rules are analyzed by means of fluid and diffusion limits for Markov chains. In particular we develop suitable extensions of the classical literature on this topic, required to accommodate non-homogeneous intensity scalings and non-differentiable drift fields. With these tools, our final proposal is shown to exhibit properties akin to the Halfin-Whitt regime, achieved automatically without knowledge of the system load. We further investigate by simulation its behavior under time-varying load, demonstrating the capabilities of our design to provide quality and efficiency in highly dynamic scenarios.

En

Thesis note

Thesis degree name

31 p.

Notes

En: Performance Evaluation, 149-150.
Incluye bibliografía.

Subject

CLOUD COMPUTING, RIGHT-SIZING, FLUID AND DIFFUSION LIMITS

Type

Preprint

Access the full text

Citation

View in library

Rights license

Campus Centro
Cuareim 1451, Montevideo, Uruguay

Teléfono central: (598) 2902 1505
Campus Pocitos
Bvar. España 2633, Montevideo, Uruguay