IEEE Transactions on Cloud Computing | 2019

Correlation Modeling and Resource Optimization for Cloud Service With Fault Recovery

 
 
 
 

Abstract


Energy-efficient cloud computing has recently attracted much attention, where not only performance but also energy consumption are important metrics to be considered for designing rational resource scheduling strategies. Most of existing approaches for achieving energy efficient computing focus on connecting these two metrics and balancing the tradeoff between them, which however is inadequate because another important factor reliability is not considered. In fact, both virtual machine (VM) failures and server failures inevitably interrupt execution of a cloud service, and eventually result in spending more time and consuming more energy on completing the cloud service. Therefore, reliability significantly affects service performance and energy consumption, and thus they should not be handled separately. Connecting these correlated metrics is essential for making more precise evaluation and further for developing rational cloud resource scheduling strategies. In this paper, we present a correlated modeling approach applying Semi-Markov models, the Laplace-Stieltjes transform (LST), a Bayesian approach to analyze reliability-performance (R-P) and reliability-energy (R-E) correlations for cloud services using a retrying fault recovery mechanism. A recursive method is also proposed for modeling the correlations for cloud services using a check-pointing fault recovery mechanism. The proposed correlation models can be used to calculate the expected service time and energy consumption for completing a cloud service. Moreover, the models can contribute to analyzing the expected performance-energy tradeoff. We formulate the expected performance-energy optimization problem by describing performance and energy consumption metrics as functions of assigned CPU frequencies. Finally, we use a derivation approach to determine Pareto optimal solutions for the formulated optimization problem. Illustrative examples are provided.

Volume 7
Pages 693-704
DOI 10.1109/TCC.2017.2691323
Language English
Journal IEEE Transactions on Cloud Computing

Full Text