A Forecasting Methodology for Workload Forecasting in Cloud Systems
Metadatos
Mostrar el registro completo del ítemAutor
Baldán Lozano, Francisco Javier; Ramírez-Gallego, Sergio; Bergmeir, Christoph Norbert; Herrera Triguero, Francisco; Benítez Sánchez, José ManuelMateria
Cloud Computing Elasticity Workload forecasting
Fecha
2014Referencia bibliográfica
Published version: F. J. Baldan, S. Ramirez-Gallego, C. Bergmeir, F. Herrera and J. M. Benitez, "A Forecasting Methodology for Workload Forecasting in Cloud Systems," in IEEE Transactions on Cloud Computing, vol. 6, no. 4, pp. 929-941, 1 Oct.-Dec. 2018, doi: 10.1109/TCC.2016.2586064
Resumen
Cloud Computing is an essential paradigm of computing services based on the “elasticity” property, where available
resources are adapted efficiently to different workloads over time. In elastic platforms, the forecasting component can be considered
by far the most important element and the differentiating factor when comparing such systems, with workload forecasting one of the
problems to solve if we want to achieve a truly elastic system. When properly addressed the cloud workload forecasting problem
becomes a really interesting case study. As there is no general methodology in the literature that addresses this problem analytically
and from a time series forecasting perspective (even less so in the cloud field), we propose a combination of these tools based on
a state-of-the-art forecasting methodology which we have enhanced with some elements, such as: a specific cost function, statistical
tests, visual analysis, etc. The insights obtained from this analysis are used to detect the asymmetrical nature of the forecasting problem
and to find the best forecasting model from the viewpoint of the current state of the art in time series forecasting. From an operational
point of view the most interesting forecast is a short-time horizon, so we focus on this. To show the feasibility of this methodology, we
apply it to several realistic workload datasets from different datacenters. The results indicate that the analyzed series are non-linear in
nature and that no seasonal patterns can be found. Moreover, on the analyzed datasets, the penalty cost as usually included in the
SLA can be reduced to a 30% on average.