STLF With Xgboost
STLF With Xgboost
Using XGBoost
Abstract. For efficient use of smart grid, exact prediction about the in-
future coming load is of great importance to the utility. In this proposed
scheme initially we converted daily Australian energy market operator
load data to weekly data time series. Furthermore, we used eXtreme Gra-
dient Boosting (XGBoost) for extracting features from the data. After
feature selection we used XGBoost for the purpose of forecasting the elec-
tricity load for single time lag. XGBoost perform extremely well for time
series prediction with efficient computing time and memmory resources
usage. Our proposed scheme outperformed other schemes for mean aver-
age percentage error metric.
1 Introduction
Energy production and consumption difference minimization is a challenging
task these days. Efficient consumption of electricity is one good solution to this
problem. Researchers has done a lot of work in for introducing cost effective
and efficient energy utilization systems. Next generation Smart Grid (SG) is
the most attractive solution so far. SG is the integration of information and
communication technology in traditional grid which makes it intelligent power
grid supporting real-time information exchange between producer and consumer.
SG enables the energy efficiency optimization. More precisely, the SG needs an
accurate forecasting of the energy load for more productive application.
AN increasing attempt of deregulating strength markets to shape a more
dependable, green, and price-effective system with the aid of improving com-
petitions has been witnessed in world’s important economies [1,2]. In the lib-
eralized markets, the strength is commoditized and consequently its price is
dynamic. Due to the fee variant, pricing power correctly will become important
to generate profits, schedule strength productions, and plan load responses [3–
7]. The accurate power charge forecasting is helpful to decide the strength rate
and accordingly is precious. As liberalized electricity markets include types, day-
ahead and real-time [8], it’s miles meaningful to discuss both of the day-ahead
c Springer Nature Switzerland AG 2019
L. Barolli et al. (Eds.): WAINA 2019, AISC 927, pp. 1120–1131, 2019.
https://doi.org/10.1007/978-3-030-15035-8_108
STLF Using XGBoost 1121
and online forecasting of the power fee. The energy rate forecasting has been
vigorously studied inside the literature. From the application aspect, the fore-
casting of the energy rate in unique deregulated markets of essential economies
around the world has been stated [9–15].
The Rest of the paper is structured as defined next. The mostly used practi-
cle techniques for load forecasting are discussed in Sect. 2. The Sect. 3 enlightens
the different problems associated with the load prediction. The proposed model
for load prediction and the evaluation metrices are explained in Sect. 4. Exper-
imental results are depicted and highlighted in Sect. 5. Conclusion about the
work done in this paper is expressed in Sect. 6 at the end.
2 Related Work
Bayesian network forecasting, constant analysis of data through data mining and
unsupervised data accumulating for electricity consumption prediction. In [26]
authors proposed a short-lived electricity load prediction scheme for academic
buildings. This work used 2-stage forecasting analysis for the productive working
of their energy system. Energy consumption data is collected from different uni-
versities and moving average method is used for finding the energy load pattern
according to week day. Random Forest (RF) technique is used for forecasting
the daily energy load. RF performance is assessed using cross-validation on time
series.
González et al. predicted electricity price adopting functional time series
using a New Hilbertian ARMAX model in [27]. Suggested scheme has a lin-
ear regression structure, where functional variables are operated by functional
parameters. Where functional parameters are fundamental entities with linearly
combined kernels as sigmoid operations. Quasi-Newton model is used for param-
eters optimization in sigmoid which minimizes the sum of squared error. Data
integrity attacks affect the results of load prediction models i.e., artificial neural
network, multiple linear regression, support vector regression and fuzzy interac-
tion regression). Authors in [28] worked on exposing the consequences of these
attacks. We begin by simulating some knowledge integrity attacks through the
random injection of some multipliers that follow a traditional or uniform distri-
bution into the load series. Then, the four same load prognostication models are
used to generate one-year-ahead ex post purpose forecasts so as to supply a com-
parison of their forecast errors. The results show that the support vector regres-
sion model is most robust, followed closely by the multiple rectilinear regression
model, whereas the fuzzy interaction regression model is that the least sturdy of
the four. withal, all four models fail to supply satisfying forecasts once the size
of the info integrity attacks becomes giant. This presents a serious challenge to
each load forecasters and therefore the broader prognostication community: the
generation of correct forecasts beneath knowledge integrity attacks.
Dong et al. worked on the energy management in a microgrid. Bayesian-
optimization-algorithm (BOA) is used for a single SG using house. Authors
in [29] articulates the enhancement beyond the closed form equitable function
equation, and work out on it using BOA based data-driven technique. We can
consider the suggested technique as a black box function improving technique
as a whole. Furthermore, it has the ability to handle the microgrid working and
argument forecasting ambiguity.
We used various standards for the evaluation of our proposed prediction model
efficiency. The two most commonly used metrices for the measurement of predic-
tion accuracy are Mean Absolute Percentage Error (MAPE) and Mean Absolute
Error (MAE).
4.1.1 MAPE
The MAPE may be a live of prediction accuracy of a forecasting methodology for
constructing fitted statistic values in statistics, specifically in trend estimation.
it always expresses accuracy as a proportion of the error. as a result of this range
may be a percentage, it may be easier to know than the opposite statistics. The
MAPE is outlined as shown in (1). Here, and area unit the actual worth and
therefore the forecast worth, severally. Also, is the number of times discovered.
n
100 At − Ft
M AP E = (1)
n t=1 At
4.1.2 MAE
In statistics, the MAE is employed to measure however shut forecasts or predic-
tions area unit to the particular outcomes. It’s calculated by making a mean of
absolutely the variations between the prediction values and therefore the actual
ascertained values. The MAE is defined as shown in (2). Wherever nine is that
the prediction price, is the actual price.
n n
1 1
M AE = |fi − yi | = |ei | (2)
n i=1 n i=1
1126 R. A. Abbasi et al.
We used XGBoost for forecasting the load for a specific time lag in a week using
weekly data. Figure 5 shows the real load in the dataset and the XGBoost fore-
casted load. Here x-axis is depicting the time lags where y-axis is the load at
1128 R. A. Abbasi et al.
that specific time lag. Actual load is represented by the blue graph, where fore
casted load is represented by the green graph. We can see that the XGBoost
load prediction follows the real load at most of the time, however at some high
load instances XGBoost is not exactly following the real load. We can see that
XGBoost is not predicting well at the high loads.
The XGBoost load forecasting results for a time lag are displayed in Fig. 6.
We can see that XGBoost forecasting technique results in a low Mean Average
Percentage Error (MAPE), high accuracy and high Mean Average Error (MAE).
XGBoost load prediction resulted in a 10.08% MAPE, 97.21% accuracy and
88.90% MAE.
STLF Using XGBoost 1129
6 Conclusion
In this paper, we proposed a new scheme for electricity load forecasting. We con-
verted daily electricity load information into weekly load information. It increases
number of features available for predicting load for a lag variable. Then, we used
XGBoost, a recently dominant machine learning technique for time series pre-
diction, for feature selection from converted data. Once features are extracted
we train the model using XGBoost. After training we use trained model for load
prediction.
References
1. Mathaba, T., Xia, X., Zhang, J.: Analysing the economic benefit of electricity price
forecast in industrial load scheduling. Electr. Power Syst. Res. 116, 158–165 (2014)
2. Sarada, K., Bapiraju, V.: Comparison of day-ahead price forecasting in energy
market using Neural Network and Genetic Algorithm. In: Proceeding of the Inter-
national Conference on Smart Electric Grid, pp. 1–5 (2014)
3. Shafie-Khah, M., Moghaddam, M.P., Sheikh-El-Eslami, M.: Price forecasting of
day-ahead electricity markets using a hybrid forecast method. Energy Convers.
Manag. 52(5), 2165–2169 (2011)
4. Garcia, R.C., Contreras, J., Van Akkeren, M., Garcia, J.B.C.: A GARCH forecast-
ing model to predict day-ahead electricity prices. IEEE Trans. Power Syst. 20(2),
867–874 (2005)
5. Shahidehpour, M., Yamin, H., Li, Z.: Market overview in electric power systems. In:
Market Operations in Electric Power Systems, pp. 1–20. Wiley, New York (2002)
6. Ci-wei, G., Bompard, E., Napoli, R., Cheng, H.: Price forecast in the competitive
electricity market by support vector machine. Phys. A: Stat. Mech. Appl. 382(1),
98–113 (2007)
1130 R. A. Abbasi et al.
7. Cai, Y., Lin, J., Wan, C., Song, Y.: A stochastic Bi-level trading model for an
active distribution company with distributed generation and interruptible loads.
IET Renew. Power Gener. 11(2), 278–288 (2017)
8. Weron, R.: Electricity price forecasting: a review of the state-of-the-art with a look
into the future. Int. J. Forecast. 30(4), 1030–1081 (2014)
9. Hu, L., Taylor, G.: A novel hybrid technique for short-term electricity price fore-
casting in UK electricity markets. J. Int. Counc. Electr. Eng. 4(2), 114–120 (2014)
10. Voronin, S., Partanen, J.: Forecasting electricity price and demand using a hybrid
approach based on wavelet transform, ARIMA and neural networks. Int. J. Energy
Res. 38(5), 626–637 (2014)
11. Kou, P., Liang, D., Gao, L., Lou, J.: Probabilistic electricity price forecasting with
variational heteroscedastic Gaussian process and active learning. Energy Convers.
Manag. 89, 298–308 (2015)
12. Shrivastava, N.A., Panigrahi, B.K.: A hybrid wavelet-ELM based short term price
forecasting for electricity markets. Int. J. Electr. Power Energy Syst. 55, 41–50
(2014)
13. He, K., Xu, Y., Zou, Y., Tang, L.: Electricity price forecasts using a curvelet
denoising based approach. Phys. A: Stat. Mech. Appl. 425, 1–9 (2015)
14. Wan, C., Xu, Z., Wang, Y., Dong, Z.Y., Wong, K.P.: A hybrid approach for prob-
abilistic forecasting of electricity price. IEEE Trans. Smart Grid 5(1), 463–470
(2014)
15. Wan, C., Niu, M., Song, Y., Xu, Z.: Pareto optimal prediction intervals of electricity
price. IEEE Trans. Power Syst. 32(1), 817–819 (2017)
16. Liu, B., Nowotarski, J., Hong, T., Weron, R.: Probabilistic load forecasting via
quantile regression averaging on sister forecasts. IEEE Trans. Smart Grid 8(2), 1
(2017). https://doi.org/10.1109/tsg.2015.2437877
17. Wanga, P., Liu, B., Hongb, T.: Electric load forecasting with recency effect: a big
data approach. Int. J. Forecast. 32, 585–597 (2016)
18. Chou, J.-S., Ngo, N.-T.: Smart grid data analytics framework for increasing energy
savings in residential buildings. Autom. Constr. 72, 247–257 (2016)
19. Ludwig, N., Feuerriegel, S., Neumann, D.: Putting big data analytics to work:
feature selection for forecasting electricity prices using the LASSO and random
forests. ISSN 1246-0125 (Print) 2116-7052
20. Lago, J., De Ridder, F., De Schutter, B.: Forecasting spot electricity prices: deep
learning approaches and empirical comparison of traditional algorithms. Appl.
Energy 221, 386–405 (2018)
21. Wang, L., Zhang, Z., Chen, J.: Short-term electricity price forecasting with stacked
denoising autoencoders. IEEE Trans. Power Syst. 32(4), 2673–2681 (2017)
22. Lagoa, J., De Ridder, F., Vrancx, P., De Schutter, B.: Forecasting day-ahead elec-
tricity prices in Europe: the importance of considering market integration. Appl.
Energy 211(1), 890–903 (2018)
23. Raviv, E., Bouwman, K.E., van Dijk, D.: Forecasting day-ahead electricity prices:
utilizing hourly prices. Energy Econ. 50, 227–239 (2015)
24. Xiao, L., Jianzhou Wang, R., Hou, J.W.: A combined model based on data pre-
analysis and weight coefficients optimization for electrical load forecasting. Energy
82(15), 524–549 (2015)
25. Singh, S., Yassine, A.: Big data mining of energy time series for behavioral analytics
and energy consumption forecasting. Energies 11, 452 (2018)
26. Moon, J., Kim, K.-H., Kim, Y., Hwang, E.: A short-term electric load forecasting
scheme using 2-stage predictive analytics. In: 2018 IEEE International Conference
on Big Data and Smart Computing (2018)
STLF Using XGBoost 1131
27. González, J.P., San Roque, A.M., Perez, E.A.: Forecasting functional time series
with a new Hilbertian ARMAX model: application to electricity price forecasting.
IEEE Trans. Power Syst. 33(1), 545–556 (2018)
28. Luo, J., Hong, T., Fang, S.-C.: Benchmarking robustness of load forecasting models
under data integrity attacks. Int. J. Forecast. 34, 89–104 (2018)
29. Dong, G., Chen, Z.: Data driven energy management in a home microgrid based
on Bayesian optimal algorithm. IEEE Trans. Ind. Inform
30. Fausett, L.: Fundamentals of Neural Networks: Architectures, Algorithms, and
Applications. Pearson Education, Delhi (2006)
31. Shekhar, S., Amin, M.B.: Generalization by neural networks. IEEE Trans. Knowl.
Data Eng. 4, 177–185 (1992)