0% found this document useful (0 votes)

50 views12 pages

1 s2.0 S0960148124021232 Main

Uploaded by

txg0909

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views12 pages

1 s2.0 S0960148124021232 Main

Uploaded by

txg0909

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Renewable Energy 239 (2025) 122055

Contents lists available at ScienceDirect

Renewable Energy
journal homepage: www.elsevier.com/locate/renene

Wind and solar power forecasting based on hybrid CNN-ABiLSTM,

CNN-transformer-MLP models
Tasarruf Bashir a , Huifang Wang a,* , Mustafa Tahir a,b, Yixiang Zhang a
a
College of Electrical Engineering, Zhejiang University, Hangzhou, 310058, China
b
Shaoxing Institute of Zhejiang University, Shaoxing 312099, China

A R T I C L E I N F O A B S T R A C T

Keywords: Accurate prediction of solar and wind power output is crucial for effective integration into the electrical grid.
Renewable energy Existing methods, including conventional approaches, machine learning (ML), and hybrid models, have limi
Solar and wind power forecasting tations such as limited adaptability, narrow generalizability, and difficulty in forecasting multiple types of
Transformer model
renewable energy respectively. To address these challenges, this study introduces two novel hybrid models: the
Bidirectional long-short-term memory model
Hybrid model
CNN-ABiLSTM, which integrates Convolutional Neural Networks (CNN) with Attention-based Bidirectional Long
Short-Term Memory (ABiLSTM), and the CNN-Transformer-MLP, which integrates CNN with Transformers and
Multi-Layer Perceptrons (MLP). In both hybrid models, the CNN captures short-term patterns in solar and wind
power data, while the ABiLSTM and Transformer-MLP models address the long-term patterns. CNN, BiLSTM, and
Encoder-based Transformer were taken as baseline standalone models. The proposed hybrid models and
standalone baseline models were trained on quarter-hour-based real-time data. The hybrid models outperform
standalone baseline models in day, week, and month-ahead forecasting. The CNN-Transformer-MLP hybrid
provides more accurate day and week-ahead solar and wind power predictions with lower mean absolute error
(MAE), root mean square error (RMSE), and mean square error (MSE) values. For month-ahead forecasts, the
CNN-ABiLSTM hybrid excels in wind power prediction, demonstrating its strength in long-term forecasting.

1. Introduction renewable energy generation, their intermittent and stochastic nature

poses significant challenges for optimal integration into the power grid.
1.1. Motivation and incitement Accurate forecasting of wind and solar power generation is essential not
only for minimizing generation-demand mismatches but also for
The demand for renewable power in the power generation sector is enhancing grid stability, reducing reliance on reserve capacities, and
increasing alongside the growing demand for electricity [1]. The improving the efficiency of predictive dispatch. This forecasting capa
growing demand for renewable energy sources (RESs) in the grid is bility supports both short-term operational decisions, such as load
driven by their carbon neutrality and eco-friendliness compared to fossil balancing and real-time scheduling, and long-term strategic planning,
fuels [2]. Various studies [3–6], have empirically demonstrated an in including capacity expansion, infrastructure investments, and policy
verse proportional relationship between carbon emissions and per capita development. Furthermore, these insights contribute to the optimal
consumption of renewable energy, showing a significant decrease of allocation of resources, enhancing the overall reliability and economic
1.25 percent in carbon emissions for every one percentage point increase viability of renewable energy systems [8]. As a result, wind and solar
in renewable energy consumption per capita. Owing to these positive power generation forecasting remains an active area of research, driving
environmental impacts, most developed and developing countries are the need for innovative solutions, particularly in scenarios where access
adopting renewable energy sources as a primary energy source. Ac to high-resolution meteorological data is limited.
cording to the International Energy Agency (IEA), between 2023 and
2028, renewable energy sources are expected to supply 42 percent of 1.2. Literature review
global electricity generation, with solar and wind power contributing a
25 percent share [7]. Despite the critical role of wind and solar power in RES power generation forecasting is typically categorized based on

* Corresponding author.
E-mail address: [email protected] (H. Wang).

https://doi.org/10.1016/j.renene.2024.122055
Received 18 August 2024; Received in revised form 12 November 2024; Accepted 28 November 2024
Available online 29 November 2024
0960-1481/© 2024 Elsevier Ltd. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
T. Bashir et al. Renewable Energy 239 (2025) 122055

the time span of the forecast, ranging from ultra-short-term (up to 1 h), day-ahead wind power generation data. Similarly, a hybrid CNN and
short-term (daily), medium-term (daily to weekly), to long-term fore Transformer model proposed in Ref. [33] focuses on forecasting wind
casting (weekly to yearly) [9]. Ultra-short-term forecasts are vital for power generation data across multiple farms for ultra-short-term and
grid management, short-term forecasts support real-time electricity short-term periods, thereby overlooking the need for a model that
dispatch and unit commitment, medium-term forecasts assist in grid handles both solar and wind power data. A hybrid Wavelet Packet
maintenance planning, and long-term forecasts help in planning grid Decomposition (WPD)-CNN-LSTM-MLP model was proposed in
expansion [10]. Regarding forecasting techniques, RES power genera Ref. [33] to forecast hourly-based solar irradiance data. The
tion prediction utilizes three main methods. Analytical equations are the WPD-CNN-LSTM-MLP hybrid exhibited superior performance compared
foundation of the ’White-Box’ or ’Physical’ methods, which investigate to other network combinations, indicating that a random combination of
the interaction between a variety of factors that affect the production of diverse networks for forecasting tasks is inadequate. This was notably
RES [11]. Conversely, ’Black-Box’ methods, which include statistical underscored by the efficacy of the WPD-CNN-LSTM combination. A
and machine learning approaches, rely on historical RES power gener hybrid model, comprising a CNN layer, fully connected neural network
ation data to uncover hidden relationships between output and input layers, and Gated Recurrent Unit (GRU) layers, was used to predict very
variables using mathematical models [12]. Statistical methods, such as short-term wind data in Ref. [34]. In Ref. [35], a Multi-Head-Attention
the Exponential Smoothing (ES), auto regressive moving average (MHA) probabilistic CNN-BiLSTM model was proposed to forecast wind
(ARMA), and autoregressive integrated moving average (ARIMA) [13], speed data. All these studies focused on short term forecasting for a
and regression models [14], are particularly effective for single RES generation but did not tackle the problem of combined solar
ultra-short-term forecasting of RES data. Machine learning (ML) [15] and wind forecasting for longer period of time.
techniques, such asartificial neural networks (ANN), decision trees [16, There is a dearth of published research on hybrid models that
17], and k-nearest neighbors (kNN) [18], are predominantly used for attempt to predict data from both solar and wind power sources. For
short-term forecasting [19]. Lastly, ’Grey-Box’ methods, which are hy example, in Ref. [36], a novel approach was introduced to forecast solar
brids of deep and ML models, have been shown to yield the most ac and wind power generation data by utilizing a hybrid deep learning
curate forecasting results compared to the aforementioned methods in model that integrates time2vec, wide-first layer kernels CNN, and
all time-frames [20,21].These methodologies profit from the advantages BiLSTM. Although this hybrid model utilized hourly based wind and
of both ’White-Box’ and ’Black-Box’ methodologies, thereby delivering solar data, it didn’t address the model’s forecasting accuracy over longer
more precise predictions [22]. Different RES forecasting models have periods. In Ref. [37], the author proposed using statistical Markov Chain
been compared in Ref. [23], highlighting research trend on hybrid Monte Carlo (MCMC) simulations to forecast wind and solar power
models due to their better performance than standalone counterparts. generation, aiming to optimize long-term energy contracts for pur
Various studies have employed diverse combinations of machine and chasing renewable energy. The forecasting results from the MCMC
deep learning-based hybrid models to predict the RES power generation simulations were compared with those derived from Bayesian estimates
data. In Ref. [24], the Transformer model’s forecasting capabilities were to evaluate their effectiveness. However, it didn’t address the model’s
investigated in light of the correlation between various wind farms in forecasting over shorter periods. In addition data from a single site was
order to forecast short-term wind power production. Although the used to evaluate the forecasting accuracy in above studies.
Transformer model excels at capturing long-term dependencies in se Each of the hybrid models mentioned above, including MHA-CNN-
quences through its self-attention mechanism [25], it may overlook local BiLSTM, CNN-LSTM-TF, WPD-CNN-LSTM-MLP and so on, surpasses
relations. Therefore, to thoroughly investigate the local relations within the performance of traditional (like ARIMA, ES, etc.) or ML models (such
sequences, the Transformer model has been utilized in a hybrid fashion. as ANN, kNN, LSTM, Transformer etc.) when used standalone. These
For example, in Ref. [26] Transformer (TF) model was combined for hybrid approaches improve prediction accuracy by combining the
accuracy improvement with CNN and LSTM in different sequences. By strengths of both methods and compensating for their individual
employing solar power generation data different model combinations weaknesses. However, most hybrid approaches are specifically designed
were tested and the CNN-LSTM-TF combination found to be optimal. and applied to data from single renewable energy sources (RES), either
Mostly, the combination of CNN with other deep learning models has solar or wind power. While some hybrid techniques utilize both solar
been employed to harness its capability to capture the intrinsic features and wind power data, their use has been confined to short-term fore
of RES power generation data, complemented by weather data [27,28]. casting. While short-term forecasts are essential for real-time electric
The integration of CNN with other models has been utilized for pre grid management and operational efficiency, the longer-intervals fore
dicting outcomes using both univariate and multivariate input datasets. casts are also crucial for tactical planning and ensuring reliability in
Research indicates that multivariate hybrid CNN models tend to perform energy supply. However, there is a significant gap for hybrid models that
better in initial time steps. Conversely, the performance of univariate are computationally efficient, free from manual features selection and
hybrid CNN models typically improves over later time steps [29,30]. can effectively forecast not only short term but also longer-periods of
This pattern suggests that hybrid CNN models are effectively adaptable time.
for forecasting with both types of datasets.
Most of hybrid models for short-term forecasting focused on single 1.3. Contribution and organization
RES power generation either solar power or wind power. In Ref. [31],
the author suggested a hybrid model for forecasting short-term solar To mitigate the aforementioned issues, this study proposes two
power generation data that consists of two noise-removal decomposition hybrid techniques: CNN-ABiLSTM, CNN-Transformer-MLP. The pro
layers: Variational Mode Decomposition (VMD) and Improved Com posed work addresses the technical gap in solar and wind power data
plementary Ensemble Empirical Mode Decomposition (ICEEMD). Then, forecasting by overcoming the limitations of recent studies. These ap
the Whale Optimization Algorithm (WOA) is used to determine the proaches were rigorously tested and evaluated using several evaluation
optimal parameters for tailoring a BiLSTM model. Finally, an attention metrics. The hybrid models outperform the standalone CNN, BiLSTM,
layer was introduced to focus on key information, enhancing the fore and Transformer methods in terms of performance accuracy, as evi
casting accuracy. The incorporation of diverse deep learning models denced by empirical findings. This confirms the competitiveness and
enhances the precision of solar power generation data prediction. efficacy of the proposed models in forecasting production data for solar
However, this integration also escalates the computing expense required and wind power. The main findings of this study are outlined as follows.
to execute the hybrid model. In Ref. [32], A hybrid model, utilizing the
combination of Weighted Extreme Learning Machine (ELM) and Particle ● Hybrid of CNN-ABiLSTM and CNN-Transformer-MLP models is pro
Swarm Optimization (PSO), was employed to predict hour and posed in this work for both solar and wind power forecasting. The

2
T. Bashir et al. Renewable Energy 239 (2025) 122055

proposed hybrid methods considered both short-term and long-term involving data collection, handling missing values and noise, data di
patterns in solar and wind power data. vision, and normalization. This study gathers renewable power gener
● Both hybrid models utilize the CNN model to address the short-term ation data from renewable energy sources, which often contain missing
patterns in solar and wind power data. The Transformer-MLP and values and noise due to sensor malfunctions. Methods like linear inter
ABiLSTM models capture the residual long-term patterns. polation and forward or backward fill are used to address missing values
● The hybrid models were trained on real-time, quarter-hourly uni [38]. Similarly, noise in RES data can be addressed using various
variate solar and wind power data sourced from the European methods, with normalization commonly applied. The dataset is subse
Network of Transmission System Operators for Electricity (ENTSO- quently divided into three subsets for the purpose of training, validating,
E), specifically covering Germany and Luxembourg. and evaluating the forecasting model [39]. The training set changes
● The proposed hybrid models have a forecasting horizon that spans model parameters, the validation set examines performance after each
from one day to a month and were evaluated and compared against epoch, and the testing set assesses accuracy.
standalone CNN, Encoder-based Transformer, and BiLSTM models Data normalization is essential for optimizing model performance
using evaluation metrics including MSE, RMSE, MAE, and the coef and accelerating convergence, particularly for neural network-based
ficient of determination (R2). models. The Min-Max normalization method scales data within the
range of 0–1 and is given in Eq. (1) below:
The remaining sections of the paper are organized in the following xo − xmin
manner: Section 2 provides a detailed explanation of the suggested xo =
̂ (1)
xmax − xmin
approach, including the methodology, data preparation, and an over
view of the model components, such as CNN, BiLSTM, Transformer, and Where ̂ x o is the normalized value of data and xo denotes original input.
hybrid models. Section 3 briefly delve into the evaluation metrics. The xmax and xmin denotes the maximum and minimum values of the original
experimental setup is described in Section 4, and the results are dis values respectively.
cussed in Section 5. Section 6 conclude the paper. In this study real-time renewable solar PV and wind power genera
tion data were downloaded from platform of ENTSO-E [40]. Data from
2. Methodology the ENTSO-E platform, covering German (DE) and Luxembourg (LU)
regions with 15-min intervals, was downloaded. The dataset spans four
This section outlines various prediction methodologies using deep years (2018–2023) with 178,272 samples. Data from 2018 to 2021 (119,
learning-based neural network models to extract patterns in solar and 712 observations) was used for training, while the following 13 months
wind power data. It concludes with the principles of the proposed hybrid (29,280 observations) were used for validation and testing. The data
models, which aim to enhance accuracy and efficiency in renewable shows minor short-term fluctuations, as each instance represents the
power prediction. The overall forecasting process is illustrated in Fig. 1. combined power generation from various solar and wind farms. Prior to
forecasting, the data was processed as per the mentioned criteria.
2.1. Data preparation

Data preparation is crucial in developing forecasting models,

Fig. 1. Overall procedure for RES power generation forecasting.

3
T. Bashir et al. Renewable Energy 239 (2025) 122055

2.2. Model components gates. The sigmoid activation function also enables an LSTM model to
capture time series data non-linear features. The output gate’s hyper
Model components subsection briefly describes the basic principle of bolic tangent activation function, conversely, controls the output be
each component employed in the proposed hybrid frameworks. tween − 1 and 1. it stands for the input gate, ft for the forget gate, ot for
the output gate, ct for the cell state, and ht for the hidden state.
2.2.1. Convolutional neural network In an LSTM model, the forget gate ft uses a sigmoid function to decide
A convolutional neural network (CNN) is a feedforward neural whether to retain or discard the previous cell content. Subsequently, the
network that uses convolutional operations to extract high-level features cell state is modified by merging the forget gate (ft) with the prior cell
and correlations from time series [41]. Incorporating a 1D convolutional state (ct-1) and the input gate (it) with the current input. The output gate,
layer enhances accuracy and reduces complexity, while CNNs are denoted as ot, utilizes a sigmoid function to ascertain whether to retain
effective at extracting informative features from noisy data [42]. or transmit the present output. The hidden state ht is obtained by
A typical CNN architecture consists of three primary components: a multiplying the controlled output of ot with the current cell state ct. Both
convolutional layer, a pooling layer, and a fully connected layer. The ct and ht are then transmitted to the next stage.
convolutional layer extracts contextual spatial information from data This process is illustrated in Fig. 2.
using convolutional kernels, with multiple kernels generating feature The BiLSTM model comprises two LSTM blocks: a forward LSTM
maps through their interactions. block processing the input sequence from t-m to t, and a backward LSTM
⎛ ⎞ block processing from t to t-m. Forward LSTM network block’s output
∑ (→o t ) and backward LSTM network block’s output (← o t ) are carried out by
ym
j =f
⎝ xm− 1 ⊙ wm + bm ⎠
i ij j (2)
i∈Mj utilizing the working principles of single LSTM model described above.
The mathematical dynamics of output of BiLSTM model (yt ) is given in
Where ym m− 1 Eq. (6).
j is the j th output of m th convolutional layer, xi is the i th
output of (m-1)th convolutional layer. While Mj is the selection of input →, ←
yt = σ(ot ot ) (6)
maps, and wmij denotes the weight between i the input and j th output. ⊙
The purpose of utilizing sigmoid activation function (σ ) is to unify
denotes the convolution operation and bm
j denotes the j th bias of m th
outputs of single LSTM models. To regulate overfitting of output of
layer. Finally, rectified linear unit (ReLU) activation function is denoted
BiLSTM model, drop-out technique is employed. The rate of the drop-out
by f.
layer is a hyperparameter need to be tuned accordingly while defining
A pooling layer in CNN architecture reduces parameters by either
BiLSTM model. The working operation of the BiLSTM model is shown in
taking the global average or selecting the maximum value from the
Fig. 3.
convolutional kernel outputs, as represented by Eq. (3).
( ( ) )
m 2.2.3. Transformer model
ym
j = f βj max xj
m− 1
+ bm
j (3)
The Transformer model, by means of its self-attention mechanism,
learns temporal and recurring patterns in RES data more efficiently than
Where max(.) denotes the maxpooling subsampling function, βm
j repre traditional RNNs. Unlike RNNs, Transformers do not require processing
sents the j th bias m th layer. data in a sequential order. The components of the Transformer model
The fully connected layer produces the final results of the CNN model are discussed below.
by utilizing the outputs from the convolutional and pooling layers as its
inputs, as specified in Eq. (4).
2.2.3.1. Encoder-decoder architecture. A Transformer model typically
( )
consists of stacked encoder-decoder layers. Each layer includes a posi
ym
j = f w xj
m m− 1
+ bm
j (4)
tional encoding, a self/multi-head-attention, and a fully connected
sublayer. Each sublayer, following layer normalization, employs a re
Where ym
j denotes the final j th output of m th layer of the CNN model, sidual structure. The mathematical dynamics of these sublayers are
xm−
j
1
denotes the j th input vector of (m-1) th layer, wm denotes weight described as follows.
matrix between m th layer and (m-1) th layer.
outputsublayer = LayerNorm(x + Sublayer(x)) (7)
2.2.2. BiLSTM
Where, outputsublayer denotes output of sublayer. Sublayer(x) represents
BiLSTM is an advanced version of the LSTM model. Before delving
the function implemented by sub-layer. To support residual connection
into the BiLSTM model, it is essential to understand the fundamental
structure and working principles of the LSTM model. LSTM enhances
traditional RNNs by addressing the vanishing gradient problem and
improving long-term sequence retention through a gating mechanism
[43,44]. The LSTM unit cell comprises three gates: input, forget, and
output. These gates regulate the model’s capacity to retain long-term
dependencies. The mathematical dynamics of LSTM can be explained
as follows:
it = σ (U
( i ⋅[ht− 1 , xt ] + bi ))
ft = σ Uf ⋅[ht− 1 , xt ] + bf
ot = σ (U0 ⋅[ht− 1 , xt ] + bo ) (5)
ct = ft ⊙ ct− 1 + it ⊙ tanh(Uia ⋅[ht− 1 , xt ] + bia )
ht = ot ⊙ tanh(ct )

Where, U denotes the weight matrix and the bias of corresponding gates
are denoted by b. ⊙ symbol denotes element-wise multiplication. σ
symbol is often known as the sigmoid activation function and is
responsible for controlling the opening and closing of the corresponding
Fig. 2. Single Cell of LSTM model [45].

4
T. Bashir et al. Renewable Energy 239 (2025) 122055

information is obtained by adding each of these vectors with their cor

responding embedding vectors. The mathematical dynamics of posi
tional encoding is given in Eq. (8).
( / )
Pe(pos,2id) = sin pos 100002id/dmod el
( / ) (8)
Po(pos,2id+1) = cos pos 10000 2id/dmod el

Where, Pe , Po represents the even and odd positional elements.

2.2.3.3. Self-attention and multi-head attention mechanisms. The atten

tion mechanism prioritizes important tasks amid information overload,
efficiently managing limited computational resources and its input in
formation is comprised of three vectors: key, value, and query vectors.
The key vector and corresponding value vector can be denoted in the
Fig. 3. Working principle of BiLSTM model. form of pairs, expressed as [(K1 ,V1 ),(K2 ,V2 ),⋯,(Kn ,Vn )]. The remaining
query vector encapsulate the target values, and value vectors weights
in the Transformer model, outputsublayer of all the sub-layers including are determined based on query and key similarity. The final attention
input embedding should have the same self-defined dimensionality value is the weighted sum of value vectors. The fundamental concept of
(dmod el ). the attention mechanism is mathematically represented as follows:
A Transformer model employ this architecture to map an input
Atten = U × V
sequence (is1 ,is2 ,⋯,isn ) into a continuous sequence of outputs represented (9)
U = f(Q, K)
by c = (ic1 , ic2 , ⋯, icn ). While, during the forecasting, a decoder for given
(
elements of c generates an output sequence of representations io1 , io2 , ⋯, Where, Atten represents the attention values, V is the value vector of the
)
ion . Where, isn , icn , and ion represents n th input sequence, n th continuous pairs of (Kn , Vn ). (Q, K) represents query and key vectors respectively. U
sequence, and n th output sequence respectively. A Transformer decoder represents the weights of corresponding V, and f represents the weight
has the same number of stacked layers as the encoder. Initially, the data transformation function.
for renewable power generation is transformed to a normalized range of The self-attention mechanism takes the benefit of three trainable
values between 0 and 1 using a Min-Max scaler. The data is then con parameter metrics, namely WK , WV and WQ for the transformation of
verted into sequences, each assigned a numerical token and processed input sequence x into, value (V), key (K) and query (Q) vectors respec
through an input embedding layer, which maps each integer to a pre- tively. For weight transformation SoftMax function is employed. The dot
√̅̅̅̅̅
trained continuous vector. This study utilizes only the encoder unit of product of K and Q divided by dk yields weight of the V. The funda
Transformer model, as depicted in Fig. 4. mentals of self-attention mechanism is elaborated using mathematical
dynamics in Eq. (10) and (11).
2.2.3.2. Positional encoding. Since Transformer model lacks inherent
the recursion phenomena, that’s why employment of positional encod K = WK x ∈ Rdk ×N
V = WV x ∈ Rdv ×N (10)
ing to inject positional information of time series into embedding is
Q = WQ x ∈ Rdk ×N
inevitable. In this paper, sine and cosine functions generated two
different vectors for even and odd time steps respectively. Positional

Fig. 4. Encoder-based Transformer model.

5
T. Bashir et al. Renewable Energy 239 (2025) 122055

( )
QKT MultiHeadAtten(Q, K, V) = Concat(h1 , ⋯, hh )Wo (12)
Atten(Q, K, V) = soft max √̅̅̅̅̅ V (11)
dK
Where, hi = Atten(Qi , Ki , Vi ).
Where dK denotes dimension of K.
The multi-head attention mechanism splits a single self-attention Qi = QWiQ , WiQ ∈ Rdmod el ×dK
/
mechanism into h parallel heads, each calculated using the formula in Ki = KWiK , WiK ∈ Rdmod el ×dK , dK = dV = dmod el h
(13)
Eq. (11) to generate different weight matrices. The individual weight Vi = VWiV , WiV ∈ Rdmod el ×dV
matrices WiQ , WiK and WiV , are responsible for transformation of Q, K, and i = 1, 2, 3, ⋯, h
V of dimension of dmod el into h vectors Qi , Ki and Vi , each of dimension
dmod
. Finally, the outputs of all heads of each parallel layer are
el Where, WiQ , WiK , and WiV are linearly projected from Q, K, and V. Wio is
h
concatenated and passed through a linear layer to get the final value. also a projected parameter matrix utilized to remap the result of
The mathematical dynamics of above explanation is given as follows: concatenation to dmod el dimensions.

Fig. 5. Proposed hybrid model.

6
T. Bashir et al. Renewable Energy 239 (2025) 122055

2.2.4. Hybrid model strategy the purpose of employing the CNN model is also same like the hybrid
This study explores various hybrid framework strategies for fore CNN-ABiLSTM that is extraction of high level features and correlation
casting renewable power generation data, including CNN-LSTM, CNN- present in the renewable power generation data. After the extraction of
BiLSTM, CNN-ABiLSTM, and CNN-Transformer-MLP techniques. How high level feature and correlation by the CNN, these features are passed
ever, only two of the hybrid techniques CNN-ABiLSTM and CNN- through the positional encoding layer. The purpose of employment of
Transformer-MLP outperformed other hybrid techniques. While, the the positional encoding is to assign a specific position to a specific time-
CNN-Transformer-MLP hybrid model outperformed the previously step for the processing of the Transformer layer. Although the Trans
mentioned CNN-ABiLSTM. Therefore, these two hybrid techniques are former does not possess inherent sequential data processing skills, it is
discussed in detail here in this section on hybrid model strategy. Fig. 5 adept at capturing long-term dependencies in RES data. The output of
elaborates on the two proposed hybrid models, which share the same the Transformer model is subsequently fed to multilayer perceptron to
preprocessing steps for wind and solar power datasets. Note that the generate the ultimate result.
models were trained separately for wind and solar power predictions, A key difference between the two hybrid techniques resides in the
using wind data for wind power forecasting and solar data for solar utilization of BiLSTM and Transformer models. The BiLSTM models may
power forecasting. The figure also highlights the structural differences be susceptible to the vanishing gradient issue. This issue can hinder its
between the proposed hybrid models. ability to model the long-range dependencies present in the RES data.
Because renewable power generation data is sequential in nature, the
2.2.4.1. CNN-ABiLSTM model. Hybrid of CNN-ABiLSTM model is a BiLSTM model requires additional training time compared to the
stacked fashion hybrid strategy. This hybrid strategy utilizes the Encoder-based Transformer model, which is also sequential. However,
strength of each model at different stages of forecasting process. The the Encoder-based Transformer models cope up with aforementioned
hybrid model structure begins with the 1D-CNN, followed by BiLSTM problems leveraging its strong parallel processing ability. Similarly, to
networks, and concludes with the incorporation of attention mechanism address the issue of capturing long-range dependencies of the BiLSTM
along with a dense layer. Initially, the rolling (sliding) window method model that Transformer model can easily capture with the help of in-
is essential for structuring the input data. This method allows the model built attention mechanism, different attention mechanism modules
to forecast the subsequent time step by utilizing the renewable power such as simple attention, self-attention, and multi-head attention
data from the previous time steps (particularly solar and wind power). mechanisms are employed. However, simple attention mechanism
The rolling window strategy utilizes the following time step as the modules gave the optimal results at the optimal training time. So, the
output variable and the preceding time steps as the input variables. simple attention mechanism is finally utilized with the hybrid strategy of
The width of the window in the rolling window method is defined by CNN-ABiLSTM model.
the number of previous time steps that were utilized. Following the se
lection of the ideal window width, CNN is fed renewable power data 3. Evaluation metrics
with the ideal window size. A CNN model has the strong capability to
capture the local (short-term) correlations between the previous and The performance of previously described forecasting models is
next time steps of univariate renewable power data. The extracted cor evaluated by employing four separate evaluation metrics: the mean
relations with the help of CNN are fed to the BiLSTM model. Thus absolute error (MAE), the root mean squared error (RMSE), the mean
BiLSTM model will utilize the new information extracted by the CNN for squared error (MSE), and finally coefficient of determination (R2 )
the identification of innate contextual temporal features (long-term method. Mathematically these evaluation metrics can be described as
correlation) to forecast renewable power generation data. In addition, follows:
the output of the BiLSTM model is inputted into the attention mecha
N ⃒
nism module. 1 ∑ ( )⃒
MAE = ⃒ ̂y R − yR ⃒ (15)
The primary function of the attention mechanism is to allocate more N L=1
weights in the output layer of the BiLSTM model to time steps that are √̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
more meaningful. This allows the model to effectively adjust to shifting √
√1 ∑ N
( )2
patterns and variable lengths of temporal dependencies. It ensures that RMSE = √ y R − yR
̂ (16)
N L=1
BiLSTM layer can selectively attend to more relevant portions of time
series during forecasting.
N
The proposed hybrid framework of CNN-ABiLSTM employs the 1 ∑ ( )2
MSE = y R − yR
̂ (17)
simple attention mechanism, foregoing self-attention or multi-head N L=1
attention mechanism and mathematical dynamics of this simple atten
tion mechanism is given as follows: Rss
R2 = 1 − (18)
Tss
e = tanh(W • x + b)
β = soft max(e) Where N denotes total number of the data points, ̂ y R represents fore
∑n (14)
context = (xi ⋅βi ) casted value of renewable power generation data, and yR is the original
i=1 data. Finally, the Rss represents the residual sum of squares, and Tss is the
total sum of squares.
The set of new values, denoted by e, is created by calculating the dot
product of the input sequence x and the weight matrix W, and then
4. Experimental setup
passing the result through a hyperbolic tangent activation function. β is
the probability distribution of e. Finally, the context is weighted sum of
The open-source TensorFlow machine learning framework served as
the i th input sequence (xi ) and i th probability distribution (βi ).
the platform for coding in Python. In the hybrid CNN-ABiLSTM model,
the CNN layer is composed of two Con1D layers. The Con1D layers are
2.2.4.2. CNN-transformer-MLP model. The working principle of the
utilized by employing the built-in "keras.layers" package, with 64 filters
hybrid of CNN-Transformer-MLP is someway similar to the hybrid of
(convolutional kernels), a kernel size of 2, and ReLU as the activation
CNN-ABiLSTM. The input fed to the hybrid of CNN-Transformer-MLP
function. The BiLSTM layer consists of 50 LSTM units, each having a
model also employs same technique of the rolling or sliding window.
ReLU activation function. Finally, the mathematical formulation of Eq.
The CNN model is also an important part of this hybrid technique, and
(14) helped to build a customized attention mechanism function by

7
T. Bashir et al. Renewable Energy 239 (2025) 122055

employing the build-in “Layer” base-class of keras.layers. These con Table 2

figurations of different layers are also outlined in Table 1. Hyperparameters of CNN-Transformer-MLP model.
In the hybrid CNN-Transformer-MLP model, the CNN layer includes Model Parameter Value/Function
a Conv1D layer with 16 convolutional kernels, each with a kernel size of
Conv1D filters 16
1, and ReLU as an activation function. Positional encoding in the kernel size 1
Transformer’s Encoder layer is determined using Eq. (8) to identify the activation relu
positions of even and odd elements in the encoding vector. Encoder block of Transformer embeddings 16
The built-in “tensorflow.keras.layers.Layer” package served as a heads 2
ff_dim 32
building platform of encoder layer and implementing the multi-head GlobalAveragePooling1D – –
attention mechanism within the encoder layer. After the Encoder- MLP 1st layer nodes 64
based Transformer layer in this hybrid strategy, the global average 2nd layer noes 32
pooling layer with single dimension was called to get the fixed size activation relu
Dense output-nodes 1
output. Finally, the MLP layer was employed, comprising of two hidden
Optimizer adam –
layers with 64 neurons in the initial layer and 32 neurons in the sub
sequent layer. The configurations of the different layers are also
enumerated in Table 1. However, when it comes to predicting wind power data, the baseline
A rolling window-based function was created for data preprocessing CNN model stands out with poor results compared to the other models.
to construct sequences for optimal forecasting. The built-in “sklearn. The primary factor contributing to the poor performance of the CNN
preprocessing” module within “scikit-learn” helped to import the Min- model is its incapacity to effectively capture the intricate patterns
Max scaler function. The pre-built “adam” function and build-in mean inherent in the wind power data. The remaining baseline models exhibit
squared error function demonstrated its advantage in algorithm opti a similar performance pattern over a longer duration (7, 8). Meanwhile,
mization and in calculating the loss during training of both hybrid and the proposed hybrid strategies display an interesting behavior for the
single models. The LSTM based models underwent training for a cu predictions over a longer period of time. For week ahead predictions, the
mulative duration of 100 epochs, with a learning rate of 0.001. While, hybrid of CNN-Transformer-MLP model display comparatively better
the transformer based models used the learning rate of 0.00001. Early- performance for both datasets (7). While on the other hand, for the case
stopping with 10 epochs was used to prevent overfitting and determine of month ahead predictions, the CNN-ABiLSTM model outperform the
the trade-off between training performance and data generalization CNN-Transformer-MLP and other baseline models (8). Tables 3–5, de
during forecasting model training. The hyperparameters to get the pict the evaluation metrics in detail for the forecasting performance of
optimal performance of single base-line models and above mentioned proposed hybrid and baseline models over three different periods of
hybrid models are also presented in Table 2. time.
It is important to mention that the aforementioned investigations For the day ahead prediction of renewable power data (solar and
were conducted on a computer that met the following specifications: wind power datasets), both hybrid strategies outperform the baseline
Processor: Intel (R) Core (TM) i7-12700 CPU @ 2.10 GHz, RAM: 16.0 models with RMSE values of 36.94 and 31.32 for solar power data and
GB. The simulation runtime of the proposed hybrid CNN-ABiLSTM for wind power data with values of 44.82 and 43.07. It is evident from
model is longer than that of a standalone CNN model but shorter than Table 3 that CNN-Transformer-MLP model also outperformed CNN-
that of other baseline and hybrid models. ABiLSTM model with lower MSE, MAE, and RMSE, and R2 values for
both solar and wind power datasets. Fig. 9 (a) further illustrates the
5. Results and discussions dominance of the CNN-Transformer-MLP model through a spider plot.
As shown in the figure, the standalone CNN model had the worst per
This study aims to evaluate two hybrid strategies that incorporate a formance when compared to the CNN-Transformer-MLP model, with
CNN layer, using the renewable power production data discussed in RMSE values 64.79 % and 84.3 % higher, MAE values 79.03 % and 87.5
previous sections, to predict solar and wind power output. The baseline % higher, and MSE values 87.60 % and 97.5 % higher for solar and wind
models for assessing the efficacy and efficiency of the two proposed data, respectively. On the other hand, the CNN-ABiLSTM model per
hybrid strategies are the standalone CNN, BiLSTM, and Encoder-based formed significantly better, with RMSE values 15.2 % higher for solar
Transformer models. The obtained prediction results are analyzed and 3.9 % higher for wind, MAE values 35.9 % higher for solar and 10.4
through evaluation metrics as described in Section 3. % higher for wind, and MSE values 28.1 % higher for solar and 7.7 %
Figs. 6–8 depict the forecasting results of proposed hybrid strategies higher for wind, compared to the CNN-Transformer-MLP model.
and baseline models on both solar power and wind power datasets. Similarly, for the week ahead time horizon prediction hybrid of CNN-
These figures suggest that with the extension of the forecasting time Transformer-MLP model has proved its supremacy in performance and
horizon, the forecasting ability of baseline models in comparison to efficiency with respect to RMSE, MAE, and MSE, and R2 values
proposed hybrid strategies differs greatly. For day ahead prediction (6) compared to other hybrid and baseline models with RMSE values of
of solar power data, all models have yielded comparable outcomes. 32.28 and 117.74 for solar power and wind power data forecast
respectively. Fig. 9 (b) illustrates the percentage improvement in fore
casting accuracy of the hybrid CNN-Transformer-MLP model over other
Table 1 models. As shown in figure, the standalone CNN model had the worst
Hyperparameters of CNN-ABiLSTM model. performance when compared to the CNN-Transformer-MLP model. It
Model Parameter Value/Function performed higher RMSE values 53.18 % and 40.25 % higher, MAE
Conv1D (x2) filters 64 values of 68.46 % and 50.73 % higher, and MSE values of 78.06 % and
kernel size 2 64.21 % higher for solar and wind data, respectively. On the other hand,
activation relu the CNN-ABiLSTM model performed significantly better, with RMSE
BiLSTM hidden nodes 50
values 15.2 % higher for solar and 3.9 % higher for wind, MAE values
activation relu
return_sequences True 35.9 % higher for solar and 10.4 % higher for wind, and MSE values
Self-attention – – 28.1 % higher for solar and 7.7 % higher for wind, compared to the CNN-
Flatten – – Transformer-MLP model.
Dense output-nodes 1 For the case of month ahead predictions, the hybrid of CNN-
Optimizer adam
Transformer-MLP model proved itself more efficient while forecasting
–

8
T. Bashir et al. Renewable Energy 239 (2025) 122055

Fig. 6. Day ahead predictions (a) solar power data, (b) wind power data.

Fig. 7. Week ahead predictions (a) solar power data, (b) wind power data.

Fig. 8. Month ahead predictions (a) solar power data, (b) wind power data.

Table 3
Comparison of day-ahead forecasts across different models.
Model Name and Date Model’s Solar Power Generation Data Performance Evaluation Model’s Wind Power Generation Data Performance Evaluation

2022-11-30 RMSE (MW) MAE (MW) MSE (MW) R2 RMSE (MW) MAE (MW) MSE (MW) R2
CNN 88.95 81.60 7912.78 0.9953 274.48 270.88 75342.13 0.6471
BiLSTM 56.79 46.34 3225.51 0.9981 62.86 52.45 3952.02 0.9815
Encoder-based Transformer 48.09 29.02 2313.48 0.9986 46.48 36.82 2160.43 0.9898
CNN-ABiLSTM 36.94 26.70 1364.89 0.9992 44.82 37.68 2009.41 0.9905
CNN-Trans.-MLP 31.32 17.11 981.02 0.9994 43.07 33.75 1855.04 0.9913

for solar power dataset. The RMSE values for solar power data pre proved to be more efficient than CNN-Transformer-MLP with RMSE
dictions are lower than the CNN-ABiLSTM model that are 54.89. How value of 256.19. While, in terms of percentage performance, the CNN-
ever, during the forecast of wind power dataset CNN-ABiLSTM model ABiLSTM model outperformed the CNN-Transformer-MLP model for

9
T. Bashir et al. Renewable Energy 239 (2025) 122055

Table 4
Comparison of week-ahead forecasts across different models.
Model Name and Date Model’s Solar Power Generation Data Performance Evaluation Model’s Wind Power Generation Data Performance Evaluation

2022-12-02-09 RMSE (MW) MAE (MW) MSE (MW) R2 RMSE (MW) MAE (MW) MSE (MW) R2
CNN 68.92 48.68 4749.77 0.9972 196.96 170.88 38793.06 0.9966
BiLSTM 56.88 39.69 3235.38 0.9981 133.51 97.95 17824.25 0.9985
Encoder-based Transformer 49.68 33.42 2469.04 0.9985 140.59 103.16 19766.94 0.9983
CNN-ABiLSTM 32.89 16.13 1082.19 0.9993 125.60 89.37 15775.73 0.9986
CNN-Trans.-MLP 32.28 15.35 1042.20 0.9993 117.74 84.20 13882.27 0.9988

Table 5
Comparison of month-ahead forecasts across different models.
Model Name and Date Model’s Solar Power Generation Data Performance Evaluation Model’s Wind Power Generation Data Performance Evaluation

2023–01 RMSE (MW) MAE (MW) MSE (MW) R2 RMSE (MW) MAE (MW) MSE (MW) R2
CNN 131.90 83.96 17397.78 0.9965 294.27 220.36 86599.44 0.9994
BiLSTM 72.48 56.79 5254.72 0.9989 268.88 185.40 72297.18 0.9995
Encoder-based Transformer 83.91 50.33 7041.47 0.9986 368.40 262.34 135722.1 0.9992
CNN-ABiLSTM 63.81 46.26 4071.96 0.9992 256.19 178.01 65634.47 0.9996
CNN-Trans.-MLP 54.89 32.60 3013.95 0.9994 266.61 178.49 71085.26 0.9996

Fig. 9. CNN-Trans.-MLP performance comparison with other models: (a) day-ahead, (b) week-ahead.

wind power forecasting, achieving a 4.07 % reduction in RMSE, a 0.27 % strategies outperform standalone CNN, BiLSTM, and Encoder-based
reduction in MAE, and an 8.31 % reduction in MSE. It also confirms the Transformer models with respect to RMSE, MAE, MSE, and R2. The
necessity of testing many models for various time horizons, as demon proposed hybrid strategies significantly outperform the standalone
strated in the spider plot depicted in Fig. 10. The above discussion of counterparts for long-term predictions and marginally surpass in short-
results suggest that, the hybrid of CNN-Transformer-MLP model proved term predictions.
to be more effective in forecasting both solar and wind power dataset for The results analysis of renewable energy forecasting models has
short period of time. But for the long period of time, CNN-ABiLSTM revealed significant performance variances across diverse timeframes
model start to challenge this status for solar power data and also got and model architectures. For day-ahead forecasts, the hybrid CNN-
successful to beat CNN-Transformer-MLP model in performance for Transformer-MLP model significantly outperformed other models. It
wind power dataset. achieved average improvements of 31.94 % in RMSE, 43.81 % in MAE,
and 48.58 % in MSE for solar power data. Similarly, it enhanced wind
6. Conclusion power forecasting accuracy with improvements of 25.41 % in RMSE,
28.39 % in MAE, and 34.48 % in MSE. However, for week-ahead pre
Accurate forecasting of renewable power generation is crucial to dictions of solar power production, although the CNN-ABiLSTM model
ensure stable operation and effective administration of electric grid. As demonstrated improved performance compared to day-ahead pre
renewable energy is intermittent and occasionally unpredictable, the dictions, the CNN-Transformer-MLP model still managed to outperform
simple standalone machine learning-based models fail to provide effi it by a narrow margin. Specifically, the CNN-Transformer-MLP model
cient results. This article proposed two hybrid strategies, a hybrid of outperformed the CNN-ABiLSTM model, showing a 4.84 % improve
CNN-ABiLSTM and a CNN-Transformer-MLP model, for the forecast of ment in MAE, 1.85 % improvement in RMSE and a 3.69 % improvement
renewable power production, specifically wind and solar power pro in MSE. For week-ahead predictions of wind power data, the CNN-
duction. These hybrid strategies feature individual methods based on Transformer-MLP model outperformed other models, including the
their strength for extraction and forecasting complex and non-linear CNN-ABiLSTM. It achieved an average improvement of 18.64 % in
trends in renewable power data. The developed hybrid strategies are RMSE, 22.73 % in MAE, and 32.53 % in MSE. Nevertheless, for month-
validated on real-time renewable power production dataset for Ger ahead predictions of wind power data, the CNN-ABiLSTM model out
many. The simulation results confirm that the proposed hybrid performed other models, including the CNN-Transformer-MLP model. It

10
T. Bashir et al. Renewable Energy 239 (2025) 122055

approach, J. Clean. Prod. 434 (2024) 139655, https://doi.org/10.1016/j.

jclepro.2023.139655.
[4] C. Zhao, J. Wang, K. Dong, K. Wang, Is renewable energy technology innovation an
excellent strategy for reducing climate risk? The case of China, Renew. Energy 223
(2024) 120042, https://doi.org/10.1016/j.renene.2024.120042.
[5] M. Ilyas, Z. Mu, S. Akhtar, H. Hassan, K. Shahzad, B. Aslam, S. Maqsood,
Renewable energy, economic development, energy consumption and its impact on
environmental quality: new evidence from South East Asian countries, Renew.
Energy 223 (2024) 119961, https://doi.org/10.1016/j.renene.2024.119961.
[6] Configuration optimization of a wind-solar based net-zero emission tri-generation
energy system considering renewable power and carbon trading mechanisms,
Renew. Energy 232 (2024) 121086, https://doi.org/10.1016/j.
renene.2024.121086.
[7] Renewables 2023 – Analysis, IEA, 2024. https://www.iea.org/reports/renewable
s-2023. (Accessed 7 April 2024).
[8] X. Zheng, F. Bai, Z. Zhuang, Z. Chen, T. Jin, A new demand response management
strategy considering renewable energy prediction and filtering technology, Renew.
Energy 211 (2023) 656–668, https://doi.org/10.1016/j.renene.2023.04.106.
[9] J.E. Hammond, R.A.L. Orozco, M. Baldea, B.A. Korgel, Short-term solar irradiance
forecasting under data transmission constraints, Renew. Energy (2024) 121058,
https://doi.org/10.1016/j.renene.2024.121058.
[10] E. Ahn, J. Hur, A short-term forecasting of wind power outputs using the enhanced
wavelet transform and arimax techniques, Renew. Energy 212 (2023) 394–402,
https://doi.org/10.1016/j.renene.2023.05.048.
[11] Hybrid model for intra-day probabilistic PV power forecast, Renew. Energy 232
(2024) 121057, https://doi.org/10.1016/j.renene.2024.121057.
[12] S. Aslam, H. Herodotou, S.M. Mohsin, N. Javaid, N. Ashraf, S. Aslam, A survey on
deep learning methods for power load and renewable energy forecasting in smart
microgrids, Renew. Sustain. Energy Rev. 144 (2021) 110992, https://doi.org/
10.1016/j.rser.2021.110992.
[13] S.N. Singh Aasim, A. Mohapatra, Repeated wavelet transform based ARIMA model
Fig. 10. Comparison of CNN-Trans.-MLP forecasting performance with other for very short-term wind speed forecasting, Renew. Energy 136 (2019) 758–768,
models during a month-ahead horizon. https://doi.org/10.1016/j.renene.2019.01.031.
[14] K. Lee, S. Im, B. Lee, Prediction of renewable energy hosting capacity using
multiple linear regression in KEPCO system, Energy Rep. 9 (2023) 343–347,
achieved average improvements of 10.51 % in RMSE, 13.90 % in MAE, https://doi.org/10.1016/j.egyr.2023.09.121.
and 23.66 % in MSE. These findings underscore the significance of the [15] G. Narvaez, L.F. Giraldo, M. Bressan, A. Pantoja, Machine learning for site-
adaptation and solar radiation forecasting, Renew. Energy 167 (2021) 333–342,
proposed hybrid forecasting models tailored for solar and wind data. https://doi.org/10.1016/j.renene.2020.11.089.
The proposed models are adaptable to various sites with similar [16] Frontiers | Effective artificial neural network-based wind power generation and
meteorological trends; however, their efficacy can be further evaluated load demand forecasting for optimum energy management, (n.d.). https://www.
frontiersin.org/articles/10.3389/fenrg.2022.898413/full (accessed March 5,
for sites with significantly different patterns. Incorporating additional 2024).
data characteristics, such as wind speed and weather information, could [17] Simulation and forecasting of power by energy harvesting method in photovoltaic
improve the accuracy of the forecasting projections. The suggested panels using artificial neural network, Renew. Energy 222 (2024) 120017, https://
doi.org/10.1016/j.renene.2024.120017.
hybrid models have the capability to be utilized for a range of applica [18] G.S. Martins, M. Giesbrecht, Hybrid approaches based on Singular Spectrum
tions, such as sales, stock prices, and weather forecasts, in addition to Analysis and kk- Nearest Neighbors for clearness index forecasting, Renew. Energy
their primary function of predicting renewable power output. In addi 219 (2023) 119434, https://doi.org/10.1016/j.renene.2023.119434.
[19] Forecasting short-term renewable energy consumption of China using a novel
tion to evaluating parameters based on reduced error, limited processing
fractional nonlinear grey Bernoulli model, Renew. Energy 140 (2019) 70–87,
time, and fast convergence, such systems can also be evaluated for cost- https://doi.org/10.1016/j.renene.2019.03.006.
effectiveness. [20] Wind power generation prediction during the COVID-19 epidemic based on novel
hybrid deep learning techniques, Renew. Energy 222 (2024) 119863, https://doi.
org/10.1016/j.renene.2023.119863.
CRediT authorship contribution statement [21] H.-K. Wang, K. Song, Y. Cheng, A hybrid forecasting model based on CNN and
informer for short-term wind power, Front. Energy Res. 9 (2022), https://doi.org/
10.3389/fenrg.2021.788320.
Tasarruf Bashir: Writing – review & editing, Writing – original
[22] Z. Hajirahimi, M. Khashei, Hybrid structures in time series modeling and
draft, Validation, Formal analysis, Data curation, Conceptualization. forecasting: a review, Eng. Appl. Artif. Intell. 86 (2019) 83–106, https://doi.org/
Huifang Wang: Writing – review & editing, Supervision, Investigation, 10.1016/j.engappai.2019.08.018.
[23] G. Alkhayat, R. Mehmood, A review and taxonomy of wind and solar energy
Conceptualization. Mustafa Tahir: Writing – review & editing, Visual
forecasting methods based on deep learning, Energy and AI 4 (2021) 100060,
ization, Formal analysis. Yixiang Zhang: Writing – review & editing, https://doi.org/10.1016/j.egyai.2021.100060.
Validation. [24] K. Qu, G. Si, Z. Shan, X. Kong, X. Yang, Short-term forecasting for multiple wind
farms based on transformer model, Energy Rep. 8 (2022) 483–490, https://doi.
org/10.1016/j.egyr.2022.02.184.
Declaration of competing interest [25] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, Ł. ukasz
Kaiser, I. Polosukhin, Attention is all you need, in: Advances in Neural Information
Processing Systems, Curran Associates, Inc., 2017, in: https://proceedings.neurips.
The authors declare that they have no known competing financial cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.
interests or personal relationships that could have appeared to influence html. (Accessed 27 May 2024).
[26] D. Salman, C. Direkoglu, M. Kusaf, M. Fahrioglu, Hybrid deep learning models for
the work reported in this paper.
time series forecasting of solar power, Neural Comput. Appl. (2024), https://doi.
org/10.1007/s00521-024-09558-5.
References [27] A CNN encoder decoder LSTM model for sustainable wind power predictive
analytics, Sustainable Computing: Informatics and Systems 38 (2023) 100869,
https://doi.org/10.1016/j.suscom.2023.100869.
[1] E. Ahmad, D. Khan, M.K. Anser, A.A. Nassani, S.A. Hassan, K. Zaman, The influence
[28] C. Zhang, H. Ma, L. Hua, W. Sun, M.S. Nazir, T. Peng, An evolutionary deep
of grid connectivity, electricity pricing, policy-driven power incentives, and carbon
learning model based on TVFEMD, improved sine cosine algorithm, CNN and
emissions on renewable energy adoption: exploring key factors, Renew. Energy
BiLSTM for wind speed prediction, Energy 254 (2022) 124250, https://doi.org/
232 (2024) 121108, https://doi.org/10.1016/j.renene.2024.121108.
10.1016/j.energy.2022.124250.
[2] S. Ullah, B. Lin, Green energy dynamics: analyzing the environmental impacts of
[29] S. Salehi, M. Kavgic, H. Bonakdari, L. Begnoche, Comparative study of univariate
renewable, hydro, and nuclear energy consumption in Pakistan, Renew. Energy
and multivariate strategy for short-term forecasting of heat demand density:
(2024) 121025, https://doi.org/10.1016/j.renene.2024.121025.
exploring single and hybrid deep learning models, Energy and AI 16 (2024)
[3] G. Perone, The relationship between renewable energy production and CO2
100343, https://doi.org/10.1016/j.egyai.2024.100343.
emissions in 27 OECD countries: a panel cointegration and Granger non-causality

11
T. Bashir et al. Renewable Energy 239 (2025) 122055

[30] S. Souha, Hybrid Deep Learning Model for Renewable Energy Forecasting, Master’s [38] A. Van Poecke, H. Tabari, P. Hellinckx, Unveiling the backbone of the renewable
Thesis, Leibniz University, 2021. https://www.iwi.uni-hannover.de/fileadmin/iw energy forecasting process: exploring direct and indirect methods and their
i/Abschlussarbeiten/MA_Soltani_K.pdf. (Accessed 27 May 2024). applications, Energy Rep. 11 (2024) 544–557, https://doi.org/10.1016/j.
[31] M. Yu, D. Niu, K. Wang, R. Du, X. Yu, L. Sun, F. Wang, Short-term photovoltaic egyr.2023.12.031.
power point-interval forecasting based on double-layer decomposition and WOA- [39] I. Karijadi, S.-Y. Chou, A. Dewabharata, Wind power forecasting based on hybrid
BiLSTM-Attention and considering weather classification, Energy 275 (2023) CEEMDAN-EWT deep learning method, Renew. Energy 218 (2023) 119357,
127348, https://doi.org/10.1016/j.energy.2023.127348. https://doi.org/10.1016/j.renene.2023.119357.
[32] R. Tahmasebifar, M.P. Moghaddam, M.K. Sheikh-El-Eslami, R. Kheirollahi, A new [40] F. Parreño, C. Parreño-Torres, R. Alvarez-Valdes, A matheuristic algorithm for the
hybrid model for point and probabilistic forecasting of wind power, Energy 211 maintenance planning problem at an electricity transmission system operator,
(2020) 119016, https://doi.org/10.1016/j.energy.2020.119016. Expert Syst. Appl. 230 (2023) 120583, https://doi.org/10.1016/j.
[33] X. Huang, Q. Li, Y. Tai, Z. Chen, J. Zhang, J. Shi, B. Gao, W. Liu, Hybrid deep eswa.2023.120583.
neural model for hourly solar irradiance forecasting, Renew. Energy 171 (2021) [41] Z. Liao, C.F.M. Coimbra, Hybrid solar irradiance nowcasting and forecasting with
1041–1060, https://doi.org/10.1016/j.renene.2021.02.161. the SCOPE method and convolutional neural networks, Renew. Energy 232 (2024)
[34] M.A. Hossain, R.K. Chakrabortty, S. Elsawah, M.J. Ryan, Very short-term 121055, https://doi.org/10.1016/j.renene.2024.121055.
forecasting of wind power generation using hybrid deep learning model, J. Clean. [42] S. Kiranyaz, O. Avci, O. Abdeljaber, T. Ince, M. Gabbouj, D.J. Inman, 1D
Prod. 296 (2021) 126564, https://doi.org/10.1016/j.jclepro.2021.126564. convolutional neural networks and applications: a survey, Mech. Syst. Signal
[35] Y.-M. Zhang, H. Wang, Multi-head attention-based probabilistic CNN-BiLSTM for Process. 151 (2021) 107398, https://doi.org/10.1016/j.ymssp.2020.107398.
day-ahead wind speed forecasting, Energy 278 (2023) 127865, https://doi.org/ [43] F. Shahid, A. Zameer, M. Muneeb, A novel genetic LSTM model for wind power
10.1016/j.energy.2023.127865. forecast, Energy 223 (2021) 120069, https://doi.org/10.1016/j.
[36] D. Geng, B. Wang, Q. Gao, A hybrid photovoltaic/wind power prediction model energy.2021.120069.
based on Time2Vec, WDCNN and BiLSTM, Energy Convers. Manag. 291 (2023) [44] M. Abdel-Nasser, K. Mahmoud, Accurate photovoltaic power forecasting models
117342, https://doi.org/10.1016/j.enconman.2023.117342. using deep LSTM-RNN, Neural Comput. Appl. 31 (2019) 2727–2740, https://doi.
[37] J.J. Mesa-Jiménez, A.L. Tzianoumis, L. Stokes, Q. Yang, V.N. Livina, Long-term org/10.1007/s00521-017-3225-z.
wind and solar energy generation forecasts, and optimisation of Power Purchase [45] T. Bashir, C. Haoyong, M.F. Tahir, Z. Liqiang, Short term electricity load
Agreements, Energy Rep. 9 (2023) 292–302, https://doi.org/10.1016/j. forecasting using hybrid prophet-LSTM model optimized by BPNN, Energy Rep. 8
egyr.2022.11.175. (2022) 1678–1686, https://doi.org/10.1016/j.egyr.2021.12.067.

AS & A Level Physics Coursebook Answers
68% (22)
AS & A Level Physics Coursebook Answers
91 pages
Great Grammar Practice 5
100% (11)
Great Grammar Practice 5
67 pages
The Lost E-Whoring Guide by Elite Cow
100% (3)
The Lost E-Whoring Guide by Elite Cow
5 pages
Improving Renewable Energy Forecasting With A Grid of Numerical Weather Predictions
No ratings yet
Improving Renewable Energy Forecasting With A Grid of Numerical Weather Predictions
10 pages
Forecasting of Solar and Wind Power Using
No ratings yet
Forecasting of Solar and Wind Power Using
14 pages
SCT Final Project Report
No ratings yet
SCT Final Project Report
20 pages
2 14 1615800902 3ijcseitrjun20213
No ratings yet
2 14 1615800902 3ijcseitrjun20213
8 pages
Energies: Solar Power Forecasting Using CNN-LSTM Hybrid Model
No ratings yet
Energies: Solar Power Forecasting Using CNN-LSTM Hybrid Model
17 pages
Unit 6: Test-Taking Skills Booster
0% (1)
Unit 6: Test-Taking Skills Booster
2 pages
A Hybrid Framework For Forecasting Power Generation of Multiple Renewable Energy Sources
No ratings yet
A Hybrid Framework For Forecasting Power Generation of Multiple Renewable Energy Sources
14 pages
A Comprehensive Study and Performance Analysis of Deep Neural Network-Based Approaches in Wind Time-Series Forecasting
No ratings yet
A Comprehensive Study and Performance Analysis of Deep Neural Network-Based Approaches in Wind Time-Series Forecasting
18 pages
ICE Evidence Format
No ratings yet
ICE Evidence Format
10 pages
IET Generation Trans Dist - 2020 - Frizzo Stefenon - Hybrid Deep Learning For Power Generation Forecasting in Active
No ratings yet
IET Generation Trans Dist - 2020 - Frizzo Stefenon - Hybrid Deep Learning For Power Generation Forecasting in Active
9 pages
Sustainability 15 07087 v2
No ratings yet
Sustainability 15 07087 v2
33 pages
1 s2.0 S0306261923010024 Main
No ratings yet
1 s2.0 S0306261923010024 Main
17 pages
Temporal Fusion VMD Windpower
No ratings yet
Temporal Fusion VMD Windpower
18 pages
Empowering Distributed Solutions in Renewable Energy Systems and Grid Optimization
No ratings yet
Empowering Distributed Solutions in Renewable Energy Systems and Grid Optimization
17 pages
Zhen - A Hybrid Deep Learning Model and Comparison For Wind Power Forecasting Considering Temporal-Spatial Feature Extraction
No ratings yet
Zhen - A Hybrid Deep Learning Model and Comparison For Wind Power Forecasting Considering Temporal-Spatial Feature Extraction
24 pages
Avant-Pays de La Meseta Marocaine
No ratings yet
Avant-Pays de La Meseta Marocaine
7 pages
The Effect of Online Game Addiction On Children and Adolescents' School Success
No ratings yet
The Effect of Online Game Addiction On Children and Adolescents' School Success
52 pages
Thesis S8
No ratings yet
Thesis S8
13 pages
Final
No ratings yet
Final
45 pages
Enhanced Forecasting Accuracy of A Grid-Connected
No ratings yet
Enhanced Forecasting Accuracy of A Grid-Connected
21 pages
Hybrid Deep Learning and Quantum Inspired Neural Networ - 2024 - Expert Systems
No ratings yet
Hybrid Deep Learning and Quantum Inspired Neural Networ - 2024 - Expert Systems
11 pages
LR042
No ratings yet
LR042
14 pages
Wind Power Prediction Using ML and DL Methodologies
No ratings yet
Wind Power Prediction Using ML and DL Methodologies
13 pages
STI 4.0 8346 Camera Ready
No ratings yet
STI 4.0 8346 Camera Ready
6 pages
Advancing Ultra-Short-Term Wind Power Forecasting With Multi-Channel ML Techniques
100% (1)
Advancing Ultra-Short-Term Wind Power Forecasting With Multi-Channel ML Techniques
4 pages
COMM 101: Fundamentals of Public Speaking - Ms. Wilde: Course Description
No ratings yet
COMM 101: Fundamentals of Public Speaking - Ms. Wilde: Course Description
5 pages
Assessment of Future Performance of Hybrid Solar-Wind Street Lamp Through Energy Generation Forecasting Using Artificial Neural Network
No ratings yet
Assessment of Future Performance of Hybrid Solar-Wind Street Lamp Through Energy Generation Forecasting Using Artificial Neural Network
18 pages
2023-24 Haven Pe Course Letter
No ratings yet
2023-24 Haven Pe Course Letter
5 pages
Energy Forecasting A Review and Outlook
No ratings yet
Energy Forecasting A Review and Outlook
13 pages
Authentic Assessment-2
No ratings yet
Authentic Assessment-2
2 pages
Career Resume Uncg
No ratings yet
Career Resume Uncg
1 page
Listening Lesson Plan Template 1
No ratings yet
Listening Lesson Plan Template 1
3 pages
Energies 13 06512
No ratings yet
Energies 13 06512
15 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
3 pages
Juvenile Delinquency: Analysis On The Causes of Delinquency and Programs of Barangay Officials in Barangay Kudanding Isulan, Sultan Kudarat
No ratings yet
Juvenile Delinquency: Analysis On The Causes of Delinquency and Programs of Barangay Officials in Barangay Kudanding Isulan, Sultan Kudarat
32 pages
Affidavit For Registered Pharmacist of Other State 2
No ratings yet
Affidavit For Registered Pharmacist of Other State 2
2 pages
Wind Power Forecasting System With Data Enhancement and
No ratings yet
Wind Power Forecasting System With Data Enhancement and
19 pages
Statement of Purpose Sop
No ratings yet
Statement of Purpose Sop
3 pages
25CSD09 Icrticc-2025
No ratings yet
25CSD09 Icrticc-2025
4 pages
WO Hybrid Report - SolarPowerPredictionModel
No ratings yet
WO Hybrid Report - SolarPowerPredictionModel
34 pages
Merit List of Recommended Candidates in Ntse Level-1 Exam Nov 2018 (As Per Quota) Haryana State
No ratings yet
Merit List of Recommended Candidates in Ntse Level-1 Exam Nov 2018 (As Per Quota) Haryana State
4 pages
25CSD09 Poster Presentation
No ratings yet
25CSD09 Poster Presentation
1 page
Assignment 3 - 3ES3
No ratings yet
Assignment 3 - 3ES3
7 pages
SBI Clerk 2024: Exam Date, Notification & Apply Online Link: Anangsha Patra
No ratings yet
SBI Clerk 2024: Exam Date, Notification & Apply Online Link: Anangsha Patra
8 pages
1 s2.0 S1364032117311620 Main
No ratings yet
1 s2.0 S1364032117311620 Main
17 pages
Predicting Wind Turbine Energy Production With Deep Learning Methods in GIS - A Study On HAWTs and VAWTs
No ratings yet
Predicting Wind Turbine Energy Production With Deep Learning Methods in GIS - A Study On HAWTs and VAWTs
12 pages
COC 102 Christian Ethics & Moral Maturity Introd & Module I - 111
No ratings yet
COC 102 Christian Ethics & Moral Maturity Introd & Module I - 111
91 pages
Dynamic Forecasting of Solar Energy Microgrid Systems Using Feature Engineering
No ratings yet
Dynamic Forecasting of Solar Energy Microgrid Systems Using Feature Engineering
13 pages
A Hybrid Model of CNN and LSTM Autoencoder-Based Short-Term PV Power Generation Forecasting
No ratings yet
A Hybrid Model of CNN and LSTM Autoencoder-Based Short-Term PV Power Generation Forecasting
18 pages
Review Transformer For Forcasting
No ratings yet
Review Transformer For Forcasting
29 pages
2009 Batch Regular
No ratings yet
2009 Batch Regular
2 pages
Mathematics P2 QP June 2022 Eng Eastern Cape
No ratings yet
Mathematics P2 QP June 2022 Eng Eastern Cape
13 pages
1 s2.0 S2096511722000226 Main
No ratings yet
1 s2.0 S2096511722000226 Main
22 pages
Easychair Preprint: Trung Van Nguyen
No ratings yet
Easychair Preprint: Trung Van Nguyen
6 pages
Proposal-Revised 3
No ratings yet
Proposal-Revised 3
26 pages
Applsci 13 11455 v2
No ratings yet
Applsci 13 11455 v2
19 pages
Revised Programme Schedule For Yuva Samvad
No ratings yet
Revised Programme Schedule For Yuva Samvad
2 pages
2935 5578 1 PB
No ratings yet
2935 5578 1 PB
5 pages
REF 3.0 Research - and - Application - of - Optimal - LSTM - Combinatorial - Model - Based - On - Convolutional - Neural - Network
No ratings yet
REF 3.0 Research - and - Application - of - Optimal - LSTM - Combinatorial - Model - Based - On - Convolutional - Neural - Network
5 pages
Compeleceng S 25 02696
No ratings yet
Compeleceng S 25 02696
23 pages
Enhancing Solar Forecasting Accuracy With Sequential Deep Artificial Neural Network and Hybrid Random Forest and Gradient Boosting Models Across Varied Terrains
100% (1)
Enhancing Solar Forecasting Accuracy With Sequential Deep Artificial Neural Network and Hybrid Random Forest and Gradient Boosting Models Across Varied Terrains
20 pages
Enhancing Power Quality in Solar-Wind Grid-Connected Systems Through Soft Computing Techniques
No ratings yet
Enhancing Power Quality in Solar-Wind Grid-Connected Systems Through Soft Computing Techniques
8 pages
Seminar Report
No ratings yet
Seminar Report
21 pages
Lesson Exemplar Template
No ratings yet
Lesson Exemplar Template
4 pages
Deep Learning For Wind and Sol
No ratings yet
Deep Learning For Wind and Sol
13 pages
Solar Power Prediction Using LSTM & RNN
No ratings yet
Solar Power Prediction Using LSTM & RNN
4 pages
Aarav - Impact of AI On Employment
No ratings yet
Aarav - Impact of AI On Employment
24 pages
The City of Berlin S Official Guide To Schools in English
No ratings yet
The City of Berlin S Official Guide To Schools in English
32 pages
A Machine Learning-Based Sustainable Energy Manage
No ratings yet
A Machine Learning-Based Sustainable Energy Manage
21 pages
(IJCST-V13I1P4) :DR - Snehal K Joshi
No ratings yet
(IJCST-V13I1P4) :DR - Snehal K Joshi
7 pages
ISAS - 2024 - SolarForecast - BiLSTM - Attn - Submitted
No ratings yet
ISAS - 2024 - SolarForecast - BiLSTM - Attn - Submitted
9 pages
Ebooks File (Ebook PDF) Empowered Popular Feminism and Popular Misogyny All Chapters
100% (1)
Ebooks File (Ebook PDF) Empowered Popular Feminism and Popular Misogyny All Chapters
62 pages
1 s2.0 S0140988324005929 Main
No ratings yet
1 s2.0 S0140988324005929 Main
12 pages
Major
No ratings yet
Major
6 pages
Sustainability 16 09535
No ratings yet
Sustainability 16 09535
22 pages
SOCI 426 Lecture One - 2024 - For Students
No ratings yet
SOCI 426 Lecture One - 2024 - For Students
30 pages
Paper Introduction
No ratings yet
Paper Introduction
2 pages
1 s2.0 S030626191931027X Main
No ratings yet
1 s2.0 S030626191931027X Main
17 pages
Energies 18 02108
No ratings yet
Energies 18 02108
28 pages
Short-Term Forecast of PV Power Using LSTM
No ratings yet
Short-Term Forecast of PV Power Using LSTM
19 pages
Aml g 通过网格搜索交叉验证优化rnn-lstm模型的风电预测
No ratings yet
Aml g 通过网格搜索交叉验证优化rnn-lstm模型的风电预测
21 pages
LITEARCY SYLABUS Grade 5
No ratings yet
LITEARCY SYLABUS Grade 5
5 pages
AI Group - 5
No ratings yet
AI Group - 5
22 pages
Forecasting Renewable Energy Production Using AI-Based Weather Prediction Models
No ratings yet
Forecasting Renewable Energy Production Using AI-Based Weather Prediction Models
8 pages
HRM Comprehensive Examination Form
No ratings yet
HRM Comprehensive Examination Form
1 page
Principles of Energy Storage Systems
From Everand
Principles of Energy Storage Systems
Jayarama P. Reddy
No ratings yet
Solar PV and Wind Energy Systems
From Everand
Solar PV and Wind Energy Systems
Amitabh Bhosale
No ratings yet

1 s2.0 S0960148124021232 Main

Uploaded by

1 s2.0 S0960148124021232 Main

Uploaded by

Renewable Energy 239 (2025) 122055

Contents lists available at ScienceDirect

Wind and solar power forecasting based on hybrid CNN-ABiLSTM,

1. Introduction renewable energy generation, their intermittent and stochastic nature

Data preparation is crucial in developing forecasting models,

Fig. 1. Overall procedure for RES power generation forecasting.

information is obtained by adding each of these vectors with their cor­

Where, Pe , Po represents the even and odd positional elements.

2.2.3.3. Self-attention and multi-head attention mechanisms. The atten­

Fig. 4. Encoder-based Transformer model.

Fig. 5. Proposed hybrid model.

employing the build-in “Layer” base-class of keras.layers. These con­ Table 2

approach, J. Clean. Prod. 434 (2024) 139655, https://doi.org/10.1016/j.

You might also like

information is obtained by adding each of these vectors with their cor

2.2.3.3. Self-attention and multi-head attention mechanisms. The atten

employing the build-in “Layer” base-class of keras.layers. These con Table 2