Int. J. Renew. Energy Dev. 2023, 12(2), 288-299 
    | 288 

https://doi.org/10.14710/ijred.2023.48672  
ISSN: 2252-4940/© 2023.The Author(s). Published by CBIORE 

 
Contents list available at IJRED website 
 

International Journal of Renewable Energy Development 
 

Journal homepage: https://ijred.undip.ac.id 

 
Wind Speed Prediction Based on Statistical and Deep Learning 
Models 

Ilham Tyassa* , Tajeddine Khalilib , Mohamed Rafika , Bellat Abdelouaheda , Abdelhadi 
Raihania , Khalifa Mansouria  

aENSET Mohammedia, EEIS Laboratory, Hassan II University of Casablanca, Morocco 
bLICPM Laboratory, ENSA Béni Mellal, Sultan Moulay Sliman University, Béni Mellal, Morocco. 

Abstract. Wind is a dominant source of renewable energy with a high sustainability potential. However, the intermittence and unstable nature of 
wind source affect the efficiency and reliability of wind energy conversion systems. The prediction of the available wind potential is also heavily flawed 
by its unstable nature. Thus, evaluating the wind energy trough wind speed prevision, is crucial for adapting energy production to load shifting and 
user demand rates. This work aims to forecast the wind speed using the statistical Seasonal Auto-Regressive Integrated Moving Average (SARIMA) 
model and the Deep Neural Network model of Long Short-Term Memory (LSTM). In order to shed light on these methods, a comparative analysis is 
conducted to select the most appropriate model for wind speed prediction. The errors metrics, mean square error (MSE), root mean square error 
(RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE) are used to evaluate the effectiveness of each model and are used 
to select the best prediction model. Overall, the obtained results showed that LSTM model, compared to SARIMA, has shown leading performance 
with an average of absolute percentage error (MAPE) of 14.05%.  

Keywords: Wind energy, Wind speed, Time series, Forecasting models, SARIMA, Deep learning, LSTM. 

@ The author(s). Published by CBIORE. This is an open access article under the CC BY-SA license 
 (http://creativecommons.org/licenses/by-sa/4.0/). 

Received: 1st Sept 2022; Revised: 16th Dec 2022; Accepted: 10th January 2023; Available online: 23rd January 2023   

 
1. Introduction 

Renewable energies are a serious alternative to fossil fuels. these 
type of energy sources are sustainable, extremely abundant 
within nature and can be a reliable alternative to classic energy 
generation methods. Wind energy, particularly, is one of the 
most exploited energy sources; it is a clean source, with a high-
performance coefficient, it assures a good energy 
independence. The most imposing drawback of wind energy is 
the high intermittence within the year cycle (Asari et al., 2002). 
Because of its fluctuating nature, wind speed is an extremely 
unpredictable meteorological factor. Increasing the accuracy of 
wind forecasting is necessary to optimize wind farm operations, 
maximize their yield, and ensure a steady development  (Devis 
et al., 2018). Wind energy production plants use special 
equipment to determine wind direction in order to control the 
rotor axis direction (Kodjo et al., 2008). However, The wind 
direction has a low influence on the energy production, 
especially since wind tends to have one or two dominant 
directions for which most of the energy is produced (Adekunle, 
2017). Thus, the unpredictable nature of wind potential, 
affecting the yield of wind turbines is related to the variation of 
wind speed itself for a given direction. 

Currently, there are two methods to forecast the wind speed. 
The first one, named indirect technique, involves predicting 
wind speed based on environmental data, such as wind 
direction, temperature, air humidity, etc. The second way is to 

 
*

Corresponding author 

  Email: ilham.tyass-etu@etu.univh2c.ma (I. Tyass) 

use past wind speed data to forecast wind speed for the next 
hours or days. According to the time horizon, the time series 
prediction is divided into three categories: (i) an extremely 
short-term prediction, which is very useful for intraday market 
trading and it represents a few minutes to an hour, (ii) A short-
term prediction approach that is suitable for maintenance 
planning and it is from one hour to 12 hours. Finally, (iii) a 
medium to long-term prediction, which is useful for maintaining 
non-renewable energy production. The prediction horizon 
ranges from several hours to several days. 

To grasp wind speed forecasting, several approaches have 
been proposed. These methods can be separated into physical 
and statistical methods. Generally, the physical models require 
ample physical background information, so that they are not 
appropriate for wind speed forecasting in electric power 
systems (Mi et al., 2019). In the statistical methods, the historical 
data are used to train time series models, e.g., auto-regression 
(AR) model, autoregressive moving average (ARMA) model 
(Erdem and Shi, 2011), Kalman filter, and artificial neural 
network (ANN) (Zhang et al., 2016). The purpose of this article 
is to compare the predicting results of the Seasonal Auto-
Regressive Integrated Moving Average (SARIMA) model with 
the deep neural network model of Long Short-Term Memory 
(LSTM) using the same dataset and diverse evaluation metrics. 

In the first section of this paper, various kinds of forecasting 
models are described, and then compared based on the analysis 
of several contents in the literature. In the second section, wind 

Research Article 

https://doi.org/10.14710/ijred.2023.48672
https://doi.org/10.14710/ijred.2023.48672
mailto:ilham.tyass-etu@etu.univh2c.ma
https://orcid.org/0000-0001-5803-6776
https://orcid.org/0000-0001-9559-585X
https://orcid.org/0000-0002-4044-9035
https://orcid.org/0000-0002-2137-2618
https://orcid.org/0000-0002-6295-274X
https://orcid.org/0000-0003-2242-0973
http://crossmark.crossref.org/dialog/?doi=10.14710/ijred.2023.48672&domain=pdf


I. Tyassl et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 

| 289 

ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

speed time series data is studied using the time-series 
decomposition method. The third paragraph discusses a short-
term wind speed forecasting using two of the most frequently 
employed time-series forecasting methods: SARIMA and the 
LSTM models. In the last section of the paper, the obtained 
results are carefully analyzed. To evaluate the performance of 
the wind speed forecasting model, data from three different 
velocity month were used, and four kinds of error metrics were 
considered: The Mean Squared Error (MSE), Root Mean 
Squared Error (RMSE), Mean Absolute Error (MAE) and Mean 
Absolute Percentage Error (MAPE). 

2. Related work 

According to the methodology, there are three major types of 
weather forecasting systems namely, physical, deterministic 
and hybrid systems. Physical systems utilize probabilistic 
approaches to indicate weather event probability, Deterministic 
approaches produce more precise weather forecasts for a given 
location, and hybrid models which combine multiple individual 
prediction models to overcome several prediction limits 
(Jaseena and Kovoor, 2020). In the literature, all these 
approaches are applied for wind speed forecasting systems: 

2.1. Physical models 

In general, a physical model is an interpretable model, which 
permits the identification of causal relationships and has the 
possibility to generate data by performing various numerical 
simulation experiments. However, it can only approximate 
complex cases and only simulates well-known physical laws. 
Physical models are used to forecast the wind resource, 
beginning with meteorological data and adjusting it to local 
physical effects. The physical method does not require any 
training inputs from older data; they provide real-time physical 
information of wind farm, such as wind speed, wind direction, 
temperature, humidity, air pressure, air density (Bessac et al., 
2018). Combined with the information of topography around the 
wind farm (terrain, obstacle, contour etc.), the wind speed at the 
wind turbine’s mast height is calculated, and the output of the 
turbine may then be determined. The use of inaccurate 
meteorological parameters induces system faults, which 
subsequently causes a cumulative impact of error, and reduce 
the forecast accuracy. The physical model uses the 
mathematical approaches of the atmosphere, they are mainly 
based on numerical weather prediction model; the NWP model 
is a program that resolves complex equations and describes 
atmospheric processes and how they evolve over time. Authors 
in (Lorenc, 1986), have proposed a NWP model that uses 
Bayesian Probabilistic Arguments. However, the model does 
not exhibit accurate results in short-term prediction. Indeed, the 
performance of the physical model is relatively weak when the 
wind speed is very random, it is only used for long-term 
forecasting or it is considered as an input for the statistical 
method. Evidence illustrates that, to achieve the objective of 
accurate wind speed prediction, the statistical model built based 
on probability theory and mathematical statistical method is 
identified as the most widely used model. 

2.2. Deterministic models 

Deterministic models can be classified into statistical models 
and Artificial Intelligence models. (i) The statistical methods 
consider the training of history of wind speed data and produces 
a result without considering the effect physical phenomena. A 
statistical method contains Kalman filters model; in this method, 
wind speed is considered a state variable leading to the 

establishment of state space representation. Kalman filters 
algorithm is then used to forecast the future wind speed. This 
approach commonly increases the accuracy of NWP models 
predictions (Cassola and Burlando, 2012). The fuzzy logic 
model, is a polyvalent statistical approach for wind forecasting, 
which the values of variables are real numbers between 0 and 1. 
It is based on qualitative variables to model complex systems, it 
is generally best suited for systems and phenomenon, for which 
it’s not possible to establish an exact model (Damousis et al., 
2004). Another statistical method is the conventional statistical 
model; they are similar to the direct random time-series model. 
Model identity, parameter estimation, and model validation are 
used to develop a mathematical solution to the problem. These 
models can be categorized into the following categories: 
Autoregressive model (AR), moving average model (MA), 
autoregressive moving average model (ARMA), auto regressive 
integrated moving average model (ARIMA) (Yatiyana et al., 
2017). (ii) Artificial intelligence (AI) models have proven to be 
more accurate and efficient at handling nonlinear data sets and 
providing better forecasting results. These models are divided 
between machine learning (ML) approaches for forecasting and 
deep learning predictors. Among the most commonly used 
machine-learning predictors for forecasting are ANN and 
Support Vector Machine (SVM) models. Thanks to its ability to 
convey non-linear correlations between previous weather 
patterns and future weather conditions, ANN-based models 
(Faniband and Shaahid, 2020) are one of the most common 
approaches for weather prediction. Neural networks are non-
linear regression models that can rapidly process large amounts 
of data and efficiently match input and output variables. SVM 
are supervised machine learning models focused on solving 
mathematical discrimination and regression problems. SVMs 
are appreciated for their simplicity of application, they can 
handle classification and regression issues (Brereton and Lloyd, 
2010). Deep Learning is a subcategory of machine learning that 
uses an artificial neural network approach to extract intelligence 
from big data sets. This technique employs supervised or 
unsupervised methods in deep architectures to create 
hierarchical representations. RNNs are employed in deep 
learning and in the generation of models that simulate the 
activity of the human brain system. They are particularly 
powerful in forecasting results. For classical RNNs, the absence 
of the gradient is a severe issue, since the neural network, 
unable to be properly trained, would surely lose performance. 
LSTM (Long Short-Term Memory) units are one of the solutions 
to this problem (Tokgöz and Ünal, 2018). LSTM based 
architectures are capable of capturing long-term dependencies 
with much more precision. 

2.3. Hybrid models 

Hybrid predictive models were introduced to address the limits 
of single models and increasing the performance of wind speed 
forecasting, a single model is not sufficient to handle complex 
real-world systems with unknown mixed models. Such as the 
combination of ANN and ARIMA model forecasts proposed in 
(Li et al., 2011) (Nair et al., 2017), the combination of long short-
term memory (LSTM) network and the decomposition methods 
using the grey wolf optimizer (GWO) (Altan et al., 2021). 

In literature, the most common and extensively used hybrid 
models are parameter optimization and data preprocessing 
based models (Hajirahimi and Khashei, 2019). The data 
preprocessing-based hybrid models, generally rely on data 
preparation approaches, the time series is converted split into 
many subsets of data. Authors in (Nguyen and Novák, 2019) 
created a hybrid model based on preprocessing to forecast 
seasonal time series. 


I.Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 
| 290 

 ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

Table 1 
Comparison of wind speed forecasting models 

Wind speed forecasting model Strengths  Weaknesses 

Models based on Bayesian approach. 
(Miranda and Dunn, 2006) 
(Tascikaraoglu and Uzunoglu, 2014). 

It offers a high flexibility in the modeling, 
allowing the inclusion of information from 
physical phenomena that may have an 
influence on the variable studied.  

This approach demands more work and time and 
requires a certain level of skill on the part of the 
user. 

NWP models (Yang et al., 2018) (Bennitt 
and Schueler, (2012).) (Martinez-García et 
al., 2021). 

Provides good results for longer prediction 
terms. The spatial resolution of numerical 
weather prediction models is continuously 
growing, providing a better representation 
of weather characteristics. 

Requires longer processing times and advanced 
computational resources. 
Not appropriate for short forecast times. 
Difficult to model. Requires the analysis and 
understanding of a variety of conditions. 

Kalman filter models (Wu et al., 2004) 
(Babazadeh et al., 2012) (Hide et al., 2003) 

Good performance in linear regression 
methods  
Because of its recursive form it is not 
needed to store historical data 

Needs extensive previous information of the 
system. 
Divergent filter estimates due to stability 
problems. 

Fuzzy logic models (Bououden et al., 
2012). 

Appropriate for structures that are harder to 
model precisely. 

Very complicated, and needs a considerable 
processing time when there are numerous rules.  

Time series models (AR, ARMA, ARIMA) 
(Torres et al., 2005) (Eldali et al., 2016) 

The structure is relatively simple, it’s 
possible to correct local trends in data,  

Requires a much longer historic records, difficult 
to model nonlinear problems.  

ANN-based models (Navas et al., 2020) 
(Nazir et al., 2020). 

Before analysis, there is no requirement to 
establish a mathematical model, not very 
susceptible to input data errors, better 
adaptability to online measures. 

Needs a training process and a large number of 
training data sets. 

SVM-based models (Ranganayaki and 
Deepa, 2017) (Pinto et al., 2014) 

Good efficiency in generalization 
Requires a good setting of parameters, using a 
complicated optimization approach, and 
long training time. 

LSTM based models (Araya et al., 2020)  
(Geng et al., 2020). 

Capable of capturing long-term 
dependencies more precisely. 

Needs a training process and a big number of 
training data sets. 

Hybrid models (Soman et al., 2010) 
(Zhang et al., 2020). 

Achieves a good prediction ability with 
higher precision, and maximizes the 
approximation of the actual value. 
Achieves excellent Stability performance. 

Difficulty of analyzing correlated observations and 
their temporal orders. 
Sequential nature of the data. Require complex 
and specific resolution techniques.  

 
The parameter optimization models are established using 
optimization algorithm, specifically Meta heuristic techniques, 
thanks to its ample searching benefits (Qian et al., 2019) 

2.4. Literature synthesis 

Different forecasting models have been used trough literature. 
Each model includes a special set of strengths and weaknesses. 
Table 1 summarizes numerous references with an explication of 
the strengths and weaknesses of the methods used. 

Based on this literature analysis, numerous researches work 
related to wind speed forecasting and power using various 
models have been used; these approaches often produce 
credible results, and each method has strengths and 
weaknesses. The appropriate model is chosen based on the 
specific data characteristics of the site and the application area 
of the method, though, the developed prediction models are 
typically site-specific and greatly influenced by the change in the 
prediction times required.

 
3.  Mathematical fundamentals 

3.1. Problem formulation 

The problem consists of predicting the future wind speed values 
from historical wind data. Assuming that 𝑊 denotes the input 
set that includes the recorded wind speed data in the past up to 

the time 𝑡, and �̂� means the output vector that refers to the 
predicted wind speed values over a given prediction horizon ℎ. 

Figure 1 shows the wind speed forecasting paradigm. 𝑊 And �̂� 
are defined below: 

 
{
𝑊 = [𝑤𝑡−𝑁 , 𝑤𝑡−(𝑁−1), … , 𝑤𝑡 ]                       

�̂� = [�̂�𝑡+1, �̂�𝑡+2, �̂�𝑡+3, … , �̂�𝑡+ℎ−1, �̂�𝑡+ℎ ]
   (1) 

 
Fig. 1 Wind speed forecasting paradigm 

𝑥𝑡−𝑁  𝑥1 𝑥𝑡  𝑥𝑡+ℎ  

𝑁 + 1 𝑣𝑎𝑙𝑢𝑒𝑠 ℎ 𝑣𝑎𝑙𝑢𝑒𝑠 

𝑇𝑖𝑚𝑒 

W
in

d
 s

p
e
e
d

 
I. Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 

| 291 

ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

The prediction operation consists to forecast the �̂� values 
from the known 𝑊 records with the objective to minimize the 
prediction error (𝑤𝑡+𝑖 − �̂�𝑡+𝑖 ) for each 𝑖 = 0 𝑡𝑜 ℎ. Thus, the 
prediction problem can be formulated as follow: 
 
[�̂�𝑡+1, �̂�𝑡+2, … , �̂�𝑡+ℎ−1, �̂�𝑡+ℎ ] =  𝑓(𝑤𝑡−𝑁 , 𝑤𝑡−(𝑁−1), … , 𝑤𝑡 ) (2) 

 
Where 𝑓 denotes the forecasting method. 
In this study, SARIMA and LSTM methods are used to predict 
the time related wind speed. Those models would be compared 
based on several metrics as detailed below. 

3.2. ARIMA model 

ARIMA model, which means Auto-Regressive Integrated 
Moving Average is the most popular stochastic models in time 
series forecasting (Siami-Namini et al., 2018). ARIMA is a time 
series model that can be employed to analyze and forecast next 
values in the series. ARIMA models are determined by the 
combination of three different features (𝑝,𝑑 and 𝑞). The 
parameters 𝑝, 𝑑 and 𝑞 are integers superior or equals to 0 and 
are described as follow: 

p: represents the seasonality, it’s specifying the number or 
order of the AR terms. 

d: the trend, it’s the number or order of the differences  
q: the noise, it represents the number or order of the MA, 

moving average terms. 
 
The fundamental forecasting equation in terms of y is: 
 
�̂�𝑡 = ϕ1 y𝑡−1 + ⋯ + ϕ𝑝 y𝑡−𝑝 + 𝑒𝑡  − θ1𝑒𝑡−1  − ⋯ θ𝑞 𝑒𝑡−𝑞     (3) 

 
Where ϕi are the auto-regression coefficients, θj are the 

moving average coefficients of the model, et is the noise, and yt 
represents the wind speed value at time t. 

A more detailed version of the ARIMA model is seasonal 
ARIMA (SARIMA), this technic proposes to model the 
seasonality of the time series by adding the period parameter S 
and the coefficients P, D and Q equivalent to the parameters p, 
d and q of the differentiated time series (Farida and Zeghdoudi, 
2020)  

The factors p, d, q, S, P, D and Q are fixed by improving the 
Akaike information criteria (AIC). This measure is based on 
striking a balance between a model's complexity and its fit 
(Mantalos et al., 2010). 

3.3. Deep learning models 

3.3.1. Recurrent neural network (RNN) 

A recurrent neural network (RNN) is a developed branch of 
artificial neural network (ANN) that aids in sequence modeling. 
The recurrent neural networks have the capacity to build on 
previous types of networks. 

In RNN, an input sample is added to previously recorded 
outputs that are included as new inputs. Although RNNs are 
efficient, they are affected by the vanishing gradient problem, 
which makes learning large data sequences very difficult. The 
gradient transmits information used in updating the RNR 
parameters and when the gradient becomes significantly 
smaller, the parameter updates become inconsequential, 
indicating that no meaningful learning is done. Instead, better 
variation of RNNs is used: Long Short-Term Networks (LSTM) 
(Tian et al., 2018). 

3.3.2. Long Short-Term Memory (LSTM) 

LSTM networks are a category of RNN that employs a 
combination of special and standard units. A “memory cell” in a 

LSTM unit can store information and keep it for lengthy periods, 
and a collection of gates is used to manage information. As a 
result, they use an activation function to determine the 
activation of a weighted sum. The structure of LSTM cell is 
shown in Figure 2. The LSTM cell contains the following 
components: Forget Gate f, candidate layer c, input Gate i, 
output Gate o, hidden state h, memory state c, inputs of the 
LSTM cell at any step are 𝑋𝑡 (current input), ht-1 (previous 
hidden state) and 𝑐𝑡−1 (previous memory state).  

The LSTM cell produces two outputs: ht (current hidden 
state) and 𝑐𝑡  (current memory state). Firstly, the LSTM cell get 
the prior memory state  𝑐𝑡−1 and performs by element 
multiplication with the forget gate f. If the forget gate value is 0, 
the prior memory state is totally forgotten; if it is 1, the previous 
memory state is completely transferred to the cell. (f gate values 
are between 0 and 1). 

 
𝑐𝑡 = 𝑐𝑡−1 × 𝑓𝑡       (4) 

 
Calculating the new memory state: 

𝑐𝑡 = 𝑐𝑡 + (𝑖𝑡 × 𝑐
′
𝑡 )    (5) 

Thus, the output is  
 

ℎ𝑡 = 𝑡𝑎𝑛ℎ(𝑐𝑡 )  (6) 
 

3.4.  Evaluation metrics 

Validating a prediction model is indispensable to ensure that the 
model is indeed capable of accurately predicting the values of a 
variable of interest. The focus is on whether the values predicted 
by the model are close to the true values in the validation data 
set. The statistical error metrics described in the following  are 
the most commonly reported indicators: The mean square error 
(MSE), the root mean square error (RMSE), mean absolute error 
(MAE) and mean absolute percentage error (MAPE) 
(Botchkarev, 2019): 

• MSE and RMSE 

The mean square error of an estimator of a parameter is a 
measure characterizing the precision of this estimator; it 
measures the average of the square of the error’s deviancies. 
However, the RMSE is the square root of the second sampling 
moment of discrepancies between forecasted and observed 
values. Because the influence of each error is proportional to the 
amount of the error squared, greater errors have a 
disproportionate effect on RMSE. (Hossin and Sulaiman, 2015),  
(Kamble and Deshmukh, 2017). 

 
𝑅𝑀𝑆𝐸𝑦,�̂� = √
1

𝑛
∑ (𝑦𝑗 − �̂�𝑗 )

2𝑛
𝑗=1     (7) 

 
Fig. 2 The structure of LSTM cell 


I. Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 

| 292 

ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

• MAE 

Mean absolute error (MAE) is a statistical measure for 
comparing two continuous variables. Given that, y and y1 are 
variables of paired observations expressing the same 
phenomena. Comparisons of expected against observed, 
subsequent time versus starting time, and one measuring 
technique versus another measurement technique are examples 
of y1 versus y (Botchkarev, 2019): 

  
𝑀𝐴𝐸𝑦,�̂� =
1

𝑛
∑ |𝑦𝑗 − �̂�𝑗 |

𝑛
𝑗=1    (8) 

 
The mean absolute error is one of the main methods used for 

comparing predictions with their eventual results. 

• MAPE 

The mean absolute percentage error (MAPE) is one of the most 
widely used indicators of prediction performance. It was used 
as the primary measure in the M-competition (Kim and Kim, 
2016). 

 
𝑀𝐴𝑃𝐸𝑦,�̂� =
1

𝑛
∑ |

𝑦𝑗−�̂�𝑗

𝑦𝑗
|𝑛𝑗=1        (9) 

 
MAPE has a severe disadvantage in generates undefined 

values when the real values are null or near to zero (Tayman 
and Swanson, 1999). 

4. Materials and methods 

4.1.Methodical approach for the work process 

A methodical approach was developed to better implement and 
assess the two methods for wind speed prediction; the 
work process is depicted in the diagram shown in Figure 3. 

 
Fig. 3 Methodical approach for the work process 
 

Fig. 4 Wind Data Visualization : (a) March 2018, (b) July 2018 and (c) 

October 2018 

4.2.Data preparation 

The meteorological data for this study was recorded at the site 
"Abdelkhalak Torres" in "Al Koudia Al Baida -Tetouen" located 
at (Latitude: 35° 45' 35.1, Longitude: -5° 41' 19.9''). The data 
used in this is recorded daily with a 10 𝑚𝑖𝑛 sampling period at 
a height of 100 m above the ground. The data that was 
specifically used for this study was recorded between January 
1st and December 31st of the year 2018. Records includes date, 
time of the record and the wind speed. The data that was 
considered for the validation of the prediction model is related 
to three months: March, July and October of the year. 

Data cleaning is a crucial step of data preparation. Missing 
data might be the result of an instrument malfunction or a 
discrepancy with previously recorded data. In the suggested 
method, the mean of the previous five observations is used to 
fill in the missing data. 

The analysis and processing of the data was carried out by 
the python 3.8 programming language, with the Spyder 5.0.0 
application available on the graphic interface of the Anaconda 
distribution. 

Data preprocessing 

Data cleaning 

Data scaling 

Data analysis 

Data visualization 

Time series decomposition 

Prediction model development 

SARIMA 

Model 

Data collection 

Performance metrics calculation 

LSTM Model 

(a) 

M
e
a

n
 s

p
e
e
d

 
m
/s

 
M

e
a

n
 s

p
e
e
d

 
m
/s

 
(b) 

M
e
a

n
 s

p
e
e
d

 
m
/s

 
(c)    


I. Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 

| 293 

ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

4.3. Time series wind data analysis 

A time series is defined as a sequence of numerical values 
indexed in time, habitually with the same time step separating 
two successive observations. In this paper, the data is a 
univariate time-series data. After importing the data set from the 
recorded CSV file, and to better apprehend the data recording 
for each studied month, the data is plotted as a time series with 
the date along the x-axis and the wind speed on the y-axis. The 
data visualization is showed in Figure 4. 

Time-series forecasting is the process of fitting a model to 
previous data and employing it to forecast future observations. 
Decomposition is a time series analysis technique, it gives a 
systematic method of thinking about a time series forecasting 
issue, in terms of modeling complexity and in terms of how to 
effectively capture each of these components in a given model. 
A time series must be decomposed to systematic and non-
systematic components. Systematic components are time series 
components with consistency or recurrence that can be 
characterized and modeled. Non-systematic components are 
time series components that cannot be directly represented. 
Time series have three systematic components: level, trend, and 
seasonality, with an additional non-systematic noise 
component. 

 
Fig. 5. Time series decomposition 

 
The average value is called level; the trend in the series is the 
increasing or decreasing value. Seasonality is the recurring 
short-term cycle, while noise is the random fluctuation in time 
series (Athiyarath et al., 2020). Results obtained for each month 
are shown in Figure 5. 

For each month, the plot on Figure 4 shows clearly that the 
wind speed is unstable and extreme winds are rare. It is noted 
that the seasonal component is more significant on October and 
evolves slowly in time, because the data are from separate 
months of the year. However, for consecutive months these will 
be different seasonal patterns with more important seasonal 
components. In addition, random fluctuations are more frequent 
in October and appear to be approximately stable in size over 
time. According to the trend line, the data and the trend are on 
the same scale and there is no long-term evolution of the series. 

5. Simulations and results 

5.1 Wind speed forecasting based on SARIMA model 

5.1.1  Parameter’s selection and model fitting 

The objective of this subsection is to conduct a grid search to 
identify the best combination of parameters that generates the 
highest performance for the model. To accomplish this 
objective, time series predictive model SARIMA was used. The 
code's output indicates that respectively: SARIMA(1, 1, 1)x(0, 0, 
1, 2) , SARIMA(1, 1, 1)x(0, 0, 1, 2), and SARIMA(1, 0, 1)x(0, 1, 1, 
2) generate the lowest AIC (The Akaike Information Criteria ) 
value of 14 869,12 for march, 9 443,72 for July, and 12 864,92  
for October. Therefore, this optimal combination was adopted. 

After fitting seasonal ARIMA models, it is important to run 
model diagnostics to ensure that the assumptions made by the 
model are correct. Figure 6 illustrates the diagnostics produced 
by the proposed model for each month. 

The conducted study has shown that the model residuals are 
normally distributed. The correlogram reveals that the 
autocorrelation of time series residuals are near to zero. This is 
confirmed by the plot of the residuals over time, which shows 
no significant seasonality. The Kernel Density Estimator (KDE) 
tracks tightly the normal distribution N (0, 1); this demonstrates 
that the residuals are distributed regularly. From the Q-Q plot, 
it can be observed that in the middle of the plot, the distribution 
of residuals closely follows the linear trend of the data obtained 
from a standard distribution; however, the residuals distribution 
diverges at the extremities, which corresponds to the extreme 
values of the data. Subsequently, the model generates a 
satisfactory fit that can be used to predict future values. 

5.1.2 Forecasting validation  

In order to evaluate the accuracy of the proposed forecasts, 25% 
of the time series data was used as a test set. The forecasted 
wind speed targeted by the present model is related to the 
periods between the 24th and the end of the months March, July 
and October 2018. The forecasts were generated at one-step 
forward, meaning that the forecasts at each date are compiled 
using the complete history up to that time. The graph in Figure 
7, compares the actual values to the predicted values. Overall, 
the forecasted data tightly aligns with the actual values; this 
confirms that the proposed model produces reliable prediction 
results. This performance will be quantified using the errors 
metrics introduced previously.  

 
R
e
s
id

u
a

l 
  
 S

e
a

s
o
n

a
l 

  
 T

r
e
n

d
 

March 2018 

R
e
s
id

u
a

l 
  
 S

e
a

s
o
n

a
l 

  
 T

r
e
n

d
 

July 2018 

R
e
s
id

u
a

l 
  
 S

e
a

s
o
n

a
l 

  
 T

r
e
n

d
 

October 2018 


I. Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 

| 294 

ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

 
Fig. 6. The model diagnostics: (a) March 2018, (b) July 2018 and 

(c) October 2018 
 
 
Fig. 7. SARIMA Forecasting validation: (a) March 2018, (b) July 2018 

and (c) October 2018 

 
5.2 Wind speed forecasting with the LSTM network 

5.2.1 Experimental test setup 

For each month, the global dataset is split in training dataset and 
test dataset of 75% and 25% of data respectively. A walk-
forward validation forecasting model is used. Each step of the 
dataset test is browsed one at a time. Afterwards, the model is 
ready to make predictions. The new estimated value from the 
test data set is obtained and made available to predict the next 
time step. 

5.2.2 . LSTM data preparation 

To create a LSTM model, the data must be processed. The time 
series data is therefore turned into a supervised learning model. 
Data is separated into input (X) and output (Y). The observation 
from the prior time step (t-1) is used as the input, while the 
observation from the current time step (t) is used as the output. 
The hyperbolic tangent function (𝑡𝑎𝑛ℎ), having an average 
range (-1, 1), is the default activation function for LSTMs. 
Therefore, the training dataset was employed to generate the 

(a) 

(b) 

(c) 

(a) 

M
e
a

n
 s

p
e
e
d

 m
/s

 
(b) 

M
e
a

n
 s

p
e
e
d

 m
/s

 
M

e
a

n
 s

p
e
e
d

 m
/s

 
(c) 


I. Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 

| 295 

ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

scaling coefficients (min and max) and then used the results to 
scale the test dataset and the observations were transformed to 
get a certain scale: the scale was reversed in order to restore the 
values to the original scale while computing error score in order 
to interpret and compare the acquired results. 

5.2.3  LSTM model development 

The LSTM layer requires that the inputs be arranged in a matrix 
with three following dimensions: samples, time steps, and 
features. Samples are sets of data in rows, time steps are 
discrete periods inside an observation, and features are distinct 
measurements taken at the time of the observation. 
Accordingly, each time step in the original sequence was treated 
as a distinct sample with a single time step and feature. The 
LSTM layer specifies the projected number of observations to 
be received in each batch. The frequency of updating the 
weights per time step is determined by the batch size and the 
number of epochs. The number of neurons is the last 
characteristic to define the LSTM layer, a number between 1 
and 5 is sufficient. To forecast the wind speed at the following 
time step, the network requires a single neuron in the output 

layer. Once the network is described, it must be compiled into 
an appropriate symbolic representation. ADAM optimization 
algorithm was applied to compile the network since it is a 
realistic approach with advantages over other stochastic 
optimization methods (Chang et al., 2019) (Zhou et al., 2019). 
After, the network is fitted to the training data, which is taken in 
a supervised learning format. As be needed to make one-step 
predictions, the batch size is fixed to 1, the best configuration 
found is: Batch Size: 1, number of epochs: 50, number of 
neurons: 4. The number of epochs specifies how many times the 
learning algorithm will work on the training data set. 

The result will be a two-dimensional array with a single value 
because they are supplied with a single input. At this point, the 
LSTM Network model for the wind speed dataset can be 
submitted for performance evaluation.  

Table 2 shows an example of the expected and predicted 
values in the test dataset for each month studied. Figure 8 
illustrates a line graph of the test values versus the predicted 
values. As observed, the forecasted data firmly aligns with the 
observed values, this, further confirms the model’s forecasting 
capability. 

 
Table 2 
The expected values versus predicted values 

March July October 

H=10, Pred= 8.523, Expec=11.068 
H=11, Pred=11.109, Expec=11.534 
H=12, Pred=11.545, Expec=10.359 
H=13, Pred=10.248, Expec=10.107 
H=14, Pred= 9.981, Expec=12.074 
H=15, Pred=12.072, Expec=11.480 

H=10, Pred=4.226, Expec=5.700 
H=11, Pred=5.820, Expec=6.169 
H=12, Pred=6.173, Expec=5.354 
H=13, Pred=5.164, Expec=7.654 
H=14, Pred=7.901, Expec=8.754 
H=15, Pred=8.851, Expec=7.180 

H=10, Pred=10.553, Expec=13.057 
H=11, Pred=13.191, Expec=12.621 
H=12, Pred=12.745, Expec=11.590 
H=13, Pred=11.738, Expec=11.424 
H=14, Pred=11.589, Expec=11.612 
H=15, Pred=11.787, Expec= 9.983 

 
(a) 
 

(b) 

 
(c) 

 
Fig. 8. Expected values vs predicted values: (a) March 2018, (b) 
July 2018 and (c) October 2018 

 
M
e
a

n
 s

p
e
e
d

 m
/s

 
M
e
a

n
 s

p
e
e
d

 m
/s

 
M
e
a

n
 s

p
e
e
d

 m
/s

 
I.Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 
| 296 

 ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

5.2.4 Reinforcement of the model 

The outputs of neural networks vary according to the initial 
conditions. For this specific problem, the experiment from the 
previous section were repeatedly conducted and the average of 
each error metric as a measure of the configuration performance 
envisaged on unknown data was used. In a loop with a set 
number of repetitions, the model's fit and walk-forward 
performance were validated. Every iteration, the executions of 
error metrics are captured. Next, a summary of the scores for all 
of the error metrics (MSE, RMSE, MAE, and MAPE) is produced. 
Table 3 shows the errors metrics results for 15 iterations for the 
three periods studied. The box and whisker graph of the 
distribution of test MAPE results for each of the 15 iterations is 
shown in Figure 9. MAPE values obtained with different periods 

for the same number of iterations show significant variances. 
However, it is interesting to note that the model produced the 
lowest MAPE value in JULY when compared to the 

other months. Looking at the three boxes medians, the median 
for March is closest to the minimum error, but for October, 50% 

of MAPE values are close to the maximum value. The lowest 
interquartile range (Q3-Q1) calculated, IQR = 0.060%, 
corresponds to the month of JULY, which indicates that the 
MAPE value distribution is the most homogeneous and the 
errors values are evenly distributed around the median. 

5.3 Results discussion 

The objective of this analysis section is to evaluate the 
numerical predictive models and determine which one 
performed better in terms of producing the lowest metrics error 
and minimal computation time. Overall, both, LSTM and 
SARIMA  models offers good accuracy values of errors 
metrics comparing to results cited in the literature, for short 
term wind speed forecast (Liu et al., 2021) (Haddad et al., 2019) 
(Duan et al., 2021). As shown in Table 4, besides the 
improvement, LSTM model still provides the highest predictive 
performance in terms of minimal metrics error in March and 
October the months with a high velocity and more extreme 
values. 

 
Table 3  
Error metrics values resulting from 15 iterations 
 March  July  October 

 MSE RMSE MAE MAPE  MSE RMSE MAE MAPE  MSE RMSE MAE MAPE 

Minimum 1.1050 1.0449 0.7102 15.9400%  0.3947 0.6234 0.4544 10.5000%  1.5131 1.2298 0.9300 15.7000% 

Q1 1.1177 1.0493 0.7147 16.0094%  0.4022 0.6270 0.4573 10.6704%  1.5240 1.2398 0.9350 15.7667% 

Median 1.1203 1.0556 0.7164 16.0346%  0.4050 0.6310 0.4587 10.7006%  1.5295 1.2420 0.9370 15.8133% 

Q3 1.1286 1.0567 0.7177 16.2518%  0.4070 0.6352 0.4599 10.7307%  1.5414 1.2442 0.9372 15.8309% 

Maximum 1.1449 1.0586 0.7231 16.3737%  0.4089 0.6377 0.4889 10.9224%  1.5546 1.2464 0.9399 15.8926% 

Mean 1.1238 1.0525 0.7168 16.1607%  0.4042 0.6313 0.4606 10.6982%  1.5318 1.2413 0.9360 15.8025% 

Range 0.0399 0.0137 0.0129 0.4337%  0.0142 0.0143 0.0345 0.4224%  0.0415 0.0166 0.0099 0.1926% 

 
Fig. 9. LSTM Repeated Experiment Box and Whisker graph 

15.69%

15.71%

15.73%

15.75%

15.77%

15.79%

15.81%

15.83%

15.85%

15.87%

15.89%

MAPE -OCTOBER-

10.48%

10.53%

10.58%

10.63%

10.68%

10.73%

10.78%

10.83%

10.88%

10.93%

MAPE -JULY-

15.90%

15.95%

16.00%

16.05%

16.10%

16.15%

16.20%

16.25%

16.30%

16.35%

16.40%

MAPE -MARCH-


I.Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 
| 297 

 ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

Table 4 
Errors comparison 

 
The remarkable performance observed through LSTM -based 
approaches are related to the used iterative optimization 
algorithm, with the goal of finding the best results, Furthermore, 
taking a look at the RMSE values, the statistical SARIMA model 
gives slightly better results on July, than the LSTM model. 
Therefore, in periods with small fluctuations, the neural network 
achieved comparable performances to the traditional statistical 
models. It should be mentioned that LSTM would give more 
accurate results if the data contained structural changes with 
frequent fluctuation. In effect, the SARIMA method is unable to 
interpret the non-linear component of the wind speed time 
series data, thus, the model cannot capture all of the available 
details of the dataset. Contrarily, LSTM is a deep learning tool 
developed to learn temporal patterns, capture non-linear 
relations, and store relevant memory for a longer period. 
Furthermore, the required computational time for forecasting 
becomes critical to make any comparison. The LSTM based 
model is most computationally demanding than SARIMA, 
because of the optimization algorithm, 50 epochs were trained 
for each optimizer for 15 times and found that training and 
testing time ranges from 16 to 20 minutes, while the compilation 
time using SARIMA does not exceed 6 min. Previous study has 
also supported this (Shivani et al., 2019). Indeed, authors in 
(Makridakis et al., 2018), confirm that the complexity reported 
by ML approaches still substantially higher than statistical 
methods. The utility and advantage of a large data-learning 
model such as LSTM, is more solicited when the size of data is 
much larger (Liu et al., 2021). In this study, relatively small series 
data sets (one month) were used for predictive model training 
ant testing. Otherwise, the performance differences favoring the 
LSTM model would have been considerably more interesting. 
In previous academic research, the ARIMA model provided 
favorable performance with a smaller volume of data (Elsaraiti 
and Merabet, 2021). 

6. Conclusion 

The present study is a comparative analysis of SARIMA and 
LSTM forecasting models. The main goal is to evaluate the 
prediction performance of each one. The study uses wind speed 
time series recorded at three different months as common data. 
Furthermore, SARIMA and LSTM were compared based on the 
same evaluation metrics: MSE, RMSE, MAE and MAPE. The 
MAPE values ranges from 10.5% to 16.10% for both models 
which indicates that performance is acceptable in both cases. 
The results analysis shows that the LSTM exudes higher 
performance than the SARIMA model. This distinction is 
basically due to the use of iterative optimization algorithm in 
LSTM model. Practically speaking, The SARIMA model is more 
functional, because it requires the tuning of six unique 
parameters (p, q, d, P, Q, D), while the LSTM requires the 
evaluation of numerous additional hyper parameters, such as 
the number of units in each layer, the number of layers, the 

batch size, the number of epochs and the activation function. 
Regarding the forecasting operation conducted in this study, 
related to wind speed in particular, SARIMA model was faster 
to train and less complicated to implement. As a perspective 
study, the obtained results can be supported by extending the 
comparative analysis. Thus, in the future work, other machine 
learning prediction approaches are to be considered such as the 
support vector machine. The upcoming study will also use 
larger datasets that extend over a year, allowing us to assess the 
impact of seasons on the forecasting ability. 

References 

Adekunle, S.A., (2017). Prédiction de la moyenne horaire de la vitesse 
du vent sur le site de Lomé par réseau de neurones. Sci. Appliquées 
Ing. 2, 1–12. 
http://publication.lecames.org/index.php/ing/article/view/10
76 

Altan, A., Karasu, S., Zio, E., (2021). A new hybrid model for wind speed 
forecasting combining long short-term memory neural network, 
decomposition methods and grey wolf optimizer. Appl. Soft 
Comput. 100, 106996. 
https://doi.org/10.1016/j.asoc.2020.106996 

Araya, I.A., Valle, C., Allende, H., (2020). A Multi-Scale Model based on 
the Long Short-Term Memory for day ahead hourly wind speed 
forecasting. Pattern Recognit. Lett. 136, 333–340. 
https://doi.org/10.1016/j.patrec.2019.10.011 

Asari, M., Nanahara, T., Maejima, T., Yamaguchi, K., Sato, T., (2002). A 
study on smoothing effect on output fluctuation of distributed 
wind power generation, in: IEEE/PES Transmission and 
Distribution Conference and Exhibition. Presented at the 
IEEE/PES Transmission and Distribution Conference and Exhibition, 
pp. 938–943 vol.2. https://doi.org/10.1109/TDC.2002.1177602 

Athiyarath, S., Paul, M., Krishnaswamy, S., (2020). A Comparative Study 
and Analysis of Time Series Forecasting Techniques. SN Comput. 
Sci. 1, 175. https://doi.org/10.1007/s42979-020-00180-5 

Babazadeh, H., Gao, W., Cheng, L., Lin, J., (2012). An hour ahead wind 
speed prediction by Kalman filter, in: 2012 IEEE Power 
Electronics and Machines in Wind Applications. Presented at the 
2012 IEEE Power Electronics and Machines in Wind Applications, pp. 
1–6. https://doi.org/10.1109/PEMWA.2012.6316394 

Bennitt, G.V., Schueler, T., (2012) An assessment of zenith total delay 
corrections from numerical weather prediction models 1. 
Geophysical Research Abstracts 14, EGU2012-11292, 
https://meetingorganizer.copernicus.org/EGU2012/EGU2012-
11292.pdf 

Bessac, J., Constantinescu, E., Anitescu, M., (2018). Stochastic 
simulation of predictive space–time scenarios of wind speed using 
observations and physical model outputs. Ann. Appl. Stat. 12. 
https://doi.org/10.1214/17-AOAS1099 

Botchkarev, A., (2019). A New Typology Design of Performance Metrics 
to Measure Errors in Machine Learning Regression Algorithms. 
Interdiscip. J. Inf. Knowl. Manag. 14, 045–076. 
https://doi.org/10.28945/4184 

Bououden, S., Chadli, M., Filali, S., El Hajjaji, A., (2012). Fuzzy model 
based multivariable predictive control of a variable speed wind 
turbine: LMI approach. Renew. Energy 37, 434–439. 
https://doi.org/10.1016/j.renene.2011.06.025 

Brereton, R.G., Lloyd, G.R., (2010). Support Vector Machines for 
classification and regression. Analyst 135, 230–267. 
https://doi.org/10.1039/B918972F 

Cassola, F., Burlando, M., (2012). Wind speed and wind energy forecast 
through Kalman filtering of Numerical Weather Prediction model 
output. Appl. Energy 99, 154–166. 
https://doi.org/10.1016/j.apenergy.2012.03.054 

Chang, Z., Zhang, Y., Chen, W., (2019). Electricity price prediction based 
on hybrid model of adam optimized LSTM neural network and 
wavelet transform. Energy 187, 115804. 
https://doi.org/10.1016/j.energy.2019.07.134 

Damousis, I.G., Alexiadis, M.C., Theocharis, J.B., Dokopoulos, P.S., 
(2004). A fuzzy model for wind speed prediction and power 
generation in wind parks using spatial correlation. IEEE Trans. 

 March  July  October 

 SARIMA LSTM  SARIMA LSTM  SARIMA LSTM 

MSE 1.19 1.10  0.38 0.39  1.54 1.51 

RMSE 1.09 1.04 
 

0.62 0.62 
 

1.24 1.23 

MAE 0.72 0.71  0.44 0.45  0.93 0.93 

MAPE 16.10% 15.94%  10.67% 10.50%  16.03% 15.70% 

http://publication.lecames.org/index.php/ing/article/view/1076
http://publication.lecames.org/index.php/ing/article/view/1076
https://doi.org/10.1016/j.asoc.2020.106996
https://doi.org/10.1016/j.patrec.2019.10.011
https://doi.org/10.1109/TDC.2002.1177602
https://doi.org/10.1007/s42979-020-00180-5
https://doi.org/10.1109/PEMWA.2012.6316394
https://meetingorganizer.copernicus.org/EGU2012/EGU2012-11292.pdf
https://meetingorganizer.copernicus.org/EGU2012/EGU2012-11292.pdf
https://doi.org/10.1214/17-AOAS1099
https://doi.org/10.28945/4184
https://doi.org/10.1016/j.renene.2011.06.025
https://doi.org/10.1039/B918972F
https://doi.org/10.1016/j.apenergy.2012.03.054
https://doi.org/10.1016/j.energy.2019.07.134


I. Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 

| 298 

ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

Energy Convers. 19, 352–361. 
https://doi.org/10.1109/TEC.2003.821865 

Devis, A., Lipzig, N.P.M.V., Demuzere, M., (2018). Should future wind 
speed changes be taken into account in wind farm development? 
Environ. Res. Lett. 13, 064012. https://doi.org/10.1088/1748-
9326/aabff7 

Duan, Jikai, Zuo, H., Bai, Y., Duan, Jizheng, Chang, M., Chen, B., (2021). 
Short-term wind speed forecasting using recurrent neural 
networks with error correction. Energy 217, 119397. 
https://doi.org/10.1016/j.energy.2020.119397 

Eldali, F.A., Hansen, T.M., Suryanarayanan, S., Chong, E.K.P., (2016). 
Employing ARIMA models to improve wind power forecasts: A 
case study in ERCOT, in: 2016 North American Power 
Symposium (NAPS). Presented at the 2016 North American Power 
Symposium (NAPS), pp. 1–6. 
https://doi.org/10.1109/NAPS.2016.7747861 

Elsaraiti, M., Merabet, A., (2021). A Comparative Analysis of the ARIMA 
and LSTM Predictive Models and Their Effectiveness for 
Predicting Wind Speed. Energies 14, 6782. 
https://doi.org/10.3390/en14206782 

Erdem, E., Shi, J., (2011). ARMA based approaches for forecasting the 
tuple of wind speed and direction. Appl. Energy 88, 1405–1414. 
https://doi.org/10.1016/j.apenergy.2010.10.031 

Faniband, Y.P., Shaahid, S.M., (2020). Forecasting Wind Speed using 
Artificial Neural Networks – A Case Study of a Potential Location 
of Saudi Arabia. E3S Web Conf. 173, 01004. 
https://doi.org/10.1051/e3sconf/202017301004  

Farida, M., Zeghdoudi, H., (2020). On Modelling seasonal ARIMA series: 
Comparison, Application and Forecast (Number of Injured in 
Road Accidents in Northeast Algeria). WSEAS Trans. Syst. Control 
15, 235–246. https://doi.org/10.37394/23203.2020.15.25 

Geng, D., Zhang, H., Wu, H., (2020). Short-Term Wind Speed Prediction 
Based on Principal Component Analysis and LSTM. Appl. Sci. 10, 
4416. https://doi.org/10.3390/app10134416 

Haddad, M., Nicod, J., Boubacar Mainassara, Y., Rabehasaina, L., Al 
Masry, Z., Péra, M., (2019). Wind and Solar Forecasting for 
Renewable Energy System using SARIMA-based Model, in: 
International Conference on Time Series and Forecasting. Gran 
Canaria, Spain. 

Hajirahimi, Z., Khashei, M., (2019). Hybrid structures in time series 
modeling and forecasting: A review. Eng. Appl. Artif. Intell. 86, 83–
106. https://doi.org/10.1016/j.engappai.2019.08.018 

Hide, C., Moore, T., Smith, M., (2003). Adaptive Kalman Filtering for 
Low-cost INS/GPS. J. Navig. 56, 143–152. 
https://doi.org/10.1017/S0373463302002151 

Hossin, M. and Sulaiman, M.N. (2015). A Review on Evaluation Metrics 
for Data Classification Evaluations. Int. J. Data Min. Knowl. Manag. 
Process 5, 01–11. https://doi.org/10.5121/ijdkp.2015.5201 

Jaseena, K.U., Kovoor, B.C., (2020). Deterministic weather forecasting 
models based on intelligent predictors: A survey. J. King Saud 
Univ. - Comput. Inf. Sci. S1319157820304729. 
https://doi.org/10.1016/j.jksuci.2020.09.009 

Kamble, V.B., Deshmukh, S.N., (2017). Comparision Between Accuracy 
and MSE,RMSE by Using Proposed Method with Imputation 
Technique. Orient. J. Comput. Sci. Technol. 10, 773–779. 
https://doi.org/10.13005/ojcst/10.04.11 

Kim, S., Kim, H., (2016). A new metric of absolute percentage error for 
intermittent demand forecasts. Int. J. Forecast. 32, 669–679. 
https://doi.org/10.1016/j.ijforecast.2015.12.003 

Kodjo, M.K., Bédja, K., Ajavon, A.S.A., Faye, R.M., Lishou, C., (2008). 
Neural networks for predictive control of the mechanism of 
orientation of a wind turbine. J. Sci. Pour Ing. 9, 75–85. 
https://doi.org/10.4314/jspi.v9i1.30061 

Li, G., Shi, J., Zhou, J., (2011). Bayesian adaptive combination of short-
term wind speed forecasts from neural network models. Renew. 
Energy 36, 352–359. 
https://doi.org/10.1016/j.renene.2010.06.049 

Liu, X., Lin, Z., Feng, Z., (2021). Short-term offshore wind speed forecast 
by seasonal ARIMA - A comparison against GRU and LSTM. 
Energy 227, 120492. 
https://doi.org/10.1016/j.energy.2021.120492 

Lorenc, A.C., (1986). Analysis methods for numerical weather 
prediction. Q. J. R. Meteorol. Soc. 112, 1177–1194. 
https://doi.org/10.1002/qj.49711247414 

Makridakis, S., Spiliotis, E., Assimakopoulos, V., (2018). Statistical and 
Machine Learning forecasting methods: Concerns and ways 
forward. PLOS ONE 13, e0194889. 
https://doi.org/10.1371/journal.pone.0194889 

Mantalos, P., Mattheou, K., Karagrigoriou, A., (2010). Forecasting 
ARMA models: a comparative study of information criteria 
focusing on MDIC. J. Stat. Comput. Simul. 80, 61–73. 
https://doi.org/10.1080/00949650802464137 

Martinez-García, F.P., Contreras-de-Villar, A., Muñoz-Perez, J.J., (2021). 
Review of Wind Models at a Local Scale: Advantages and 
Disadvantages. J. Mar. Sci. Eng. 9, 318. 
https://doi.org/10.3390/jmse9030318 

Mi, X., Liu, H., Li, Y., (2019). Wind speed prediction model using 
singular spectrum analysis, empirical mode decomposition and 
convolutional support vector machine. Energy Convers. Manag. 
180, 196–205. https://doi.org/10.1016/j.enconman.2018.11.006 

Miranda, M.S., Dunn, R.W., (2006). One-hour-ahead wind speed 
prediction using a Bayesian methodology, in: 2006 IEEE Power 
Engineering Society General Meeting. Presented at the 2006 IEEE 
Power Engineering Society General Meeting, p. 6 pp.-. 
https://doi.org/10.1109/PES.2006.1709479 

Nair, K.R., Vanitha, V., Jisma, M., (2017). Forecasting of wind speed 
using ANN, ARIMA and Hybrid models, in: 2017 International 
Conference on Intelligent Computing, Instrumentation and 
Control Technologies (ICICICT). Presented at the 2017 
International Conference on Intelligent Computing, Instrumentation 
and Control Technologies (ICICICT), IEEE, Kerala State,Kannur, 
India, pp. 170–175. 
https://doi.org/10.1109/ICICICT1.2017.8342555 

Navas, R.K.B., Prakash, S., Sasipraba, T., 2020. Artificial Neural Network 
based computing model for wind speed prediction: A case study 
of Coimbatore, Tamil Nadu, India. Phys. Stat. Mech. Its Appl. 542, 
123383. https://doi.org/10.1016/j.physa.2019.123383 

Nazir, M.S., Alturise, F., Alshmrany, S., Nazir, H.M.J., Bilal, M., Abdalla, 
A.N., Sanjeevikumar, P., M. Ali, Z., (2020). Wind Generation 
Forecasting Methods and Proliferation of Artificial Neural 
Network: A Review of Five Years Research Trend. Sustainability 
12, 3778. https://doi.org/10.3390/su12093778 

Nguyen, L., Novák, V., (2019). Forecasting seasonal time series based 
on fuzzy techniques. Fuzzy Sets Syst. 361, 114–129. 
https://doi.org/10.1016/j.fss.2018.09.010 

Pinto, T., Ramos, S., Sousa, T.M., Vale, Z., (2014). Short-term wind 
speed forecasting using Support Vector Machines, in: 2014 IEEE 
Symposium on Computational Intelligence in Dynamic and 
Uncertain Environments (CIDUE). Presented at the 2014 IEEE 
Symposium on Computational Intelligence in Dynamic and Uncertain 
Environments (CIDUE), pp. 40–46. 
https://doi.org/10.1109/CIDUE.2014.7007865 

Qian, Z., Pei, Y., Zareipour, H., Chen, N., (2019). A review and discussion 
of decomposition-based hybrid models for wind energy 
forecasting applications. Appl. Energy 235, 939–953. 
https://doi.org/10.1016/j.apenergy.2018.10.080 

Ranganayaki, V., Deepa, S.N., (2017). SVM Based Neuro Fuzzy Model 
for Short Term Wind Power Forecasting. Natl. Acad. Sci. Lett. 40, 
131–134. https://doi.org/10.1007/s40009-016-0521-6 

Shivani, Sandhu, K.S., Ramachandran Nair, A., (2019). A Comparative 
Study of ARIMA and RNN for Short Term Wind Speed 
Forecasting, in: 2019 10th International Conference on 
Computing, Communication and Networking Technologies 
(ICCCNT). Presented at the 2019 10th International Conference 
on Computing, Communication and Networking Technologies 
(ICCCNT), IEEE, Kanpur, India, pp. 1–7. 
https://doi.org/10.1109/ICCCNT45670.2019.8944466 

Siami-Namini, S., Tavakoli, N., Siami Namin, A., (2018). A Comparison 
of ARIMA and LSTM in Forecasting Time Series, in: 2018 17th 
IEEE International Conference on Machine Learning and 
Applications (ICMLA). Presented at the 2018 17th IEEE 
International Conference on Machine Learning and Applications 
(ICMLA), IEEE, Orlando, FL, pp. 1394–1401. 
https://doi.org/10.1109/ICMLA.2018.00227 

Soman, S.S., Zareipour, H., Malik, O., Mandal, P., (2010). A review of 
wind power and wind speed forecasting methods with different 
time horizons, in: North American Power Symposium 2010. 

https://doi.org/10.1109/TEC.2003.821865
https://doi.org/10.1088/1748-9326/aabff7
https://doi.org/10.1088/1748-9326/aabff7
https://doi.org/10.1016/j.energy.2020.119397
https://doi.org/10.1109/NAPS.2016.7747861
https://doi.org/10.3390/en14206782
https://doi.org/10.1016/j.apenergy.2010.10.031
https://doi.org/10.1051/e3sconf/202017301004
https://doi.org/10.37394/23203.2020.15.25
https://doi.org/10.3390/app10134416
https://doi.org/10.1016/j.engappai.2019.08.018
https://doi.org/10.1017/S0373463302002151
https://doi.org/10.5121/ijdkp.2015.5201
https://doi.org/10.1016/j.jksuci.2020.09.009
https://doi.org/10.13005/ojcst/10.04.11
https://doi.org/10.1016/j.ijforecast.2015.12.003
https://doi.org/10.4314/jspi.v9i1.30061
https://doi.org/10.1016/j.renene.2010.06.049
https://doi.org/10.1016/j.energy.2021.120492
https://doi.org/10.1002/qj.49711247414
https://doi.org/10.1371/journal.pone.0194889
https://doi.org/10.1080/00949650802464137
https://doi.org/10.3390/jmse9030318
https://doi.org/10.1016/j.enconman.2018.11.006
https://doi.org/10.1109/PES.2006.1709479
https://doi.org/10.1109/ICICICT1.2017.8342555
https://doi.org/10.1016/j.physa.2019.123383
https://doi.org/10.3390/su12093778
https://doi.org/10.1016/j.fss.2018.09.010
https://doi.org/10.1109/CIDUE.2014.7007865
https://doi.org/10.1016/j.apenergy.2018.10.080
https://doi.org/10.1007/s40009-016-0521-6
https://doi.org/10.1109/ICCCNT45670.2019.8944466
https://doi.org/10.1109/ICMLA.2018.00227


I. Tyass et al  Int. J. Renew. Energy Dev 2023, 12(2), 288-299 

| 299 

ISSN: 2252-4940/© 2023. The Author(s). Published by CBIORE 

Presented at the North American Power Symposium 2010, pp. 1–
8. https://doi.org/10.1109/NAPS.2010.5619586 

Tascikaraoglu, A., Uzunoglu, M., (2014). A review of combined 
approaches for prediction of short-term wind speed and power. 
Renew. Sustain. Energy Rev. 34, 243–254. 
https://doi.org/10.1016/j.rser.2014.03.033 

Tayman, J., Swanson, D.A., 1999. On the validity of MAPE as a measure 
of population forecast accuracy. Popul. Res. Policy Rev. 18, 299–
322. https://doi.org/10.1023/A:1006166418051 

Tian, Y., Xu, Y.-P., Yang, Z., Wang, G., Zhu, Q., 2018. Integration of a 
Parsimonious Hydrological Model with Recurrent Neural 
Networks for Improved Streamflow Forecasting. Water 10, 1655. 
https://doi.org/10.3390/w10111655 

Tokgöz, A., Ünal, G., 2018. A RNN based time series approach for 
forecasting turkish electricity load, in: 2018 26th Signal 
Processing and Communications Applications Conference (SIU). 
Presented at the 2018 26th Signal Processing and Communications 
Applications Conference (SIU), pp. 1–4. 
https://doi.org/10.1109/SIU.2018.8404313 

Torres, J.L., García, A., De Blas, M., De Francisco, A., 2005. Forecast of 
hourly average wind speed with ARMA models in Navarre 
(Spain). Sol. Energy 79, 65–77. 
https://doi.org/10.1016/j.solener.2004.09.013 

Wu, W., Shaikhouni, A., Donoghue, J.R., Black, M.J., 2004. Closed-loop 
neural control of cursor motion using a Kalman filter, in: The 26th 
Annual International Conference of the IEEE Engineering in 
Medicine and Biology Society. Presented at the The 26th Annual 

International Conference of the IEEE Engineering in Medicine and 
Biology Society, pp. 4126–4129. 
https://doi.org/10.1109/IEMBS.2004.1404151 

Yang, J., Astitha, M., Monache, L.D., Alessandrini, S., 2018. An Analog 
Technique to Improve Storm Wind Speed Prediction Using a Dual 
NWP Model Approach. Mon. Weather Rev. 146, 4057–4077. 
https://doi.org/10.1175/MWR-D-17-0198.1 

Yatiyana, E., Rajakaruna, S., Ghosh, A., 2017. Wind speed and direction 
forecasting for wind power generation using ARIMA model, in: 
2017 Australasian Universities Power Engineering Conference 
(AUPEC). Presented at the 2017 Australasian Universities Power 
Engineering Conference (AUPEC), pp. 1–6. 
https://doi.org/10.1109/AUPEC.2017.8282494 

Zhang, C., Wei, H., Xie, L., Shen, Y., Zhang, K., 2016. Direct interval 
forecasting of wind speed using radial basis function neural 
networks in a multi-objective optimization framework. 
Neurocomputing 205, 53–63. 
https://doi.org/10.1016/j.neucom.2016.03.061 

Zhang, W., Zhang, L., Wang, J., Niu, X., 2020. Hybrid system based on 
a multi-objective optimization and kernel approximation for 
multi-scale wind speed forecasting. Appl. Energy 277, 115561. 
https://doi.org/10.1016/j.apenergy.2020.115561 

Zhou, B., Ma, X., Luo, Y., Yang, D., 2019. Wind Power Prediction Based 
on LSTM Networks and Nonparametric Kernel Density 
Estimation. IEEE Access 7, 165279–165292. 
https://doi.org/10.1109/ACCESS.2019.2952555 

 
© 2023. The Author(s). This article is an open access article distributed under the terms and conditions of the Creative Commons 
Attribution-ShareAlike 4.0 (CC BY-SA) International License (http://creativecommons.org/licenses/by-sa/4.0/) 

https://doi.org/10.1109/NAPS.2010.5619586
https://doi.org/10.1016/j.rser.2014.03.033
https://doi.org/10.1023/A:1006166418051
https://doi.org/10.3390/w10111655
https://doi.org/10.1109/SIU.2018.8404313
https://doi.org/10.1016/j.solener.2004.09.013
https://doi.org/10.1109/IEMBS.2004.1404151
https://doi.org/10.1175/MWR-D-17-0198.1
https://doi.org/10.1109/AUPEC.2017.8282494
https://doi.org/10.1016/j.neucom.2016.03.061
https://doi.org/10.1016/j.apenergy.2020.115561
https://doi.org/10.1109/ACCESS.2019.2952555