Microsoft Word - 24-3278_s1_ETASR_V10_N1_pp5228-5232 Engineering, Technology & Applied Science Research Vol. 10, No. 1, 2020, 5228-5232 5228 www.etasr.com Boubaker et al.: Prediction of Daily Global Solar Radiation using Resilient-propagation Artificial … Prediction of Daily Global Solar Radiation using Resilient-propagation Artificial Neural Network and Historical Data: A Case Study of Hail, Saudi Arabia Sahbi Boubaker Department of Electronics Engineering University of Hail Hail City, Saudi Arabia s.boubaker@uoh.edu.sa Souad Kamel Department of Computer & Network Engineering, University of Jeddah Jeddah, Saudi Arabia skamel@uj.edu.sa Mourad Kchaou Department of Electrical Engineering University of Hail Hail, Saudi Arabia m.kchaou@uoh.edu.sa Abstract—In this paper, several different Feed Forward Artificial Neural Networks (FFANNs) were used for forecasting the one- day-ahead Global Horizontal Irradiation (GHI) in Hail region, Saudi Arabia. The main motivation behind predicting GHI is that it is a critical parameter in sizing and planning photovoltaic water pumping systems. The novelty of the proposed approach is that it employs only the historical values of the GHI itself as explanatory variables and a fast training algorithm (resilient- propagation). In terms of performance metrics, the rp-trained FFANNs provided better results than Quasi-Newton (bfg) algorithm trained FFANNs for almost all the studied combinations of the FFANN structure. It has been also shown that increasing the number of neurons per layer didn’t improve the performance. Medium structures with fast training algorithms are recommended. Keywords-global horizontal irradiation (GHI); forecasting; feed-forward artificial neural network (FFANN); resilient- propagation (rp) training algorithm I. INTRODUCTION Hail region (Saudi Arabia) has a semi-arid climate where solar energy is abundant, while water needs to be pumped from relatively deep wells. Due to the new electricity pricing policy adopted in Saudi Arabia at the beginning of 2018, where the price of one KWh increased from 0.05SAR to 0.18SAR (1USD=3.75SAR), solar energy can constitute an alternative solution replacing classical electricity. Predicting as accurately as possible the future amounts of solar energy is of high importance in designing stand-alone or grid-connected solar plants. Given the fact that solar energy resources measurement stations remain relatively expensive, one solution is the design of computerized numerical forecasters. Moreover, global horizontal irradiation (GHI) is characterized by a high level of variability due to the climatic and geographical factors affecting it. Accordingly, researchers have resorted to mathematical models to describe such complex and uncertain relationships. An increasing interest in more reliable and accurate forecasting approaches has been observed among the solar energy research community during the last few years. Various techniques have been used in the literature to forecast GHI including time-series (TS) and machine learning (ML). Artificial Neural Networks (ANNs) as flexible tools having high capabilities to map inputs (patterns) to outputs (targets) are popular in the domain of forecasting. However, accuracy is found to be varying from one study to another and to be highly related to the nature and size of the available datasets. The present paper focuses on forecasting the GHI in Hail region using different structures of a Feed-Forward ANN (FFANN) based only on the historical records of the GHI itself. Thus, the GHI is expressed as a symbolic form GHI(t)=f(GHI(t-1), GHI(t-2), …, GHI(t-d)). This form is a dynamic description of the next day GHI. Once this form is identified, it will be used in a recursive manner to predict the future value of the GHI. Based on trial-and-error procedure, different FFANN structures, input combinations, and training algorithms have been explored in order to improve the accuracy and reduce the computational burden. It should be noted that the results presented in this paper are a part of a complete study aiming at sizing, implementing and controlling a pilot photovoltaic water pumping system (PVWPS) in Hail, Saudi Arabia. II. LITERATURE REVIEW In this section, a critical discussion is presented in order to situate the contribution of our paper. For predicting daily global solar radiation in arid Northwest of China, a back-propagation ANN tuned by particle swarm optimization (BP-PSO) has been investigated in [1], using data from eight meteorological stations. The BP-PSO has shown good performance with a coefficient of determination R 2 varying between 0.5630 and 0.9678. Sunshine duration has been found to be the most affecting global solar radiation. However, the study presented a kind of now-casting (assessment) exercise rather than a forecasting (prediction) one. Based on multiple neural networks and using a photovoltaic panel (PV) model associated with an irradiance forecast, a PV yield prediction system has been presented in [2]. The designed GHI forecaster has shown a mean absolute percentage error (MAPE) of 3.4% on a sunny day and 23% on a cloudy for Stuttgart, Germany. The developed approach is found to Corresponding author: Sahbi Boubaker Engineering, Technology & Applied Science Research Vol. 10, No. 1, 2020, 5228-5232 5229 www.etasr.com Boubaker et al.: Prediction of Daily Global Solar Radiation using Resilient-propagation Artificial … require a large meteorological data set which is usually unavailable. A quaternion-valued neural network (QVNN) based method has been developed for forecasting GHI in [3]. Presenting the meteorological variables to the quaternion domain in a unique variable including the latitude, the longitude and the time indexes has resulted on three times reduction of the input-output layers which decreased computation time. This method has been successfully applied on real datasets from Tamarnasset, Algeria. Several empirical equations and ANN-based algorithms were used for estimating the solar radiation for stations of Aristotle University, Greece [4]. Using daily meteorological data such as temperature, radiation and humidity as explanatory variables, ANN and multi-linear regression (MLR) have been found able to improve the accuracy. Prediction and forecasting of monthly mean daily GHI has been implemented using ANNs. Nonlinear autoregressive (NAR) with exogenous input (NARX) and hybrid time-series have been evaluated in selected regions from Nigeria. The hybrid model involving the current month number has been found to be the most accurate and reliable. In fact, the coefficient of determination (R 2 ) has reached 0.96 in Abuja when using the hybrid model [5]. Since solar radiation is scarcely measured in Turkey, an approach using ANN, adaptive neuro-fuzzy inference system (ANFIS), and MLR was implemented in order to forecast GHI in [6]. Using a variance factor analysis procedure, calendar month number, average temperature, average relative humidity and extraterrestrial radiation have been found the most affecting GHI variables. ANN was more accurate outperforming ANFIS, MLR and a set of empirical equations. A hybrid approach including wavelet multi-resolution procedure combined to ANN has been used in [7] for modeling solar radiation. Wavelets have been applied to the original time-series and decomposed them into simple parts. After that, various ANN types (MLP, ANFIS, NARX and generalized ANN (GRNN)) have been used to reconstruct the original signals. The coefficient of determination R 2 has been improved (6.84%) for the MLP and the RMSE (2.78%) for the GRNN without wavelet transformation. The case study was Abudhabi, United Arab Emirates. Further testing was declared to be interesting in other geographical locations. For sizing a stand-alone PV system at Almadinah, Saudi Arabia [8], a radial-basis ANN has been utilized for predicting global solar radiation. The RBF using the sunshine duration and the temperature as explanatory variables has provided a coefficient of determination of 98.80%. A set of artificial intelligence methods including ANNs have been compared when used for assessing global solar radiation in 12 locations over Iran in [9]. The group method handling data (GMHD) has been found to outperform the other methods in terms of R 2 , RMSE and MSE. ANN-based methods combined to several feature selection techniques have been devised for predicting one-day-ahead GHI in different locations in Saudi Arabia [10]. Further improvements will be obtained when taking into consideration the uncertainties of the used data. A back- propagation ANN (BPNN) optimized by genetic algorithm (GA) and PSO for predicting daily diffuse solar radiation in Beijing, China has been implemented in [11]. Based on several external variables such as temperature and humidity, the PSO- BPNN has been found to be more accurate than GA-BPNN and BPNN. Combining BPNN to a global optimization algorithm has helped in overcoming the problem of local search algorithms based on gradient descent. One can conclude that the accuracy of the obtained results is strongly linked to factors such as the size and quality of datasets, the geographic location, the climatic conditions and the design of the method itself. It could be also noted that combined methods look to be more suitable since they allow superposing the advantages of each one of the individual methods. III. MATERIALS AND METHODS The data used to perform this study were collected from King Abdullah City for Atomic and Renewable Energy (KACARE) in Saudi Arabia. About two years of daily GHI covering the period from January 1, 2015 to November 30, 2016 from a measurement station located in Hail City (Figure 1) were utilized. The data were divided into subsets: 85% were used for training and 15% for testing of the ANN models. Fig. 1. Hail region location A. ANN Pprinciples Basically, ANNs are computing agents that have the ability to imitate the human brain functions. Structurally, an ANN is composed of nodes, weights and activation functions. In its basic form, an ANN includes three layers (input, hidden and output). The main strength of an ANN is that given a set of inputs (patterns) and outputs (targets), it performs a training process to map the inputs to the outputs. This process is called training which can be supervised or unsupervised. B. ANNs Used in our Study In this study, we considered the case where the one-day- ahead GHI is affected only by its own previous values. Thus, the ANN inputs are respectively GHI(t-1), GHI(t-2), …, GHI(t-d). The output is the GHI(t) and d is the model order (Figure 2). The number of neurons in each layer is determined by trial-and-error procedure. The basic multi-layer perceptron (MLP) [14] was used first with a small number of neurons. If the ANN fails to provide good performance, the number of hidden layers and the number of their neurons are increased in a reasonable manner. Weighted-inputs are summed with biases Engineering, Technology & Applied Science Research Vol. 10, No. 1, 2020, 5228-5232 5230 www.etasr.com Boubaker et al.: Prediction of Daily Global Solar Radiation using Resilient-propagation Artificial … and fed to a transfer function which converts them to an output of the input layer. This output plays after the role of input to the next layer. This process is repeated until reaching the last layer (commonly called output layer). In general, an ANN operates following seven steps: data collection, dataset pre-processing, creation of the network structure, configuration of the network, definition of the ANN training parameters, training and testing of the ANN. Fig. 2. Proposed FFANN for predicting GHI The multi-layer FFANN is described by the following set of equations: a1 = f1 (W1,1X+b1) a2 = f2 (W2,1a1+b2) . aj = fj (Wj,j-1aj-1+bj) . aN = fN (WN,N-1aN-1+bN) (1) where: a j , j=1:N are the outputs of the respective layers. a 1 is the output of the input layer and a N is the output of the output layer), W j,j-1 are the weights of the j th layer, b j are the biases of the j th layer, f j is the transfer function of the j th layer. Tangent sigmoid (tansig) and logsig are commonly used for input and hidden layers whereas purelin is usually used for the output layer. It should be noted that N=3 layers (1 input, 1 hidden and 1 output) are used in most cases. If the number of hidden layers is more than one, we will be in the case of deep learning (a special class of machine learning). During the training phase, the training algorithm updates iteratively the ANN weights as in (2): W(k) = W(k-1) + 2α∆W(k-1)X T (2) where k is the iteration index, α the learning rate and ∆W(k-1) is the error function related to the weights. In practice, it is very difficult to choose the suitable training algorithm in terms of accuracy and computation time. The choice is conditioned depending on many factors including the quality and size of the datasets, the number of weights and biases to be adjusted, the level of accuracy and the complexity of the problem to be solved. Commonly, Levenberg-Marquardt (LM), scaled-conjugate gradient (SCG) and resilient propagation (rp) are used. IV. RESULTS AND DISCUSSION All computations are carried out using Matlab2013a run on an Intel Core i3 processor with a CPU of 2.40Ghz with 4GB RAM. First, the collected data were pre-processed by dividing them by the maximum value of the daily GHI. The aim of this operation is to transform them into values within the interval [0, 1] which is most suitable for the ANN activation functions. At the end of the training and the validation phases, the results are multiplied by the same maximum value to return-back to their regular values. To evaluate the performance of the developed ANN algorithms, three performance metrics, namely the mean absolute percentage error (MAPE), the coefficient of determination (R 2 ) and the RMSE were used (see [13, 14] for definitions and details). After several runs, a three-layer (1 input, 1 hidden and 1 output) ANN was adopted. The number of neurons in each layer varied. The resilient-propagation (rp) and the Quasi-Newton (bfg) are used as training algorithms. The tansig and the linear transfer functions (purelin) have been used respectively for the input, hidden and output layers. Several combinations of inputs have been roughly tested. It has been found that better results were obtained when using the six previous daily GHIs as inputs for forecasting the one-day- ahead GHI. The obtained results for some combinations of ANN structures are summarized in Table I for the training and testing phase, in Table II for the training phase and in Table III for the testing phase. TABLE I. PERFORMANCE O F THE TRAININ G AND TESTIN G PH ASES Case # ANN structure Training algorithm MAPE R 2 RMSE 1 (10, 5, 1) bfg 6.8554 0.8594 6.7894 2 (10, 5, 1) rp 7.7818 0.8112 7.8657 3 (20, 15, 1) bfg 5.1914 0.9166 5.2281 4 (20, 15, 1) rp 6.9134 0.8576 6.8316 5 (50, 30, 1) bfg 7.9008 0.8069 7.9563 6 (50, 30, 1) rp 5.6850 0.9058 5.5566 Jan. 1, 2015 to Nov. 30, 2016 TABLE II. PERFORMAN CE O F THE TRAIN IN G PHASE Case # ANN structure Training algorithm MAPE R 2 RMSE 1 (10, 5, 1) bfg 6.9162 0.8606 6.8871 2 (10, 5, 1) rp 7.9192 0.8129 7.9784 3 (20, 15, 1) bfg 4.7008 0.9363 4.6561 4 (20, 15, 1) rp 6.9653 0.8597 6.9087 5 (50, 30, 1) bfg 8.1239 0.8050 8.1450 6 (50, 30, 1) rp 5.3614 0.9157 5.3549 Jan. 1, 2015 to Aug. 26, 2016 TABLE III. PERFORMANCE O F THE TESTIN G PHASE Case # ANN structure Training algorithm MAPE R 2 RMSE 1 (10, 5, 1) bfg 6.3815 0.7840 6.0898 2 (10, 5, 1) rp 6.7113 0.7098 7.0590 3 (20, 15, 1) bfg 9.0127 0.6188 8.0899 4 (20, 15, 1) rp 6.5091 0.7698 6.2873 5 (50, 30, 1) bfg 6.1631 0.7506 6.5435 6 (50, 30, 1) rp 7.5339 0.7337 6.7616 Aug. 27, 2016 to Nov. 30, 2016 To investigate the performance of the FFANN for forecasting daily GHI in Hail region, six cases have been Engineering, Technology & Applied Science Research Vol. 10, No. 1, 2020, 5228-5232 5231 www.etasr.com Boubaker et al.: Prediction of Daily Global Solar Radiation using Resilient-propagation Artificial … considered. Three levels of investigations have been drawn. At first, the effect of the ANN structure was studied (more particularly the number of neurons per layer). Then, the used training algorithm in terms of convergence characteristics, result accuracy, and training time and finally, the performance of the ANN for different data subsets (training and testing subset, training subset, and testing subset). By analyzing the results of case#1 and case#2 where a relatively small number of neurons have been utilized, we can conclude that the bfg algorithm performs better than the rp algorithm. By increasing the number of neurons per layer (case#3 and case#4), an improvement of performance for both training algorithms for the three data subsets has been obtained. However, the rp training algorithm has provided better results during the testing phase (MAPE=6.5091, R 2 =0.7698 and RMSE=6.2873). This result is very interesting since it concerns the testing phase because naturally the training dataset has to provide good agreement when used in testing phase. The relatively high number of neurons (case#5 and case#6) didn’t improve the results for the bfg algorithm in the three subsets. In addition, the bfg algorithm took a lot of time in training the ANN (in case#6, the training time was more than one hour). The rp algorithm was way faster than the bfg. Based on the above described results, it can be concluded that for a small number of neurons per layer it is better to use the Quasi-Newton (bfg) algorithm. For a medium number of neurons, rp can be chosen for its fastness and bfg can be chosen for its accuracy. In the case of high numbers of neurons, the rp training algorithm performs better in terms of computation time. Figures 3 and 4 show the graphs of the forecasted GHI, the observed (real) GHI, and the scattered plot of the obtained results during the testing phase. Fig. 3. Forecasted and observed GHI for the testing phase (FFANN trained by resilient-propagation algorithm) V. CONCLUSIONS The suitability of FFANNs to predict daily GHI was evaluated using data from the King Abdullah City for Atomic and Renewable Energy (KACARE) station located in Hail College of Technology. Fig. 4. Scattered plot (case#6) The obtained results were compared with the use of statistical criteria such as MAPE, R2, and RMSE. The best results were recorded when using a medium structure (20, 15, 1) of neurons for the input, hidden and output layers and the Quasi-Newton (bfg) training algorithm. However, the resilient-propagation (rp) algorithm is recommended for structures with high number of neurons in different layers. Further improvements will be achieved in future works by using deep learning (DL) algorithms with more data. Also, other explanatory variables such as climatic conditions and storm effects will be investigated. The results of this study can be extended to photovoltaic power system modeling as in [15]. ACKNOWLEDGMENT The authors would like to thank Hail University for providing the grant number 160811 to support this study. REFERENCES [1] Y. Zhang, N. Cui, Y. Feng, D. Gong, X. Hu, “Comparison of BP, PSO- BP and statistical models for predicting daily global solar radiation in arid Northwest China”, Computers and Electronics in Agriculture, Vol. 164, Article ID 104905, 2019 [2] S. P. Durrani, S. Balluff, L. Wurzer, S. Krautter, “Photovoltaic yield prediction using an irradiance forecast model based on multiple neural networks”, Journal of Modern Power Systems and Clean Energy, Vol. 6, No. 2, pp. 255–267, 2018 [3] L. S. Saoud, F. Rahmoune, V. Tourtchine, K. Baddari, “A novel method to forecast 24 h of global solar Irradiation”, Energy Systems, Vol. 9, pp. 171–193, 2018 [4] V. Z. Antonopoulos, D. M. Papamichail, V. G. Aschonitis, A. V. Antonopoulos, “Solar radiation estimation methods using ANN and empirical models”, Computers and Electronics in Agriculture, Vol. 160, pp. 160–167, 2019 [5] C. G. Ozoegwu, “Artificial neural network forecast of monthly mean daily global solar radiation of selected locations based on time series and month number”, Journal of Cleaner Production, Vol. 216, pp. 1-13, 2019 Engineering, Technology & Applied Science Research Vol. 10, No. 1, 2020, 5228-5232 5232 www.etasr.com Boubaker et al.: Prediction of Daily Global Solar Radiation using Resilient-propagation Artificial … [6] H. Citakoglu, “Comparison of artificial intelligence techniques via empirical equations for prediction of solar radiation”, Computers and Electronics in Agriculture, Vol. 118, pp. 28-37, 2015 [7] S. Hussain, A. AlAlili, “A hybrid solar radiation modeling approach using wavelet multiresolution analysis and artificial neural networks”, Applied Energy, Vol. 208, pp. 540-550, 2017 [8] M. Benghanem, A. Mellit, “Radial basis function network-based prediction of global solar radiation data: application for sizing of a stand-alone photovoltaic system at al-Madinah, Saudi Arabia”, Energy, Vol. 35, pp. 3751-3762, 2010 [9] A. Khosravi , R. O. Nunes, M. A. H. Assad, L. Machado, “Comparison of artificial intelligence methods in estimation of daily global solar radiation”, Journal of Cleaner Production, Vol. 194, pp. 342-358, 2018 [10] M. Almaraachi, “Investigating the impact of feature selection on the prediction of solar radiation in different loactions in Saudi Arabia”, Applied Soft Computing, Vol. 66, pp. 250-263, 2018 [11] X. Xue, “Prediction of daily diffuse solar radiation using artificial neural networks”, International Journal of Hydrogen Energy, Vol. 42, pp. 28214-28221, 2017 [12] S. Boubaker, “Identification of nonlinear Hammerstein system using mixed integer-real coded particle swarm optimization: application to the electric daily peak-load forecasting”, Nonlinear Dynamics, Vol. 90, No. 2, pp. 797-814, 2017 [13] S. Boubaker, “Identification of monthly municipal water demand system based on Autoregressive Integrated Moving Average model tuned by Particle Swarm Optimization”, Journal of Hydroinformatics, Vol. 19, No. 2, pp. 261-281, 2017 [14] S. Shah, H. N. Nagraja, J. Chakravorty, “ANN and ANFIS for short term load forecasting”, Engineering, Technology & Applied Science Research, Vol. 8, No. 2, pp. 2818-2820, 2018 [15] F. Mavromatakis, Y. Franghiadakis, F. Vignola, “Modeling Photovoltaic Power”, Engineering, Technology & Applied Science Research, Vol. 6, No. 5, pp. 1115-1118, 2016