APPLICATION OF DIGITAL CELLULAR RADIO FOR MOBILE LOCATION ESTIMATION IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 SARIMA-LSTM COMBINATION FOR COVID-19 CASE MODELING IMAM TAHYUDIN1, RIZKI WAHYUDI2*AND HIDETAKA NAMBO3 1 Department of Information System, Universitas Amikom Purwokerto, Purwokerto, Indonesia 2 Department of Informatics, Universitas Amikom Purwokerto, Purwokerto, Indonesia 3 Artificial Intelligence Laboratory, Kanazawa University, Japan * Corresponding author: rizkiw@amikompurwokerto.ac.id (Received: 4th July 2021; Accepted: 3rd December 2021; Published on-line: 4th July 2022) ABSTRACT: The study of SARIMA method in combination with LSTM is interesting to do. This combination method can be convincing and significant because the data collected is numerical and saved based on time. In addition, the proposed method can anticipate datasets, either linear or non-linear. Based on several previous studies, the SARIMA method has the advantage of completing linear datasets while the LSTM method excels in achieving non-linear datasets. Also, both methods have been shown to have an accuracy value compared to some other methods. This study tried to combine the two through several stages of the first stage of applying the SARIMA method using fit datasets (linear data) then residual Dataset (non-linear data) analysed using the LSTM method. The result of the combination methods will be checked for the accuracy value. This research will be compared by using SARIMA and LSTM methods separately. The Dataset used as a trial is COVID-19 patient data in the United States. The results showed that the combination of SARIMA-LSTM method is better than either SARIMA or LSTM alone with RMSE of 0.33905765 and MAE of 0.29077017. ABSTRAK: Gabungan kaedah kajian SARIMA dengan LSTM adalah menarik untuk dikaji. Gabungan kaedah ini meyakinkan dan penting kerana data yang dikumpulkan bersifat numerik dan disimpan berdasarkan waktu. Selain itu, kaedah yang diusulkan ini dapat menerima set data, samada berkadar langsung atau tidak langsung. Berdasarkan beberapa penelitian sebelumnya, kaedah SARIMA mempunyai faedah dalam melengkapi set data linear, sedangkan kaedah LSTM berguna dalam mencapai set data tidak-linear. Tambahan, kedua-dua kaedah ini terbukti memiliki nilai ketepatan lebih baik berbanding beberapa kaedah lain. Kajian ini cuba menggabungkan keduanya melalui beberapa tahap. Tahap pertama mengunakan kaedah SARIMA secara set data (data linear) kemudian baki set data (data tidak-linear) dianalisa menggunakan kaedah LSTM. Dapatan dari gabungan kedua-dua kaedah tersebut akan diperiksa nilai ketepatannya. Kajian ini akan dibandingkan melalui kaedah SARIMA dan LSTM secara berasingan. Set data yang digunakan adalah merupakan data pesakit COVID-19 dari Amerika Syarikat. Dapatan kajian menunjukkan gabungan kaedah SARIMA-LSTM memiliki nilai ketepatan yang lebih baik berbanding kaedah SARIMA secara berasingan, dan LSTM dengan RMSE adalah sebanyak 0.33905765 dan MAE sebanyak 0.29077017. KEYWORDS: SARIMA; LSTM; SARIMA-LSTM; COVID-19 patients 1. INTRODUCTION The term SARIMA designates a seasonal autoregressive integrated moving average. One of the time series method's topics is this model. To handle problems involving time- run results, time series are frequently used. The time-series methodology is used in a 171 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 variety of domains, including the economics and financial turnaround results at hospitals [1-3]. A discussion of the time series method was held to complete the modelling of potential bioelectric plant data. Using autoregressive (AR), moving average (MA), and ARIMA models are one example. However, some of these models have not produced the best value. The average error rate for mean square error (MSE) and the mean absolute error (MAE) continues to be strong. Although the average prediction accuracy is still about 75% [4-6]. Therefore, this study aims to increase accuracy by using another time series model, SARIMA. In this study, the SARIMA method was combined with the LSTM method. The combination of SARIMA with other methods is proven to have better accuracy results. Among them is the combination of the SARIMA method with other methods, including the SVM method, to predict the production value of the machine industry in Taiwan [7]. The results showed that SARIMA hybrid accuracy with SVM is better than with each method. Other studies used hybrid ARIMA with ANN for forecasting pollution index in cities throughout Southeast Asia and further research used the same approach to predict tourists coming at Minangkabau international airport [8]. The results showed the accuracy value of hybrid methods is better. Subsequent research compared SARIMA and ANN methods to predict power absorption in Turkey’s electricity users [9]. After 12 weeks, the results showed that the ANN method's MAPE value was 1.8% better than SARIMA because it had a MAPE of 2.6%. However, in certain conditions, such as the time after a holiday, the result is the opposite. Another study combined SARIMA with SVM, and then in analysis using clustering [10], this research was used to predict passengers at northern Iranian stations. The result is a better mix of these approaches than the individual methods. As a result, the integrated approach outperforms the respective processes. Additional research on the combination of SVR-SARIMA models was done for tourist forecasting [11], for the best model's determination using the decision support system PROMETHEE II. The result is the same combination method is better. Thus, this study combined the SARIMA method with LSTM. The SARIMA model successfully predicts a person's position for linear data set type better than the deep learning method [12], and also the SARIMA model has been tested with high accuracy of about 80% [13]. According to research, the Long-Short Term Memory (LSTM) Recurrent Neural Network on Workload Forecasting Models for Cloud Datacentres has generated empirical results. The proposed method achieves high accuracy in prediction by reducing average squared errors by up to 3.17 x 10-3 [14]. Therefore, we use both methods because both can solve problems for linear and non-linear data sets. In addition, this study will compare the SARIMA-LSTM combination with each method separately. 2. METHOD 2.1 ARIMA Model ARIMA is a term derived from its parts: autoregressive (AR), integration (I), and moving average (MA) shape. In general, the ARIMA models are classified into two types, namely non-traditional (non-seasonal) ARIMA and Seasonal ARIMA models [15-17]. The ARIMA model is as follows: ARIMA (p,d,q). p represents the sum of AR values, d is the value of integration (I), and q is the MA value. In general, the ARIMA model (p,d,q) can be seen from the model as follows: (1 - 1B.... pB p )(1 - B) d Yt = c + (1 + 1B..... + qB q ) et (1) 172 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 There are 3 main components in the model, the first being AR (p), (1-1B.... - pB p ) (2) The second is differentiation through I (d), (1 - B) d Yt (3) The third was indeed MA (q), (1 + 1B..... + qB q )et (4) c, on the other side, is a constant value. 2.2 Seasonal ARIMA Model The seasonal ARIMA, or SARIMA model, is a model or shape that repeats itself at regular intervals. For stationary datasets, seasonality can be detected from the ACF plot. If the ACF visualization shows seasonal patterns, it will be done with a different solution [18-20]. In general, the seasonal ARIMA equation is shown in eqn. 5. ARIMA(p,d,q)(P,D,Q)s (5) where (p,d,q) is the non-seasonal ARIMA model index, while (P, D, Q) is the seasonal ARIMA model, and S is the number of periods on the seasonal model. For example, if ARIMA (1,0,0), then the model follows the following eqns. 6 and 7: (1 - 1B)Yt = c (6) where is BYt = Yt-1. So Yt = c + 1Yt-1 (7) To detect seasonal datasets, there are several chart techniques including sequential plots, seasonal plot subseries, multiple box plots, and autocorrelated plots. The study will use autocorrelated schemes to detect seasonality. One of the solutions for this autocorrelation plot is to use seasonal differential operators. 2.3 LSTM Long-short-term Memory (LSTM) is a form of RNN that consists of a collection of cells with features that allow them to memorize data sequences. Data streams are captured and stored in cells. The cell then connects one module from the past to another, allowing data to be transmitted from several previous instances to the present. The data in every cell can be rejected, screened or started adding as a result of the gates in every cell in preparation for the cells that come after [19,21]. The gates focus on a neural network with sigmoidal shape layers, and the active cells either transfer data or discard it. Each sigmoid layer generates a number between 0 and 1, indicating the sum of each data segment that must be permitted in every cell. More precisely, The estimated low value assumes that "nothing should be allowed to pass"; while forecast one shows that "let it all pass". Every LSTM has three types of gates that regulate a state for every cell: 1) Forget Gate produces a value between 0 and 1, with 1 denoting success. "fully save this"; whereas 0 says "ignore this." 2) Memory Gate The sigmoid layer, following either by the tanh layer, determines which of the cell's most recent data must be kept. The first sigmoid layer, known as 173 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 the "doorway layer," selects which values to modify. The tanh layer then generates a new candidate value vector, which can be added to the state. 3) The Output Gate determines what to make from each cell. The final value will be determined by the cell state as well as newly added filtered data. If the distance is vast, the RNN will be unable to predict the next result. Consider the following text: "I go to work every day" and "I work hard at the office." The location's name is the next possible word for current knowledge, but deciding what kind of location to use is difficult. Since there was some related knowledge in the previous period, RNN cannot learn to relate information. As a result, LSTM is a solution for overcoming these flaws. LSTMs can study long-term dependencies. Remembering information for a long time is the default behavior. Some of the equations show this module as follows [22], [23]. 𝑟𝑡 = 𝑠𝑖𝑔𝑚(𝑊𝑥𝑟𝑥𝑡 + 𝑊ℎ𝑟 ℎ𝑡−1 + 𝑏𝑟 ) (8) 𝑧𝑡 = 𝑠𝑖𝑔𝑚(𝑊𝑥𝑧𝑥𝑡 + 𝑊ℎ𝑧ℎ𝑡−1 + 𝑏𝑧 ) (9) ℎ̃𝑡 = 𝑡𝑎𝑛ℎ(𝑊𝑥ℎ 𝑥𝑡 + 𝑊ℎℎ (𝑟𝑡 ⨀ℎ𝑡−1) + 𝑏ℎ ) (10) ℎ𝑡 = 𝑧𝑡 ⨀ℎ𝑡−1 + (1 − 𝑧𝑡 )⨀ℎ̃𝑡 (11) 2.4 Combination of SARIMA Method with LSTM The steps of this combination are described in Fig. 1. Fig. 1: Combination SARIMA-LSTM Model of COVID-19 patient. Based on Fig. 1, the combination of the two methods is as follows: (1) Prepare datasets; in this case, the Dataset used is the COVID-19 patient death data in a country affected by COVID-19. (2) Analysis process using the SARIMA model. The method includes the identification process. This process is checked using the Box-Jenkins process. That is the process of determining static data. The seasonal Dataset is then examined by looking at the ACF and PACF values. (3) If the Dataset is seasonal, then the next step determines the best SARIMA model. (4) SARIMA Fit model is forwarded for the forecasting process for linear components. While residual data is used for the forecasting process using the LSTM method for non-linear components. (5) The last 174 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 step is the combined result of forecasting the two methods, further measuring MSE values. 3. RESULTS AND DISCUSSION 3.1 SARIMA Model Implementation In implementing the SARIMA method, COVID-19 patients who died based on gender in the United States were used. The original data used as many as 1008 datasets but because many values were empty (missing value), the empty data was deleted. The net data was 711 datasets. The analysis process used R and Minitab software. 3.1.1 Identification As a dataset trial, male COVID-19 patients' datasets were used. A plot of male COVID-19 patients who died in the United States is shown in Fig. 2. Fig. 2: Visual Dataset of male COVID-19 patients who died in the USA. Following stationary check dataset of male COVID-19 patients. a) Stationary check of variety with Cox Box test Results obtained as follows: Fig. 3: Visual Dataset of male COVID-19 patients who died in the USA. 24 /0 8/ 20 20 17 /0 8/ 20 20 03 /0 8/ 20 20 27 /0 7/ 20 20 13 /0 7/ 20 20 29 /0 6/ 20 20 15 /0 6/ 20 20 01 /0 6/ 20 20 18 /0 5/ 20 20 04 /0 5/ 20 20 13 /0 4/ 20 20 16000 14000 12000 10000 8000 6000 4000 2000 0 Tanggal M a le Time Series Plot of Male 1.51.00.50.0-0.5-1.0 5000 4000 3000 2000 1000 Lambda S t D e v Lower C L Upper C L Limit Estimate 0.11 Lower C L 0.06 Upper C L 0.15 Rounded Value 0.11 (using 95.0% confidence) Lambda Box-Cox Plot of Male 175 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 Because the value is still 0.11 should be worth 1. So, the transformation process is carried out as follows: Fig. 4: Cox Box Transformation. Since the lambda value = 1 is stationary. It then checked stationary against the average by looking at its ACF and PACF scores. b) Stationary checks against averages by checking ACF and PACF values Based on ACF and PACF lag, 1-3 images are still inside the significance interval. Then it has been declared stationary against the average. Fig. 5: ACF. 5.02.50.0-2.5-5.0 0.55 0.50 0.45 0.40 0.35 0.30 Lambda S t D e v Lower C L Upper C L Limit Estimate 1.00 Lower C L 0.55 Upper C L 1.39 Rounded Value 1.00 (using 95.0% confidence) Lambda Box-Cox Plot of MaleTrans1 7065605550454035302520151051 1.0 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 -1.0 Lag A u t o c o r r e la t io n Autocorrelation Function for MaleTrans1 (with 5% significance limits for the autocorrelations) 176 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 Fig. 6: PACF. Based on Fig. 12, it appears that there is a trend. That is, there is an increase in the number of COVID-19 patients who die per 10 datasets. Therefore, the identification process is carried out further by differencing 3.1.2 Estimation In this process, an analysis is carried out based on the sex of the male. Fig. 7: DATASET of COVID-19 patients in the USA of the male gender. Once the first differencing process trend is known lag=1, The model is formed with a non-seasonal model first by inspecting the graph autocorrelation function (ACF) and partial autocorrelation function (PACF). The following ACF and PACF charts were obtained: 1009080706050403020101 1.0 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 -1.0 Lag P a r t ia l A u t o c o r r e la t io n Partial Autocorrelation Function for MaleTrans1 (with 5% significance limits for the partial autocorrelations) 11 /0 5/ 20 20 11 /0 5/ 20 20 04 /0 5/ 20 20 04 /0 5/ 20 20 04 /0 5/ 20 20 04 /0 5/ 20 20 27 /0 4/ 20 20 27 /0 4/ 20 20 27 /0 4/ 20 20 13 /0 4/ 20 20 13 /0 4/ 20 20 14000 12000 10000 8000 6000 4000 2000 0 Tanggal D a t a Male Female Variable Time Series Plot of Male, Female 177 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 Fig. 8: ACF graph of male COVID-19 patients in America non-seasonal ARIMA models Fig. 9: PACF graph of male COVID-19 patients in America non-seasonal ARIMA model. The ACF chart shows the dying down as much as five lags. Meanwhile, the PACF chart shows the cut-off pattern. Thus, the non-seasonal ARIMA model formed is ARIMA (5,1,0). They are furthermore checking the non-seasonal ARIMA model. The step to determine the seasonal ARIMA model is the same as that performed to find the best model in non-seasonal ARIMA by determining its ACF and PACF charts. The seasonal value determines the difference. In this case, it is 10 because for every 10 datasets, there is a significant increase in COVID-19 patient deaths. 3.1.3 Model Evaluation At this stage, checking the error value and other values. Based on the output of the auto_arima, the evaluation value as follows. SARIMA (2,1,2)(0,0,2)12 Next, check the error value and accuracy. Here are the evaluation values obtained: 1009080706050403020101 1.0 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 -1.0 Lag A u t o c o r r e la t io n ACF Male 7065605550454035302520151051 1.0 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 -1.0 Lag P a r t ia l A u t o c o r r e la t io n Partial Autocorrelation Function for Male_lag1 (with 5% significance limits for the partial autocorrelations) 178 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 AIC = 340.08 BIC = 374.8 RMSE = 0.44236361 MAE = 0.33180744 These values are the best value when compared to other models. 3.1.4 Forecasting Based on Fig. 10, the number of COVID-19 patients who died of male gender in the USA, in general, decreased for the following forecasting result. The average difference is about 500 people. Fig. 10: Male COVID-19 patient forecasting graph using SARIMA. 3.2 Implementation of LSTM and Combination of SARIMA-LSTM method This analysis used the parameters of batch size of 100, the look_back of seven data to previous, and the epoch for learning of 100. Furthermore, for the forecasting process we split the data set with the composition of 80% training data and 20% testing data from the same data: male COVID-19 patients in the USA. Based on the analysis using LSTM obtained an RMSE value of 0.35847506 and MAE of 0.29837463. Based on RMSE and MAE results, the values are smaller than the SARIMA result, so it can be said that LSTM is better than SARIMA. From Fig 11, it appears that the number of COVID-19 patients who died on average tends to decrease. Fig. 11: Graph forecasting male COVID-19 patients using LSTM method. 179 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 For the next test, the SARIMA method combined with LSTM was done to determine how much accuracy through RMSE and MAE value obtained and prediction result. Based on the calculation resulted that an RMSE value was 0.33905765 and MAE was 0.29077017. Those results presented that the RMSE and MAE of combination SARIMA-LSTM was better than the SARIMA and LSTM methods (Table 1). The results of combination SARIMA and LSTM are seen in Fig 12. This figure performed that the result is the best result because the predicted data is almost similar with real data. The number of COVID- 19 suspected deaths decreased to near-zero. In addition, the table of comparison of RMSE and MAE are presented in Table 1. This result presented that the combination of SARIMA and LSTM is the best method for predicting the death of COVID-19 patients in the USA. Fig. 12: Male COVID-19 patient forecasting using SARIMA-LSTM combination method Table 1: The comparison of RMSE and MAE from SARIMA, LSTM, and combination SARIMA-LSTM Parameters SARIMA LSTM SARIMA-LSTM RMSE 0.44236361 0.35847506 0.33905765 MAE 0.33180744 0.29837463 0.29077017 4. CONCLUSION Based on the results of the study, the combination SARIMA-LSTM method is the best one. It performed better than the SARIMA or LSTM methods separately. Based on the results of general predictions using all three methods, there was a decrease in the number of male COVID-19 patients who died in the USA on average. This research has the limitation of not explaining the mortality number of patients in every state within the USA. For future work, the analysis will be carried out using a combination of SARIMA – PARCD methods. REFERENCES [1] Davis RA. (2014) Introduction to statistical analysis of time series. Department of Statistics Columbia University, pp. 1-24. 180 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 [2] Borkowf CB. (2002) Time-Series Forecasting. Technometrics, 44(2): 194-195. https://doi.org/10.1198/tech.2002.s718. [3] Schlüter T. (2012) Knowledge discovery from time series (Doctoral dissertation, Universitäts-und Landesbibliothek der Heinrich-Heine-Universität Düsseldorf). [4] Chen KY, Wang CH. (2007) A hybrid SARIMA and support vector machines in forecasting the production values of the machinery industry in Taiwan. Expert Systems with Applications, 32(1): 254-264. https://doi.org/10.1016/j.eswa.2005.11.027 [5] Chi YN. (2021) Time Series Forecasting of Global Price of Soybeans using a Hybrid SARIMA and NARNN Model: Time Series Forecasting of Global Price of Soybeans. Data Science: Journal of Computing and Applied Informatics, 5(2): 85-101. https://doi.org/10.4108/eai.2-8-2019.2290473 [6] Ozozen A, Kayakutlu G, Ketterer M, Kayalica O. (2016) A combined seasonal ARIMA and ANN model for improved results in electricity spot price forecasting: Case study in Turkey. In 2016 Portland International Conference on Management of Engineering and Technology (PICMET) (pp. 2681-2690). IEEE. https://doi.org/10.1109/PICMET.2016.7806831. [7] Parviz L. (2020) Comparative evaluation of hybrid SARIMA and machine learning techniques based on time varying and decomposition of precipitation time series. Journal of Agricultural Science and Technology, 22(2): 563-578. Retrieved from: http://jast.modares.ac.ir/article-23-26018-en.html [8] Abellana DPM, Rivero DMC, Aparente ME, Rivero, A. (2020) Hybrid SVR-SARIMA model for tourism forecasting using PROMETHEE II as a selection methodology: a Philippine scenario. Journal of Tourism Futures. https://doi.org/10.1108/JTF-07-2019-0070 [9] Tahyudin I, Nambo H. (2018) Comparison Study of Deep Learning and Time Series for Bioelectric Potential Analysis. In 2018 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE) (pp. 79-83). IEEE. https://doi.org/10.1109/ICITISEE.2018.8720998 [10] Tahyudin I, Nambo H. (2018) SARIMA Model of Bioelectic Potential Dataset. In International Conference on Big Data, Cloud and Applications (pp. 367-378). Springer, Cham. https://doi.org/10.1007/978-3-319-96292-4_29 [11] Kumar J, Goomer R, Singh AK. (2018) Long short term memory recurrent neural network (LSTM-RNN) based workload forecasting model for cloud datacenters. Procedia Computer Science, 125: 676-682. https://doi.org/10.1016/j.procs.2017.12.087 [12] Benvenuto D, Giovanetti M, Vassallo L, Angeletti S, Ciccozzi M. (2020) Application of the ARIMA model on the COVID-2019 epidemic dataset. Data in Brief, 29: 105340. https://doi.org/10.1016/j.dib.2020.105340 [13] Ceylan Z. (2020) Estimation of COVID-19 prevalence in Italy, Spain, and France. Science of The Total Environment, 729:138817. [14] Zeroual A, Harrou F, Dairi A, Sun Y. (2020) Deep learning methods for forecasting COVID-19 time-Series data: A Comparative study. Chaos, Solitons & Fractals, 140: 110121 [15] NIST/SEMATECH: Seasonality (2012). http://www.itl.nist.gov/div898/ handbook/pmc/section4/pmc443.htm. Accessed 23 September 2020 [16] Qi C, Zhang D, Zhu Y, Liu L, Li C, Wang Z, Li X. (2020) SARFIMA model prediction for infectious diseases: application to hemorrhagic fever with renal syndrome and comparing with SARIMA. BMC medical research methodology, 20(1): 1-7. https://doi.org/10.1186/s12874-020-01130-8 [17] Hamilton JD. (2020) Time series analysis. Princeton university press. [18] Sherstinsky A. (2020) Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Physica D: Nonlinear Phenomena, 404: 132306. https://doi.org/10.1016/j.physd.2019.132306 [19] Reddy BK, Delen D. (2018) Predicting hospital readmission for lupus patients: An RNN- LSTM-based deep-learning methodology. Computers in biology and medicine, 101: 199- 209. https://doi.org/10.1016/j.compbiomed.2018.08.029 [20] Qi J, Liu X, Tejedor J. (2020) Variational inference-based Dropout in recurrent neural networks for slot filling in spoken language understanding. arXiv Preprint arXiv:2009.01003 181 IIUM Engineering Journal, Vol. 23, No. 2, 2022 Tahyudin et al. https://doi.org/10.31436/iiumej.v23i2.2134 [21] Li C, Zhao L, Cai B. (2020) Size prediction of railway switch gap based on RegARIMA model and LSTM network. IEEE Access, 8, 198188-198200. https://doi.org/10.1109/ACCESS.2020.3034687 [22] Z. Liu et al., “Entity recognition from clinical texts via recurrent neural network,” BMC Med. Inform. Decis. Mak., vol. 17, no. Suppl 2, 2017, doi: 10.1186/s12911- 017-0468-7 [23] M. A. Jishan, K. R. Mahmud, A. K. Al Azad, M. S. Alam, and A. M. Khan, “Hybrid deep neural network for bangla automated image descriptor,” Int. J. Adv. Intell. Informatics, vol. 6, no. 2, pp. 109–122, 2020. https://doi.org/10.26555/ijain.v6i2.499 182