Microsoft Word - cet-01.docx CHEMICAL ENGINEERING TRANSACTIONS VOL. 46, 2015 A publication of The Italian Association of Chemical Engineering Online at www.aidic.it/cet Guest Editors: Peiyu Ren, Yancang Li, Huiping Song Copyright © 2015, AIDIC Servizi S.r.l., ISBN 978-88-95608-37-2; ISSN 2283-9216 The Research on the Prediction of the Network Traffic Based on the Improved PSOSVM Algorithm Xu Yan Xi’an Physical Education University, Shaanxi, China, 710068. xuyanmini13@163.com To predict the network traffic can make us understand the dynamic situation of the network operation to forecast the actual network traffic. It can also adjust and manage the business needs of the service quality assurance. In addition, it operates the network performance. PSO and SVM are two methods in common use. In this paper, we combine the PSO with the SVM. We use the PSO method to optimize the parameters in SVM. We propose an improved PSOSVM method to predict the network traffic. Compared with other prediction methods, this method has the higher accurate. And it is an effective network traffic prediction method. 1. Introduction Prediction is an important part in our life (Mingyue Zhao et al (2015)). With the continuous expansion of the internet scale and the continuous emergence of the diversified network services, the network traffic data shows more and more complex behavior and characteristic. How to manage effectively and make the network provide better service becomes a problem that people more and more concern. There were many scholars studying the network traffic prediction. In 2012, Yue Huichen, Bin Yang, Qing Fangmeng used the flexible neural tree (FNT) model to predict the network traffic. The author optimized the parameters with the Particle Swarm Optimization algorithm to predict the small-time scale traffic measurements data. The experiment showed that the method was very effective to forecast the small scale network traffic. In 2013, Dong-Chul Park (2013) applied the BiLinear Recurrent Neural Network (BLRNN) successfully to the prediction of the network traffic. The BLRNN could reduce about half in terms of the number of weights while preserving its generalization ability though the structure simplification procedure and the application of the GA. The experiment showed that the method could reduce half of the training time and the prediction accuracy was also improved a lot. Then, A combined method which was based on the local mean decomposition (LMD) by JI Yimu et al (2015) and the generalized autoregressive conditional heteroscedasticity (GARCH) was put forward to forecast the traffic rate of the Flash P2P. In addition, there were many scientists having studied the network traffic prediction such as Jun Jiang, Symeon Papavassiliou (2006), Chang, Hsiu Fen Tsa (2009), Yingxu Lai et al (2015). In this paper, we combine the PSO with VSM and propose a kind of improved PSOSVM algorithm. Then, we apply this algorithm to the network traffic prediction. The structure of this paper is as follows. The first part is the introduction. The second part is the SVM. In this part, we introduce mainly the basic knowledge of the SVM. The third part is the improved PSOSVM algorithm. In this part, we combine the PSO with the SVM. And we propose the improved PSOSVM algorithm. The fourth part is the experiment and the last part is the conclusion. 2. SVM The research group which led by the professor V.Vapnik Bell proposed the statistical learning theory aiming at the small sample problem. With the development of the statistical learning theory, it produced the support vector machine (SVM). We assume that ( , ), ( 1, 2, , , )= ∈ diX y i n X R is the linear separable sample. ( ) = ⋅ +g X W X b is the general form of the linear discriminant function in d dimension. The classification plane is 0⋅ + =W X b . DOI: 10.3303/CET1546044 Please cite this article as: Yan X., 2015, The research on the prediction of the network traffic based on the improved psosvm algorithm, Chemical Engineering Transactions, 46, 259-264 DOI:10.3303/CET1546044 259 For the non-support vector, it meets 0=iα . Therefore, the optimal classification plane function only needs to aim at the support vectors. And for the *b , it can be solved according to the constraint conditions of any support vector. For the linear non-separable problem, we usually introduce the kernel function. we can use the kernel function to instead of the inner product operation in the above function. That is, , 1 1 1 1 min ( ) ( ) 2 . . 0, 1, 2, , 0 = = = = − ≥ = =     n n i j i j i j i i j i i n i i i Q y y K X X s t i n y α α α α α α (1) Accordingly, the classification function becomes, * * 1 ( ) sgn{ ( ) } = = ⋅ + n i i i i f x y K X X bα (2) We can select any support vector and can find out *b according to the following function. * * 1 [ ( , ) ] 1 = + = n i i i i i y y K X X bα (3) The training set is ( , ), 1, 2, ,= i ix y i N . ix is the input vector and iy is the corresponding output. For each sample, we use the nonlinear function φ to map to the high feature space. The regression function can be expressed as follows. ( ) ( )= +f x x bωφ (4) In order to get the following parameters, we can adopt the SRM principle and transform the original problem to the following problem. [ ]2 , 1 1 min ( ) 2 . . , 0 + − = − + − +   + +    − − ≤ + + − ≤ + ≥  L i i i i i i i i i i i C s t y b b y ω ξ ω ξ ξ ω ε ξ ω ε ξ ξ ξ (5) For solving the above convex optimization problem, the core idea is to use the Lagrange multiplier method to transform the above optimal question to its dual form. * * * * , 1 1 * 1 1 min ( )( )( , ) ( ) ( ) 2 . . ( ) 0 = = =   − − − − − + −    − =    L L i i j j i j i j i i i i j i L i i i i x x y s t y α α α α ε α α α α α α (6) In general, the selection of the kernel function is as follows. (1) Radial Basis Function 2 2( , ) exp( )2 − = − i ji j x x K x x σ (7) 260 (2) Exponential Radial Basis Function 2( , ) exp( )2 − = − i ji j x x K x x σ (8) 3. The improved PSOSVM algorithm PSO algorithm is a kind of evolutionary algorithm (Tao Lin et al (2015)). The advantage of the PSO algorithm is to achieve simply. And there are not many parameters needing to adjust. In PSO algorithm, the location vector of the particle is 1 2( , , , )= i i i imu u u u . The best location for the i is 1 2( , , , )= g g g gmp p p p . The speed and the update formula for the particle is as follows. 1 1 2 2 2( ) ( ) + = + − + −k k k kim im im im gm imv wv c r u c uprp (9) 1 1+ += +k k kim im imu u v (10) Where, 1r and 2r are the random numbers between [0,1] . w is the inertia weight. w is the key parameter in the particle swarm algorithm. They can balance the global search ability and the local search ability of the algorithm. In the initial stage of the evolution, we hope that the particle has the better exploration ability. With the increase of the iterations, in the latter part of the evolution, we hope that the particle has the better development ability. Therefore, during the evolution process, we need to adjust dynamically the inertia weight. Therefore, the inertia weight that this paper adopts is as follows. max max min max ( )= − − iter w w w w iter (11) Where, iter is the current iteration number. maxiter is the maximum iteration number. maxw and minw are the maximum value and the minimum value of the weights. In the particle swarm optimization algorithm, 1c and 2c are the self-learning factor and the social learning factor. In general, they are taken as 2. There is little people studying the influence of 1c and 2c on the particle swarm optimization algorithm. We know that, in the early stage of the algorithm, we require that the particle has the larger self-learning ability and the less social learning ability. It can make the particle fly in the whole search space. However, in the latter of the algorithm, we require that the particle has the less self-learning ability and the larger social learning ability. It can make the particle fly to the global optimal solution. In order to improve the convergence performance of the algorithm, this paper adjusts dynamically the self-learning factor 1c and the social learning factor 2c . 1 1 1 1 max ( )= − +e s s iter c c c c iter (12) 2 2 2 2 max ( )= − +e s s iter c c c c iter (13) 1sc and 1ec are the initial value and the final value of 1c . 2 sc and 2ec are the initial value and the final value of 2c . We assume that we forecasting observed value is it . The forecasting value is iy . Therefore, we can get the following function. 2 1 ( ) = = − n tot i i SS t t (14) 261 2 1 ( ) = = − n reg i i SS y t (15) 2 1 ( ) = = − n err i i i SS t y (16) totSS is the total sum of squares, proportional to the sample variance. regSS is the regression sum of squares, namely the explained sum of squares. errSS is the residual sum of squares。 We assume that 1 1 = =  n i i t t n . Therefore, we determine the coefficient, 2 1= − err tot SS R SS (17) The algorithm flow chart of the improved PSOSVM is shown in the following diagram. Construct the SVM model Initialize , ,C ε σ SVM training process Determine the coefficient Termination criteria No Initialize the position and the speed of the particle Calculate the fitness of particles Update the position and the speed of the particle Termination criteria NoYes , ,C ε σ The optimized parameters Figure 1: The flow chart of the PSOSVM 4. Experiment In the experimental part, we apply the improved PSOSVM algorithm to forecast the network traffic. We select the network traffic of one college from July1, 2014 to July 20, 2014 as the experimental data. Among them, the data of the first 19days is as the training set. And the data of the last day is as the prediction set. We use the improved PSOSVM method to predict and the results are as follows. 262 Figure 2: The predicted result From the figure 3, we can see that the predicted results of the improved PSOSVM model are very similar to the actual results. It shows that the method is effective. In order to verify the prediction accuracy of the improved PSOSVM algorithm, we compare the predicted results of the algorithm with other algorithms. The comparison results are as follows. Table 1: The comparison of PSOSVM, SVM and linear regression PSOSVM SVM linear regression Average relative error 7.28% 21.05% 28.32% Prediction accuracy 92.72% 78.95 71.38% From the above table, we can know that the predicted results of the PSOSVM model are very good. At the same time, compared with other predicted algorithms, the accuracy of the PSOSVM network traffic model is higher. Therefore, the model can use to predict the network traffic and it achieves the better prediction accuracy. 5. Conclusions It has the important significance to predict the network traffic, control the network congestion and security the business. This paper combines the SVM with the PSO and proposes an improved method. Then, this paper uses this method to predict the network traffic. The main works of this paper are as follows. Firstly, this paper introduces the development status of the network traffic. Secondly, this paper introduces the basic knowledge of the SVM. Thirdly, this paper proposes the improved PSOSVM method. The experimental shows that the method has the higher accuracy. And it is an effective network traffic prediction model. Reference Aytar Y., Zisserman A., 2015, Part level transfer regularization for enhancing exemplar SVMs [J]. Computer Vision and Image Understanding, 138, 114-123. Bai Y.C., Tang M., 2014, Robust visual tracking via augmented kernel SVM [J]. Image and Vision Computing, 32(8), 465-475, Chang B.R., Tsai H.F., 2009, Novel hybrid approach to data-packet-flow prediction for improving network traffic analysis [J], Applied Soft Computing, 9(3), 1177-1183. doi: 10.1016/j.asoc.2009.03.003 Chen Y.H., Yang B., Meng Q.F., 2012, Small-time scale network traffic prediction based on flexible neural tree [J], Applied Soft Computing, 12 (1), 274–279,doi: 10.1016/j.asoc.2011.08.045. 263 Ji Y.M., Yuan Y.G., Zhao C.X., Jiang C.C., Wang R.C., 2015, Research of a Novel Flash P2P Network Traffic Prediction Algorithm [J], Information Technology and Quantitative Management 55,1293 – 1301,doi: 10.1016/j.procs.2015.07.140 . Jiang J., Papavassiliou S., 2006, Enhancing network traffic prediction and anomaly detection via statistical network traffic separation and combination strategies [J], Computer Communications, 29(10), 1627-1638. doi: 10.1016/j.comcom.2005.07.030 Khadanga R.K., Satapathy J.K., 2015, Time delay approach for PSS and SSSC based coordinated controller design using hybrid PSO–GSA algorithm [J], International Journal of Electrical Power & Energy Systems, 71, 262-273. Lai Y.X., Chen Y.N., Liu Z.H., Yang Z., Li X.L., 2015, On monitoring and predicting mobile network traffic abnormality [J], Simulation Modelling Practice and Theory, 50, 176-188, Lin T., Wu P., Gao F.M., Yu Y., 2015, Study on SVM temperature compensation of liquid ammonia volumetric flowmeter based on variable weight PSO [J]. International Journal of Heat and Technology, 33(2): 151-156. Nayak R.K., Mishra D., Rath A.K., 2015, A Naïve SVM-KNN based stock market trend reversal analysis for Indian benchmark indices [J], Applied Soft Computing, 35, 670-680. Park D.C., 2013, Structure optimization of BiLinear Recurrent Neural Networks and its application to Ethernet network traffic prediction [J], Information Sciences, 237(10), 18-28.doi: 10.1016/j.ins.2009.10.005. Santos L.C., Tavares F.W., Ahón V.R.R., Kontogeorgis G.M., 2015, Modeling MEA with the CPA equation of state: A parameter estimation study adding local search to PSO algorithm [J]. Fluid Phase Equilibria, 400 (25), 76-86. Yu D.J., Hu J., Tang Z.M., Shen H.B., Yang J., Yang J.Y., 2013, Improving protein-ATP binding residues prediction by boosting SVMs with random under-sampling [J]. Neurocomputing, 104 (15), 180-190. Zhang Z.L., Yang J.G., Wang Y.L., Dou D.Y., Xia W.C., 2014, Ash content prediction of coarse coal by image analysis and GA-SVM [J], Powder Technology, 268, 429-435. Zhao M.Y., Zhao D.X., Jiang Z.X., Cui D.M., Li J., Shi X.Y., 2015, The gray prediction GM (1,1) modeling traffic forecast application [J], Mathematical modeling and engineering problems, 2 (1): 17-20. 264