Microsoft Word - 1-626-ed.doc Engineering, Technology & Applied Science Research Vol. 6, No. 2, 2016, 923-926 923 www.etasr.com Radhika and Arumugam: Variable Step Size Maximum Correntropy Criteria Based… Variable Step Size Maximum Correntropy Criteria Based Adaptive Filtering Algorithm S. Radhika Research Scholar, Faculty of Electrical and Electronics Engineering Sathyabama University, Chennai, India radhikachandru79@gmail.com A. Sivabalan Manager, Research NEC Mobile Networks Excellence Centre Chennai, India sivabalan.a@gmail.com Abstract—Maximum correntropy criterion (MCC) based adaptive filters are found to be robust against impulsive interference. This paper proposes a novel MCC based adaptive filter with variable step size in order to obtain improved performance in terms of both convergence rate and steady state error with robustness against impulsive interference. The optimal variable step size is obtained by minimizing the Mean Square Deviation (MSD) error from one iteration to the other. Simulation results in the context of a highly impulsive system identification scenario show that the proposed algorithm has faster convergence and lesser steady state error than the conventional MCC based adaptive filters. Keywords-Maximum correntropy criteria; adaptive filter; variable step size; mean square deviation. I. INTRODUCTION The minimum mean square error (MSE) criterion is a widely used criterion for adaptive filters due to its simplicity and easy implementation [1]. Recently, information theory criterion based adaptive filters have received much attention due to their robustness against impulsive noises, in which case MSE criterion performs poorly. Correntropy is one such information theory criterion which is a non linear measure of similarity between two random variables [2, 3]. The correntropy is defined as: ( , ) [ ( , )] ( , ) ( , ) (1) XY V X Y E X Y x y dF x yk k= = ò Where κ is a shift–invariant Mercer kernel and ( , )XYF x y denotes the joint distribution function of (X, Y). If τηε kernel is Gaussian, then it is defined as: 2 2 1 ( , ) ( ) (2) 22 e e x y G e expk ss p - = = æ ö÷ç ÷ç ÷ç ÷è ø where e x y  and 0 s > is the kernel width. The cost function under MCC criterion is defined as 2 2 exp (3) 2 MCC e J E s - = é ùæ ö÷çê ú÷ç ÷ç ÷ê úè øë û The adaptive filter using MCCJ as the cost function is defined [4] as: 2 2 ( ) ( 1) ( ) exp ( ) ( ) (4) 2 e n W n W n e n X nm s - + = + æ ö÷ç ÷ç ÷ç ÷è ø The steady state MSE of MCC algorithm is derived in [5] using energy conservation argument. From the analysis it is evident that MCC based algorithm is an excellent candidate for impulsive interference and its performance is dependent on the step size and kernel width. Also from [5] it is found that a large value of step size increases the convergence at the cost of more steady state error and vise versa. This conflicting tradeoff between convergence speed and steady state error is resolved in [6] using a combinational approach. But the major disadvantage with this approach is that the complexity is higher as two adaptive filters are used in parallel. Adaptive kernel width is proposed in [7] where lesser steady state error is obtained but convergence speed is not improved especially when the kernel width is larger. The variable step size approach has long been used for better performance as it gives a more appreciable solution than the fixed step size approach [8-10]. Hence, in this paper a variable step size for MCC based adaptive filter is proposed so as to have better performance in terms of both faster convergence and lesser steady state error. The criterion for variation is based on the minimization of MSD as it indicates how close the filter is to optimal performance. Thus the step size is increased or decreased based on the square of the instantaneous error obtained. Finally the performance of the proposed algorithm is verified through simulation results for the identification of highly impulsive unknown system. II. PROPOSED ALGORITHM Consider an unknown system with input signal vector given by ( ) [ ( ), ( 1) , . ( 1)] T X n x n x n x n N= - ¼ - + where ( )x n is the input signal. Let ,1 ,2 ,3 , [ , , . ]T o o o o o N W w w w w= ¼ be the optimal weights of length N. The desired response ( )d n is given by 0 ( ) ( ) ( ) T d n W X n v n  where ( )v n is a noise source with variance 2 vs which is made of both background noise ( ) b v n and impulsive noise ( ) i v n . The error signal e(n) is given as: ( ) ( ) ( ) ( ) T e n d n W n X n  Engineering, Technology & Applied Science Research Vol. 6, No. 2, 2016, 923-926 924 www.etasr.com Radhika and Arumugam: Variable Step Size Maximum Correntropy Criteria Based… where 1 2 3 ( ) [ ( ), ( ), ( ) . ( )] T N W n w n w n w n w n= ¼ is the estimate of 0 W at the nth iteration . If the weight error vector be defined as  0 ( ) ( )W n W W n  then (4) can be written in terms of weight error vector as:   2 2 ( ) (n 1) (n) exp ( ) ( ) (5) 2 e n W W e n X nm s - + = - æ ö÷ç ÷ç ÷ç ÷è ø Let 2 2 ( ) ( ( )) 2 e n f e n exp s - = æ ö÷ç ÷ç ÷ç ÷è ø .Thus (5) becomes:  (n 1) (n) f ( ( )) ( ) ( ) (6)W W e n e n X n   Squaring on both sides of (6) we get:    2 2 2 [ (n 1) ] E[ (n) ] 2 Re[ ( ( )f ( ( )) ( ) ( ))] (7) ( ( ) ( )f ( ( ))f ( ( )) ( ) ( )) T T T T E W W E W n e n e n X n E X n e n e n e n e n X n m m + = = - + We can write (7) further as:  2 2[ (n 1) ] E[ (n) ] ( ) (8)E W W m+ = -D where  2 ( ) 2 Re[ ( ( )f ( ( )) ( ) ( ))] ( ( ) ( )f ( ( ))f ( ( )) ( ) ( )) T T T T E W n e n e n X n E X n e n e n e n e n X n m m m D = - Thus in order to reduce the MSD from one iteration to the other, ( ) should be maximized. Thus differentiating ( ) respect to  and equating it to zero gives optimal value of step size as ( ) ( ) Re ( )f ( ( )) ( ) ( ) ( ) (9) ( ) ( ) ( ( )) ( ( )) ( ) ( ) T o T T T E W n e n e n X n n E X n e n f e n f e n e n X n m = In order to further simplify (9) we use of the following assumptions: 1. The input ( )X n is independent and identically distributed (i.i.d) with zero mean and ( ( ) ( )) T E X n X n R . 2. The noise v(n) is i.i.d with zero mean and variance 2 v s and is independent of the input ( )X n 3. The error ( )e n is independent of ( )X n 4. The error non linearity f ( ( ))e n is independent of ( )X n Assumption 1 and 2 are commonly used to analyze adaptive filters. Assumption 3 is used in [11] and Assumption 4 is used in [12] and they are reasonable when the actual weights converge to the optimal value. Using assumptions 1 to 4 we can write (9) as: 2 2 2 ( ( ( ))) ( ( ) ) ( ) (10) ( ( ( ))) ( ) ( ( ) ) a E f e n E e n n E f e n Tr R E e n m = If ( ) ( ) ( ) ( ) ( ) ( ) T a e n n X n vW n e n v n    where ( ) a e n is the a priori error ,then squaring on both sides of ( )e n and it expectation is taken we get 2 2 2 ( ) ( ) a v E e n E e n s= + Thus (10) is written in terms of ( )ne as: 2 2 22 ( ( ) )(f ( ( ))) ( ) (11) (f ( ( ))) ( ) ( ( ) ) v E e nE e n n E e n Tr R E e n s m - = ì üï ïï ïí ý ï ïï ïî þ If we assume that the weights ( )W n are independent of the input ( )X n , then: 2 2 1 ( ) 1 (12) (f ( ( ))) ( ) ( ( )) vn E e n Tr R E e n s m = - ì üï ïï ïí ý ï ïï ïî þ A. Practical step size From (12) it is found that variable step size may prove difficult because of the presence of expectation. Therefore we replace the expectation by an instantaneous approximate value. Finally using time averaging as adopted in [9], the update equation becomes: ( ) ( 1) (1 ) ( ) (13)n n nm am a b= - + - where 2 2 1 ( ) 1 ( ( )) ( ) ( ) vn f e n Tr R e n s b = - ì üï ïï ïí ý ï ïï ïî þ and a is the smoothing factor that ranges between 0 1a< < .Thus we can say that whenever the instantaneous error is more or during the initial stage of the algorithm , the quantity 2 2 ( ) v e n s becomes smaller as 2 22 ( ) ( ) v a e n e ns= + and 2 2 1 ( ) v e n s - ì üï ïï ïí ý ï ïï ïî þ becomes larger (nearer to but less than unity). During the same time 2 2 ( ) f ( ( )) exp 2 e n e n s - = æ ö÷ç ÷ç ÷÷çè ø becomes smaller 2 2 ( ) (exp 1) 2 e n s - < æ ö÷ç ÷ç ÷÷çè ø so that 1/f ( ( ))e n is more. Therefore the overall step size becomes larger so as to have faster convergence. On the other hand during the steady state condition or whenever the instantaneous error becomes smaller, and 2 2 2 2 lim ( ) lim ( ) a v v n n e n e n s e s ¥ ¥ = + = + where e is the value Engineering, Technology & Applied Science Research Vol. 6, No. 2, 2016, 923-926 925 www.etasr.com Radhika and Arumugam: Variable Step Size Maximum Correntropy Criteria Based… of steady state excess mean square error. Therefore 2 2 ( ) v e n s becomes larger such that 2 2 1 ( ) v e n s < . Similarly, under the same condition f ( ( )e n becomes larger than the value obtained during the initial stage with ( ( )) 1.f e n < Thus 2 2 1 ( ) v e n s - ì üï ïï ïí ý ï ïï ïî þ becomes very small and 1 ( ( ))f e n also becomes small. As a result, the step size becomes smaller so as to have a smaller steady state error. During the intermediate value of error, the step size is increased or decreased based on the value of the instantaneous error. Thus both improvement in convergence and steady state error can be obtained simultaneously. III. SIMULATION RESULTS In this section, we present the simulation results to prove the performance improvement of the proposed algorithm against the fixed step size MCC based algorithm. For this purpose an unknown system with 16 coefficients is taken. It is assumed that both the filter and the system have the same number of coefficients. The input is a zero mean white Gaussian noise with unity variance. The noise is made of both background noise and impulsive noise. The background noise is Gaussian with variance 2 bv s .The impulsive noise is generated ( ) ( ) k n A n where ( )k n is a Bernoulli process with probability of success [ ( ) 1] r P k n p= = .The performance is evaluated by normalized MSD (NMSD) which is given as ( )0 2220 10 ( ) /NMSD log W n W=  .The kernel width is fixed as 2s = .The performance curves are obtained by ensemble averages over 50 independent runs. It is assumed that the variance of the noise source is known [ 9]. Figure 1 examines the performance of the proposed algorithm with signal to noise ratio (SNR) taken as 30 dB, signal to interference ratio (SIR) as -10 dB and 0.001, 0.01, 0.1 r p = respectively. For comparison, fixed step size MCC algorithm with 0.05, 0.01m = and 0.001m = with the same parameters were also plotted. The plot confirms that the proposed algorithm can achieve both faster convergence and smaller steady state error than the fixed step size counter parts. Also it is evident from Figure 1 that the variable step size approach is robust against changing values of pr. Similar conclusions can be drawn from Figure 2 and Figure 3 which were simulated for different values of SNR, SIR with pr as 0.001. The convergence behavior of our proposed algorithm can be analyzed if the time evolution of [ ( )]E nm is made. Fig 4 shows the plot of [ ( )]E nm Vs n. As per (13) when the instantaneous error is larger, the value of m is expected to be larger in order to obtain faster convergence .On the other hand, when the instantaneous error is smaller, then m is expected to become smaller in order to have lesser steady state error. Fig. 1. Performance of proposed variable step size MCC and fixed step size MCC for different values of pr.The input is white with SNR=30 dB, SIR=-10 dB. Fig. 2. Performance of proposed variable step size MCC and fixed step size MCC for different values of SNR.The input is white with SIR=-10 dB and pr=0.001 Fig. 3. Performance of proposed variable step size MCC and fixed step size MCC for different values of SIR.The input is white with SNR=30 dB and pr=0.001 Engineering, Technology & Applied Science Research Vol. 6, No. 2, 2016, 923-926 926 www.etasr.com Radhika and Arumugam: Variable Step Size Maximum Correntropy Criteria Based… Thus, Figure 4 confirms the predicted results where [ ( )]E nm is initially larger and starts to decrease and reaches its minimum during steady state which is required for improved performance. In Figure 5, the performance of the proposed algorithm is tested for colored input for the same system as above with SNR=30 dB, SIR=-10 dB and pr=0.001. The input is obtained by passing white Gaussian noise through a first order system with pole at 0.8.Thus, it is found from Figure 5 that the proposed algorithm works well even for colored input. The tracking performance of the proposed algorithm is analyzed in Figure 6 where the system is made to change abruptly after 10,000 samples with pr=0.001, SNR=30dB, SIR=-10 dB. Fig. 4. Time evolution of E[μ(n)] Fig. 5. Performance of proposed variable step size MCC and fixed step size MCC for colored input with pole at 0.8 with SNR=30 dB, SIR=-10 dB and pr=0.001. IV. CONCLUSION Step size is an important parameter which affects the performance of MCC based adaptive filters. A fixed step size gives rise to the tradeoff between convergence speed and steady state error. This paper presented a variable step size approach for MCC based adaptive filters based on largest decrease in MSD. Here the step size is increased or decreased based on the instantaneous error square obtained. Therefore improved filter performance in terms of convergence speed and steady state error is obtained. The simulations done in the context of a system identification scenario that includes impulsive noises confirms that the proposed algorithm has better performance than the fixed step size MCC based adaptive filters. Fig. 6. Tracking performance of proposed variable step size MCC and fixed step size MCC for white input with SNR=30 dB, SIR=-10 dB and pr=0.001 REFERENCES [1] S. Haykin, Adaptive filtering theory, Prentice-Hall, New York, NY, USA, 1996. [2] J. C. Principe, Information theoretic learning: Renyi’s entropy and kernel perspectives, Springer Verlag, Berlin, Germany, 2010 [3] W. Liu, P. P. Pokharel, and J. C. Principe, “Correntropy: Properties, and applications in non-Gaussian signal processing”, IEEE Trans. Signal Process., Vol. 55, No. 11, pp. 5286–5298, 2007 [4] A. Singh, J. C. Principe, “Using correntropy as a cost function in adaptive filters”, International Joint Conference on Neural Networks, Atlanta, USA, pp. 2950–2955, June 14-19, 2009 [5] B. Chen, L. Xing, J. Liang, N. Zheng, J. C. Príncipe, “Steady-state mean-square error analysis for adaptive filtering under the maximum correntropy criterion”, IEEE Signal Processing Letters, Vol. 21, No. 7, pp. 880-884, 2014 [6] L. Shi, Y. Lin, “Convex combination of adaptive filters under the maximum correntropy criterion in impulsive interference,” IEEE Signal Processing Letters, Vol. 21, No. 11, pp. 1385-1388, 2014 [7] S. Zhao, B. Chen, J. C. Principe, “An adaptive kernel width update for correntropy” , The 2012 International Joint Conference on Neural networks, Brisbane, Australia, pp. 1–5, June 10-15, 2012 [8] R. H. Kwong, E. W. Johnston, “A variable step size LMS algorithm”, IEEE Transactions on Signal Processing, Vol. 40,No. 7, pp. 1633-1642, 1992 [9] H. C. Shin, A. H. Sayed, W. J. Song, “Variable step-size NLMS and affine projection algorithms”, IEEE Signal Processing Letters, Vol. 11, No. 2, pp. 132-135, 2004 [10] X. D. Luo, Z. H. Jia, Q. Wang, “A new variable step size LMS adaptive filtering algorithm”, Acta Electronica Sinica, Vol. 34, No. 6, pp. 1123- 1126, 2006 [11] A. H. Sayed, Adaptive filters, John Wiley & Sons, 2008 [12] W. Ma, H. Qua, G. Gui, L. Xu, J. Zhaoa, B. Chen, “Maximum correntropy criterion based sparse adaptive filtering algorithms for robust channel estimation under non-Gaussian environments”, Journal of the Franklin Institute, Vol. 352, No. 7, pp. 2708-2727, 2015