Microsoft Word - cet-01.docx CHEMICAL ENGINEERING TRANSACTIONS VOL. 46, 2015 A publication of The Italian Association of Chemical Engineering Online at www.aidic.it/cet Guest Editors: Peiyu Ren, Yancang Li, Huiping Song Copyright © 2015, AIDIC Servizi S.r.l., ISBN 978-88-95608-37-2; ISSN 2283-9216 An Improved Sparse Representation Model for Robust Image Denoising Zhi Cui*, Xianpu Cui School of Communication and Electronic Engineering, Hunan City University, Yiyang, 413000, China. zhicui@yeah.net Though the sparse representation has demonstrated to be a very effective tool to de-noise the images with low levels of noise, it usually losses the power to well preserve structural features in images with high levels of noise. In this paper, we propose an improved de-noising method for images with low signal-to noise ratio. Specifically, the proposed method takes the histogram structural similarity (HSSIM) as similarity factor to replace the reconstruction error as the new fidelity term, and finds the most appropriate sparse coefficients by using the modified orthogonal matching pursuit (OMP) algorithm which enables structures in the recons tructed image run as close as possible to the ideal image. In addition, the proposed method adaptively trains the initialized dictionary by using the K-singular value decomposition (K-SVD) algorithm based on HSSIM to assure the image structures can be well reconstructed under the high noise circumstance. Experiment results have shown that the proposed method is better than some well-known de-noising methods in terms of PSNR and edge-preserved index (EPI) in high noise condition. 1. Introduction De-noising is a fundamental work in image processing. In the past few years, m any approaches have been investigated to against this problem such as curvelet transformation (Ma and Plonka (2007)), total variation diffusion (Easley, et al (2009)), wavelet threshholding (Donoho (1995)), quaternion wavelet (Yin, et al (2012)), and inter-scale orthonormal wavelet thresholding (Luisier and Blu (2008)). Recently, sparse and redundant representation has attracted more and more attention. It is based on the theory that the given non-zero vector can be linear combined by minimum column in a predefined row-full-rank matrix with constraint of row number is much less than column number (Mallat and Zhang (1993)). Generally, the row-full-rank matrix is called dictionary and the columns in this matrix are called atoms. Sparse and redundant representation model has broken up the limit of orthogonal basis and maximize the advantages of redundant basis, and has been widely used in image fusion (Li, et al (2013)), image restoration (Ning, et al (2013)), and image Super-resolution (Kim and Kwon (2010)). Application of sparse representation in image de-noising has also garnered considerable attention. Because of the satisfaction of sparsity and separability for signal to noise, the sparse representation method based on redundant dictionary can achieve better de-noised effect than the transform domain based method (Li, et al (2012)). One of the most successful technique is the K-SVD algorithm reported by Elad and Aharon (2006). This approach extends the work by reported by Aharon and Elad (2006) in many ways, and the results have demonstrated that a dictionary which can adaptively updated would cause improved de -noising performance. On the foundation of work by Elad and Aharon (2006), Z. Zhou, et al (2012) proposed a K-least mean square algorithm (K-LMS) for the dictionary learning and image representation, and the de-noising could be achieved by combine the adaptive image sparse decomposition in a learned over-complete dictionary and the threshold process. However, the fixed step length adopted in the sparse decomposition would cause a large level of steady-state error. From the research course of the image de-noising based on the sparse representing, it is known that most of the current de-noising methods consider the reconstruction error of the images before and after de-noising as the fidelity term. However, affected by work environment and monitoring objects, image collected in the real condition always contains rich details and low signal-to-noise ratio. If we continue using the above criteria, the noise components introduced in the reconstructing process will have a larger impact on the image DOI: 10.3303/CET1546030 Please cite this article as: Cui Z., Cui X.P., 2015, An improved sparse representation model for robust image denoising, Chemical Engineering Transactions, 46, 175-180 DOI:10.3303/CET1546030 175 reconstruction accuracy, and the image's structural similarity also could not be well preserved. In order to keep the structural features information as much as possible when the image is effectively de -noised, in this paper, we focus on the removal of additive Gaussian white noise from standard images and propose an im proved method based on the foundational work by Elad and Aharon (2006). Our method has two specific parts. The first is to replace the reconstructed error with the histogram structural similarity as the new fidelity term. This is motivated by the conclusion that HSSIM aligns to the features of visual cortex and thus tends to produce better results that agree with the human visual system (Wang and Bovik (2004)). The second relates to the training of redundant dictionary. With our method, the initialized dictionary is trained by using K-SVD with constraint of HSSIM so that the image structures can be well reconstructed under the high noise circumstance. 2. Related Work Fang et al (2012) reported that any ideal image could be described by the model of y R n N in which the image is separated into n n parts. According to the sparse representation, a dictionary matrix D R n N is defined to represent all the im age parts as follows: 2 0 2 ˆ arg min . .  a a Da xs t  (1) where a Rn is the sparse representation coefficient, 0 a is the number of the non-zero values which means the sparsity of a , D is the dictionary. Transform the constraints to the penalty term. According to the regularization optimization, the equation (1) can be changed to: 2 2 0 , , , ˆ ˆ{ , } argmin ( )      a y a y y x a Da R y ij ij ij ij ij ij i j i j   (2) where  is the Lagrange multiplier, ij  is a coefficient, the first component represent the integral similarity between the image with noise and the clear image, and it should be less than the convolution of 2C  (C is a constant). The second one is the sparse constraints. R ij is the n N matrix to pick up the image block in ( , )i j from the image of. D is the over-completed dictionary. Finally, the denoised image y is updated by 22 2 2 , ˆ argmin    y y y x Da R y ij ij ij i j   (3) 3. The proposed method In this section, we describe the proposed method. The core of our method is to replace the reconstruction error with histogram structure similarity and make it as a new fidelity term. In addition, when the ways of initial over-complete dictionary and orthogonal matching pursuit algorithm are applied in sparse decomposition, we incorporate HSSIM into the constraint condition. In practical terms, the demonstration of the proposed method can be shown in Fig. 1. 176 javascript:void(0); Image patches Clustering K-SVD based on HSSIM Updated dictionary OMP based on HSSIM The preliminary de-noised image DCT dictionary - + The compensation image The final de-noised image De-noising by using DCT dictionary The de-noised Compensation image The observed Image Merging Figure 1: Flowchart of the proposed method Here is the improved sparse representation de-noising model that we have proposed: 2 2 0 , , , ˆ ˆ{ , } argmin (1 H( ))       a y a y y x a Da R y ij ij ij ij ij ij i j i j   (4) In equation (4), the first and the second parts on the right side are the constraints, and the third one is the similarity factor which replaces the reconstruction error of K-SVD as the new computational fidelity term. HSSIM is defined as follows: ( , ) ( , ) ( , ) ( , )H x y l x y q x y h x y (5) where ( , )l x y , ( , )q x y and ( , )h x y represents intensity, contrast and am biguity, respectively . Assume that the initial condition is y = x , the sparse coefficient of each image patch can be calculated as follows: 0 ˆ argmin (1 H( ))    a a a Da R y ij ij ij ij ij ij  (6) Then the result of de-noising can be acquired by solving y partial derivatives of equation (5): T 1 T , , ˆˆ ( )     y I R R x R Da( )ij ij ij ij i j i j   (7) From the above analysis, the procedure of our method is summarized as follows: (1) We adopt the completed DCT dictionary as the initial dictionary, satisfying y = x , and cluster each sample into K sub-groups. (2) W e divide the observed image into overlapping blocks, and take these overlapping image patches as the training data set for the K-SVD algorithm. Specifically, we take 1 HSSIM( ) Da R y ij ij to replace 2 2 Da R y ij ij as the new fidelity term in order to well preserve the image structural features. Through equation (13), we can obtain all the sparse coefficients. (3) In this stage, we define the error matrix as T   y d ak j j j k E , and define the atom used for the sparse separation in the dictionary as T { 1 , ( ) 0}   a i j i i k i , where d j is the first column of dictionary atom, and T a j is the first row of the sparse coefficient matrix.. Then the dictionary updating is switched to the following model: 177 , T T min(1 ( , )) . .    d a y d a a j j j j j i j k H s t  (8) Repeat the stage (3) J times until the stop criterion. (4) W hen the above stages are completed, we can now obtain the preliminary de-noised image y . However, the details such as edge in the image might be lost. To solve it, we first obtain the compensation image y dif by calculating the difference between the preliminary de-noised image y and the noisy image x , then obtain the de-noised compensation image ŷ dif by using the DCT redundant dictionary, after that we add the de-noised compensation image ŷ dif to the preliminary de-noised image y and obtain the final de-noised image y t . The solution is formulated as follows: ˆy y + y t dif (9) 4. Experiments In this section, we carry out experiments with the purpose of testing the performance of the proposed method. We used other three different methods for a fair comparative analysis. The parameters of the proposed method are set as 10J and 8 8 n , respectively. Since in [25], Elad and Aharon set  and C as 30  and 1.15 respectively, we adopt the same value, but default value for the parameters in the other methods. Considering the objectivity of the effect evaluation, PSNR and EPI were adopted as the objective indices to evaluate the quality of de-noised images. All the experiments are promoted on a Core i5(R) 2.6 GHz PC with 4 GB RAM. EPI is defined as follows: ( , )      i j i j x x y y p p E x y p p (10) where x p and yp represents the gray level in observed im age and denoise im age . First, we compared the EPI and PSNR results obtained by the proposed method and some other methods, such as contourlet, K-SVD, and K-LMS. As is shown that, the tests were performed on three images: “Man”, “Babara” and “Boat”. AWGN is superimposed on them with  20, 30 and 40. The results are given in Tab. 1. Table 1: Performance of the de-noising methods by HSSIM and PSNR Image Noise σ EPI PSNR/dB Contourlet K-SVD K-LMS Proposed Contourlet K-SVD K-LMS Proposed Man 20 0.546 0.603 0. 617 0.689 26.45 28.74 28.83 28.91 30 0.527 0.582 0.594 0.632 25.97 28.13 28.24 28.47 40 0.503 0.541 0.545 0.599 24.44 26.69 26.71 27.85 Babara 20 0.573 0.612 0.633 0.681 24.42 27.48 27.54 27.83 30 0.554 0.593 0.605 0.624 23.57 26.60 26.63 26.97 40 0.539 0.567 0.575 0.591 21.31 24.26 24.39 25.65 Boat 20 0.534 0.601 0.627 0.673 22.63 25.54 25.60 25.89 30 0.516 0.587 0.593 0.616 21.48 24.21 24.59 24.70 40 0.489 0.551 0.572 0.584 20.19 22.48 22.57 23.94 The differences of de-noising results obtained by the proposed method and the others for the same image with the same  are significant. Specifically, by adding the white noise with the variance of 30 to “Man”, the EPI 178 result obtained by the proposed method is 0.632, with the increased ratio by 19.92%%, 8.59% and 6.39% respectively compared with the values by using Contourlet, K-SVD, and K-LMS, and the PSNR result obtained by the proposed method is 28.47 dB, with increased value by 2.50dB, 0.34dB and 0.23dB respectively compared with the values in Contourlet, K-SVD and K-LMS. It also can be found that the results obtained by the proposed method are better than the others for different images with different  . For instance, the PSNR result obtained by the proposed method is increased by 3.31dB, 0.67dB and 0.56dB on average respectively than in Contourlet, K-SVD and K-LMS, and the EPI result obtained by the proposed method is increased by 19.11%, 8.64% and 6.14% on average respectively than in Contourlet, K-SVD and K-LMS. The comparison of the de-noising effects obtained by the proposed method and the others for all the three images are given in Fig. 2, with the high noise variance of 70, because under the low noise condition ( 20 ), the de-noising results are almost perfect, just with some slight differences. From the aspect of subjective visual effect, it can be found that the proposed method has some advantages in keeping the structure features compared with the other methods. Figure 2: Visual comparison of the reconstructed results on three images (Man, Babara and Boat), with σ=70. The first column: noise-free images. The second column: reconstructed results obtained by Contourlet. The third column: reconstructed results obtained by K-SVD. The fourth column: reconstructed results obtained by K-LMS. The fifth column: reconstructed results obtained by the proposed method. 5. Conclusions In this paper, we have proposed an improved de-noising method for low-SNR images. The proposed method makes use of structure information in image in order to keep the de-noising result meet the human visual system. Traditional sparse representation based methods take the reconstruction error as fidelity term, which leads to the shortage of structure preserving. This regret motivated us to replace the reconstruction error with similarity factor as a new fidelity term. Detail compensation is also used for improve the effect of de -noising. Experiment results show that the proposed method outperforms Contourlet, K-SVD and K-LMS in terms of EPI and PSNR. In the future, we will consider the dictionary learning method. Acknowledgments This work is supported by Scientific Research Fund of Hunan Provincial Education Department under Grant of 13C122. 179 References Donoho, D. L., 1995, De-noising by soft threshholding, IEEE Transactions on Information Theory, 41(3), 613-627, 10.1109/APCCAS.2000.913631 Easley, G. R., Labate, D., Colonna, F., 2009, Shearlet based total variation diffusion for denoising, IEEE Transactions on Image Processing, 18(2), 260-268, 10.1109/tip.2008.2008070 Elad, M., Aharon, M., 2006, Image denoising via learned dictionaries and sparse representation, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 895-900, 10.1109/CVPR.2006.142 Fang, L. Y., Li, S. T., Nie, Q., Izatt, J. A., Toth, C. A., Farsiu, S., 2012, Sparsity based denoising of spectral domain optical coherence tomography images, Biomedical Optics Express, 3(5), 927-942, 10.1364/boe.3.000927 Kim, K. I., Kwon, Y. H., 2010, Single-image super-resolution using sparse regression and natural image prior, IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(6), 1127-1133, 10.1109/tpami.2010.25 Li, S.T., Yin, H.T., Fang, L.Y., 2013, Remote sensing image fusion via sparse representations over learned dictionaries, IEEE Transactions on Geoscience and Remote sensing, 51(9), 4779 –4789, 10.1109/tgrs.2012.2230332 Li, S. T., Fang, L.Y., Yin, H. T., 2012, An efficient dictionary learning algorithm and its application to 3-D medical image denoising, IEEE Transactions on Biomedical Engineering, 59(2), 417-427, 10.1109/tbme.2011.2173935 Luisier, F., Blu, T., 2008, SURE-LET multichannel image denoising: Interscale orthonormal wavelet thresholding, IEEE Transactions on Image Processing,17(4), 1057-7149, M. Aharon, M. Elad, A.M. Bruckstein, 2006, K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representation, IEEE Transactions on Signal Processing, 54(11), 4311-4322, 10.1109/tsp.2006.881199 Ma, J., Plonka, G., 2007, Combined curvelet shrinkage and nonlinear anisotropic diffusion, IEEE Transactions on Image Processing, 16(9), 2198-2206, 10.1109/tip.2007.902333 Mallat, S., Zhang, Z., 1993, Matching pursuits with time-frequency dictionaries, IEEE Transactions on Image Processing, 41(12), 3397–3415, 10.1109/tip.2008.919370 Wang, Z., Bovic, A. C., 2009, Mean square error: Love it or leave it? A new look at signal fidelity measures, IEEE Signal Processing Magazine, 26(1), 98-117, 10.1109/msp.2008.930649 Wang, Z., Bovik, A. C., Sheikh, H. R., Simoncelli, E. P., 2004, Image quality assessment: From error visibility to structural similarity, IEEE Transactions on Image Processing, 13(4), 600–612, 10.1109/tip.2003.819861 Ning, Q., Chen, K., Yi, L., Fan, C. C., Lu, Y., Wen, J. T., 2013, Image super-resolution via analysis sparse prior, IEEE Signal Processing Letters, 20(4), 399-402, 10.1109/lsp.2013.2242198 Yin, M., Liu, W ., Shui, J., Wu, J. M., 2012, Quaternion wavelet analysis and application in image denoising, Mathematical Problems in Engineering, 2012, 1-21, 10.3724/sp.j.1187.2012.00338 Zhou, Z., Luo, L. M., 2012, Research on Image Denoising Algorithm based on Adaptive Overcomplete Sparse Representation Theories, Journal of Convergence and Information Technology, 7(16), 315 -321, 10.4156/jcit.vol7.issue16.38 180 http://dx.doi.org/10.1109/APCCAS.2000.913631 http://dx.doi.org/10.1109/CVPR.2006.142