ap-6-10.dvi


Acta Polytechnica Vol. 50 No. 6/2010

Ensemble Empirical Mode Decomposition: Image Data Analysis with
White-noise Reflection

M. Kopecký

Abstract

During the last decade, Zhaohua Wu and Norden E. Huang announced a new improvement of the original Empirical
Mode Decomposition method (EMD). Ensemble Empirical Mode Decomposition and its abbreviation EEMD represents
a major improvement with great versatility and robustness in noisy data filtering. EEMD consists of sifting and making
an ensemble of a white noise-added signal, and treats the mean value as the final true result. This is due to the use of a
finite, not infinitesimal, amplitude of white noise which forces the ensemble to exhaust all possible solutions in the sifting
process. These steps collate signals of different scale in a proper intrinsic mode function (IMF) dictated by the dyadic
filter bank. As EEMD is a time–space analysis method, the added white noise is averaged out with a sufficient number
of trials. Here, the only persistent part that survives the averaging process is the signal component (original data),
which is then treated as the true and more physically meaningful answer. The main purpose of adding white noise was
to provide a uniform reference frame in the time–frequency space. The added noise collates the portion of the signal of
comparable scale in a single IMF. Image data taken as time series is a non-stationary and nonlinear process to which the
new proposed EEMD method can be fitted out. This paper reviews the new approach of using EEMD and demonstrates
its use on the example of image data analysis, making use of some advantages of the statistical characteristics of white
noise. This approach helps to deal with omnipresent noise.

Keywords: EmpiricalModeDecomposition (EMD),EnsembleEmpiricalModeDecomposition (EEMD), image analysis,
sifting, Noise-Assigned Data Analysis (NADA).

1 Introduction
EmpiricalModeDecomposition(EMD)hasbeenpro-
posed as an adaptive time-frequency data analysis
method. It has been proved in applications for ex-
tracting signals from data generated in noisy non-
linear and non-stationary processes [1, 2]. However,
there are still several known unresolved difficulties
with EMD. The first major weakness of the origi-
nal EMD is the frequent occurrence of mode mix-
ing, which is defined as a single IntrinsicMode Func-
tion (IMF). IMFs can also consist ofwidely disparate
scales, or can consist of a similar signal residing in
different IMF components. Mode mixing is often a
consequence of an intermittent signal, as discussed
by Huang et al. [2, 3, 4]. An intermittent signal can-
not only cause serious aliasing in the time–frequency
distribution, but can also make the physical mean-
ing of individual IMFs seriously unclear. To alle-
viate this drawback, Huang proposed the intermit-
tence test [3, 2, 4]. This test aimed to avoid sev-
eral difficulties. However the test caused its own is-
sues, which were also illustrated by Huang and Wu.
The first issue is the subjectively selected scale of
the test. By this intervention EMD became totally
adaptive. The second issue concerns separable and
definable data selection from timescales, which was

discussed by Huang and Wu et al. [4]. To overcome
the scale separation issue without introducing a sub-
jective intermittence test, Huang andWuproposed a
newNoise-AssignedDataAnalysis (NADA) method,
knownasEnsembleEMD(EEMD),whichdefines the
true IMFcomponents as themeanvalue of an ensem-
ble of trials [2]. Each trial consists of the signal plus a
white noise of finite amplitude. Binding white noise
helps to better cover real cases, as is common in cur-
rent papers on image data analysis [1]. Addingwhite
noise has shownEMD to be an adaptive dyadic filter
bank. Tomake suchan improvement,WuandHuang
were inspired byFlandring andGledhill and their re-
search in adding white noise to data analysis, which
helps to improve EMD results [5, 4]. The main prin-
ciple of EEMD was defined simply: “the added white
noise would populate the whole time–frequency space
uniformly with the constituting components of differ-
ent scales. When a signal is added to this uniformly
distributed white background, the bits of signal of dif-
ferent scales are automatically projected onto proper
scales of reference established by the white noise in
the background” [4, p. 2].
This approach was adopted on the basis of the

following observations:
1. A collection of white noise cancels itself in the
ensemble mean if averaged in a time domain;

49


Acta Polytechnica Vol. 50 No. 6/2010

therefore, only the signal can survive andpersist
in thefinalnoise-addedsignal ensemblewhenav-
eraged.

2. Finite, not infinitesimal, amplitude white noise
is needed to force the ensemble to exhaust all
possible solutions. Finitemagnitudenoisemakes
the different scale signals reside in the cor-
responding IMF, dictated by the dyadic filter
banks, and renders the resulting ensemble mean
more meaningful.

3. The true and physically meaningful answer to
EMD is not an answerwithout noise. It is desig-
nated tobe the ensemblemeanof a largenumber
of trials consisting of the noise-added signal.
The proposed EEMD method utilizes many im-

portant statistical definitions of noise [4]. The follow-
ing sections describe a part of the research done by
Huang and Wu on the relation between white noise
and a real signal. Image data is used in this paper.
Section 2 provides a brief introduction to Ensemble
Empirical Mode Decomposition, and several details
of drawbacks associated with mode mixing are de-
scribed. Section 3 describes the usefulness and ca-
pability of EEMD through examples in Image Data
Analysis and Optical Flow Assignment.

2 Introduction to Ensemble
Empirical Mode
Decomposition over EMD

As an introduction to a more detailed description of
the EEMD method, we begin with a short review
of EMD [2, 6]. The EMD method is an adaptive
method, with the decomposition based on data and
derived from data. In the EMD approach, the data,
time series x(t), is decomposed in terms of IMFs, as
has been described in [4] by Huang,

x(t)=
n∑

j=1

cn + rn

where rn is the residue of the original data x(t) and
n is the number of steps for extracting IMFs. IMFs
are simple oscillatory functions with varying ampli-
tudes and frequency. The extracted IMFs have the
following properties:
1. Throughout the length of a single IMF, thenum-
ber of extremes and thenumber of zero crossings
must either be equal or must differ at most by
one (although these numbers can differ signifi-
cantly for the original data set).

2. At any data location, the mean value of the en-
velope defined by the local maxima and the en-
velope defined by the local minimum is zero.

Fig. 1: The sine wave signal with three crests is used as
an example in an introduction to EEMD (Fig. 2)

Fig. 2: The first step of the sifting process. Panel (a)
is the input. Panel (b) identifies local highs (red dots).
Panel (c)plots theupperenvelope (upper reddashed line)
and the lower envelope (lower red dashed line) and their
input signal (blue line), and panel (d) are the difference
between the input and the mean mi(t) of the envelopes

In common practice, EMD is implemented
through a sifting process using only local extrema [2,
6]. Foranydata rj−1 the followingprocedureapplies:
1. identify all the local extrema (a combination of
highs and lows) and connect all these local highs
(lows) with a cubic spline as the upper (lower)
envelope

2. obtain the first component h(t) by taking the
mean m(t) of the upper and lower envelopes

3. Treat h(t)= x(t)− m(t) as the data, and repeat
steps 1 and 2 as many times as is required until
the envelopes are symmetric with respect to the
zero mean under certain criteria.
The final h(t) is designated as cj. The complete

sifting process stops when the residue rn becomes a
monotone function fromwhich nomore IMFs can be

50


Acta Polytechnica Vol. 50 No. 6/2010

extracted. The process is stopped using a stoppage
criterion. Two types of stoppage criteria have com-
monly been used. The first type was used by Huang
in 1998. It is based on a Cauchy type of conver-
gence test. The test requires the normalized squared
difference between two successive sifting operations
defined as Eq. 1

SDk =

∑T
t=0 |hk−1(t) − hk(t)|

2∑T
t=0 h

2
k−1

(1)

to be small. If this squared difference SDk is smaller
than the desirednumber, the processwill be stopped.
Setting up the right SDk value seems tobeaverydif-
ficult task, because no acceptable definition is avail-
able [2]. The second criterion is to set up a pre-
selected S-number, which deals with other issues
about how to select the appropriate number. These
difficulties led authors [1] to develop a new approach
to obtaining IMFs from the acquired signal (data),
which will be described and applied below.
Based on the research of Fladrin and Gledhill,

Wu andHuang [4] used their results as a background
for improving the definition of the EMD method.
Bearing the definition of EMD in mind, they showed
that EMD behaves as a dyadic filter bank when
white noise is populateduniformly through thewhole
timescale or time-frequency space [4].
Their description postulated that the total num-

ber of IMFs in a data set is close to log2(N), with
N as the total number of data points. This fact en-
sures a total number of valid IMFs with difficulties
around. When the signal is not merged with pure
noise, some scales can be missing, and this involves
the total number of IMFs, which can be lower than
the expectation log2(N). Another issue is caused by
modemixing, as in the previousEMD.Anadvantage
of this approach is knowledge of the expected IMFs
expressed in N number.

Fig. 3: The intrinsic mode functions of the input dis-
played in Fig. 1(a)

2.1 A definition of mode mixing
based on EMD

Modemixing has been defined as any IMFconsisting
of oscillation of dramatically disparate scales. When
mode mixing occurs, IMF can cease to have a differ-
ent physical meaning by itself. The signal can sug-
gest an invalid physical representation. This known
drawback was mentioned by Huang in [2, 3, 4]. An
example of the sifting process is presented in Fig. 2,
and its decomposition is shown in Fig. 3, as illus-
trated by Huang and Wu [4]. The fundamental part
of the data is the sine wave with unit amplitude.
At the three middle crests of the sine wave, high-
frequency intermittent oscillations with amplitudes
close to 0.2 ride on the fundamental sine wave, in
panel (a) of Fig. 2. This signal is denoted as the test
signal. Panels (b) and (c) show the sifting process
of the identifying highs (lows), as described above.
Panel (d) displays the result, which is affected by
visible mode mixing. This is because one envelope
was a mixture of envelopes of the fundamental sine
waveandthe intermittent signal, leading toa severely
distorted envelope mean, as shown in Fig. 2 and
Fig. 4. Fig. 4 shows the main issue caused by ap-
propriate localization of the highs and lows. An un-
pleasant implication of mode mixing leads to disad-
vantages of the original EMD method with the stop-
page criterion, as described byHuang [1, 2, 4]. Mode
mixing is also the main reason why the EMD al-
gorithm is unstable (Fig. 3). Any small perturba-
tion may result in a new set of IMFs, as reported
by Gledhill [4]. Obviously, the intermittence pre-
vents EMD from extracting any signal with similar
scales. EEMD [4], which will be briefly described
in the following sections, has been proposed to solve
these problems.

Fig. 4: Test signal (red dashed line) and the locations of
highs (blue circle) and lows (green circle)

51


Acta Polytechnica Vol. 50 No. 6/2010

2.2 Ensemble empirical mode
decomposition

As shown in theprevious test signal, the data (Fig. 2,
Fig. 3) comprises collections of the main signal and
some noise. To improve the precision of the mea-
surements, the idea of the ensemble mean becomes
a powerful approach. Data is collected by separate
observations, each ofwhich contains a differentwhite
noise realization [4].
To generalize this ensemble idea, the noise is pop-

ulated to a single data set x(t). The added white
noise is treated as possible random noise that would
be encountered in the measurement process. Under
such conditions, the equation of any observation can
be written as

Xt(t)= x(t)+ wi(t) (2)

In thisway, adifferent realizationofwhite noise wi(t)
will be added when there is only one observation.
(Eq. 2)
We will now make a brief review of the proper-

ties of EMD before proceeding to a more detailed
description of the EEMD method:
1. EMD is an adaptive data analysismethod based
on local characteristics of the data, and it there-
fore captures nonlinear, nonstationary oscilla-
tions more effectively

2. EMD is a dyadic filter bank for any Gaussian
white noise-only series

3. when the data is intermittent, the dyadic prop-
erty is often compromised in the original EMD,
as shown in the example in Fig. 3

4. adding noise to the data canprovide auniformly
distributed reference scale, which enables EMD
to repair the compromised dyadic property

5. the corresponding IMFs of different series of
white noise have no correlationwith each other.
Therefore, the means mi(t) of the corresponding

IMFs of different white noise series are likely to can-
cel each other.
Bearing in mind all these properties of the EMD

method, the proposed EEMD has been developed in
the following way:
1. add white noise series to the targeted data;
2. decompose the datawith addedwhite noise into
the IMFs;

3. repeat step 1 and 2, based on the realization of
different white noise series each time;

4. perform steps 1 and 2 until the IMFs of the data
set are close to log2(N), with N as the number
of total data points;

5. obtain the (ensemble) means of the correspond-
ing IMFs of the decompositions as the final re-
sult.

Fig. 5: Test signal with white noise added (red dashed
line) and the location of its highs (blue circle) and lows
(green circle)

Fig. 6: The input top panel, its intrinsic mode functions
(C1–6), and the trend (R). In panel C5, the original in-
put is plotted as the bold dashed red line, for purposes of
comparison

The main effect of decomposition using EEMD
is that the added white noise series cancel each
other in the final mean of the corresponding IMFs.
This means that the IMFs stay within the natural
dyadic filter windows, and thus significantly reduce
the chance of mode mixing and preserve the dyadic
property.
An example of the use of the EEMD method is

illustrated in Fig. 6, in a similar meaning as for the
previous EMD in Fig. 2 and Fig. 3. Clearly, the fun-
damental signalC5 is representedalmostperfectly, as
are the intermittent signals, if C2 and C3 are added
together. The fact that the intermittent signal ac-
tually resides in two EEMD components is due to
the average spectra of the neighboring IMFs of white
noise overlapping, as revealed by Wu and Huang. It

52


Acta Polytechnica Vol. 50 No. 6/2010

is necessary to combine the two adjacent components
to one IMF. The need for this type of adjustment
is easily determined through an orthogonal check.
Whenever two IMF components become grossly non-
orthogonal, we should consider combining the two
components to form a single IMF component [4].

3 EEMD method
implemented in image data
analysis

The previous theoretical example shows the main
concept of NADA, using the EEMD method as a
tool. Based on previous text discussion, the question
arises how the method could be used in image data
analysis. Every frame of an image comprises a time-
sequence of data that is processed by the brain. Our
brain gives an appropriate meaning to this ensemble
of data. In this way, synapses can work as a dyadic
filter bank. Data can be and mostly is affected by
all kinds of noise. There can be various reasons for
this noise. Examples are short-sightedness or day-
light. The observed object is not sharply displayed
on our retina and the data is treated in a damaged
way. This can affect itsmeaning or its values. In this
section, we will outline EEMD usage and how it can
be used in analyzing such data. An example of cube
rotation is displayed in Fig 7.
The brightness of the pixels over two images of

the same cube in line height 120 pixels forms the
test signal. The general size of the two images is
320×240 pixels in gray scale colormode. Gray scale
mode was chosen for the sake of simplicity to better
understand themean value. In the first step, the two
images were analyzed in a similar way. The line val-
ues were treated and the IMFs were extracted. As
the deviation, the amount of white noise was set to
0.2 and the EEMD ensemble number was set to 50.

Fig. 7: Cube rotation: the cube rotates fromamisaligned
position to frontal position. Theblackarrow indicates the
direction of rotation

Fig. 8: Cube 1 – signal line

Fig. 9: Cube 2 – signal line

Fig. 10: Cube 1 – All IMF functions extracted by EEMD

53


Acta Polytechnica Vol. 50 No. 6/2010

In theCube 1 trial, the input (the original bright-
ness data) is a visible jump around a width value
of 100, which corresponds with the edge of the real
cube (Fig. 7). Cube 1 is slightly dipped to the left
side from the center, as shown in the original image
(Fig. 8). The next peak inFig. 10 ofwidth of around
118 units can represent the second cube edge, which
is coherent with reality. Consequent perturbation in
thefirst IMFcan reflect the footmark. Thefirstmain
point here is the edge projection as a short pertur-
bation with definable peaks over a zero value. Other
IMFs can represent surface deviances. The last line
is the residuum of the data. The residuum is mostly
explained as a trend. The trend explanation can also
be used for our case, because the last line shows a
background change caused by the box. The original
data defines the camber in the real scope. This jump
is performed in the same way as in the trend line.
Only eight different IMFs were extracted, which cor-
responds to the EEMD basics of the finite data set
explained above. The data forCube 2 (Fig. 9) is also
treated and displayed in a similar sense as Cube 1.
The image of Cube 2 is taken from the front, which
is also visible in the original. The data jump and the
additional pit are smaller. This fact is caused by the
projection of Cube 2, which is logical. The surface
texture is alsodisplayed in the originalfigure. Aswas
described above, the EEMD method works with the
IMF summary.

Fig. 11: Cube 2 – All IMF functions extracted byEEMD

Fig. 12: Comparison between the original data set and
the IMF summary

All IMFs should have no mutual correlation.
IMFs should be independent of white-noise trials.
This means that a summary of all IMFs should give
us the original signal (Fig. 12). Finally, when we
make a comparison of the two cubes, we find some
kind of data shifts in the inputs and IMFs.
These shifts are treated by cube rotation from

right to left, as indicated above (Fig. 7). There is
no doubt that the two original figures are affected
by different kinds of noise. The noise was treated
during capture time. EEMD enables us to extract
data from images properly, even in different layers.
The main intention in edge detection is to find ap-
propriate differences over all image layers, which is
enabled by the use of the EEMD method presented
here. Another important point is the logically dif-
ferent image width of cube rotation. Fig. 13 points
out several areas where the cube edge can be pro-
jected on to the specific IMFs. Possible edge fluctu-
ations are marked in subplot C1. These fluctuations
are displayed as second fluctuations, because of the
main transition. The first and the last fluctuations
directly indicate a background change to/from the
cube. Now we see that we have two limited areas
where everything can be projected. In C2, the IMFs
still show similar behavior over cubes with a logical
width shift. The underlying change is shown in IMF
C3. The Cube 1 line is projected with no underly-
ing fluctuations, and it corresponds to realitywithno
additional cube edge. In Cube 2, however, the line
fluctuation reaches a smaller peak and some kind of
promontory is treated. This can be explained as a
cube surface projection. When we look back to the
original images, we see the true reality (Fig. 8 and
Fig. 9). We consider here that C3 corresponds with
the surface of the object where no inner edge fluctu-
ation is displayed. This assertion is not proven, and

54


Acta Polytechnica Vol. 50 No. 6/2010

Fig. 13: Comparison of cube EEMDs — the colored cir-
cles indicate important edge detection areas

Fig. 14: Cube 1 — A comparison between EMD and
EEMD methods and instability in analysis

will be a topic for further study. This result can be
taken as a valid conclusion here. In our analysis of
theCube 1 signal, we found three edges. In the anal-
ysis of the Cube 2 signal we found only two edges,
which correspond to the front view. The retrieved
result is considered to be the cube rotation from the
left dipped state to the right frontal view.

4 A comparison between
EMD methods and EEMD
methods

Fig. 14 presents a comparison between the original
EMDmethod and the newEEMDmethod, using im-
age data analysis.

5 Conclusion
A major drawback of EMD is the instability of the
method, see Fig. 14. The EMD method has a prob-
lem evenwith edge detection. Edge fluctuations over
the signal are more widely spread and are not as
strict as inEEMD.EMD lines showa strongmixture
of modes, where lines dramatically change direction
with no valid reasons. Any edge projected on to the
retina is awidth limited area,which causes strict sen-
sor stimulations. This assertion is closer to EEMD
behavior, where narrower fluctuations are depicted.
In brief, results produced by the EMD method

can be less valid thanEEMD lines, which offermuch
stricter behaviors. This instability can have a dra-
matic effect on further studies of any signal, not nec-
essary from an image [1, 2, 4, 6]. The reason for the
drawbackis linkedwith the ideaof acleardata signal.
This ideadoesnot correspondto realworldexamples.
The EEMD method uses a white noise-added signal.
This approach exhausts all possible solutions during
the shifting process (Fig. 4, Fig. 5). The comparative
test of the instability of the twomethods displays the
mainbenefit ofusing theEEMDmethod. The ability
todivide the incoming signal data intomore indepen-
dent IMFs functions and then analyze it gives power
to the EEMD method, where this signal, not nec-
essary an image, can be interpreted and processed
by layers. The natural fact of the independence of
IMFs provides an opportunity to treat any signals as
a summary of their layers, and this could be helpful
in future medical research. There is potential and
motivation for improving the proposed method.

Acknowledgement

This work was supported by the Grant Agency of
the Czech Technical University in Prague, grant
No. 2B06023 and by CTU grant SGS OHK2-021/10.

References
[1] Kopecký,M.: Hilbert-HuangTransformationand
its application in optical flow evaluation, STC,
2009.

[2] Huang,N.E., Samuel, S. P. Shen: Hilbert-Huang
Transformand ItsApplication,Politecnico diMi-
lano, Vol. 25. ISSN 0262-8856.

55


Acta Polytechnica Vol. 50 No. 6/2010

[3] Huang, N. E., Shen, Z., Long, S. R., Wu, M. C.,
Shih, E. H., Zheng, Q., Tung, C. C., Liu, H. H.:
The empirical mode decomposition method and
the Hilbert spectrum for non-stationary time se-
ries analysis,Proc. Roy. Soc. London 454A, 1998,
p. 903–995.

[4] Wuand, Z., Huang, N. E.: Ensemble Empiri-
calModeDecomposition andNoise-assistedData
Analysis Method, World Scientific Publishing
Copany, Advances in Adaptive Data Analysis,
Vol. 1, No. 1 (2009), p. 1–41.

[5] Flandrin, P., Goncalves, P., Rilling, G.: EMD
equivalent filter banks, from interpretation to ap-
plications, in Hilbert-Huang Transform: Intro-

duction and Applications, eds. N. E. Huang and
S. S. P. Shen, World Scientific, Singapore, 2005.

[6] Kokeš, J.: Okamžitá frekvence a Hilbert-
Huangova transformace. Sborńık konference In-
teligentńı systémy pro praxi, Lázně Bohdaneč,
2006. ISBN 80-239-6535-2.

Miroslav Kopecký
E-mail: Miroslav.Kopecky@fs.cvut.cz
Department of Instrumentation
and Control Engineering
Czech Technical University in Prague
Faculty of Mechanical Engineering
Technická 2, 166 27 Praha, Czech Republic

56