Microsoft Word - cet-01.docx


 CHEMICAL ENGINEERING TRANSACTIONS  
 

VOL. 46, 2015 

A publication of 

 
The Italian Association 

of Chemical Engineering 
Online at www.aidic.it/cet 

Guest Editors: Peiyu Ren, Yancang Li, Huiping Song 
Copyright © 2015, AIDIC Servizi S.r.l., 
ISBN 978-88-95608-37-2; ISSN 2283-9216 

Features Extraction of Moving Target Image Based on 
Nuclear Method and Spatio-Temporal Correlation Theory 

Baoli Yuan  

ShijiaZhuang Vocational Technology Institute, Shijiazhuang, Hebei, China  
stevenybl@qq.com 

For the detection of moving targets, we firstly discuss the background difference method, inter frame 
difference method, optical flow method to the traditional detection method, and points out their respective 
scope of application and the advantages and disadvantages, and in this foundation proposed moving target 
detection scheme based on temporal and spatial correlation to better solves the traditional moving object 
detection methods have some defects, improve the quality of target detection. This paper described the 
background estimation of target detection and extraction method and several common background estimation 
algorithm based on, through multi frame combined to achieve the fixed scene under the background of the 
recovery algorithm, gives the corresponding experimental results. By tracking template and consecutive 
frames difference profile of coarse contour tracking, to avoid the traditional tracking method to match and 
search the large computational problems and fixed size matching template for non rigid object tracking the 
false detection problem. 

1. Introduction  

With the rapid development of the network, the video has gradually become one of the main information 
dissemination. Because of the special status of vision in human perception, the video has been the focus of 
research. And video moving target detection and tracking is the basis for other studies, is a kind of pattern 
recognition, image processing, comprehensive application in the field of computer vision and artificial 
intelligence, with a strong theoretical research value. In visual motion analysis, object detection and 
segmentation is in image sequence will be moving target from the background separated. The principle is 
relevant characteristics of the moving target, for example, image edge, shape, texture and a variety of moment 
features. In several processes involved in visual motion analysis and moving object detection and 
segmentation in the bottom of the visual surveillance system, all subsequent processing such as target 
classification, behavior understanding. Therefore, it has very prominent importance. 
Moving target detection algorithm in accordance with the relationship between the target and the camera can 
be divided into static background motion detection and dynamic background motion detection. The so-called 
static background motion detection is the camera in the monitoring process does not move, only the monitored 
target in the field of vision of the camera motion, the process only the target relative to the camera movement. 
Dynamic background motion detection is the camera in the monitoring process occurs (such as translation, 
rotation or multi freedom degree movement) movement, the monitored target in the field of vision of the 
camera moved, the process produced between the target and the camera complex relative movement. 
The location of the target tracking, the traditional region based image matching algorithm although higher 
accuracy, but very time-consuming, a compromise is to extract the target contour information from an image 
region, then the contour of object matching, this method will bring matching quality decline, but can greatly 
reduce the track of spent time. At the same time, to calculate the motion area can improve the effect of 
contour matching, and can further reduce the computation time of search. The contour matching location 
tracking scheme in no great mass loss can quickly extract the moving object based on spatial and temporal 
correlation of motion target detection and differential and the location tracking, can meet the requirements of 
real-time in practical application.  

                               
DOI: 10.3303/CET1546065

 
Please cite this article as: Yuan B.L., 2015, Features extraction of moving target image based on nuclear method and spatio-temporal 
correlation theory, Chemical Engineering Transactions, 46, 385-390  DOI:10.3303/CET1546065  

385


2. Related theory  

2.1 Morphological and sequence analysis of digital image 
Image sequence which is composed of a series of relative with the given or assumption of the composition, 
and gives the adjacent image acquisition time interval between the generally can be expressed as follows: 

2
0 0B (x, y) : N(m , s )  (1) 

[ ]
2M-1

2
0 k 0

k =0

1
σ (x, y) = f (x, y) -μ (x, y)

M 
 (2) 

M-1

0 k
k =0

1
μ (x, y) = f (x, y)

M 
 (3) 

Moving target tracking is the use of one or more can only said effective target characteristics, using 
appropriate matching algorithm, in continuous image sequence for and target template in position, velocity, 
shape and other aspects of the most similar to the candidate target, it simply is the continuous image 
sequence for moving target positioning. 

2.2 Moving target detection and tracking 
a. Background compensation method 
The specific approach is to select the motion picture at K (a), which contains the moving target and 
background, and then K+1 (a) and the time of moving image is obtained by subtracting a contour map (b), the 
gray area at time K map is a moving target, but at the moment of K+1 is the background region, background 
region if K+1 moment K moment is here instead of target coverage, can reduce the K time in the background 
of target coverage, and so on, through the N frame, when the target area is completely removed detected The 
target in the current frame (Figure 1) and area (a) in the target area is completely coincident (b). 
 

(a)                                    (b)                                                                

Figure 1: The target in the current frame and area 

b. The statistical average method 
Statistical average method based on the continuous image sequence pixel by pixel statistical average, the 
average values to approximate the background image, both continuous acquisition of N image cumulative 
average: 

1 1
1

( )k k k k NB f f f
N

− − −= + +  (4) 

Where N is the image frames, motion velocity value and moving target N and target size, moving faster, more 
small, can obtain the background with fewer frames, general N bigger and more beneficial to get more realistic 
background estimation. If the value of N at 25 frames per second calculation, 250 frames to 10 seconds of 
time, so real-time requirements limit the use of statistical average method. 

3. Kernel function and common kernel method 

3.1 Kernel function 
Kernel function is one of an important part of the theory of support vector machine attracted many researchers 
interest. The kernel function is often used to satisfy the Mercer condition are linear, polynomial and radial 

386


Sigmoid function, select different kernel can construct support vector machine function, the simple introduction 
to the four kinds of kernel functions: 
   

Linear function ( , ) ,i iK x x x x=  (5) 

moid function ( , ) tanh[ , ]i iK x x v x x a= +  (6) 

Linear separable problem mapping to 3D feature space linear separable was shown in Figure 2. 

 
Figure 2: Linear separable problem mapping to 3D feature space linear separable 

The discriminant function is normalized, so that the two types of all samples are satisfied: | f(x) | ≥ 1, then 
from the surface, near the sample has | f(x) |=1. If all samples classified correctly to meet: 

[( ) ] 1 0, 1, 2,iy w x b i N∗ + − ≥ =    (7) 

The optimal classification face problems can be expressed as the following constraints: 

21( ) || ||
2

M in w wΦ =   (8) 

3.2 GF space and new polynomial kernel 

Let 1
nP

 is real polynomial series of n-vertices single variable u, then arbitrary polynomial  f  in 1
nP

 can be 

uniquely determined by n+1 coefficients: 0
( )

!

n
kk

k

f
f u u

k=
= 

. That is, polynomial  f  is completely defined by 
the n+1 coefficient. 

Construction of  GF space 1
nF

 containing 1
nP

 requires the following two steps: 

(a) Let 0 1
( , ,..., )nρ ρ ρ ρ=  is a set of positive weighted bounded constant, the prior information of the 

problem to be solved is determined. 

(b) f  and g are any member of the GF space 1
nF

, and k
g

 for the g parameter. Then the 1
nF

 dot product can 
be defined as:  

1
0

,
!

n

n
k

k kF
k

f g f g
k

ρ
=

= 
 

 (9) 

The real kernel K of two real variables u, v can be expressed as:  

0

1
( , ) ( )

!

n
k

k k

K u v uv
k ρ=

= 
 

 (10) 

The kernel function K has two important properties.  

387


4. The experiment and  analysis 

4.1 Indoor scene experiment 
The experimental data for the image size is 240x320, frame rate of 10 frames per second video in indoor 
environment. There are windows in the video, and the indoor illumination is influenced by the outdoor natural 
light and the change of the indoor light. Outdoor natural light in different time on the Illuminati gradually 
changing, and indoor illumination is affected by light can switch different number of lights to change light, 
create a light changes of the scene and to test the algorithm under different light illumination of indoor 
environment in detecting foreground objects. 
In the training phase, the reference background image and the known image of the indoor lighting are used as 
the training image. In order to facilitate the comparison and operation of the algorithm, the nuclear function of 
the default KICA is Gauss core, the kernel width is 1, the normalized estimate is 0.02. The background image 
training phase was shown in Figure 3. (a) was the reference background, (b) for known contains foreground 
image, (c) albino after reference to the background, and (d) for whitened the foreground image, (e) for the 
KICA isolated, (f) for KICA isolated from the foreground image binarization is obtained after the foreground 
object. 

   
(a)                                           (b)                                         (c)                                         

   
(d)                                           (e)                                               (f) 

Figure 3: Indoor training phase image 

The motion of the image area in addition to including the movement of the object itself, but also includes new 
obscured or revealed background area, combined with the change in the motion picture area other features 
can effectively extract moving target. The key to this class method is the first to detect the motion of the region, 
and motion - changed region detection accurate or not will determine the accuracy of the final generation of 
the moving object. Bit-level analysis, noise filter, composed of moving target area image generator of bit 
difference operation and binarization shown in Figure 4. 

Bit plane 
delamination

0 ( , , )bp x y k

1 ( , , )bp x y k

2 ( , , )bp x y k

7 ( , , )bp x y k
...

(x, y) 
domain 
block of 
bit plane

Dividing 
among the 

ratio number 
of connected 
regions block

Filtering out the 
details and noise of 

(x, y) domain 
image by the 
proportion of 

connected regions

Filtering 
criterion

Filtering 
threshold

Gray level image bit plane (layer) analysis

XOR of 
each bit 
plane

Or operation 
merge of each 

bit plane

Binary 
image of 
moving 
target 
area

Gray level image bit plane (layer) analysis

Frame 
memory

Frame 
memory

( , , )I x y k

...

...

...

Gray level image bit plane analysis
 

Figure 4: The image generator of bit difference operation 

388


System is the most important step is to extract motion of sub-regions, and the contribution of this paper is to 
propose a method based on the parameters α bit image layer to filter out noise, thereby automatically detect 
the motion of the region. 

4.2 Outdoor scene experiment 
Table 1 uses a database of infrared image recognition database for vehicle identification. The database by 
using infrared camera equipment on the road for image acquisition, Figure 5 is the two images, is a car free of 
infrared image, there is a car of infrared image.  
Then, the feature extraction of the collected images is extracted, and a sample has 10 feature attributes. The 
first property is the goal and the width ratio, because according to the far small big visual characteristics, have 
the demand; the second property is on the gray value, because the vehicle friction with the ground and the 
wheel turns fast, feel the wheels of the infrared energy is larger than the body, so set up here two a value of 0 
and 1, if the dark light, is 0, if the dark or light gray on almost 1; third to six properties is the speed 
characteristic, also is to choose the image easier selection of 4 feature points, and then calculate the number 
of pixels adjacent two frames of these feature points mobile; 4 is the final attribute of shape characteristics, 
calculation of horizontal, vertical, 45 degrees and 135 degrees direction histogram proportion, because the car 
is generally four square, relatively more in the horizontal and vertical direction, according to this characteristic, 
can easily distinguish between cars and other objects. There are 548 samples in the database, and there are 
83 positive samples of the automobile, and there are 465 negative samples of the automobile. Take 17 
positive samples and 93 negative samples to sample the sample set, and the remaining 438 samples as the 
test sample set. 
 

Figure 5: The two images of infrared image 

Table 1: Database classification of infrared image recognition 

Kernel function Gauss 
kernel 

Conventional 
polynomial kernel 

MKL GF Space polynomial 
kernel (limit 1) 

GF Space polynomial 
kernel (limit 2) 

Kernel 
parameters 

94 9  β r 

Positive 
detection rate 

0.8563 0.8522 0.8702 0.8644 0.8699 

Support vector 
number 

8 8 62 88 23 

Training time 9.1409 7.2446 1.7369 37.7623 1.1787 

In which β=10-8×1, r=[0.1927 0.1407 0.1009 0.2813 0.2844]  
The time of the polynomial kernel function of GF space is more, and the computational complexity is higher. 
Especially to limit the conditions of 1 space GF polynomial kernel function, because there is no determine the 
convex problem, using PSO update, computational complexity with the pre-set update relevant algebraic, but 
can only get a suboptimal solution. But the positive rate of the complexity of the sacrifice is slightly higher than 
that of the Gauss kernel and the conventional polynomial kernel. The positive detection rate of GF space 
polynomial kernel function under the limit condition 1 and 2 can only be said to be due to the different order or 
the same order of the database. 

389


For any one through the origin line can reflect the information characteristics of frequency domain, see Figure 
6, although local jitter badly, but the overall trend is still with the increasing of. Another improvement idea is 
put forward in this paper likely to appear deviation, if make full use of the fitting error through the origin line 
balance of information, that can improve the precision of parameters. 

 
Figure 6. The information characteristics of frequency domain 

5. Conclusions 

How robust recovery of the body's movement information from the video, is an important area of human 
motion analysis research. Since the human body is non-rigid structure, dynamics model of its movement is 
very complex, and there are video body self-occlusion and mutual occlusion, which makes video-based 
human motion tracking and the reconstruction is very difficult, it is difficult to ensure its accuracy. At present 
research results have simplified a lot of problem: only under certain conditions acquisition process of motion 
video, and most of them only simple periodic motion type for example walking, jogging and so on. In this paper, 
we propose and implement a new video-based human motion reconstruction, reconstruction of a 3D human 
body posture, body posture based video content refinement, key temporal modeling based on the motion 
libraries and other sports Reconstruction Research and Implementation of the technical steps, we can learn 
from any given field-based motion video recovered accurate 3D motion information. 

References 

Deng, J., Han, R., & Mishra, S. (2004, June). Intrusion tolerance and anti-traffic analysis strategies for wireless 
sensor networks. In Dependable Systems and Networks, 2004 International Conference on (pp. 637-646). 
IEEE. 

Li, J.Z., Li, J.B., & Shi, S.F. (2003). Concepts, issues and advance of sensor networks and data management 
of sensor networks. Journal of software, 14(10), 1717-1727.  

Sadek, M., Tarighat, A., & Sayed, A.H. (2007). A leakage-based precoding scheme for downlink multi-user 
MIMO channels. Wireless Communications, IEEE Transactions on, 6(5), 1711-1721. 

Spencer, Q.H., & Haardt, M. (2004). Zero-forcing methods for downlink spatial multiplexing in multiuser MIMO 
channels. Signal Processing, IEEE Transactions on, 52(2), 461-471. 

Srivastava, M., Culler, D., & Estrin, D. (2004). Guest editors' introduction: Overview of sensor networks. 
Computer, 37(8), 0041-49. 

Stuber, G.L., Barry, J.R., Mclaughlin, S.W., Li, Y., Ingram, M.A., & Pratt, T.G. (2004). Broadband MIMO-
OFDM wireless communications. Proceedings of the IEEE, 92(2), 271-294. 

Tarighat, A., Sadek, M., & Sayed, A.H. (2005, March). A multi user beamforming scheme for downlink MIMO 
channels based on maximizing signal-to-leakage ratios. In Acoustics, Speech, and Signal Processing, 
2005. Proceedings.(ICASSP'05). IEEE International Conference on (Vol. 3, pp. iii-1129). IEEE. 

Xin, W., Tan Z.H., & Xia, C. (2005, August). Doppler diversity receiver for broadband wireless OFDM system 
under high-speed mobile environments. In Microwave, Antenna, Propagation and EMC Technologies for 
Wireless Communications, 2005. MAPE 2005. IEEE International Symposium on (Vol. 2, pp. 1444-1447). 
IEEE. 

 
-150 -100 -50 0 50 100 150
-9

-8

-7

-6

-5

-4

-3

-2

-1

0

-150 -100 -50 0 50 100 150
-9

-8

-7

-6

-5

-4

-3

-2

-1

0

390