Microsoft Word - ETASR_V12_N4_pp8872-8877 Engineering, Technology & Applied Science Research Vol. 12, No. 4, 2022, 8872-8877 8872 www.etasr.com Ozturk & Cayiroglu: A Real-Time Application of Singular Spectrum Analysis to Object Tracking with … A Real-Time Application of Singular Spectrum Analysis to Object Tracking with SIFT Ali Ozturk Department of Mechatronics Vocational School of Technical Sciences Mus Alparslan University Mus, Turkey a.ozturk@alparslan.edu.tr Ibrahim Cayiroglu Department of Mechatronics Faculty of Engineering Karabuk University Karabuk, Turkey icayiroglu@karabuk.edu.tr Received: 27 April 2022 | Revised: 31 May 2022 | Accepted: 1 June 2022 Abstract- This study combined SIFT and SSA to propose a novel algorithm for real-time object tracking. The proposed algorithm utilizes an intermediate fixed-size buffer and a modified SSA algorithm. Since the complete reconstruction step of the SSA algorithm was unnecessary, it was considerably simplified. In addition, the execution time of a Matlab implementation of the SSA algorithm was compared with a respective C++ implementation. Moreover, the performance of the two different matching algorithms in the detection, the FlannBasedMatcher and Brute-Force matcher algorithms of the OpenCV library, was compared. Keywords-object tracking; object detection; computer vision; SIFT; SSA I. INTRODUCTION Object tracking or visual tracking has been employed, studied, and commercialized in a broad spectrum of applications such as Simultaneous Localization And Mapping (SLAM), traffic surveillance, pedestrian counting, hand gesture recognition, tracking the path of protein stress granules in cells, and video compression [1]. Object tracking can be defined as dynamic object detection [2]. The main problems of object tracking are common with object detection, such as occlusion, clutter in the background, viewpoint changes due to affine transformation such as translation and rotation, scale changes (zoom in and out), and photometric deformation such as distortion, blur, and illumination changes. The most important challenge is still the speed or real-time performance of the online tracker, such as the tracker used in visual servoing in robotics [3]. On the other hand, object tracking has some problems of its own. An optimal algorithm for visual tracking can vary depending on the camera’s position, the number of cameras, and the number of moving objects. In the case of a stationary camera, background removal can simplify the task by eliminating the stationary pixels [4]. If the number of moving objects and the target object increase, then the task becomes complicated. Object tracking consists of two important steps: detecting the object of interest and tracking it in the following frames [5]. Object detection can be defined as template matching in which a template or a model is detected within a scene. Typically, a template is the picture of an object of interest. Detection can be defined as enclosing a given template in the scene in a rectangle. In object tracking, the basic task is to track the object as long as it exists in the scene. Scale Invariant Feature Transform (SIFT) is one of the most robust local descriptors which can be used in object tracking [6-8]. It has been utilized in a broad spectrum of applications, such as face recognition, gender recognition [9], and smoke detection [10]. It is a good candidate to tackle the problems of tracking, such as translation, rotation, occlusion, and scale. A method was proposed in [11] to track objects by combining SIFT with mean shift. In [12], SIFT was utilized with Kanade- Lucas-Tomasi Tracker (KLT) [13] for UAV-based applications. In this approach, the object was detected with SIFT, and its location was computed with KLT. Since the algorithms are not perfectly robust to all image transformations and deformations, the object of interest can be lost or detected with low accuracy during any object tracking task. In such cases, the pose can be recovered or improved by using filters or smoothing. Singular Spectrum Analysis (SSA) is a method of time series analysis based on multivariate statistics. It is also known as the principal component analysis of time series [14]. SSA is considered to be a non-parametric (model-free) filtering technique. However, it can also be used for modeling [15]. It has been mainly used in applications that require post- processing [16, 17]. SSA was combined with the Kalman filter to smooth real-time positioning data obtained from the Global Navigation Satellite System (GNSS) in [18]. However, there is no description of how SSA was implemented in real-time. A rank-one eigenvector update was proposed in a recursive framework in [19] for the identification of structural damage detection in real-time by utilizing First-Order Perturbation (FOP) for the Henkel matrix. This paper proposes an object-tracking algorithm utilizing SIFT and SSA. To the best of our knowledge, this is the first study that employs SIFT and SSA in object tracking. As SSA always requires a batch of data, its real-time implementation was carried out using a fixed-sized buffer. Since the real-time implementation of SAA does not require a full reconstruction step by diagonal averaging, the reconstruction algorithm was considerably simplified. Corresponding author: Ali Ozturk Engineering, Technology & Applied Science Research Vol. 12, No. 4, 2022, 8872-8877 8873 www.etasr.com Ozturk & Cayiroglu: A Real-Time Application of Singular Spectrum Analysis to Object Tracking with … II. METHOD This section investigates the SSA algorithm, provides the real-time implementation, describes object tracking with a template matching algorithm, and provides the details of the experiments. A. Singular Spectrum Analysis The SSA algorithm can be summarized in four steps: embedding, Singular Value Decomposition (SVD), grouping, and finally, reconstruction. In the embedding step, a Henkel matrix was constructed from the input signal. Each column of the Henkel matrix is a portion of the signal obtained by windowing with a selected window length. This process is also called a caterpillar since it behaves as such. If the length of the raw signal is n and the window length is l, the size of the Henkel matrix is l×k, where l is the rows, � is the columns, and k=n-l+1. Figure 1 shows an example of the embedding step. In this example, a Henkel matrix is created from an input signal. The size of the signal is 10, and the window length is 4. Therefore, the size of the resulting matrix is 7×4. Fig. 1. An example of the embedding step. In the SVD step, the singular value decomposition of the trajectory matrix is computed. If X is the Henkel matrix, then the trajectory matrix is XX T , where X T is the transpose of X. The Henkel matrix can be written as a sum of different and independent matrices with the aid of SVD. This decomposition can be stated as follows: � � �� � �� � ⋯ � �� (1) where each Xi is called an elementary matrix. Each elementary matrix Xi is calculated as follows: � � � � � , � � 1, ⋯ , � (2) where Ui denotes the eigenvectors of SVD, and each vector Vi is calculated as follows: � �� �� �� , � � 1, ⋯ � (3) Each elementary matrix � corresponds to one of the eigenvalues of SVD. It should be noted that: ‖� ‖� � � , � � 1, ⋯ , � (4) The contribution of each elementary matrix to X is proportional to its corresponding eigenvalue λi. The eigenvalues are sorted in decreasing order. In SSA, elementary matrices corresponding to small eigenvalues are considered to be the noise portion of the signal. This portion is excluded in the grouping step: �� � ∑ � � �� (5) where r