5 Motor Imagery signal Classification for BCI System Using Empirical Mode Décomposition and Bandpower Feature Extraction Dalila Trad ESSTT, LaTICE 5, Avenue Taha Hussein, B. P. : 56, Bab Menara, 1008 Tunisia trad.dalila@gmail.com Tarik Al-Ani Dep. Informatique, Cit Descartes-BP 99, 93162 Noisy-Le-Grand, UVSQ, LISV, 10/12 Avenue de /Europe,78140 Velizy, France and ESIEE-Paris, France. t.alani@esiee.fr Mohamed Jemni ESSTT, LaTICE 5, Avenue Taha Hussein, B. P. : 56, Bab Menara, 1008 Tunisia mohamed.jemni@alecso.org.tn Abstract The idea that brain activity could be used as a communication channel has rapidly developed. Indeed, Electroencephalography (EEG) is the most common technique to measure the brain activity on the scalp and in real-time. In this study we examine the use of EEG signals in Brain Computer Interface (BCI). This approach consists of combining the Empirical Mode Decomposition (EMD) and band power (BP) for the extraction of EEG signals in order to classify motor imagery (MI). This new feature extraction approach is intended for non-stationary and non-linear characteristics MI EEG. The EMD method is proposed to decompose the EEG signal into a set of stationary time series called Intrinsic Mode Functions (IMF). These IMFs are analyzed with the bandpower (BP) to detect the characteristics of sensorimotor rhythms (mu and beta) when a subject imagines a left or right hand movement. Finally, the data were just reconstructed with the specific IMFs and the bandpower is applied on the new database. Once the new feature vector is rebuilt, the classification of MI is performed using two types of classifiers: generative and discriminant. The results obtained show that the EMD allows the most reliable features to be extracted from EEG and that the classification rate obtained is higher and better than using the direct BP approach only. Such a system is a promising communication channel for people suffering from severe paralysis, for instance, people with myopathic diseases or muscular dystrophy (MD) in order to help them move a joystick to a desired direction corresponding to the specific motor imagery. Keywords: Brain Computer Interface, motor imagery, Bandpower, Empirical Mode Decomposition, Hidden Markov Model, Support Vector Machines, Cohens kappa coefficient. 1. Introduction A BCI system is a communication and control pathway between a brain and an external device. It does not require any external devices or muscle intervention to issue commands and complete the interaction (Abdulkader et al., 2015). This system primarily concerns the medical domain and mainly the field of disability. So, the major goal of the BCI research is to provide severely disabled people with a new communication channel, which is not based on the traditional motor output channels (Van Erp et al., 2012). A BCI system is represented as a system in a continuous closed, generally composed of six steps, (see fig 1): 1. Brain activity measurement, 2. Preprocessing, 3. Feature Extraction, 4. Classification, 5. Translation into a command and Feedback. BRAIN. Broad Research in Artificial Intelligence and Neuroscience Volume 7, Issue 2, June 2016, ISSN 2067-3957 (online), ISSN 2068 - 0473 (print) 6 Figure 1. General architecture of an online (BCI) One major challenge of our BCI system is to describe the signals EEG by a few relevant values called features i.e. step 3 in Fig (1). The success of the mental imagery classification depends on the choice of features used to characterize the raw EEG signals. These features can then be used in step 4 in order to classify the user’s mental state. Several approaches for feature extraction have been proposed in literature. These approaches are based on time, frequency or time-frequency methods (Al-ani & Trad, 2010; Pfurtscheller, 2004). A common way to gain BCI control is to use motor imagery of left and right hand movement, which is based on Event-related desynchronization/synchronisation (ERD/ERS) in specific frequency bands. For instance, imagination of left or right hand movement results in amplitude causes an attenuation (Event-related desynchronization (ERD)) of (8-13Hz) and central (13-30Hz) rhythms at the contra-lateral sensorimotor representation area and, in an amplitude increase (event-related synchronization (ERS)) within the band (30-40Hz) at the ipsi- lateral hemisphere (Pfurtscheller, 1999; Neuper & Pfurtscheller, 1999). Several common band power techniques were employed in the BCI literature. Herman et al. (Herman et al., 2008) demonstrated that the Yule and Welch PSD approaches, mainly dominate the other studied ones. These approaches are based essentially on some linearity and stationary hypothesis such as the use of fast Fourier transform (FFT) spectrum in a short-time segment of data. The accuracy of the FFT calculation is closely related to the choice of the duration of the signal segment. However, the nature of the EEG signal is non stationary and nonlinear. The main non-stationary and nonlinear feature extraction technique is the Wavelet Transform (WT) (Samar et al., 1999). Various kinds of wavelets have been used for BCI, such as Morlet wavelets (Lemm et al., 2004) and wavelet packet decomposition (Hettiarachchi et al. 2014). Despite being more effective than the FFT, WT approach shows a much bigger ambiguity in signal decomposition. However, it cannot provide higher resolution both in time and frequency domain, besides, the decomposition of signal is not adaptive. In this paper, we applied a recent technique proposed by Huang et al. (Huang et al. 1998), called the empirical mode decomposition (EMD) for nonlinear and non-stationary time series data for pattern extraction from motor imagery EEG of left and right hand imaginary movement. This method EMD is a data driven approach (i.e. one does not need to define a mother wavelet beforehand) that can be used to decompose adaptively a signal into a finite number of mono- component signals, which are known as intrinsic mode functions (IMF s) or modes. It considers signals at their local oscillations, but they are not necessarily considered in the sense of Fourier harmonics. Their extraction is non-linear, whereas their recombination for exact reconstruction of the signal is linear. The IMFs admit well-behaved Hilbert transforms (HT) (Long et al. 1995) and they satisfy the following properties: they are symmetric, different IMFs yield different instantaneous local frequencies as functions of time that give sharp identifications of embedded structures. In this work, we propose a hybrid approach combining the EMD and BP for feature extraction from the EEG signals. We first apply the EMD to select only the IMFs corresponding to sensorimotor rhythms (mu and beta) using Welch-based PSD to extract the reliable information of EEG during left and right hand movement imagination. Based on these new features, the experimental using two classifiers hidden Markov models (HMMs) and Support Vector Machines (SVM) results show that the proposed method improves recognition rate greatly. D. Trad, T. Al-Ani, M. Jemni - Motor Imagery signal Classification for BCI System Using Empirical Mode Décomposition and Bandpower Feature Extraction 7 2. Methods 2.1. Participants and experimental paradigm The EEG signals used for this experiment were recorded by a real-time EEG acquisition hardware g.GAMMAsys active electrode System along with a g.USBamp amplifier g.tec, Guger Technologies. In this study, the EEG data of ten subjects (two young females and seven young males with ages ranging from about 22 to 35 years), recorded during imagination of the left and right hand movements. A subject training session in our work consisted of one experimental run of 40 trials with randomized direction of the cues (20 left-hand imagination and 20 right-hand imagination). For each subject, we used only 4 out of the best sessions (i.e. 160 trials/subject). Therefore, each subject is seated in a comfortable armchair 150 cm in front of a computer screen. At the beginning of each trial (t = 0 s), a fixation cross-appeared on the black screen. After two seconds a warning stimulus was given in the form of a beep. From 3 to 4.25s, an arrow (cue stimulus), pointing to the left or right, was shown on the screen. The subject was instructed to imagine a left or right hand movement until the end of the trial, depending on the direction of the arrow. The EEG was sampled and classified on-line throughout the session. Between 4.25 and 8s, the classification result was used to give a continuously updated feedback stimulus in the form of a horizontal bar that appeared in the center of the screen. The paradigm is illustrated in fig (2). Figure 2. Timing of one trial of the experiment with continuous feedback (Guger et al, 2001) 2.2. Preprocessing Two bipolar EEG channels were measured over C3 and C4 according to the international 10- 20 System (Jasper 1958). The left ear served as reference and the FPz as ground. The signals were sampled at 256 Hz and bandpass-filtered between 0.5 Hz and 30 Hz. An additional 50 Hz notch filter was enabled to suppress line noise. 2.3. Feature extraction Once the raw EEG data is recorded and pre-processed, the following step is to extract its intrinsic features. This step is essential in the functioning of BCIs due to the important measured amount of brain activities. The aim of this step is then to find a better representation of the EEG signal while keeping the most relevant properties corresponding to the performed mental imagery. This information is called features. In this work, we propose a robust method bases on BP and EMD to extract the relevant EEG features corresponding to MI. 2.3.1. Bandpower The features may be extracted from the EEG signals by estimating the power distribution of the EEG in predefined frequency bands. In general, the band power is estimated by digitally band- BRAIN. Broad Research in Artificial Intelligence and Neuroscience Volume 7, Issue 2, June 2016, ISSN 2067-3957 (online), ISSN 2068 - 0473 (print) 8 pass filtering a signal in a given frequency band, then squaring the filtered signal and finally averaging the obtained values over a given time window. Pfurtscheller et al. (Pfurtscheller et al.,1997) used the BP and demonstrated that for each subject, different frequency components were found in the and bands which provided best discrimination between left and right hand movement imagination. These frequency bands varied between 9 and 14 Hz and between 18 and 26 Hz. Such features have been notably used with success for motor imagery classification. 2.3.2. Empirical mode decomposition (EMD): The traditional EMD was recently proposed as an adaptive time-frequency data analysis method (Huang et al., 1998). An algorithm based on an empirical framework defines it. The basic EMD is defined by a process called sifting to break down any multimodal signal to a sum of basis components called intrinsic mode functions (IMFs). The IMFs are zero-mean AM-FM signals which must satisfy two conditions: the first one is that the number of extrema and that of zero-crossing must differ at most by one; the second one is that the mean value between the upper and lower envelopes are equal to zero at any point. Conceptually, the establishment of this method is quite simple: one needs to consider a signal at its local oscillation level, remove the fastest oscillation and iterate the process on the residue considered as a new signal. At the end of the sifting processes, a given signal x(t) can be written as a sum of a finite number of IMFs, Im(t), m = 1. 2, .... M, and a final residue rM(t) (Equation1): (1) The decomposition is stopped at step M, if either the residue is a mono-component signal or has only 2 extrema. The stopping criterion must be set to ensure that the obtained signal satisfies the properties of an IMF while limiting the number of iterations. For more details about the different steps of the sifting process for the calculation of the IMFi as well as the stopping criterion definition see (Huang et al., 1998). Since the decomposition into IMFs is based on the local characteristic time scale of the data, it applies to nonlinear and non-stationary processes. The IMFs admit well-behaved Hilbert transforms (HT) and they satisfy the following properties: they are symmetric, different IMFs yield different instantaneous local frequencies as functions of time that give sharp identifications of embedded structures. The decomposition is done linearly or non-linearly depending on the data. This complete and almost orthogonal decomposition is empirically realized by identifying the physical local characteristic time scales intrinsic to these data, which is the lapse between successive extrema. 2.3.3. EMD and BP for motor imagery In this work, we propose a direct nonlinear approach to extract the more relevant IMFs corresponding to the different frequency components in the and bands and then obtain the BP in order to use them as features for mental task classification (see Fig. 3). The feature vector pi used for the demonstration in this paper is composed, for each sample I, 1 < i < 2048, in a given trial (among a total of 160 trials) of four bandpower, calculated of the rhythms and in positions C3 and C4 (Trad et al., 2011). Fig. 4 shows the EMD decomposition result of one-trial (left hand movement imagination) for subject 2 in the channels C3 and C4 respectively (the pre-filtered EEG signal used for this illustration is not corrupted by blinking artifact.). Each channel is decomposed into ten IMFs and one residue. )()()( 1 trtItx M M m m D. Trad, T. Al-Ani, M. Jemni - Motor Imagery signal Classification for BCI System Using Empirical Mode Décomposition and Bandpower Feature Extraction 9 Welch based PSD method was applied to analyze the different characteristics of each IMF. This method was applied to each IMF to find the active frequency bands such as the and rhythms. Fig. 5 shows the PSD in each IMF shown in Fig.4. The major advantage of EMD is that the input signal may be decomposed directly and adaptively to basic functions (IMFs), each with a distinct time scale. The IMFs are ordered in increasing time scales, i.e., decreasing frequency. Based on this property, we can notice that the characteristics of the active frequency bands corresponding to [8-12Hz] and [13-30Hz] are located only in IMF 1, IMF 2 on C3 and C4. Therefore, the new signal is reconstructed by keeping only the two first IMFs. EMD also allows eliminating the artifacts in the EEG during the recording sessions like eye blinks and eyeball movements. In Fig 5, we noted that ocular artifact frequency is generally low around 5Hz with high amplitude. This artifact appears mainly in IMF3 and IMF4. Finally, band power was applied for the new signal. As a last step, the logarithm of the BP is calculated in order to transform the distribution of this feature to a more Gaussian like shape, because the classifiers we used, such as HMMs and SVM assume normally distributed features. a) Figure 3. Hybrid EMD-BP approach for one trail feature extraction BRAIN. Broad Research in Artificial Intelligence and Neuroscience Volume 7, Issue 2, June 2016, ISSN 2067-3957 (online), ISSN 2068 - 0473 (print) 10 b) Figure 4. The EMD decomposition results for subject 2 when he imagines left hand movement. From top-left to down-right: the raw signal, the ten IMFs and the residue in channel C4 (a) and in C3 (b) a) D. Trad, T. Al-Ani, M. Jemni - Motor Imagery signal Classification for BCI System Using Empirical Mode Décomposition and Bandpower Feature Extraction 11 b) Figure 5. PSD (dB/Hz) vs freaquency (Hz) of each IMF showen in fig 4 in channel C4 (a) and in C3 (b) 2.4. Classification The goal of the classification step is to assign automatically a class to the feature vector previously extracted. In this study, four feature sets for each feature extraction approach, BP and EMD+BP, were used for classification and for test. Two sets of feature vectors; one for LMI and the other for RMI were used to train the classifier (creating one model for each MI) while the two other sets of feature vectors, one for left and the other for right MI were used for test. Each set contains 40 feature matrices each of dimension 4x2048. We evaluated our feature extraction approach by using 2 classifiers. 2.4.1. Hidden Markov Model (HMM) The HMMs approach is very efficient nonlinear technique used for the classification of time series (Rabiner, 1989). It necessitates two stages: a training stage where the stochastic process models are estimated through extensive observation corpus and decoding or detection stage where the model may be used off/on-line to obtain the likelihoods of the given test sequence evaluated by each model. A HMM is defined by the following compact notation to indicate the complete parameter set of the model = (II, A, B), where II, A and B are the initial State distribution vector, matrix of State transition probabilities and the set of the observation probability distribution in each State, respectively: II = [II 1, II 2, ..., II Ns], II i = P(q1 = si), A = [aij ], aij = P(qt+1 = sj | qt = si). Where 1 ≤ i, j ≤ N, si, sj S, S = {s1, s2, ..., sN}, t {1, 2, ..., T }. The observation at time (or index) t, Ot, is considered in this paper as continuous or real-valued K-dimensional vector Ot R K , 1 ≤ t ≤ T, 1 ≤ t ≤ T. For a continuous observation, the State conditional probability of the observation bi(Ot) may be defined by a finite mixture of any log-concave or elliptically symmetric probability density function (pdf), e.g. Gaussian pdf, with State conditional observation mean vector µi and State conditional observation covariance matrix Si. In this paper we consider only a single multivariate Gaussian pdf, so B may be defined as B = {µi, Si} i = 1,2, ... ,Ns. At each instant of time t, the model is in one of the States i, 1 ≤ i ≤ Ns. It outputs Ot according to a density function bi(Ot) and then jumps to State j, 1 ≤ j ≤ Ns with probability aij. The State transition matrix defines the structure of the HMM (Rabiner, 1989). The model may be obtained off-line by a given training procedure. In practice, given an BRAIN. Broad Research in Artificial Intelligence and Neuroscience Volume 7, Issue 2, June 2016, ISSN 2067-3957 (online), ISSN 2068 - 0473 (print) 12 observation sequence O = {01, O2, ..., Ot}, and an initial model , the HMMs need three fundamental problems to be solved: 1. How to calculate the likelihood P(O|)? The solution to this problem provides a score of how O belongs to . 2. How to determine the most likely State sequence that corresponds to a given observation sequence O? The solution to this problem provides the sequence of the hidden States corresponding to the given observation sequence O. 3. How to adjust the model in order to maximize P(O|)? This is the problem of estimating the model parameters given a corpus of training observations sequences. Problems 1 and 2 are solved in the decoding or classification stage using the forward or the Viterbi algorithms (Rabiner, 1989), while problem 3 is solved during the training phase using either a conventional algorithm such as the Baum-Welch algorithm (Rabiner, 1989) or other optimization- based algorithm, e.g., (Al-ani & Hamam, 1978). Our training scheme is based on Baum-Welch training algorithm and the Bayesian Inference Criterion (BIC). This scheme makes the training procedure independent of the a priori knowledge of the structure of each HMM needed in the Baum-Welch algorithm. The decoding or classification may be realised on-line using the stochastic dynamic programming based Viterbi algorithm. This algorithm needs all the constructed HMMs to classify on-line a given observation sequence. In our work, two HMMs were built for RMI and LMI using 40 training feature sequences from each MI. Each sequence is composed of T = 2048 feature vectors of dimension K = 4 each. 2.4.2. Support Vector Machines (SVM) SVM was first introduced by Vapnik (Vapnik, 1998) (Vapnik, 2000). It is based on two fundamental principles: 1. Maximum margin which is the distance between the border of separation and the closest samples. In SVMs, the boundary of separation is chosen as the one that maximizes the margin. This is justified by the theory of Vapnik-Chervonenkis (or statistical theory of learning) (Vapnik, 1998). The problem is to find the optimal separating boundary from a given training set. This is done by formulating the problem as a quadratic optimization problem for which there are known algorithms. 2. In order to deal with cases where the data are not linearly separable, we increase of the size of the input space. To be able to handle nonlinear separable data, we transform the space of input data to another space with larger size in which the probability that a linear boundary exists is higher. This transformation is realised by using a nonlinear kernel function. Several kernel functions may be used to design the SVMs. In our work, the Gaussian Radial Basis Function (RBF): K(x,y) exp ||xy || 2 2 (2) With a scaling factor, , of 1 was selected as it gives the most accurate results. In our work, two classes SVM was built for RMI and LMI using 40 training feature sequences from each MI. Each sequence is composed of T = 2048 feature vectors of dimension K = 4 each. 2.5. Statistical performance evaluation method: Cohen’s kappa coefficient Training the classifiers presented above was designed to minimize the classification error or to increase the classification rate usually measured by the rate of correctly classified trials. To interpret the classification performance, we take as reference the percentage rate reached by a random classification. Indeed, evaluating the performance of a classifier, based on a given features, is an important issue since its performance may be used for simplifying human training. The evaluation the performance of our feature extraction is based on the performance of the D. Trad, T. Al-Ani, M. Jemni - Motor Imagery signal Classification for BCI System Using Empirical Mode Décomposition and Bandpower Feature Extraction 13 classification results on RMI and LMI. In order to compare our new feature extraction method results with the traditional feature extraction method results, we used the Cohen’s kappa coefficient (Cohen, 1960). It allows to measure the agreement between two classifiers in classification into two categories. The value of the Cohens K ranges between 1 and -1, where 1 corresponds to perfectly correct classification and values less than 1 imply less than perfect agreement. Let P0 the observed agreement among two classifiers and Pe the expected agreement. The Cohens kappa K is defined by: K Po Pe 1Pe (3) 3. Results and discussion 3.1. Confusion matrices The recognition rate calculated by HMM and SVM for the 10 subjects is represented by the confusion matrix (Table 1) for the two feature extraction methods direct BP and EMD + BP. It is clearly seen that the combination of the two features methods, EMD and BP gives the best classification rates for almost all subjects. Based on these matrices, the performance of our approach was evaluated using K scores. Table 1. Confusion matrix: recognition rates calculated by HMMs and SVM in the case of RMI and LMI. These results are showen for both methods of feature extraction (BP and EMD+BP) for all subjects. Subject FE true class HMM - predicted class SVM –predicted class predicted class 1 RMI LMI RMI LMI BP RMI 70 30 86.25 13.75 LMI 2.5 97.5 36.25 63.75 EMD+BP RMI 90 10 85 15 LMI 2.5 97.5 2.50 97.50 2 BP RMI 77.5 22.5 76.25 23.75 LMI 7.5 923 40 60 EMD+BP RMI 75 25 75 25 LMI 2.5 97.5 25 75 3 BP RMI 57.5 42.5 61.25 38.75 LMI 35 65 33.75 66.25 EMD+BP RMI 72.5 27.5 61.87 38.13 LMI 40 60 25 75 4 BP RMI 71.42 28.58 55.62 44.38 LMI 37.15 62.85 42.50 75.50 EMD+BP RMI 71.42 28.58 77.50 22.50 LMI 42.85 57.15 82.75 71.25 5 BP RMI 52.5 47.5 76.25 23.75 LMI 42.5 57.5 33.13 61.87 EMD+BP RMI 72.5 27.5 76.25 23.75 LMI 40 60 27.50 72.50 6 BP RMI 80 20 67.50 32.50 LMI 50 50 32.50 76.50 EMD+BP RMI 85 15 68.12 31.88 LMI 47.5 523 17.50 8230 7 BP RMI 80 20 70.62 29.38 LMI 35 65 28.75 71.25 EMD+BP RMI 80 20 78.12 21.88 LMI 32.5 67.5 29.38 70.62 8 BP RMI 87.5 12.5 57.50 42.50 LMI 32.5 67.5 45.63 54.37 EMD+BP RMI 90 10 38.75 16.25 LMI 37.5 62.5 33.75 66.25 BRAIN. Broad Research in Artificial Intelligence and Neuroscience Volume 7, Issue 2, June 2016, ISSN 2067-3957 (online), ISSN 2068 - 0473 (print) 14 9 BP RMI 60 40 78.12 21.88 LMI 45 55 28.13 71.87 EMD+BP RMI 82.5 17.5 77.50 22.50 LMI 25 75 7.50 72.50 10 BP RMI 60 40 66.87 33.13 LMI 42.5 57.5 36.88 63.12 EMD+BP RMI 75 25 75 25 LMI 35 65 25 75 3.2. Cohen’s Kappa coefficient Table 2 gives the Kappa- values for each one of the ten subjects in order to evaluate our method of feature extraction based on EMD+BP. The average score obtained by the HMM classifier when the feature vectors are generated by EMD+BP is equal to 0.54 while that obtained when the feature vectors are generated by BP is equal to 0.4, i.e., an increase of 35%. For the SVM classifier, the average score increases from 0.34 to 0.52, i.e., an increase of 53% using the methods BP and EMD+BP respectively. Therefore, the use of the EMD+BP approach with the two classifiers gives to the Kappa scores a significant superiority compared to the direct BP approach. Table 2. K-values obtained by the two types of classifiers (C) HMMs and SVM with the two types of features extraction (FE) methods: BP and EMD+BP for 10 subjects during RMI and LMI as well as their mean values C FE 1 2 3 4 5 6 7 8 9 10 Mean k HMM BP 0.6 0.8 0.4 0.2 0.2 0.4 0.4 0.6 0.2 0.2 0.40 EMD+BP 0.8 0.8 0.6 0.4 0.4 0.4 0.4 0.6 0.6 0.4 034 SVM BP 0.6 0.4 0.2 0.2 0.4 0.4 0.4 0.2 0.4 0.2 0.34 EMD+BP 0.8 0.6 0.4 0.2 0.4 0.6 0.4 0.6 0.6 0.6 032 3.3. Translation into a command Once the motor imagery is identified, a command may be associated to this mental task in order to control a machine (Prataksita et al., (2014)) (Guger et al., 1999). In this work, we constructed a new Simuhnk/MathWork model to translate on-line the EEG signals into low-level commands. Fig. 6 shows our experimental EEG-based BCI System (Trad et al., 2015). 1. Off-line phase: In the first step, the EEG signals are recorded while subjects imagine a right and left hand movement. The second step is the preprocessing and feature extraction of EEG data. In this step, we implemented our method to extract the relevant features of the EEG. This method is based on the combination of EMD and BP. In the third step, we implemented the HMM classifier to assign a model to each motor imagery task: l for LMI and 2 for RMI. 2. On-line phase: Once the motor imagery is identified by one of the two models, a low- level command can be then associated to this mental task. This mechanism was implemented with Simulink/Mathworks. EMD + BP and Viterbi algorithm (Rabiner 1989) are implemented as an embedded function in Simulink in order to identify the motor imagery on-line. Viterbi algorithm has two inputs data, the first input is the EEG data and the second is the two models already constructed in the first step (offline). The Viterbi-based recognition result is translated into a command to reinforce the movement of the joystick (right or left) in order to help persons with myopathic diseases or muscular dystrophy to move this joystick to a desired direction. D. Trad, T. Al-Ani, M. Jemni - Motor Imagery signal Classification for BCI System Using Empirical Mode Décomposition and Bandpower Feature Extraction 15 4. Conclusion The work presented in this paper concerns the steps of feature extraction and classification for motor imagery in BCI framework. These steps are essential for the design of BCI Systems since these Systems represent a direct communication channel between the brain of a subject and a machine without any direct muscular intervention. Our approach is based on indirect, independent and asynchronous BCI Systems. Ten healthy subjects participated in the realization of this process by imagining two hand movements. During our work, we have studied the changes in frequency and amplitude of the EEG from each subject participating in our experiment. Changes in frequency distribution within the bands of sensorimotor and rhythms vary from one individual to another and evolve strongly over time. For this reason it is essential to carry a preliminary study for each subject. Once the values of sensorimotor rhythms are determined for each subject, it is easier to control subject's brain activity when one realizes motor imagery. This is achieved through the feedback returned to subject. EMD is particularly important in extracting directly from the EEGs the time courses of and rhythms. This facilitates the detection of the reactive ERD/ERS bands for individual subjects, i.e., no need to choose an individual subject specific frequency band. Another advantage of EMD is not only that it is a data driven approach but also it has the advantage of removing or reducing the artifacts and the noise affecting the EEG signal. Therefore, we deduce that the classification rate in the two movements imagination are better using EMD+BP approach than the direct BP approach. When subject motor imagery is recognized, we translated the EEG signal to a low-level command. This System could allow subjects who suffer from severe motor disabilities to better reinforce a joysticks movement to right or left. References Abdulkader, S. N., Atia, A., & Mostafa, M.-S. (2015). Brain computer interfacing: Applications and challenges. Egyptian Informatics Journal 16, 213–230 Van Erp, J., Lotte, F., & Tangermann, M. (2012). Brain-computer interfaces: Beyond medical applications. IEEE Computer 45, 26–34 Al-ani, T. & Trad, D. (2010). Signal processing and classification approaches for brain-computer interface. In Intelligent and Biosensors, ed. V. S. Somerset (InTech), 25–66 Pfurtscheller, G. (2004). Brain-computer interface - state of the art and future prospects. In Proceedings of European Signal Processing Conference (Vienna, Austria), 509–510 Figure 6. The general conception of our asynchronous system BCI (offline - online) for reinforcement of a joystick movement BRAIN. Broad Research in Artificial Intelligence and Neuroscience Volume 7, Issue 2, June 2016, ISSN 2067-3957 (online), ISSN 2068 - 0473 (print) 16 Pfurtscheller, G. (1999). Eeg event-related desynchronization (erd) and eventrelated synchronization (ers). Electroencephalography: Basic Principles, Clinical Applications and Related Fields, 4th ed , 958–967 Neuper, C. & Pfurtscheller, G. (1999). Motor imagery and erd related desyncronization. In Eventrelated Synchronisation, eds. G. Pfurtscheller and E. F. H. Lopes da Silva (Amsterdam Handbook of Electroencephalography and Clinical Neurophysiology. Revised Edition: Elsevier), 303–525 Herman, P., Prasad, G., McGinnity, T., & Coyle, D. (2008). Comparative analysis of spectral approaches to feature extraction for eeg-based motor imagery classification. IEEE Transactions on Neural Systems and Rehabilitation Engineering 16, 317–326 Samar, V., Bopardikar, A., Rao, R., & Swartz., K. (1999). Wavelet analysis of neuroelectric waveforms: A conceptual tutorial. Brain and Language 66, 7–60 Lemm, S., Schafer, C., & Curio, G. (2004). BCI competition 2003data set iii: probabilistic modeling of sensorimotor mu rhythms for classification of imaginary hand movements. IEEE Transactions on Biomedical Engineering 51, 10771080 Hettiarachchi, I., Nguyen, T., & Nahavandi, S. (2014). Motor imagery data classification for bci application using wavelet packet feature extraction. In Neural Information Processing (Springer International Publishing), 519–526 Huang, N. E., Shen, Z., Long, S. R., Wu, M. C., Shih, H. H., Zheng, Q., et al. (1998). The empirical mode decomposition and the hilbert spectrum for nonlinear and non-stationary time series analysis. In Proceedings of the Royal Society. vol. 454, 903–995 Long, S. R., Huang, N., Tung, C., Wu, M., Lin, R., & Mollo-Christensen, Y., E Yuan (1995). The Hilbert techniques: an alternate approach for non-steady time series analysis. IEEE Geoscience Remote Sensing Soc. Lett. 3, 6–11 Guger, C., Schlogl, A., Neuper, C., Walterspacher, D., Strein, T. & Pfurtscheller, G. (2001) Rapid prototyping of an eeg-based brain-computer interface (bci), Electroencephalography and Clinical Neurophysiology 9, 49-58. Jasper, H. (1958). The ten-twenty electrode system of the international federation. In Electroencephalography and Clinical Neurophysiology. vol. 10, 370–375 Pfurtscheller, G., Neuper, C., Flotzinger, D. & Pregenzer, M. (1997). Eegbased discrimination between imagination of right and left hand movement. In Electroencephalography and Clinical Neurophysiology. vol. 103, 642–651 Trad, D., Al-ani, T., Monacelli, E. & Jemni, M. (2011). Nonlinear and nonstationary framework for feature extraction and classification of motor imagery. International Conference on Rehabilitation Robotics, IEEE, 1-6 Rabiner, L. R. (1989). A tutorial on hidden markov models and selected application in speech recognition. In Proceedings IEEE. vol. 77, 267–296 Al-ani, T. & Hamam, Y. (1978). A low complexity simulated annealing approach for training hidden markov models. International Journal of Operational Research (IJOR) 8, 483–510 Vapnik, V. N. (1998). Computational Learning Theory (John Wiley & Sons) Vapnik, V. N. (2000). The Nature of Statistical Learning Theory (2nd Ed., Springer) Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, 37–46 Prataksita, N., Lin, Y-T., Chou, H-C. & Kuo, C-H. (2014). Brain-robot control interface: Development and application. Bioelectronics and Bioinformatics (ISBB), IEEE, 1-4 Guger, C., Harkam, W., Hertnaes, C. & Pfurtscheller, G. (1999). Prosthetic control by an EEG- based brain-computer interface (BCI). Proceedings AAATE, vol 45, 799-804. Trad, D., Al-ani, T., & Jemni, M. (2015). A feature extraction technique of eeg based on emd-bp for motor imagery classification in bci. In 2015 5th International Conference on Information Communication Technology and Accessibility (ICTA) (IEEE), 1–6