Estimating depths and dimensions of gravity sources through optimized support vector classifier (SVC) ANNALS OF GEOPHYSICS, 59, 3, 2016, S0319; doi:10.4401/ag-7029 S0319 Estimating depths and dimensions of gravity sources through optimized support vector classifier (SVC) Mohammad Ehsan Hekmatian1,2,*, Vahid Ebrahimzadeh Ardestani3, Mohammad Ali Riahi3, Ayyub Memar Koucheh Bagh2 1 Islamic Azad University, Faculty of Basic Sciences, Science and Research Branch, Tehran, Iran 2 Nuclear Fuel Cycle Research School, Nuclear Science and Technology Research Institute (NSTRI), Tehran, Iran 3 Institute of Geophysics, University of Tehran, Tehran, Iran ABSTRACT By researching and applying new methods we will be able to improve sig- nificantly estimation of shapes, dimensions and depths of gravity sources. After shapes estimation of gravity sources through support vector classi- fier (SVC) in our last research [Hekmatian et al. 2015], in this paper SVC is applied for estimating depths and dimensions of gravity sources. These estimations give us logical and complete initial guesses regarding shapes, depths and dimensions of gravity sources which are needed in more pre- cise interpretations and inversions of gravity sources. Also for better ap- plication of SVC, we selected more proper features using the technique called feature selection (FS). In this paper, we trained SVC with 320 syn- thetic gravity profiles for estimation of dimensions and depths of gravity sources. We tested the trained SVC codes by about 200 other synthetic and some real gravity profiles. The depths and dimensions of a well along with two ore bodies (three real gravity sources) are estimated during the testing process. 1. Introduction The dimensions and depths estimation of the gravity sources contains errors. Applying new methods can decrease the estimation errors. In this research, we show that SVC algorithm is a proper method for the mentioned estimations. Support vector classifier (SVC) algorithm is one of the pattern recognition (PR) algorithms and is consid- ered as a supervised learning algorithm (that should be trained) of PR [Van der Heijden et al. 2004]. Regarding producing training data in gravity, it should be men- tioned that we need to create several synthetic gravity profiles caused by synthetic gravity sources with differ- ent definite shapes, depths and dimensions. Then, val- ues of proper features from each of synthetic gravity profiles will be extracted (see Sections 2.2 and 3). These values are used to train the supervised learning algo- rithm (SVC) and this trained algorithm would be able to estimate dimensions and depths of gravity sources which their approximate shapes was estimated earlier. Neural network has been used frequently for grav- ity interpretation and other geophysical applications [Gret and Klingele 1998, Van der Baan and Christian 2000, Osman et al. 2006] unlike the SVC. SVC algorithm has very suitable properties such as single global solution and ability to use different ker- nels. It should be mentioned that one of these kernels is Gaussian (or RBF) kernel [see Van der Heijden et al. 2004]. It is proper for separating the objects of one class Article history Received April 11, 2016; accepted June 14, 2016. Subject classification: Gravity sources depths and dimensions, Gravity profile, SVC, Feature, FS. Figure 1. The boundary between star shape points and cross shape points shows an example of Gaussian (or RBF) kernel of SVC. Hor- izontal and vertical axes show 2 features used in the classification. that its features are distributed in feature space in a non- regular condition (see Figure 1 as an example). More usually the features of the objects related to each shape of gravity sources are distributed in the mentioned man- ner in the feature space. So this nonlinear kernel has been applied for shape estimation of gravity sources [Hekmatian et al. 2015]. Also according to [Moran and Lavrenko 2011], Minkowski kernel (see Section 2.1) mimics logical AND of variation in feature values, whilst Gaussian (or RBF) kernel is closer to logical OR. Estimating the shapes of gravity sources (separating the class of objects related to each gravity source shape) is a logical OR, since each shape is not related to other shapes. On the other hand, for estimation of dimensions or depths of different gravity sources, we are obliged to select two different classes of depths or dimensions. For example, suppose that we want to estimate the depths of gravity sources with spherical shapes with a radius of about 20 m, and we know that these gravity sources are located at a depth between 5 m to 1000 m. In these conditions we have two classes: one class is the spherical gravity sources with depth equal to 5 m, and the other class is the same gravity sources but with depth equal to 1000 m. So we should use a proper train- ing set consisting of these two classes. Also we should use a nonlinear kernel of SVC which can estimate the depth of the gravity sources by detecting the probability of each of the two mentioned classes for the gravity sources. For example, if one of the gravity sources is related to class 1 (with a depth of 5 m) with a probability equal to 0.2 and the other gravity source is related to class 2 (with a depth of 1000 m) with a probability equal to 0.8, then its esti- mated depth is (0.2*5) + (0.8*1000) = 801 m. So a kernel that mimics logical AND could be a proper one for es- timating depths and dimensions of gravity sources. Therefore, we applied Minkowski kernel for estima- tions of the depths and dimensions of gravity sources. For better application of SVC, we need to opti- mize SVC by selecting proper features. We have done that by applying FS. 2. Theories and algorithms 2.1. Support vector classifier (SVC) algorithm 2.1.1. Linear SVC Prior to linear SVC description we should mention that support vector machine (SVM) is a concept in com- puter science for a set of related supervised learning methods that analyse data and recognize patterns used for classification and regression analysis, also the origi- nal SVM formulations for classification is named SVC. SVC is clarified briefly in this and the next section ac- cording to Van der Heijden et al. [2004]. Linear (and nonlinear) SVC is considering only 2 classes of information that can be separated by a linear border. One class will be located on the one side of the linear border and the other will be located on the other side of the linear border. If there are more than 2 classes, all classes except one of them should consider as one class and the remaining one as the other class. We should know that in SVC the border between the 2 classes is not a line but a margin and this border line is in the centre of the margin. The linear discriminant function that is used in linear SVC is [Van der Heijden et al. 2004]: g(Zn)= w* Zn + b, (1) where Zn is the matrix of the values of the features (measurement vectors), w is the vector of coefficient and b is a scalar value. The rows of Zn are the features of each object (each gravity profile) and the columns of this matrix show the kind of features in all objects. Two classes (Cn = 1 and Cn = −1 are labels of two classes) are defined by Van der Heijden et al. [2004] as: wTZn+ b≥1 if Cn=+1 (2) wTZn+ b≤−1 if Cn=−1, (3) where wT means transpose of w. The length of the mar- gin between these two classes is shown in Figure 2. For more explanations regarding linear SVC, one of proper references is Van der Heijden et al. [2004]. 2.1.2. Nonlinear SVC At first some definitions should be mentioned. It should be mentioned that decision function is a func- tion that maps the measurements space (features space) HEKMATIAN ET AL. 2 Figure 2. The linear support vector classifier. The boundary be- tween 2 classes is not only a margin but also is a margin with max- imum width and this shows the single global solution of SVC [Van der Heijden et al. 2004]. 3 on to the set of possible classes. It should emphasized that in SVC only 2 classes exist. Therefore in linear and nonlinear SVC, decision function is consist of only 2 discriminant function (g(Z)≥1 and g(Z)≥−1 and notice to relations 1 to 3). To have nonlinear boundaries in SVC, the dis- criminant function should be defined as [Van der Heij- den et al. 2004]: g(Z) = wTy(Z) =∑ NSn=1 k(Z, Zn), (4) where k(Z, Zn) is the kernel (such as polynomial one) and NS is the number of samples in the training set. In relation 4, g(Z) = 1 and g(Z) = −1 show upper limit and lower limit of the margin of nonlinear SVC re- spectively (see Figure 2 for comparing with linear SVC). Minkowski kernel is a kind of nonlinear kernel [Duin et al. 2007]: k(Zn,Zm) = (∑ NS m=1|Zn− Zm| p) 1−p , (5) where in Equation (5) p is a parameter for increasing or decreasing the power. As we have mentioned earlier, we used this kernel in the depth and dimension esti- mations. 2.2. The features and features selection (FS) algorithm Feature is a kind of measurement value which is obtained from each object. In this paper, object is a grav- ity profile which is created by a specific gravity source. Features that can be used for defining the depths and di- mensions of the gravity sources are derivable from a principal gravity profile. Gret and Klingele [1998] noted that the principal gravity profile is the gravity profile passing through the maximum value along the anomaly and crossing the anomaly lines perpendicularly. Gravity profiles in this research mean principal gravity profiles. Some examples of the different features used to characterize the gravity anomalies are defined as [Gret and Klingele 1998]: F1 = Xg50/Xg75, (6) F2 = Xg75/Xg50, (7) F3 = Xginf/Xg75, (8) F4 = (Xg50− Xg66)/(Xg80− Xg66), (9) where in the above equations, gy is the value of gravity (g) at y% of maximum value of gravity gmax and Xgy is the value of X at gy (for more explanations we should say that in our research X is the horizontal distance in the direction of the profile from the location of gmax in the profile). Also Xginf is the value of X at ginf and ginf is the value of gravity (g) at inflection point of the grav- ity profile. In Figure 3 an example of a gravity profile and some of the features describing the depths and di- mensions of gravity sources are shown. Choosing proper features (features selection - FS) is very important while dealing with high misclassifica- tion in testing our classifier (more proper features are those features that when are used in classification leads to better separation of the classes). The high misclassi- fication means in our selected features space, the different classes are not separate properly. One of the enable FS al- gorithms is “inter/intra class distance” (IICD) and we have used this algorithm in our research. IICD algo- rithm like all other FS algorithms, has a criterion for se- lection of more proper features. This criterion is called performance measure (PM). PM is increased when the suitability of the features for classification is increased and vice versa in other words selection of more proper features is done when their related PM is bigger. PM of IICD is based on Euclidean distance between each of the two objects in the training set (all objects used for training SVC) in the features space. For more explana- tion about “inter/intra class distance” (IICD) algorithm, we should mention that as Van der Heijden et al. [2004] noted Ts is a training set with Ns samples (objects). The classes wk are represented by subsets Tk (subsets Tk are subsets of objects of Ts), each class having Nk samples. Measurement vectors in Ts without reference to their class are denoted by Zn (for each object we measure a DEPTH AND DIMENSION ESTIMATION USING SVC Figure 3. Some of the features (F6, F7, F8 and F9) describing the depths and dimensions of gravity sources [Gret and Klingele 1998]. set of features, this set of features is called measure- ment vector; the point location of each object in fea- tures space is defined with its measurement vector). Measurement vectors in Tk (i.e. vectors coming from class wk) are denoted by Zk,n. The sample mean of class wk is [Van der Heijden et al. 2004]: The sample mean of the entire training set is [Van der Heijden et al. 2004]: We can represent the distances between objects or samples (of the training set) by means of scatter matrices. A scatter matrix gives some information about the dis- persion of a population of samples around their mean. For instance, the matrix that describes the scattering of vectors from class wk is [Van der Heijden et al. 2004]: Sk supplies information about the average distance of the scattering in class wk. Averaged over all classes, the scatter matrix is [Van der Heijden et al. 2004]: This matrix is “the within-scatter matrix” as it de- scribes the average scattering within classes. In addition to this is “the between-scatter matrix” Sb that describes the scattering of the class-dependent sample means around the overall average [Van der Heijden et al. 2004]: We know that an individual number is a kind of matrix. So that when “the within-scatter matrix” (Sw) is an individual number, it shows the squared average dis- tance of the location of each object in one class from the average location of all objects of that class in fea- tures space for all the classes. In this case we name Sw as intraclass distance, and when “the between-scatter matrix” (Sb) is an individual number, it refers to the squared average distance of the average location of all objects of each class from the average location of all objects of all classes in features space. In this case we name Sb as interclass distance. The features that are more suitable for classification are the ones that their interclass distance is greater than their intraclass dis- tance. Such that a PM suited to express the separability of classes is the ratio between interclass and intraclass distance. Therefore features with greater PM are pre- ferred for classification (according to IICD algorithm). Also we should take into consider that the num- ber of possible subsets, q(D), consisting of D features between N features is [Van der Heijden et al. 2004]: We have used the FS approach as will be described in the following of the paper. 3. The procedure To conduct this research, we needed to program a number of software for producing synthetic gravity profiles, extracting features from them, selecting proper features and training SVC for the shape estimation of the gravity sources. There is more explanation in our last research [Hekmatian et al. 2015] regarding these software. Also the needed SVC codes (for the depth or dimension estimation) for this research are implied dur- ing the procedure explanations in the following. The overall procedure which we used is as the following: (1) Producing more than 500 synthetic gravity pro- files for training and testing the SVC codes which we produced for estimating the depths and dimensions of gravity sources, having one of the six supposed shapes (in our previous research [Hekmatian et al. 2015], we ap- plied SVC in estimating the approximate gravity sources shapes between six possible shapes: sphere, horizontal cylinder, vertical cylinder, rectangular prism, syncline and anticline). From these profiles, about 320 were used for training and about 200 were used for testing. (2) Extracting 3200 features from the 320 profiles which we used for training. In other word, we extracted 10 features from each of the mentioned 320 profiles. These features are shown in Table 1 (for more explana- tions regarding the features, see Section 2.2). These fea- tures have good characters: first, they are related to the depth and dimension of the gravity sources and are in- dependent from the shape of the sources; second, they consist of all Xgy from Xg50 to Xg85 systematically; third, they are actually obtainable from common real gravity profiles (for example, in real gravity profiles usually Xg50 exists but Xg10 does not exist). (3) Selecting the most proper 2 features between the mentioned ten features (by inter/intra class dis- tance FS algorithm; see Section 2.2). We used the most proper 2 features of the training sets to train our SVC codes (for depths or dimensions estimations). Accord- ing to the shape of gravity sources and also the kind of depth and dimension estimations respectively, the most ( ) ( ) . S S Z Z NS Nk N 1 1 T w k k, n k, n NS Nkk K S n N k kk K 1 11 N k n n = = - - = == t t | || .( ) ( )S N N 1 T b S kk K k k1 n n n n= --= t t t t| ( ) ( ) .Z ZS N 1 T k, n k, nk k n N k k k1 N k n n= - -= t t| NS 1 NS n N 1 S n= =t | .Zn1 Nk 1 k Nk n N k n = =t | .Zk, n1 ( ) (q D N D N = - ) ! ! ! .D D HEKMATIAN ET AL. 4 (15) (10) (11) (12) (13) (14) 5 proper 2 features were different. This is shown in Table 2. Here it should be mentioned that it is possible to se- lect the most proper other number than 2 features (for example the most proper 3 features) between the men- tioned ten features by inter/intra class distance FS al- gorithm. But by selecting the most proper 2 features, we have received good and proper results (as will be showed in the following of this paper). Therefore it was not needed to select the most proper other number than 2 features. In any way, we have showed the proce- dures for selecting any number of features and other researchers can select any number of features in their researches. (4) Extracting the most proper 2 features (related to Table 2) from each of about 200 synthetic gravity profiles which were produced for testing our trained SVC codes (for depths or dimensions estimations). We tested our trained SVC codes by the mentioned most proper 2 features extracted from testing sets. Using the mentioned most proper 2 features, depth and dimension estimation was done properly. As you see in Table 2 the number of different kinds of depth or dimension esti- mations of the gravity sources is 16. It should be men- tioned that for testing each of these 16 different kinds, we produced 10 to 20 synthetic gravity profiles and for training each of the mentioned 16 different kinds, we produced about 20 synthetic gravity profiles too. (5) Testing our trained SVC codes (for depth or di- mension estimations) by some real gravity profiles and the proper features that we extracted from them. 4. Results and discussion 4.1. Synthetic data (and some other discussions) For depth estimation through SVC, we should know the approximate shape and the range or approx- imate dimensions of the gravity source; it is the same case for dimension estimation. Different kinds of depth and dimension estimations that we did using SVC are listed in Table 2. We show some examples of these dif- ferent kinds of depth and dimension estimations (with synthetic data in the following). Suppose we have estimated the shape of the grav- ity source to be a vertical cylinder. Also regarding its minimum depth using other methods we have reached to a range of 2 m to 9 m and a range of 2 m to 14 m re- garding its height. Its density difference (with the back- ground) is 2.5 gr/cm3. In this condition, we can train a SVC code by proper features extracted from a training set. This training set could consist of 20 synthetic ver- tical cylinder gravity sources which have density differ- ence (with the background) the same as the gravity source and have the mentioned ranges of minimum depths and heights. The specifications of this training set are shown in Table 3. As it has shown in Table 2, the best 2 features for training SVC code for radius estimation of vertical cylin- der gravity sources are feature 1 (Xg50) and feature 9 (gmax). DEPTH AND DIMENSION ESTIMATION USING SVC Number of the extracted features for depth and dimension estimation Extracted features 1 Xg50 2 Xg55 3 Xg60 4 Xg65 5 Xg70 6 Xg75 7 Xg80 8 Xg85 9 gmax 10 Integral of g(x) in central 100 m of profile The kind of depth or dimension estimation of the gravity source Name or number of 2 proper features (see Table 1) Radius of sphere 8, 9 Minimum depth of sphere 7, 9 Radius of horizontal cylinder 9, 10 Minimum depth of horizontal cylinder 8, 9 Radius of vertical cylinder 1, 9 Minimum depth of vertical cylinder 7, 9 Height of vertical cylinder 9, 10 Minimum depth of rectangular prism 6, 9 Vertical extension of rectangular prism 9, 0 Horizontal extension of rectangular prism 5, 9 Minimum depth of anticline 8, 9 Vertical extension of anticline 9, 10 Horizontal extension of anticline 9, 10 Minimum depth of syncline 1, 9 Vertical extension of syncline 9, 10 Horizontal extension of syncline 9, 10 Table 1. The extracted features and their numbers for depths and di- mensions estimations of gravity sources. Table 2. The most proper 2 features selected for different kinds of depth or dimension estimations of the gravity sources. For testing the mentioned trained SVC code, we used a testing set with 12 synthetic gravity profiles cre- ated by vertical cylinder gravity sources which have density difference (with the background) equal to 2.5 gr/cm3 and with minimum depths and heights equal to 5 m and 7 m respectively. Table 4 shows the real radii, feature 1 (Xg50) and feature 9 (gmax) of the mentioned synthetic gravity profiles and the radii which are esti- mated by the trained SVC code (proper for radius esti- mation of vertical cylinder gravity sources) and the errors of the radii estimations. If we want to estimate the radius of vertical cylin- der gravity sources with other depth, height, density dif- ference and other range of radius, we should use other training set. The results of training the mentioned SVC code and testing it in graphic style are shown in Figures 4 and 5. The radii of vertical cylinder sources are estimated using Minkowski kernel (Equation (5) with setting its parameter to 0.1) based on the following formula: ((The probability of having radius equal to 10m calculated by the trained SVC)*10) + ((The probability of having radius equal to 1m calculated by the trained SVC)*1). We should mention that the range of radius that can be estimated by the mentioned trained SVC is from 1 m to 10 m. Also, as it has been noted before, estimat- ing the radius of vertical cylinder gravity sources with other range of radius needs using other training set for proper training SVC. Estimation of the depth and dimension of sources HEKMATIAN ET AL. 6 Numbers of files of the synthetic gravity profiles used as the training set Radius (m) Minimum depth (m) Height (m) 1 1 2 2 2 1 5 5 3 1 2 8 4 1 5 11 5 1 2 14 6 1 4 2 7 1 9 5 8 1 4 8 9 1 9 11 10 1 4 14 11 10 2 2 12 10 5 5 13 10 2 8 14 10 5 11 15 10 2 14 16 10 4 2 17 10 9 5 18 10 4 8 19 10 9 11 20 10 4 14 Numbers of files of the synthetic gravity profiles used as the testing set Feature 11 (m) Feature 19 (mGal) Real radius (m) Estimated radius (m) Real error (absolute of real radius minus estimated radius) (m) Relative error (real error/ real radius) 1 6.5 0.06378 3.5 5.14 1.64 0.47 2 6.7 0.07971 4 5.22 1.22 0.31 3 7 0.09629 4.5 5.32 0.82 0.18 4 7.3 0.11323 5 5.41 0.41 0.08 5 7.6 0.13031 5.5 5.50 0.00 0.00 6 7.9 0.14735 6 5.57 0.43 0.07 7 8.3 0.1642 6.5 5.63 0.87 0.13 8 8.6 0.18076 7 5.70 1.30 0.19 9 9 0.19694 7.5 5.79 1.71 0.23 10 9.4 0.21267 8 5.89 2.11 0.26 11 9.7 0.22793 8.5 5.96 2.54 0.30 12 10.1 0.24268 9 6.05 2.95 0.33 Table 3. The specifications of the training set which we used for training SVC code for radius estimation of vertical cylinder gravity sources. Table 4. The real radii, feature 1 (Xg50) and feature 9 (gmax) of the synthetic gravity profiles (of the testing set) and the radii which are esti- mated by the trained SVC code (proper for radius estimation of vertical cylinder gravity sources) and the errors of the radii estimations. 7 with other shapes (see Table 2) were, in general, in the same manner with little differences. In continuation, three other examples including, minimum depth and height of vertical cylinder source estimation in addition to estimation of minimum depth of rectangular prism source by their related trained SVC on their testing sets are presented. Regarding the minimum depth of vertical cylinder estimation, the training set includes 20 synthetic grav- ity profiles which are created by vertical cylinder gravity DEPTH AND DIMENSION ESTIMATION USING SVC Figure 4. Minkowski kernel (of SVC) in a 2 dimensional space (hor- izontal axis is showing Xg50 and vertical axis is showing gmax) which is trained by training set of Table 3. The Cross points are showing the 10 vertical cylinder gravity sources with radius equal to 10 m. And the star points are showing the 10 vertical cylinder gravity sources with radius equal to 1 m. These 20 points are training set. Figure 5. This figure have the same parameters and kernl as Figure 4, but The Minkowski kernel (of SVC) separate vertical cylinder gravity sources of the testing set (Table 4) according to their radius. The points are showing the 12 vertical cylinder gravity sources of the testing set. Numbers of files of the synthetic gravity profiles used as the testing set Real minimum depth (m) Estimated minimum depth (m) Real error (absolute of real minimum depth minus estimated minimum depth) (m) Relative error (real error/ real minimum depth) 1 5.5 7.84 2.34 0.43 2 6 7.95 1.95 0.33 3 6.5 8.01 1.51 0.23 4 7 8.15 1.15 0.16 5 7.5 8.29 0.79 0.11 6 8 8.36 0.36 0.05 7 8.5 8.50 0.00 0.00 8 9 8.64 0.36 0.04 9 9.5 8.71 0.79 0.08 10 10 8.85 1.15 0.11 11 10.5 8.99 1.51 0.14 12 11 9.13 1.87 0.17 13 11.5 9.27 2.23 0.19 14 12 9.41 2.59 0.22 15 12.5 9.54 2.96 0.24 16 13 9.60 3.40 0.26 Table 5. The real minimum depths of the synthetic gravity profiles of the testing set and the minimum depths estimated by the trained SVC code (proper for minimum depths estimations of vertical cylinder gravity sources) and the errors of minimum depths estimations. sources. 10 of these profiles have sources at minimum depth equal to 3 m and 10 other profiles have sources at minimum depth 15 m. The radii of all the 20 profiles are equal to 5 m or 6 m and their height range is 1 m to 36 m. The density difference with background for all of them is equal to 2.5 gr/cm3. With such training set, we would be able to estimate minimum depth of those vertical cylinder gravity sources which their minimum depths are between 3 m and 15 m and have the men- tioned ranges or values of height, radius and density difference. The estimated minimum depths of vertical cylinder sources of the testing set are shown in Table 5. The training set regarding the height of vertical cylinder sources consists of 20 synthetic gravity profiles with 2 classes. One class is related to height equal to 2m and consists of 10 synthetic gravity profiles created by vertical cylinder sources with height equal to 2 m and the other class is related to height equal to 25 m and con- sists of 10 other profiles with the height of their sources equal to 25 m. Radii of the mentioned 20 profiles are 5 m or 6 m and the range of their depths are between 5 m to 7 m. Their density difference with background is 2.5 gr/cm3. The estimated heights of vertical cylin- der sources of the testing set are shown in Table 6. According to the 3 mentioned testing sets (see Table 4 to Table 6), if the radii, depths and heights of vertical cylinder sources are about in the middle of the used 2 classes of each of them, then they can be estimated with low errors. For example the lowest error pertain- ing to radius estimation is related to radius equal to 5 m or 6 m (see Table 4). The reason is that as we mentioned in Section 1 according to [Moran and Lavrenko 2011], Minkowski kernel (relation 5) mimics logical AND of variation in feature values. Regarding estimation of minimum depth of rec- tangular prism sources, the training set consists of 20 rectangular prism sources with density difference with background equal to 2.5 gr/cm3. Ten of them have the minimum depths equal to 2 m and depths of other ten are equal to 25 m. Horizontal and vertical thickness of all of the 20 rectangular prism sources is between 14 m and 18 m and between 20 m and 30 m respectively. After training by the mentioned training set, we examined the trained SVC codes with the testing set. The results are shown in Table 7. 4.2. Real data We have two sets of real gravity data. The approxi- mate location of these 2 sets is shown in Figure 6 by dark spots. The first set of real data belongs to the small grid gravity network at Institute of Geophysics of Univer- sity of Tehran. For gravity data acquisition, a Scintrex (CG3M) gravimeter with 1 micro-gal resolution is used and the coordinates of the points were measured by the Total Station Trimple 5601 with accuracy about 1 cm in x, y and h coordinates. The goal was to detect and model the tunnels of an old buried channel. The grid space of 2*2 square meters is used for measuring the data. In the measured data, by using the gravity cor- HEKMATIAN ET AL. 8 Numbers of files of the synthetic gravity profiles used as the testing set Real height (m) Estimated height (m) Real error (absolute of real height minus estimated height) (m) Relative error (real error/ real height) 1 11 14.87 3.87 0.35 2 12 15.08 3.08 0.26 3 13 15.26 2.26 0.17 4 14 15.42 1.42 0.10 5 15 15.55 0.55 0.04 6 16 15.66 0.34 0.02 7 17 15.77 1.23 0.07 8 18 15.86 2.14 0.12 9 19 15.94 3.06 0.16 10 20 16.01 3.99 0.20 11 21 16.07 4.93 0.23 12 22 16.13 5.87 0.27 13 23 16.19 6.81 0.30 Table 6. The real heights of the synthetic gravity profiles of the testing set and the heights estimated by the trained SVC code (proper for heights estimations of vertical cylinder gravity sources) and the errors of heights estimations. 9 DEPTH AND DIMENSION ESTIMATION USING SVC Names of files of the synthetic gravity profiles used as the testing set Real minimum depth (m) Estimated minimum depth (m) Real error (absolute of real minimum depth minus estimated minimum depth) (m) Relative error (real error/ real minimum depth) 1 6 8.48 2.48 0.41 2 7 9.38 2.38 0.34 3 8 9.42 1.42 0.18 4 9 9.45 0.45 0.05 5 10 10.81 0.81 0.08 6 11 10.83 0.17 0.02 7 12 10.86 1.14 0.10 8 13 12.29 0.71 0.05 9 14 12.31 1.69 0.12 10 15 12.33 2.67 0.18 11 16 13.79 2.21 0.14 12 17 13.81 3.19 0.19 13 18 15.26 2.74 0.15 14 19 15.27 3.73 0.20 15 20 15.27 4.73 0.24 16 21 16.67 4.33 0.21 17 22 16.67 5.33 0.24 Figure 6. The approximate location in Iran of the 2 sets of real data is specified by dark spots. The geographic map of Iran is extracted from Ardestani [2008]. Table 7. The real minimum depths of gravity profiles of the testing set and minimum depths estimated by the trained SVC code (proper for minimum depths estimations of rectangular prism gravity sources) and the errors of minimum depths estimations. rections due to standard formulas, the Bouguer gravity anomalies are computed and the trend effects are re- moved by polynomial fitting method, and finally the residual anomalies are calculated and shown in Figure 7. We should confirm that contour interval of 1 mi- crogal is the default presentation of the software used for producing Figure 7 (geosoft) and it is only for better presentation of the figure and does not show the real accuracy of the mentioned figure. A gravity profile (line AB in Figure 7) is selected for testing. The mentioned profile is shown in Figure 7-1. This profile was collected above a well i.e. the gravity source of this gravity profile was a well full of air. We knew that the well was very similar in shape to a vertical cylinder or a long vertical rectangular prism. The second set of real data belongs to the grid HEKMATIAN ET AL. 10 Figure 7. Location of the gravity profile AB over the well. The map of this figure is a residual gravity map of a part of Institute of Geophysics of University of Tehran and the coordinates are the local one [Ardestani 2009]. Figure 7-1. The gravity profile AB extracted from the map of Fig- ure 7 [Ardestani 2009]. 11 gravity network carried out in a mining site close to Zanjan (a city in Iran). The grid space of 10*10 square meters is used for measuring the data. For this set of real data, a Scintrex (CG3M) gravimeter with 1 micro-gal resolution is used and the coordinates of the points in the second set of real data were measured by the Total Station (Leika750) with accuracy to a few cm in x, y and h coordinates. After gravity corrections due to standard formulas, the Bouguer gravity anomalies are computed. After removing the trend effect by polynomial fitting method, the residual anomalies are computed and for the second set of real data shown in Figure 8. Three real gravity profiles that we used in this set of real data are presented in Figure 8 as the profiles CD, EF and GH. CD, EF and GH profiles are shown in Figures 8-1 to 8-3. Finally the estimated shapes of the gravity sources of the mentioned real gravity profiles by our different trained SVC codes (the shapes estimations have been explained in detail in our last research: Hekmatian et al. [2015]) are given in Table 8. It has been shown in the last research [Hekmatian et al. 2015] that the estimation of the shapes of the real gravity sources with SVC is compatible with the reality. After estimating shapes of the real gravity profiles, now we can estimate their minimum depths and di- mensions. As we mentioned before, the source of AB profile is a well that can be seen in the surface of the Earth, so that we know its depth and dimensions. Regarding the radius of the source of AB profile (see Figure 7 and Table 8), now we know that the shape of this source is DEPTH AND DIMENSION ESTIMATION USING SVC Figure 8. Location of the gravity profiles CD, EF and GH. The map of this figure is a residual gravity map of a mining site near Zanjan (a city in Iran) and the coordinates are UTM. a vertical cylinder. For radius estimation, we should not only know the shape of the source, but also the range of the minimum depth and the height of the vertical cylinder. The minimum depth of this vertical cylinder according to the prior knowledge is less than 2 m (we take this range between 0 m and 2 m). Also we knew that the height is less than 8 meters (we take the range of height between 2 m and 7 m). In addition, the source is a well full of air, so that after measuring the density in the field, we take its density difference with the back- ground equal to −2.0 gr/cm3. By these range of possi- ble specifications, we trained the proper SVC code for this case. By the mentioned trained SVC code, we esti- mated the radius of the source of AB profile approxi- mately equal to 1.05 m. The real radius of the mentioned source is at maximum 1 m (is between 0.6 m to 1 m and therefore the average radius is about 0.8 m). Similarly, for minimum depth estimation of the source of AB profile, we should know shape, radius and height of the source. So far, we estimated shape and ra- dius; we also know the range of height of the source. So we produced proper trained SVC by which we estimated the minimum depth approximately equal to 0.3 m which is about the same as real minimum depth (zero meter). Finally, we estimated height of the mentioned source approximately equal to 5.8 m. The real height is 5 m. Regarding EF profile, the estimated shape of the gravity source is a vertical cylinder (see Figure 8 and Table 8), and its radius, height an minimum depth are estimated by SVC equal to 6 m, 17 m and 6 m respec- tively. For GH profile, we estimated its shape as a rec- tangular prism (see Figure 8 and Table 8), and its mini- mum depth, horizontal thickness (in the direction of the profile) and vertical thickness are estimated by SVC equal to 16 m, 17 m and 18 m respectively. Regarding the sources of the profiles EF and GH, we have no real values of their depths and dimensions. But regarding their depths, we should mention that they are consistent with the results of Euler estimations (see Figure 9). Regarding their horizontal thickness (or diameter), they are consistent with the analytic signals map (see Figure 10). HEKMATIAN ET AL. 12 Name of real gravity profile Shape of gravity source Estimated shape of the source by the trained SVC codes AB similar to vertical cylinder vertical cylinder CD possibly rectangular prism not interpretable because of non being principal profile EF possibly vertical cylinder or rectangular prism vertical cylinder GH possibly rectangular prism rectangular prism Figure 8-1. The gravity profile CD extracted from the map of Fig- ure 8. Figure 8-2. The gravity profile EF extracted from the map of Fig- ure 8. Figure 8-3. The gravity profile GH extracted from the map of Fig- ure 8. Table 8. Shapes of gravity sources of the real gravity profiles and the results of their testing by our different trained SVC codes. 13 DEPTH AND DIMENSION ESTIMATION USING SVC Figure 9. Location of the gravity profiles EF and GH on Euler depth estimations of residual gravity map consists of a mining site near Zan- jan and the coordinates are UTM. Figure 10. Location of the gravity profiles EF and GH on analytic signals of residual gravity map consists of a mining site near Zanjan and the coordinates are UTM. 5. Conclusion Estimations of depths and dimensions of gravity sources through SVC are possible. Therefore, estima- tions by SVC give us initial guesses regarding shape [Hekmatian et al. 2015], depth and dimension of grav- ity sources which is needed in more precise interpreta- tion and inversion of gravity sources. Finally, we propose this approach to be applied for geo-scientific investigations as a new approach for the estimation of the depths and dimensions of the gravity sources in exploration projects. We suggest that the above mentioned approach can be used in other geo- physical branches. Acknowledgements. We thank all the faculty members who assisted us in preparing this paper. Our special thanks goes to Dr. Hashemi, for introducing us to the Pattern Recognition Laboratory, Delft University of Technology (whose group produced and de- signed PRTools which we used in this research). Also by studying his papers [Hashemi et al. 2008, Hashemi 2010], we decided to perform this research. We thank Prof. Duin (Pattern Recognition Labora- tory, Delft University of Technology), because of his helpful guid- ance at the beginning of this research. We thank the geophysics group of the Faculty of Basic Sciences of Science and Research Branch, Is- lamic Azad University, for guiding us so well during this research. Also we thank the Research Council of the University of Tehran. References Ardestani, V.E. (2008). Modelling the Karst zones in a dam site through micro-gravity data, Explor. Geo- phys., 39, 204-209. Ardestani, V.E. (2009). Residual gravity map of a part of Institute of Geophysics of University of Tehran, Tehran, Iran. Duin, R.P.W., P. Juszczak, P. Paclik, E. Pekalska, D. de Ridder, D.M.J. Tax and S. Verzakov (2007). PRTools4. A Matlab Toolbox for Pattern Recognition. Version 4.1, Delft University of Technology, Netherlands; http://www.prtools.org/ (accessed June 22, 2011). Gret, A.A., and E.E. Klingele (1998). Application of Ar- tificial Neural Networks for Gravity Interpretation in two Dimensions, Institute of Geodesy and Pho- togrammetry, Swiss Federal Institute of Technol- ogy, Zurich. Hashemi, H., D.M.J. Tax, R.P.W. Duin, A. Javaherian and P. Groot (2008). Gas chimney detection based on improving the performance of combined multi- layer perceptron and support vector classifier, Non- linear Proc. Geoph., 15, 863-871. Hashemi, H. (2010). Logical considerations in applying pattern recognition techniques on seismic data: Pre- cise ruling, realistic solutions, CSEG Recorder, 35, 47-50. Hekmatian, M.E., V.E. Ardestani, M.A. Riahi, A. Memar Koucheh Bagh and J. Amini (2015). Estimating the shapes of gravity sources through optimized sup- port vector classifier (SVC), Acta Geophys., 63 (4), 1000-1024. Moran, S., and V. Lavrenko (2011). Optimal Tag Sets for Automatic Image Annotation, In: J. Hoey, S. McKenna and E. Trucco, Proceedings of the British Machine Vision Conference, BMVA Press, September 2011, 1.1-1.11; http://dx.doi.org/10.5244/C.25.1. Osman, O., A.M. Albora and O.N. Ucan (2006). A new approach for residual gravity anomaly profile inter- pretations: Forced Neural Network (FNN), Annals of Geophysics, 49 (6), 1201-1208. Van der Baan, M., and J. Christian (2000). Neural ne two rks in geophysical applications, Geophysics, 65 (4), 1032-1047. Van der Heijden, F., R.P.W. Duin, D. de Ridder and D.M.J. Tax (2004). Classification, Parameter Esti- mation and State Estimation, John Wiley & Sons, Ltd., Chichester, England, 440 p. * Corresponding author: Mohammad Ehsan Hekmatian, Islamic Azad University, Faculty of Basic Sciences, Science and Research Branch, Tehran, Iran; email: mhekmatian@aeoi.org.ir. © 2016 by the Istituto Nazionale di Geofisica e Vulcanologia. All rights reserved. HEKMATIAN ET AL. 14 << /ASCII85EncodePages false /AllowTransparency false /AutoPositionEPSFiles false /AutoRotatePages /None /Binding /Left /CalGrayProfile (Dot Gain 20%) /CalRGBProfile (sRGB IEC61966-2.1) /CalCMYKProfile (U.S. Web Coated \050SWOP\051 v2) /sRGBProfile (sRGB IEC61966-2.1) /CannotEmbedFontPolicy /Warning /CompatibilityLevel 1.3 /CompressObjects /Tags /CompressPages true /ConvertImagesToIndexed true /PassThroughJPEGImages true /CreateJobTicket false /DefaultRenderingIntent /Default /DetectBlends true /DetectCurves 0.1000 /ColorConversionStrategy /LeaveColorUnchanged /DoThumbnails false /EmbedAllFonts true /EmbedOpenType false /ParseICCProfilesInComments true /EmbedJobOptions true /DSCReportingLevel 0 /EmitDSCWarnings false /EndPage -1 /ImageMemory 1048576 /LockDistillerParams true /MaxSubsetPct 100 /Optimize false /OPM 1 /ParseDSCComments true /ParseDSCCommentsForDocInfo true /PreserveCopyPage true /PreserveDICMYKValues true /PreserveEPSInfo true /PreserveFlatness true /PreserveHalftoneInfo false /PreserveOPIComments false /PreserveOverprintSettings true /StartPage 1 /SubsetFonts true /TransferFunctionInfo /Apply /UCRandBGInfo /Preserve /UsePrologue false /ColorSettingsFile (None) /AlwaysEmbed [ true /AndaleMono /Apple-Chancery /Arial-Black /Arial-BoldItalicMT /Arial-BoldMT /Arial-ItalicMT /ArialMT /CapitalsRegular /Charcoal /Chicago /ComicSansMS /ComicSansMS-Bold /Courier /Courier-Bold /CourierNewPS-BoldItalicMT /CourierNewPS-BoldMT /CourierNewPS-ItalicMT /CourierNewPSMT /GadgetRegular /Geneva /Georgia /Georgia-Bold /Georgia-BoldItalic /Georgia-Italic /Helvetica /Helvetica-Bold /HelveticaInserat-Roman /HoeflerText-Black /HoeflerText-BlackItalic /HoeflerText-Italic /HoeflerText-Ornaments /HoeflerText-Regular /Impact /Monaco /NewYork /Palatino-Bold /Palatino-BoldItalic /Palatino-Italic /Palatino-Roman /SandRegular /Skia-Regular /Symbol /TechnoRegular /TextileRegular /Times-Bold /Times-BoldItalic /Times-Italic /Times-Roman /TimesNewRomanPS-BoldItalicMT /TimesNewRomanPS-BoldMT /TimesNewRomanPS-ItalicMT /TimesNewRomanPSMT /Trebuchet-BoldItalic /TrebuchetMS /TrebuchetMS-Bold /TrebuchetMS-Italic /Verdana /Verdana-Bold /Verdana-BoldItalic /Verdana-Italic /Webdings ] /NeverEmbed [ true ] /AntiAliasColorImages false /CropColorImages true /ColorImageMinResolution 150 /ColorImageMinResolutionPolicy /OK /DownsampleColorImages true /ColorImageDownsampleType /Bicubic /ColorImageResolution 300 /ColorImageDepth -1 /ColorImageMinDownsampleDepth 1 /ColorImageDownsampleThreshold 1.10000 /EncodeColorImages true /ColorImageFilter /DCTEncode /AutoFilterColorImages true /ColorImageAutoFilterStrategy /JPEG /ColorACSImageDict << /QFactor 0.15 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /ColorImageDict << /QFactor 0.15 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /JPEG2000ColorACSImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /JPEG2000ColorImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /AntiAliasGrayImages false /CropGrayImages true /GrayImageMinResolution 150 /GrayImageMinResolutionPolicy /OK /DownsampleGrayImages true /GrayImageDownsampleType /Bicubic /GrayImageResolution 300 /GrayImageDepth -1 /GrayImageMinDownsampleDepth 2 /GrayImageDownsampleThreshold 1.10000 /EncodeGrayImages true /GrayImageFilter /DCTEncode /AutoFilterGrayImages true /GrayImageAutoFilterStrategy /JPEG /GrayACSImageDict << /QFactor 0.15 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /GrayImageDict << /QFactor 0.15 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /JPEG2000GrayACSImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /JPEG2000GrayImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /AntiAliasMonoImages false /CropMonoImages true /MonoImageMinResolution 1200 /MonoImageMinResolutionPolicy /OK /DownsampleMonoImages true /MonoImageDownsampleType /Bicubic /MonoImageResolution 1200 /MonoImageDepth -1 /MonoImageDownsampleThreshold 1.08250 /EncodeMonoImages true /MonoImageFilter /CCITTFaxEncode /MonoImageDict << /K -1 >> /AllowPSXObjects false /CheckCompliance [ /None ] /PDFX1aCheck false /PDFX3Check false /PDFXCompliantPDFOnly false /PDFXNoTrimBoxError true /PDFXTrimBoxToMediaBoxOffset [ 0.00000 0.00000 0.00000 0.00000 ] /PDFXSetBleedBoxToMediaBox true /PDFXBleedBoxToTrimBoxOffset [ 0.00000 0.00000 0.00000 0.00000 ] /PDFXOutputIntentProfile (None) /PDFXOutputConditionIdentifier () /PDFXOutputCondition () /PDFXRegistryName (http://www.color.org) /PDFXTrapped /Unknown /CreateJDFFile false /SyntheticBoldness 1.000000 /Description << /ENU (Use these settings to create PDF documents with higher image resolution for high quality pre-press printing. The PDF documents can be opened with Acrobat and Reader 5.0 and later. These settings require font embedding.) /JPN /FRA /DEU /PTB /DAN /NLD /ESP /SUO /NOR /SVE /KOR /CHS /CHT /ITA >> >> setdistillerparams << /HWResolution [2400 2400] /PageSize [595.000 842.000] >> setpagedevice