JOURNAL OF THEORETICAL AND APPLIED MECHANICS 42, 3, pp. 629-649, Warsaw 2004 NEURAL NETWORK ASSISTED CRACK AND FLAW IDENTIFICATION IN TRANSIENT DYNAMICS Georgios E. Stavroulakis Department of Applied Mathematics and Mechanics, University of Ioannina, Ioannina, Greece and TU Braunschweig, Germany; e-mail: gestavr@cc.uoi.gr Marek Engelhardt Institute of Applied Mechanics, Carolo Wilhelmina Technical University, Braunschweig, Germany e-mail: marek.engelhardt@tu-bs.de Aristidis Likas Department of Computer Science, University of Ioannina, Ioannina, Greece e-mail: arly@cs.uoi.gr Rafael Gallego Department of Structural Mechanics, University of Granada, Granada, Spain e-mail: gallego@ugr.es Heinz Antes Institute of Applied Mechanics, Carolo Wilhelmina Technical University, Braunschweig, Germany e-mmail: h.antes@tu-bs.de Crack and flaw identification problems in two-dimensional elastomechanics arenumerically studied in this paper.Themechanicalmodelling is basedon boundary element techniques, with special care of hypersingular issues for the cracks. The possibility of partially or totally closed cracks (unilateral contact effects) is taken into accountby linear complementarity techniques. Backpropagation neural networks are used for the solution of the inverse problems. For dynamical problems, a suitable preprocessing of the input data enhances the effectiveness of the procedure. For the two-dimensional examples presented here, the proposed method has similar performance for classical crack and flaw identification problems. The identification of unilateral cracks is a considerably more difficult task, which nevertheless, can also be solved by the samemethod, provided that a suitable dynamical test loading is applied. Key words: crack identification, neural networks, unilateral contact, wave propagation, nondestructive evaluation 630 G.E.Stavroulakis et al. 1. Introduction Crack and flaw identification problems are solved in the paper. Accurate modelling of static and dynamic problems in two-dimensional linearly elastic bodies is done by means of the boundary element method, based on Gallego andDomı́nguez (1996, 1997).Unilateral effects, which allow for partial or total closure of cracks, are also included, following a linear complementarity problem formulation of the unilateral problem, see Antes and Panagiotopoulos (1992). For the numerical solution of the inverse problem, usually the problem is transformed into an optimization task, following an output errorminimization formulation. It is solved subsequently by numerical optimization (see, among others, Stavroulakis, 2000; Rus and Carlborg, 2001; Rus and Gallego, 2002). Neural networks are able to directly approximate the relation betweenmeasu- rements and unknown parameters. They have been tested for the solution of crack identification problems in statics (Stavroulakis andAntes, 1997), and in dynamics (both steady state, harmonic elastodynamics, in Stavroulakis and Antes (1998), and time domain elastodynamics inStavroulakis (1999), see also Stavroulakis (2000)). The mentioned optimization task is now hidden in the training phase of the neural network. This generalmethodology is extended in the present paper for the solution of two-dimensional problems. From the mechanical point of view, this paper presents a comparison of the identifiability of flaws (holes) with respect to cracks andanumerical studyof the effect of non-classical, due to theunilateral contact phenomenon, cracks. From the data processing point of view, themain task is the reduction of the size of data used for the training of the neural network. A partial answer to this problem is proposed and tested here. The modelling of back propagationneural networks is based on theNeuralNetwork Toolbox of MATLAB and on homemade programs (Likas et al., 1998). 2. Static and dynamic identification problems Based on the loading excitation, which is used for the testing of a structu- re, damage and crack identification problems can be divided into static pro- blems, harmonic dynamic or modal analysis problems and transient dynamic problems. For solving the inverse problem, first, the unknown quantities li- ke number and type of defects, their position and other shape constants are expressed with the help of certain variables. In the next step, a number of Neural network assisted crack... 631 mechanical tests are considered, where for each value of the unknown defect parameters, the corresponding responses of the structure are compared with the wished (measured) ones. Several methods for the effective numerical solution to this problem have been tested (among others, numerical optimization, genetic algorithms, soft computing, see Stavroulakis (2000)). Themethod used here employs a neural network to solve the inverse problem, and supports on-line implementation. Analogous results were published in statics (Liang and Hwu, 2001) and in dynamics (one-dimensional problems in Ziemianski and Piatkowski (2000), two-dimensional problems in Oishi et al. (1995)). Moreover, the evaluation of ultrasonic data has been performed in several cases by means of neural network models, see, among others, Zgonc and Achenbach (1996) and the review articles Yagawa andOkuda (1996), Zeng (1998). In the author’s previous work, dynamical problems on relatively simple layered structures with cracks or defects, which were parallel to the layers, were considered (Stavroulakis, 1999). In the present paper, the method is extended to general two-dimensional structures with measurements of dyna- mical responses on several different parts of the boundaries. The size of the data (measured waveforms) increases dramatically, therefore a data reduction schememust be used to allow for effective treatment by neural networks. Con- cerning the identification of unilateral cracks, except for the previous work by the authors summarized in Stavroulakis (2000), and the paper Alessandri and Mallardo (1999) which was based on the classical optimization for the solution to statical problems, no other published work is available to the best knowledge of the authors. 3. Boundary element modelling of the mechanical problem 3.1. Boundary integral equations for elastostatics The formulation of the direct elastomechanical problem is based on the basic equations of the theory of elasticity, i.e., on Navier’s equation µui,jj(x)+(µ+λ)uj,ij(x)+ bi(x)= 0 (3.1) where the Lamé constants µ and λ are used. Further, one formulates the weighted residuum ∫ Ω [ui,jj(x)+(µ+λ)uj,ij(x)+ bi(x)]u ∗ ik(x,ξ) dΩ=0 632 G.E.Stavroulakis et al. where u∗ik(x,ξ) is the so called fundamental solution, i.e., the displacement field at the point x (observation point), due to a unit force applied at the point ξ (collocation or source point), in an infinite domain. Integrating byparts and taking into account that the fundamental solution fulfills governing equations (3.1), the previous equation leads to uk(ξ)= ∫ Γ [pi(x)u ∗ ik(x;ξ)−ui(x)p ∗ ik(x;ξ)] dΓx+ ∫ Ω bi(x)u ∗ ik(x;ξ) dΩx (3.2) where pi(x) and p ∗ ik(x;ξ) are the traction vectors at the boundary Γ corre- sponding to the displacement fields ui andu ∗ ik, respectively. In equation (3.2), the collocation point belongs to the interior of the domain, i.e., ξ ∈ Ω but ξ 6∈ Γ . To obtain the displacement Boundary Integral Equation a limiting process must be carried out, which finally leads to cki(ξ)ui(ξ)=− ∫ Γ [pi(x)u ∗ ik(x;ξ)−ui(x)p ∗ ik(x;ξ)] dΓx (3.3) where the free term cik depends on the position of the collocation point: cik = δik if ξ belongs to the interior of Ω; cik = 0 if ξ 6∈ Ω; if ξ ∈ Γ , the value of cik depends on the local geometry of the boundary at ξ; if the normal to the boundary is continuous at ξ∈Γ then cik = δik/2. − ∫ stands for the Cauchy Principal Value (CPV) of the integral. 3.2. Displacement boundary integral representation for transient elasto- dynamics Consider an elastodynamic problem in the domain Ω subject to transient loads. The displacement field ui satisfies Navier’s differential equations in the domain (c21− c 2 2)ui,ij (x, t)+ c 2 2uj,ii (x, t)− üj(x, t)=− bj(x, t) ρ (3.4) where c1 and c2 are the dilatational and shearwave velocities, ρ is the density and bj are the components of the body force per unit volume. The initial and boundary conditions have to be satisfied in the domain at t = 0, and on the boundary Γ , respectively. The stresses at a point in the domain can be obtained from the displacement field using Hooke’s law σim = ρ[δim(c 2 1−2c 2 2)uj,j+c 2 2(ui,m+um,i )] (3.5) Neural network assisted crack... 633 and the traction at a point on Γ , whose outward normal has components nk, can be computed from the stresses pi =σiknk = ρ[ni(c 2 1−2c 2 2)uj,j+nkc 2 2(ui,k+uk,i )] (3.6) where pi are the components of the traction vector. The displacement at a point ξ and time t can be represented in terms of the displacements and tractions on the boundary by cij(ξ)uj(ξ, t)+− ∫ Γ t+∫ 0 p̄∗ij(x, t− τ;ξ)uj(x,τ) dτ dΓ = (3.7) = ∫ Γ t+∫ 0 (u∗ij(x, t− τ;ξ)pj(x,τ) dτ dΓ where u∗ij(x, t− τ;ξ) is the displacement field in an infinite medium due to a unit impulse in the direction xi located at the point ξ and acting at the time τ; p̄∗ij(x, t− τ;ξ) is the corresponding traction field obtained from the displacements by Hooke’s law (3.6); cij is equal to the free term in equation (3.3); ∫ t+ 0 = limε→0 ∫ t+ε 0 . Zero initial conditions and zero body forces bj have been assumed. The traction tensor p̄∗ij contains Dirac’s delta functions which preclude a direct numerical integration of eqaution (3.7) (seeMansur (1983), Domı́nguez (1993) for technical details). To eliminateDirac’s delta terms, the spatial deri- vative of theHeaviside function is related to its timederivative, and integration by parts is performed. This leads to the integral equation cijuj + ∫ Γ t+∫ 0 (p∗ijuj +p v∗ ij u̇j) dτ dΓ = ∫ Γ t+∫ 0 u∗ijpj dτ dΓ (3.8) where u̇j is the velocity field, and p ∗ ij and p v∗ ij are new kernels where Dirac’s delta terms have been removed. 3.3. Traction boundary integral representation for transient elasto- dynamics From the integral representation of the displacements at an interior point (3.8), the corresponding integral representation for the stress tensor at this 634 G.E.Stavroulakis et al. point can be obtained by Hooke’s law σim(ξ, t)+ ∫ Γ t+∫ 0 d̄∗imjpj dτ dΓ = ∫ Γ t+∫ 0 (s̄∗ijuj + s̄ ∗v ij u̇j) dτ dΓ (3.9) where d̄∗imj = ρ[δim(c 2 1−2c 2 2)p ∗ kj,k+c 2 2(p ∗ ij,m+p ∗ mj,i )] (3.10) and similar equations for the rest of the kernels. Again, Dirac’s delta function terms appear in the kernels of equation (3.9), due to the differentation of the Heaviside functions in the kernels of (3.8). Integration by parts leads to the equation σim+ ∫ Γ t+∫ 0 (d∗imjpj +d v∗ imjṗj) dτ dΓ = (3.11) = ∫ Γ t+∫ 0 (s∗imjuj +s v∗ imju̇j +s a∗ imjüj) dτ dΓ where üj is the acceleration field; ṗi is the time derivative of the traction vector; the kernels, d∗imj, d v∗ imj, s ∗ imj, s v∗ imj and s a∗ imj donot contain any strongly singular term (Dirac’s delta functions). Before carrying the former equation to the boundary it is necessary to asses the order and location of the singularities involved in the time and space integrations. It can be shown that the kernels have the same singularities as the corresponding ones in the dynamic antiplane formulation (Gallego and Domı́nguez, 1995, 1996), and therefore a similar regularization approach can beperformed to find the traction boundary integral representation (Guiggiani, 1992). The final boundary integral representation for the traction at a smooth boundary, whose outward normal is nm, can be written as cijpj+− ∫ Γ t+∫ 0 (d∗ijpj+d v∗ ij ṗj) dτ dΓ == ∫ Γ t+∫ 0 (s∗ijuj+s v∗ ij u̇j+s a∗ ij üj) dτ dΓ (3.12) where cij = δij/2; d ∗ ij = d ∗ imjnm, d v∗ ij = d v∗ imjnm, s ∗ ij = s ∗ imjnm, sv∗ij = s v∗ imjnm and s a∗ ij = s a∗ imjnm; = ∫ stands for the Hadamard Finite Part of the integral. Equation (3.12) can be written at interior points and points outside Ω with cij =1 and cij =0, respectively. Neural network assisted crack... 635 3.4. Application of the traction integral equation to crack problems A mixed Boundary Element formulation has been proposed to solve frac- ture mechanics problems in static (Portela et al., 1992; Sáez et al., 1995) and dynamic loadings (Fedelinski et al., 1994). A more refined approach, taken from Gallego and Domı́nguez (1997), which is essential for problems with a nonzero traction on the crack faces, is used here. Let us call Γ+ and Γ− the upper and lower face of the crack, respecti- vely, and Γc the rest of the boundary. The superscripts ‘+’ and ‘−’ denote any function evaluated on the upper and lower face of the crack, respecti- vely. It could be checked that d∗ij + = d∗ij −, dv∗ij + = dv∗ij −, s∗ij + = −s∗ij −, sv∗ij + =−sv∗ij − andsa∗ij + =−sa∗ij −.Therefore, assumingthat∆pj= p + j +p − j =0, equation (3.12) for an interior point can be written as pi+ ∫ Γc t+∫ 0 (d∗ijpj +d v∗ ij ṗj) dτ dΓ = ∫ Γc t+∫ 0 (s∗ijuj +s v∗ ij u̇j +s a∗ ij üj) dτ dΓ + (3.13) + ∫ Γ+ t+∫ 0 (s∗ij∆uj +s v∗ ij∆u̇j +s a∗ ij∆üj) dτ dΓ where ∆uj = u + j − u − j is the crack displacement jump. If this equation is carried to a point on the boundary Γ+, the following expression is obtained p+i (ξ, t)+ ∫ Γc t+∫ 0 (d∗ijpj +d v∗ ij ṗj) dτ dΓ = ∫ Γc t+∫ 0 (s∗ijuj +s v∗ ij u̇j +s a∗ ij üj) dτ dΓ + (3.14) += ∫ Γ+ t+∫ 0 (s∗ij∆uj +s v∗ ij∆u̇j +s a∗ ij∆üj) dτ dΓ The difference between this equation and the general traction representa- tion, Eq (3.12), is the free term, which is 1 for the crack point instead of 1/2. Hypersingular boundary integral equation (3.14) in the crack face Γ+ and the standard displacement boundary integral representation on the rest of the boundary Γc provide a complete set of equations to compute the tractions and/or displacement or displacement jump on the boundary and the crack faces. 636 G.E.Stavroulakis et al. 3.5. Boundary element discretization The existence of the hypersingular boundary integral equation demands C1 continuity on the displacement at the collocation point. On the other hand, the collocation point should be placed at a smooth boundary point, since otherwise the value of the traction is not unique. The boundary element discretization must fulfill both conditions, as it has been discussed in Gallego andDomı́nguez (1997). Taking this into account andusing standardBoundary ElementTechniques otherwise for spatial and timediscretization, the resulting set of algebraic equations is written in a compact matricial notation as G nnpn =Hnnun+ n−1∑ m=1 (Hnmum−Gnmpm) (3.15) where the arrays um and pm contain the displacements and tractions of the nodes on Γc and the displacement jumps and tractions on Γ +, for the time step m. At this point, the boundary conditions of the elastomechanical problem at the time m are taken into account, known and unknown quantities are separated and a system of linear equations is formulated and solved for all unknown boundary displacements with respect to tractions y G nnpn =Hnnun+Rn ⇒ Annyn =fn (3.16) This system of equations is solved step by step as in the standard displa- cement formulation in the time domain. 3.6. Unilateral contact at cracks (partial closure): a linear complemen- tarity problem Unilateral contact problemscanbeconsideredbyusingaLCP-BEM(linear complementarity – boundary element) method, analogous to the one develo- ped in the case of the two-region BEM in Antes and Panagiotopoulos (1992), Stavroulakis (1997, 2000). Without going into details, let us mention that, with a suitable sign assumption, a unilateral contact mechanism implying no- penetration and no-tension between the boundary displacements u and boun- dary tractions t is describedby the set of inequalities and the complementarity condition u¬ 0 t¬ 0 u>t=0 (3.17) TheLCP is composedof the latter set of relations for all unilateralmechanisms (here, on all crack nodes relating crack opening and contact traction) together with theunderdetermined systemof equations shown in thefirst part of (3.16). For other approaches see, in addition, Guz and Zozulya (2002). Neural network assisted crack... 637 4. Neural networks for crack and flaw identification 4.1. The direct-inverse neural modelling technique Let a given structurewhich contains an unknown crack be considered.The crack is characterized by a set of parameters z = [z1, . . . ,zm] >. For example, in the crack identification, the coordinates of the crack center and the length of the crack are the parameters of interest. Let, moreover, the response of the structural system for a given loading bl, l=1, . . . , l1 and for a given crack z be given by the vector x̃(z,bl) obtained as the solution to the corresponding static or dynamicmechanical problem.Here l1 is the total number of different loading cases. Obviously, the response of the consideredmechanical system is parametrizedby theunknowncrackparameters z. Let,moreover, the response of the examined structurewith the known crack subjected to the same loading bl be denoted by x̃0(z,b l). Here, a feedforward neural network is used to learn the inverse mapping x̃(z,bl)→ z (4.1) for a given value of the loading vector bl using an appropriate dataset of the example cases. The network takes the vector x̃ as the input and provides the corresponding vector z of crack parameters as the output. The data pairs composed of the vectors x̃(z,bl) and the corresponding parameter vectors z are used as training examples. In the prediction mode, thenonlinearnetwork reproduces themapping x→ z, i.e., for agivenvector of measurements x̃ (different fromtheonesused for training) it gives aprediction for the variables characterising the internal crack. The previously outlined method for the direct inverse modelling can be extended to treat problems in elastodynamics by enlarging the input vector such that it takes into account thewhole time series. Thismethodwas used in our previous investigations (see Stavroulakis, 2000). The problemswhich arise with the above simple treatment of the inverse problems in elastodynamics is that the dimension of the input vectors is dramatically increased and that a lot of this huge information is actually redundant. The redundancy is easily explained from the fact that, for example, all measurements at the external boundary before the appearance of the first wave reflected from the unknown defect do not convey any information about this defect and therefore do not help at all for the solution to the inverse problem. In thiswork,we have chosen a few suitably selected time instances along each waveform which seemed characteristic for the problems under study. These time instances correspond to the local maxima and minima, as well as turning points of the waveforms. 638 G.E.Stavroulakis et al. This way, only 4-5 time steps are kept for every measurement point, thus the increase in the number of input dimension is not large compared to the static case. 4.2. The neural network model and training algorithm The neural network model that we used to implement the inverse map- ping is the well-known Multilayer Perceptron (MLP). It is the most widely used neural network model for function approximation with numerous suc- cessful applications in almost every scientific and engineering domain. The most attractive feature of the MLP is that it exhibits excellent interpolation capabilities (even when trained with sparse datasets) which make it an ideal solution for data-driven inverse modelling problems. The MLP model (also known as backpropagation neural network) is a feedforward neural network with one or more hidden layer containing units (called hidden units) with a nonlinear activation function (usually of sigmoid type). In our experiments, to implement a mapping from a d-dimensional input space to an m-dimensional output space, we have used the MLP with d inputs, m outputs and one hidden layer with H hidden units with the hyperbolic tangent sigmoid activation function tanhx= ex− e−x ex+e−x More specifically, if [W ] = [wij] denotes the weight matrix from the input units to the hiddenunits, V= [vij] denotes theweightmatrix from the hidden units to the output units and bi denotes the bias of the hidden unit i then for a given input vector x = [x1, . . . ,xd], the corresponding output vector y= [y1, . . . ,ym] is computed as follows: • First the outputs of the hidden units are computed zi =tanh ( d∑ j=1 wijxj +bi ) i=1, . . . ,H • Next, the network outputs yk are computed using the zi values yk = H∑ j=1 vkjzj k=1, . . . ,m Neural network assisted crack... 639 The weights (wij,vij) and biases (bi) constitute the neural network para- meters to be adjusted during training in order to learn the network to imple- ment the desired mapping. TheMLPmodel can be trained to implement the desired inversemapping from a d-dimensional to an m-dimensional domain by using a training set that contains N examples of themapping, ie. pairs of the form (xi,ti) where xi = [xi1, . . . ,xid] is an input vector and ti = [ti1, . . . , tim] the desired output for the input xi.Once the training set is available, the trainingprocess is actu- ally an optimization procedure that adjusts the network parameters (weights and biases) to minimize the error function E = 1 2 N∑ i=1 m∑ k=1 (yk(xi)− tik) 2 To achieve the errorminimization, any numerical optimizationmethod can be applied, from simple gradient-descent (also called backpropagation training algorithm) to more sophisticated quasi-Newton methods or even global opti- mization methods. In this work, the Levenberg-Marquadt method has been used for theminimization of the error function that is available in theMatlab Neural Network toolbox. This trainingmethod has been found to be themost effective among several tested local optimization techniques and achieved to provide near zero minima of the error function even in the case of networks with a small number H of hidden units. An important issue for the construction of an effective neural networkmo- del is the specification of the number of hidden units H. It is well-known that for large values of H the network tends to overfit the training set. Thismeans that although the network learns the training set very accurately (the training error becomes very small), the prediction performance of the network on new examples (not used for training) is poor. On the other hand, if the number of hidden units H is very small the network does not manage to learn the training set with acceptable accuracy. Therefore, a procedure called the com- plexity control is needed to find a reasonable value for the number of hidden units H. The objective of the complexity control is to identify the smallest neural architecture that is able to learn the training set with acceptable accu- racy. In this work, we applied the complexity control by starting with a small network having H =2 hidden units and gradually increasing the value of H byone, until a sufficiently trainednetwork (with a low error value) is obtained. Since the training algorithm (Levemberg-Marquadt) is local and depends on the initial values of the weights, for each value of H we applied the training 640 G.E.Stavroulakis et al. algorithm at most 20 times, starting each time from random weight initial values. Finally, for every problem examined, after the completion of training, the prediction accuracy of the constructed network was assessed by using a sepa- rate test set of cases (different from the training set). 5. Mechanical modelling Consider an academic two-dimensional problem that is concernedwith de- fect idenfication within a rectangular elastic plate from measurements along its external boundaries, see Fig.1. It should be pointed out that we are consi- dering relatively small defects, with length or diameter equal to 1, in a plate with the external side equal to l=100. Fig. 1. Geometry and loading of the considered plate with a crack For the assumed defect, the coordinates of its center are denoted by xc and yc. The lower boundary of the plate is fixed and the loading is ap- plied at the upper boundary. For the elastic material an elasticity modulus E = 1000000 and Poisson’s ratio ν = 0,3 are used. For the discretization of the whole external boundary of the plate 10 quadratic elements are used. For each hole, 20 quadratic elements in statics and 10 in dynamics, and for every crack 7 quadratic crack elements are used. Neural network assisted crack... 641 We have considered three cases of defects. First, a circular hole is used for the modelling of the defect. Further, a horizontal rectilinear crack is used. Finally, unilateral contact phenomena are considered along the crack sides. The dynamic loading has either a form of theHeaviside function or of a dyna- mical pulse of one wave duration. The pulse loading has a form of a sinusoidal excitation or an impact-like one. The displacement records at specific points of the boundary are used as the input for the neural networks. For the model problem presented in Fig.1, this information is shown, for one point at the boundary of the plate, in Fig.2. The various lines correspond to various po- sitions of the defect. Clearly, one can use these lines in order to identify the defect. Fig. 2. Horizontal (x) and vertical (y) displacements at boundary point A: Dependence on the crack position Thepreparation of data for theneural networkpostprocessing and the sub- sequent solution to the inverse problem has been done in the following way. A learning set, for the training of the neural network, and a test set, for the demonstration of its ability, are produced. The position of the assumed defect changes in each element of these two sets of examples (paradigms). The tra- ining set includes displacements of the upper side for positions of the holewith x=10 to x=90 and from y=10 to y=90 with all combinations produced with the step ∆x = ∆y = 10. Therefore, the training set has 81 examples (i.e. different positions of the defect, differentmechanical problems), each one including 21 measurements. For the training set, the coordinates of the hole center are considered from xc =15 and yc =15with steps ∆x=10,∆y=10 up to the defect with coordinates at xc = 85 and yc = 85 (all intermediate combinations). Thus, the training set has 64 examples with different positions of the defect.We have chosenmeasurement points at time instants where the 642 G.E.Stavroulakis et al. displacement values comming from the considered examples, i.e. with different crack positions, differ as much as possible. These are, practically, the turning points of the waveforms shown in Fig.2. In addition, the time instants where thewave reflections seem to play a significant role in the data are chosen (local minima andmaxima of the waveform). This way one tries to have aminimum size of the input data, so that significant features of themeasurements are still represented. In a future step of the investigation this point could be done au- tomatically. For the hole identification 81 examples with variousmeasurement points on the boundary of the plate have been used. The crack identification follows a similar procedure. Nevertheless, a smal- ler area within the plate is considered for the placement of potential cracks and the calculation of training and test data. This is due to the fact that the crack-type defectswith their stress singularity at the endof the crackmake the dynamic boundary element method we used unstable for crack positions near the external boundaries.Afiner discretizationwould resolve this numerical in- stability andmake the computational effort significantly higher. Nevertheless, we did not use a finer discretization, in order to be able to directly compare the effectiveness of the procedurewith the previously solved hole identification problem. The training set consisted of displacement values at the external bo- undary of the plate from 49 simulations. These data were produced by taking into account a crack with the center coordinates from xc =20 and yc =20 to xc =80 and yc =80with the step ∆x=10and ∆y=10andall intermediate combinations. The test set had, analogously, 36 simulations. 5.1. Numerical results 5.1.1. Example 1: hole identification in elastostatics with two unknowns A circular hole of diameter equal to 4.0 is considered to be the unknown defect. The coordinates (x,y) of its center are the unknown parameters of the inverse problem. The defect is hidden within a rectangular plate with dimensions equal to 100.0×100.0.One static loading case (pure traction on the upper side of theplate, fixedboundaryon the opposite side) is considered.The boundary displacements (10 nodes per boundary) are used as measurements for the solution to the inverse problem. Thus, the dataset contains input- output pairs with 20-dimensional inputs and two-dimensional outputs with values normalized in the range (−0.9,0.9). These are the displacements of the boundary nodes, in the x and y direction with respect to the reference otrhogonal coordinate system, as they are calculated by the boundary element method. For the construction of the training set we considered all possible positions of the center. In particular, for the training set all values of x and y Neural network assisted crack... 643 coordinates in the interval [10−90] with a step equal to 10 have been used. For the test setwe used values in the interval [15−85] with a step equal to 10. The training dataset contains 81 cases, while the test set includes 64 cases. We used a neural networkmodel with 20 inputs, 2 outputs and H =5hidden units. The training error achieved was less than 0.001, and the results for the training set are shown in Fig.3a. The performance on the test set of cases is illustrated in Fig.3b. Fig. 3. Example 1. Performance of the neural network on the training and the test set; ◦ given position, + predicted position Avery interesting issue to note was that in order to achieve successful tra- ining, it was not necessary to use the complete 20-dimensional input vector. Instead, if only a small part of this vector was used (for example eight compo- nents) the same training and test performancewas achieved.This fact suggests that it is possible to solve the inverse problem using sparse boundary displa- cements (eg. three nodes instead of ten) or a lower number of measurements during an experiment, and needs further investigation. Similar results can be obtained with different sizes of holes and with classical (bilateral) cracks. 5.1.2. Example 2: hole identification in elastostatics with three unknowns In this case, both the center of the circular defect and its diameter are considered to be unknown.Holeswith a diameter between 8 and 12 have been considered. All sites in between these values, with a step equal to 0.2 have been used for the construction of the training set. In the test set, diameters between 8.5 and 11.5 with a step equal to 0.2 have been considered. The dataset contains input-output pairs with 20-dimensional inputs and three-dimensional outputs (two outputs for the position coordinates and one 644 G.E.Stavroulakis et al. output for the diameter) normalized in the range (−0.9,0.9). The training dataset contains 1701 cases, while the test set includes 1024 cases. In the first experiment we tried to use the available training set to train a single network with three outputs that simultaneously provide both the lo- cation and diameter of the defect for a given input vector of measurements. Nevertheless, it was impossible to successfully train such a network and obtain results of reasonable accuracy on the test set, especially in terms of the diame- ter of the defect. For this reason, we followed a different (two-stage) approach that involved a cascade of two neural networks. In the first stage, the neural network with two outputs is trained. It takes as the input the vector ofmeasurements andprovides as the outputs the (x,y) coordinates of the defect. It must be noted that the diameter measurements are not taken into account for the construction of this network. Fig. 4. Example 2. Performance of the neural network in predicting location and size for the case of the test set Then, the computed location coordinates (outputs of the first network) along with the measurements are used as the inputs to another network (se- cond stage) that has one output providing the diameter of the defect. Using this two-stage approach we were able to obtain sufficiently accurate results. The number of hidden units was H =10 and H =5 for the first and second network, respectively. Theprediction accuracy of thedefect location (accuracy of the first network) for the test cases is shown in Fig.4a. In this figure, the multiple predictions ’+’ shown for each defect location correspond to cases (experiments) with the same defect location but different diameter. In what Neural network assisted crack... 645 concerns the accuracy of the second network, the average test set error for the estimation of the diameter was found equal to 0.25 with standard devia- tion 0.15, thus indicating the estimation performance of acceptable quality (see Fig.4b). Further investigation showed that the crack and hole identification pro- blems using elastostatic measurements had similar performance. The detailed documentation of this investigation does not provide additional information in this paper (see Engelhardt [4]). 5.1.3. Example 3: Hole and classical crack identification in elastodynamics The transient dynamic problem is modelled with the previously outlined theory. The calculated waveforms have been preprocessed by the authors so that only essential measurement points and time instances were used for the neural network. Only classical cracks without contact were considered in this example. The remaining of theworkwas similar to the static case. Predictions of the neural network for the set of experiments used in the unknown (testing) data are shown in Fig.5. Fig. 5. Example 3. Performance of the neural network in predicting hole and classical (bilateral) location for the test set 5.1.4. Example 4: Unilateral crack identification in elastodynamics An identification of cracks with contact using transient dynamicmeasure- ments was done in the last example. The identification task was more com- plicated, since one tries to use complicate waves, which include reflection and transmission from nonlinear interfaces. For the used limited time interval the 646 G.E.Stavroulakis et al. method worked quite satisfactory, as it is shown in Fig.6. A generalization to more complicated problems should be done with care, since nonlinear dyna- mical phenomena may have chaotic characteristics. Further research in this direction is certainly needed. Fig. 6. Example 4. Unilateral crack identification. Performance of the neural network in predicting location for the cases of the training set Acknowledgments The work has been partially supported by the German Research Foundation (DFG) and by the Greek-German Scientific Cooperation Programm (IKYDA2001). References 1. Alessandri C.,MallardoV., 1999,Crack identification in two-dimensional unilateral contactmechanics with the boundary element method,Computatio- nal Mechanics, 24, 100-109 2. Antes H., Panagiotopoulos P.D., 1992,The Boundary Integral Approach to Static and Dynamic Contact Problems. Equality and Inequality Methods, Birkhäuser, Basel-Boston-Berlin 3. Doḿınguez J., 1993, Boundary Elements in Dynamics, Computational Me- chanics Publications, Southampton and Elsevier Applied Science, London 4. Engelhardt M., 2004, PHD Thesis, Technical University of Braunschweig, Germany (in preparation) Neural network assisted crack... 647 5. FedelinskiP.,AliabadiM.H.,RookeD.P., 1994,Dynamic stress intensity factors inmixedmode, inBoundary Elements XVI, Brebbia C.A. edit., Comp. Mech. Publications, Southampton, 513-520 6. Gallego R., Doḿınguez J., 1995, HBEM applied to Transient Dynamic Fracture,Proc. ICES’95, Hawaii, USA, Edit. S.N. Atluri 7. Gallego R., Doḿınguez J., 1996, Hypersingular BEM for transient elasto- dynamics, International Journal for NumericalMethods in Engineering,39, 10, 1681-1705 8. GallegoR., Doḿınguez J., 1997, Solving transient dynamic crack problems by the hypersingular boundary elementmethod,Fatigue and Fracture of Engi- neering Materials and Structures, 20, 5, 799-812 9. Guiggiani M., 1992, Direct evaluation of hypersingular integrals in 2DBEM, Notes onNumerical FluidMechanics, 33,W.Hackbusch, edit., Vieweg,Braun- schweig, 23-34 10. Guz A.N., Zozulya V.V., 2002, Elastodynamic unilateral contact problems with friction for bodies with cracks, International Applied Mechanics, 38, 8, 895-932 11. LiangY.C.,HwuC., 2001,On-line identificationof holes/cracks in composite structures, Smart Materials and Structures, 10, 4, 599-609 12. Likas A., Karras D., Lagaris I.E., 1998, Neural network training and simulation using a multidimensional optimization system, Int. J. of Computer Mathematics, 67, 33-46 13. Mansur W.J., 1983, A time-stepping technique to solve wave propagation problems using the Boundary Element Method, Ph.D. Thesis, Universiy of So- uthampton, U.K. 14. Oishi A.,YamadaK.,YoshimuraA.,YagawaG., 1995,Quantitative non- destructive evaluation with ultrasonic method using neural networks and com- putational mechanics,Computational Mechanics, 15, 521-533 15. Portela A., Aliabadi M.H., Rooke D.P., 1992, The dual Boundary Ele- ment Method: effective implementation for crack problems, Int. J. Numer. Meth. Engineering, 33, 6, 1269-1287 16. Rus G., Carlborg, 2001, Numerical methods for nondestructive identifica- tion of defects, Doctoral Thesis, Departamento de Mecanica de Estructuras, Universidad de Granada, Spain 17. Rus G., Gallego R., 2002, Optimization algorithms for identification of in- verse problems with the boundary element method, Engineering Analysis with Boundary Elements, 26, 4, 315-327 648 G.E.Stavroulakis et al. 18. Sáez A., Gallego R., Doḿınguez J., 1995, Hypersingular quarter point boundary elements for crack problems, Int. J. Numer. Methods Engineering, 38, 1681-1701 19. Stavroulakis G.E., 1999, Impact-echo from a unilateral interlayer crack. LCP-BEM modelling and neural identification, Engineering Fracture Mecha- nics, 62, 2-3, 165-184 20. StavroulakisG.E., 2000, Inverse andCrack Identification Problems in Engi- neering Mechanics, Kluwer Academic Publishers, Dordrecht, and Habilitation Thesis, Technical University of Braunschweig, Germany 21. Stavroulakis G.E., Antes H., 1997, Nondestructive elastostatic identifica- tion of unilateral cracks through BEM and neural networks, Computational Mechanics, 20, 5, 439-451 22. Stavroulakis G.E., Antes H., 1998, Neural crack identification in steady state elastodynamics, Computer Methods in Applied Mechanics and Engine- ering, 165, 1/4, 129-146 23. Yagawa G., Okuda H., 1996, Neural networks in computational mechanics, Archives of Computational Methods in Engineering, 3, 4, 435-512 24. Yusa N., Cheng W., Chen Z., Miya K., 2002, Generalized neural network approach to eddy current inversion,NCT and E. International, 35, 609-614 25. Zeng P., 1998, Neural computing in mechanics, ASME Applied Mechanics Reviews, 51, 2, 173-197 26. Zgonc K., Achenbach J.D., 1996, A neural network for crack sizing trained by finite element calculations,NDT and E. International, 29, 3, 147-155 27. Ziemianski L., Piatkowski G., 2000, Use of neural networks for damage detection in structural elements using wave propagation, In: Computational Engineering using Metaphors from Nature, Edit. B.H.V. Topping, Civil-Comp Press, Edinburgh, U.K. Identyfikacja pęknięć i wad strukturalnych w stanach nieustalonych za pomocą sieci neuronowych Streszczenie W pracy zajęto się problemem identyfikacji pęknięć i innych uszkodzeń struk- turalnych w dwuwymiarowym stanie odkształceń sprężystych za pomocą metod nu- merycznych. Modelowanie mechaniczne oparto na metodzie elementów brzegowych ze szczególnym uwzględnieniem kwestii osobliwości pęknięć. Możliwość powstawania częściowo lub całkowicie zamkniętych pęknięć wprowadzono poprzez wykorzystanie Neural network assisted crack... 649 liniowej metody komplementarności. Dla rozwiązania zagadnień odwrotnych użyto sieci neuronowych ze wsteczną propagacją.W zagadnieniach dynamicznych efektyw- ność zaproponowanej procedury zwiększono poprzez odpowiednią wstępną obróbkę danych wejściowych. Na przykładzie dwuwymiarowychmodeli opisywanych w pracy stwierdzono podobną skutecznośćmetody, jak w przypadku klasycznego zagadnienia identyfikacji wad strukturalnych.Wykazano, że identyfikacja jednostronnychpęknięć, która jest znacznie trudniejszym zadaniem, jest możliwa za pomocą zaprezentowanej metody, jeśli do analizy modelu przyjąć odpowiednio dobrane obciążenie testowe. Manuscript received December 2, 2003; accepted for print January 21, 2004