JOURNAL OF THEORETICAL

AND APPLIED MECHANICS

42, 3, pp. 539-563, Warsaw 2004

HYBRID, FINITE ELEMENT-ARTIFICIAL NEURAL

NETWORK MODEL FOR COMPOSITE MATERIALS

Marek Lefik

Chair of Geotechnical Engineering and Engineering Structures, Technical University of Łódź

e-mail: emlefik@p.lodz.pl

An application of Artificial Neural Networks for a definition of the ef-
fective constitutive law for a composite is described in the paper. First,
a classical homogenisation procedure is directly interpreted with a use
of this numerical tool. Next, a self-learning Finite Element code (FE
with ANN inside) is used in the case when the effective constitutive
law is deduced from a numerical experiment (substituting here a purely
phenomenological approach). The new contribution to the classical self-
learningprocedure consistsof its adaptationtoacaseof anon-monotonic
loading (non one-to-one load-deformation curve).This new ability of the
method isprincipallydue to the incremental formof the constitutive equ-
ation and the respective scheme of the neural network structure. Also
an organisation of a constitutive data-base containing learning patterns
is suitably modified. It is shown by examples that the training process
is very quick. The error of this method is smaller, comparing to other
schemes of data acquisition.

Keywords:ArtificialNeuralNetworks, compositematerials, self-learning
FEmethod

1. Introduction

The paper is concerned with the presentation of some applications of Ar-
tificial Neural Networks (ANN) in the analysis of composites. Not only a for-
mulation of the constitutive law for a composite but also its representation
inside a finite element code is described in the paper. Themethods presented
here are intended only for composites with a very finemicrostructure.
In the introductory part we explain why the representation of the consti-

tutive law by ANN is useful, and we suggest what kind of problems can be
reasonably treated with it.


540 M.Lefik

The main problem in the analysis of composites is the estimation of the
effective mechanical stiffness, effective thermal conductivity, effective thermal
expansion, effective electrical resistance and so on. Effective – means here –
significant when the structure is considered as a fictious, homogeneous body,
disregarding details of its internal composition. These properties characterise
global (or macro) behaviour, i.e. the behaviour observed at the macrosca-
le. The perturbation of fields of variables of the problem due to the micro-
heterogeneity is insignificant from this perspective. It is obvious that the glo-
bal behaviour is determined at themicrolevel, by geometry of components and
their individual physical properties. Despite this, we want to carry out the es-
timation of global effects without pushing a finite element mesh down, into
the geometry of the microstructure, while the global boundary value problem
under global loading conditions is to be solved. Because of the scale separation
between structural levels a mesh fine enough for the microlevel would result
in a huge number of elements at the macro-structural level, thus, with a huge
number of unknowns in the model. This can be avoided using theory of ho-
mogenisation. The theory of homogenisation (or rather a variety of theories)
can be considered as a fundamental tool of a reasonable analysis of materials
with a microstructure.

Alternatively, a set of experimental observations of the behaviour of a
macro-sample of the heterogeneous material must be performed to define the
effective properties of the composite.Recently thisphenomenological approach
hasbeen frequently substitutedwith a so-called ”numerical experiment” (Ban-
deira et al., 2001; Wriggers and Lohnert, 2002). In the paper we use mainly
such substitution as a source of constitutive data.

We note first that in the frame of the classical homogenisation theory some
regularity of themicrogeometry of the composition is assumed. It is necessary
to simplify the description of a heterogeneous body. There are two principal
idealisations of real structures. Asymptotic homogenisation of periodic media
is applicable to the analysis of structures obtained by translation of an ele-
mentary cell (called the cell of periodicity) while in a self-consistent approach
one assumes that the considered composite is made by multiple, irregular re-
petitions of a scaled pattern. The third possibility is a stochastic description,
which is out of the scope of this paper.

In the case of periodic composites themathematical formalism allows us to
build a qualitative theorywithout any a priori restriction concerning the form
of the resulting effective constitutive law (see, for example Sanchez-Palencia,
1980). However, in the practice of physically non-linear composites the natu-
re of non-linearity must be taken into account at an early stage of the de-


Hybrid, finite element-artificial neural network model... 541

velopment. For example, exponential non-linearity of components requires a
theoretical formulation quite different than that for an elastic-plastic physical
framework (Suquet, 1985). Because of this, many individual, particular the-
ories have been built for particular classes of materials sometimes combined
with particular geometries of the cell of periodicity.

Using asymptotic homogenisation, a solution to the problem of periodic
media typically splits into two boundary value problems. The first one is
formulated at the microlevel, for the repetitive microstructural domain. Its
solution determines effective material characteristics and allows for the con-
struction of stress distribution at the level of microstructure in the phase of
”unsmearing”. The second boundary value problem (the global one) is stated
for the homogeneous, fictious body. The constitutive description of this ma-
terial is obtained at the first stage. The principal difficulties result from the
fact that in the non-linear material or geometrical frame the superposition of
micro- and macro-behaviours is not allowed. Moreover, the coupling between
these two levels is explained by the fact that the solution to themicroBVpro-
blemdependson themicro-geometrywhichchanges during the global solution,
which, in turn, depends on the results of the micro BV problem through the
values of constitutive parameters in the global constitutive law. In the case of
a complicated physical and geometrical composition the mentioned problems
are rather difficult to overcome. This situation is even more difficult from the
point of view of a designer who needs amethod that works well formany geo-
metric andphysical arrangements suitable for various schemes of the structure
that should be considered in the design process. This analysis is simpler if can
be done without revision of the mathematical formulation at each stage of
design and at each level of the structure.

Wriggers and others (Wriggers and Lohnert, 2002; Bandeira et al., 2001)
have made an important step toward such a solution. The proposed method
consists of some numerical experiments using FEmethod at the meso- or mi-
crolevel. The proposed approach is consistentwith fundamental findings of the
homogenisation theory: as in the theory of homogenisation the global behavio-
ur is deduced froma small sample of a compositematerial but now themethod
is fully numerical. It assumes the application of pre-existing numerical proce-
dures in the modelling of contact, friction and other effects at the microlevel
(inter-particularmechanics is considered for granular assemblies for example).
Another important direction in the numerical homogenisation is represented
by Kouznetzova, Brekelmans and co-workers (see Kouznetzova et al., 2001).
Also in this approach an expected form of the resulting, effective constitutive
law should be imposed a priori.


542 M.Lefik

A self-consistent approach can be considered as themost suitable theoreti-
cal frame resulting with a closed-form expression giving the effective material
characteristics as a function of the volume fraction of the forcing material di-
spersed in thematrix. This approach, (see Zaoui, 1997; Kroner, 1978; Boutin,
2000; Hashin, 1968) is less sensitive on the morphology of the sample of the
composite. Boutin, Zaoui andmany other authors have developed expressions
for different effective properties in the case of three separate levels of porosity:
micro, meso andmacro (Boutin et al., 1998; Zaoui, 1997). Unfortunately, also
thismethod assumes some qualitative a priori decisions concerning the global
constitutive law. Finally, it seems to be very difficult to take into account all
possible non-linear, unilateral microfeatures defining ”generic heterogeneity”
for this method. Also the geometry of the scalable cell should be very simple
in this approach (concentric ellipsoids).

Our method of solving the problem of the effective constitutive law for a
materialwithamicrostructure is entirely numerical. It is basedon the assump-
tions that the Cauchy stress (or its increment) in the fictitious, homogenised
body is an unknown, non-linear function of mean strains, strain increment,
physical parameters defining themicrophysics of composition, and the geome-
try of the representative volume. The history of loading-unloading can affect
all this data. We assume a form of this function representing it by an Artifi-
cial Neural Network with hidden layers. At the input of this network we have
all independent variables, mentioned in the above sentence. At the output
we obtain a state of stress in the point of the body at the macrolevel. Since
the components of the strain tensor are present among the input variables,
the Artificial Neural Network acts as a numerical representation of the effec-
tive constitutive law. It is well known that the ANN can be considered as a
universal approximation of a function, functional or operator (a formal proof
of this statement can be found in Chen and Chen (1995)). Because of this,
this representation does not constrain the generality of the global, effective
constitutive law we are looking for. To define the ANN we must determine
all weights of the links between neurones belonging to various layers of the
network. The coefficients in this very rich and flexible representation are defi-
ned during an iterative process of ”training”. The source of knowledge for the
learning can be a real or numerical experiment. If a numerical experiment is
considered, an auxiliary FE modelling process is similar to that proposed by
Wriggers and Lohnert (2002) with the exception that in our case the form of
the constitutive equation is defined by the ANN thus, it does not restrict the
possible effective behaviour. In this case, like in the periodic homogenisation,
we discover rather than impose the global constitutive law, but now – also


Hybrid, finite element-artificial neural network model... 543

for a possibly non-linear effective behaviour. The FE analysis of a sample of
the composite is a source of ”examples” for the training process. These exam-
ples form a knowledge base containing the constitutive data. The considered
sample of the composite that defines the microstructure consists of only one
repetitive cell or a representative volume. In examples presented in this pa-
per the representative volume will be replaced, for simplicity, by an array of
cells. The equivalence between the composite structure and its homogeneous
counterpart is defined in the paper in an original manner, consistent with the
proposed numerical procedure (different from that, described in the frame of
the theory of homogenisation).

The use of the ANN in the role of the constitutive operator is not new.
Classical application of Artificial Neural Networks for constitutive modelling
of concrete was originally proposed by Ghaboussi et al. (1991). An improved
technique of ANN approximation for a problem of mechanical behaviour of
drained and undrained sand is presented in Ghaboussi and Sidarta (1998). In
the latest surveys Waszczyszyn (1997, 1998), Yagawa and Okuda (1996) the
role of neural computing in the constitutive modelling is clearly pointed out.
A similar approach is employed in Garcia et al. (2000), Gawin et al. (2001),
Kortesis and Panagiotopoulos (1993), Mucha (1997), Penumadu and Zhao
(1999) andmany other papers.

The essence of the ANN technique is to construct an application that
attributes a given set of output vectors to a given set of input vectors. When
applied to the constitutive description, the physical nature of these input-
output data is clearly determined by measured quantities: strains-stresses or
displacements-forces. The neural ”black box” operator, replacing the existing
symbolic description, cannot be directly used in the symbolic development of
the formulae and is useless for a closed-form solution. It can be very efficient
as an element of a FE code, as it is shown in Gawin et al. (2001), Mucha
(1997). A hybrid FE-ANN code is described also in Shin and Pande (2000,
2001). The authors show that the insertion of the constitutive law presented
in the formof a neural operator leads to some qualitative improvements in the
application ofFE in engineeringpractice.Namely, theANNrepresentation can
bemodified to reduce the error of a FE numerical experiment with respect to
the real experiment. Our representation of the constitutive lawwith the ANN
is slightly different. It is incremental while in Shin andPande (2000, 2001) the
ε−σ functions are directly approximated.

Obviously, there is no need to use the presented method in the case of a
simple micro-geometry of the composite and when the behaviour of compo-
nents is linear. In this case the classical description of the effective behaviour


544 M.Lefik

is much better. The ANN representation can be useful in a case, when the
physical or geometrical non-linearity dominates themicro-description and the
FE analysis at the level of the representative volume (or a direct experiment)
is the only possible tool for its effective analysis.

The paper contains academic examples illustrating the assumed strategy
of the modelling.We refer the interested reader to our previous papers (Lefik
and Schrefler, 2002a,b) in which some aspects of practical applications of the
presentedmethodto theanalysis ofmechanical behaviourofa superconducting
cable is shown.

2. The effective constitutive law for acomposite and its

representation by an Artificial Neural Network with hidden

layers

2.1. Effective properties of the composite

For usual analysis of a two scale composite we define two sets of co-
ordinates: local y related to the single, repetitive cell of periodicity Y and
the global x for the entire body Ω (composed of a finite number of the cells,
Fig.1).

Fig. 1. The cell of periodicity and two systems of coordinates


Hybrid, finite element-artificial neural network model... 545

Thedimensionless characteristic length ε of the cell Y is treated as a small
parameter tending to zero (the ”length” of the cell Y related to the ”length”
of Ω). Let the displacement field uε(x,y) be an ε-depending solution to the
problem

find uεi ∈V and σ
ε
ij ∈L

2 such that:
(2.1)

∀vi(x)∈V

∫

Ω

σεkl(u)vk,l dΩ=

∫

Sf

Fivi dSf

where the stress tensor σ is computed via (2.2)1 and remains inside the ad-
missible set P (2.2)2 defined by any usual yield criterion (an associated flow
rule completes the constitutive description)

σεij(u
ε)(x,y)= aijkl(y)ekl(u

ε(x,y)) σεij ∈P(y) (2.2)

V is the usual space of kinematically admissible displacements, e(v) is the
linearized strain tensor computed at v, F denotes a vector of external load.
Components of the fourth order elasticity tensor a are piece-wise constant
functions of ywith discontinuities along regular surfaces, and satisfy the clas-
sical conditions of symmetry, ellipticity and positivity.

Let u0, σ0 be the solution to the ”homogenised” problem, i.e. problem
(2.1) in which the variable material coefficients a(y) are replaced with some
unknown constant values ah. We suppose that the periodicity of material
characteristics imposes an analogous periodical perturbation on the quantities
describing themechanical behaviour of the body.Hence, for the displacements
we have

uε(x)≡u0(x)+εu1(x,y) (2.3)

u1 Y -periodic.

The same can be obtained for strains and the stress tensor via a simple
total differential formula with respect to x

eε(x)= e0(x,y)+εe1(x,y)+ ...+0(ε2)
(2.4)

σε(x)=σ0(x,y)+εσ1(x,y)+ ...+0(ε2)

Thanks to (2.4), problem (2.1) splits into a sequence of problems of the order
ε0, ε1 and ε2.
The problemof the order ε0 leads to equation (2.5) inwhich the perturba-

tion of the displacement is computed according to (2.6) (this last expression


546 M.Lefik

is not an assumption, it follows from the detailed analysis of the problem, see
for example Sanchez-Palencia (1980) and references given there)

find χ
pq
i ∈VY such that: (2.5)

∀vi ∈VY

∫

Y

aijkl(y)(δipδjq+χ
pq
i,j(y)
(y)vk,l(y)(y) dΩ=0

u1i(x,y)= epq(x)(u
0(x))χ

pq
i (y)+Ci(x) (2.6)

We call χpq(y) the homogenisation functions for displacements.
For ah uniquely defined via the classical homogenisation procedure uε

converges weakly to u0 as ε tends to zero. In this case, the tensor ah defines
an effective constitutive relationship between e(u0) and σ0 (a new admissible
set in the stress space Phmakes apartof this effective constitutivedescription)

ahijpq = |Y |
−1
∫

Y

aijkl(y)(δkpδlq+χ
pq
k,l(y)
) dY (2.7)

σ0ij(x)= a
h
ijklekl(u

0) (2.8)

The heterogeneous structure can now be studied as a homogeneous one with
effective material coefficients given by (2.7). The global displacements, strains
and average stresses can be computed at that moment.
In this paper, instead of looking for ah and Ph we are going to use a

numerical representation of the effective constitutive law by a suitably trained
ANN N. The input-output vector of N contains components of the strains
e(u0) and stress tensor σ0 (in the vector-like notation)

σ0 =N@e(u0) σi =
∑

q

w
(2
qi)f(q)(epw

(1)
p(q)
+θ
(1)
(q)
)+θ

(2)
i (2.9)

Symbol@denotes a result of an action of an operator at the specified operand.
In this case it is thevalue of theneural operator at the inputvector.Repetition
of the subscript indicates summation over all its range (unless it is enclosed
by paranthesis). Superscripts denote the number of a layer of neurones.
The parameters w, θ and f in (2.9)2 will be interpreted as weights, biases

and transfer functions of the Artificial Neural Network defined and discussed
in the next section. The finite element code with N included as a material
description subroutine can be used to solve numerically problem (2.10) for u0

find u0i ∈V and σ
0
ij ∈L

2 such that:
(2.10)

∀vi(x)∈V

∫

Ωh

σ0kl(u)vk,l dΩ=

∫

Sf

Fivi dSf σ
0 =Nh@e(u0)


Hybrid, finite element-artificial neural network model... 547

The correct, effective constitutive law in (2.10), representedby theANNcalled
Nh assures the following requirement

∀F ‖uε−u0‖L2(Ω) ¬ τ τ ¬ ε‖u
1‖L2(Y ) as ε→ 0 (2.11)

Estimation (2.11)2 follows from a direct application of Cauchy’s inequality.
Expression (2.11) defines a correspondence between the heterogeneous the ho-
mogeneous (homogenised) fictitious body inamanner suitable for ourpurpose.
In the numerical practice, comparison (2.11) of the effective solution with the
exact one will be checked only in a few strategic points. Condition (2.11)
formulated for any F is verifiable in practice only for a finite number of pre-
scribed loads. Searching for the Nh, we employ an iterative procedure called
the ”training of Artificial Neural Network” proposed in all textbooks devoted
to ANNs, as for example Hertz et al. (1991), Osowski (1996), Tadeusiewicz
(1993) and shortly described in the next section.

3. Artificial Neural Network for constitutive description and a

hybrid ANN-FE code

An inspiration for the application of Artificial Neural Networks (ANNs) to
different branches of engineering sciences comes from the analysis of transmis-
sion and transformation of signals in human or animal neural systems. The
importance of themethod significantly increased during last years. Nowadays,
theANNs are successfully used in the computer-aidedmanagement,modelling
of different physical dynamic processes depending onmany fuzzy variables. In
this application, the ANN will be included into the classical Finite Element
procedure as a constitutive subroutine, as it is described in this section.

3.1. Artificial neural network with hidden layer for incremental repre-

sentation of the constitutive law

TheANN can be considered as a universal, non-linear operator that trans-
forms a set of suitably interpreted discrete values into another set of nume-
rical data. Considering the structure of this operator, the ANN appears as a
collection of some simple processing units (called neurones or nodes) that are
mutually connected by linkswith adjustable strengths (seeFig.1). This logical
system, with nodes organized in layers, transforms the input signal presented
at its ”input” nodes into the output signal produced at the ”output layer”.


548 M.Lefik

The sequence of signals presented at the input and that expected to be obta-
ined at the output layer, are called the input and target patterns, respectively.
The activity of the network consists of transformation of each input pattern
into the output signal, and is defined by expression (3.1). After each cycle
(forward transmission of the input signal and back propagation of the output
error), the weights of connection aremodified to reduce the error between the
network response and the given output pattern

oi = f
(

f
(

w(2)qr f(ipw
(1)
pq )+ b

(1)
r

)

w
(3)
ri + b

(2)
i

)

σi = oi (3.1)

In this representation, f is a given sigmoid function of one variable x, Eq.
(3.2),while w and bareweights andbiasesbeingconstant duringtransmission
of the signal

fi(x)=
pi[1− exp(−λix)]

1+exp(−λix)
(3.2)

All independent variables are in i. It can be proved that this form appro-
ximates all continuous functions of several variables. A graph related to this
formula is commonly known as a neural-like network. In Fig.2 a special form
of such a network, useful for our purpose, is illustrated.

In Fig.2 we show a canonical scheme of the Artificial Neural Network for
incremental approximation of the constitutive law.

Fig. 2. A scheme of the ANN that is used throughout the paper. The light arrows
show spontaneous activity of the net along a given path in the space of
deformations. ρ is an internal parameter that allows for many suitable

interpretations, depending on the field of application


Hybrid, finite element-artificial neural network model... 549

For such a operational scheme there exist some well developed methods
for finding the coefficients: weights and biases.

Onecanobserve that the formof (2.9) for theapproximationof the effective
constitutive law is identical with expression (3.1), suitably truncated.

3.2. An incremental form of the constitutive equation

Inour formerpapers (Lefik, 1997, 2001; LefikandSchrefler, 2002, 2003),we
showed that the incremental formof the constitutive equation ismore suitable
for approximation by anArtificial NeuralNetwork.We shortly repeat here the
arguments that support this idea.

We consider only a special form of the incremental constitutive law, na-
mely (3.3)

σ̇= g(σ,F, Ḟ ,ρ) (3.3)

In this representation F denotes the gradient of deformation, g is a function.
It is well known that g in the representation (3.3) can be chosen to fulfill all
conditions defining the admissible subspace P in the space of stresses. In this
way, the solution σ to differential equation (3.3) can be statically admissible.
This formulation, on one hand involves only unknown function g that can be
well approximated by the ANN, on the other hand does not introduce any
”geometric” object in the stress space like a yield surface or other potential
usually defined in the case of non associated plastic flow.These are fundamen-
tal arguments supporting this representation when anANN approximation is
used. The third important argument is following. The approximation of the
incremental form is simpler since the ANN learns only local and short seg-
ments of the stress-strain graph. In the hidden layers some neurons specialize
in switching betweendifferent ”current states of stress”.Thus, thenetwork can
be small and learns quickly. This is shown by examples by Lefik and Schrefler
(2003). Some graphs taken from that paper are presented in Fig.3. In this
Figure we see an approximation of the experimental graph of the mechanical
histeresis of a superconducting cable, described byNijuhuis et al. (1998). The
graph in this Fig.3a shows an autonomous activity of the trained ANN along
the given ”path”, as it is expressed by equation (3.4), below. The approxi-
mation is realistic and the ANN – very small. Such a result is impossible to
obtain by a non-incremental network. In that case a typical result is shown in
Fig.3b.

The incremental approximation of the constitutive law by the ANN can
be defined as follows.


550 M.Lefik

Fig. 3. Experimental graph (dashed line) taken fromNijuhuis et al. (1998),
approximated by the incremental and non-incremental ANN (solid line).

Autonomous generation of the graph by the networks for the points never shown in
the training, according to Lefik and Schrefler (2002b): (a) incremental approach,

(b) non-incremental one

The trainedANN is an operator that performs the the following operation

(σt,F t,ρt,∆F t)→ (σt+1,F t+1,ρt+1) ∆F t = Ḟ
t
∆t (3.4)

3.3. Insertion of the ANN into a Finite Element code

We show next that representations (2.9), (3.1) work well inside a fini-
te element code. Let us suppose that the basic equation for the standard;
displacement-based finite element method can be written in the form

∫

V

B
>

N : τ dV
0 =

∫

S

N
>

Nt dS+

∫

V

N
>

Nf dV

(3.5)

ε(du)=BNdu

where V is the volume of the body in the reference configuration, BN is the
matrix that operating on the vector of admissible variation of independent


Hybrid, finite element-artificial neural network model... 551

variables to give the strainmeasure ε(du) coupledwith thematerial stress τ .
In what follows in the paper, we will consider small transformations, thus an
infinitesimal strain tensor. The index N means that B is constructed on the
basis of the approximation of u by a set of interpolation functions N(x) on
appropriate finite elements. On the right-hand side of (2.8) t is a stress vector
given on the part S of the boundary, while f are body forces acting on the
elementary volume dV .

Since the considered material behaviour is non-linear, the Newton algori-
thm will be applied to solve the system of equations (2.8). The Jacobian of
the left-hand side of (2.8) can be written as follows

J =

∫

V

[dτ : ε(du)+τ : dε(du)] dV 0 (3.6)

The first term in integral (2.9) can be computed using a usual constitutive
assumption

dτ =D : dε (3.7)

where Dij = ∂τi/∂εj.

We can rewrite the above equations, taking variation with respect to the
independent variables of the problem u and obtaining (by definition) the
stiffness matrix K

KmainMN =

∫

V 0

BM :D :BN dV
0 (3.8)

The second term in integral (3.6) represents the initial stress matrix.

Using the assumed representation of the constitutive law by the ANN,
instead of (3.7) we obtain

dτ =Nd,σ@dε (3.9)

The index d denotes that the network quality is best for some given value of
the increment d, σmeans that the stress increment is computed at the current
value of τ =σ. It is clear thatwemust replace the neural operator in (3.1) by
the matrix D or simply, construct this matrix using the given representation
of the constitutive law. This will be done by trial incrementation of ε. Let us
suppose that both tensors dτ and dε are represented by column vectors

dσ= [dτ1,dτ2,dτ1] = [Nd,σ@dε1,Nd,σ@dε2,Nd,σ@dε1]

[dεt] = [dε1,dε2,dε3] (3.10)

D= [dσ][dεt]−1


552 M.Lefik

Thematrix of trial vectors dεt is always proportional to the strains in the last
equilibratedpoint (σ,ε) during theNewton iteration process (preceding step).
The trial vectors cannot be arbitrary because Nd,σ@dε 6=−Nd,σ@(−dε), and
in fact two different tangent stiffnessmatrices can be defined in any point: one
for loading and the other for unloading. It is supposed then that the loading
(unloading) is continued during current increment in the Newton iterations.
Formulae (3.10)1,2 are used here instead of computing the derivatives of the
neural network with respect to input values.

The stress in the second term in integral (3.6) is computedusing the neural
network in the recall mode for a given constant step dε, until the strain ε
at the trial solution in the current step is reached. The ANN acts here in
the autonomous activity mode as it is defined in Section 3. This process cor-
responds to classical integration of the incremental constitutive equation for
updating σ. It always starts in the last equilibrated point, and the increment
dε is proportional to the one defined for this step (loading or unloading).

4. Classical homogenisation procedure directly interpreted by the

Artificial Neural Network

It can be shown that the vector of homogenisation functions, Eq. (2.5),
appears as a perturbationfield in the numerical experiment inwhich the cell of
periodicity is loadedwith a uniformdisplacement of points on theborder.This
imposes a given average strain state equal to the homogeneous one, computed
from formula (2.9) in which u is to be applied on the border

ui = e
ave
ij xj (4.1)

The corresponding stress can be obtained according to expression (4.2) with
t used for the stress vector on the boundary of the cell

σaveij =

∫

∂Y

xi⊗ tj ds (4.2)

Since Eq. (2.9) is a universal approximator, the introduced assumption does
not restrict the possible effective behaviour. Like in the periodic homogeni-
sation, we discover rather than impose the global constitutive law. All the
constitutive information is in the data set furnished to the network.


Hybrid, finite element-artificial neural network model... 553

4.1. Constitutive data-base for the training

Aconstitutive data-base contains pairs of input vectors andoutput vectors
for the training of the network. The input and output is accorded with the
structure of the ANN (Fig.2) and is always of the form:

{(input vector), (output vector)}

For the case of the incremental law, for the k-th pattern, we have

{[εi,σ(εi),ηi,∆εi],∆σαβi}k (4.3)

Additionally, in the below shown example, we must satisfy the following
condition imposedby isotropyof the approximated constitutive law (@denotes
the action of the neural network operator on the input data)

∀i if N@



















εi
σi
η
∆εi



















= {∆σi} then ∀Θ Θ
>
Θ=1 we have

(4.4)

N@



















Θ>εiΘ

Θ>σiΘ

η

Θ>∆εiΘ



















= {Θ>∆σiΘ}

To satisfy condition (4.4), wemust train thenetworkwith some supplementary
data: the new subset of patterns is of the form

{

[Θ>εiΘ,Θ
>σiΘ,ηi,Θ

>∆εiΘ],Θ
>∆σαβiΘ

}

(4.5)

The subscript k refers to the pattern obtained from the ith experimental
pointby transformationvia the kth rotationmatrixΘ.The total number K of
these additional terms in thematrix of patternsdependson thenumberof trial
rotation needed to train the network up to a satisfactory level of tolerance. An
artificial construction of the experimental data has been necessary to perform
a 2D numerical experiment.
In this case, the source of constitutive data is a numerical experiment de-

fined with expressions (4.1) and (4.2) executed on a single cell of periodicity
of the composite. The results of FE computations for different (assumed and
imposed over the cell) values of a small deformation tensor or tensor of defor-
mation gradient must be completed by the manipulation prescribed by (4.4).


554 M.Lefik

4.2. Example

As an example, we consider a hyperelastic composite, the repetitive cell
of which consists of a neohookean material with a circular hole of the radius
equal to the distance between the neighbouring, circular pore.

In Fig.4 this situation is shown. In Fig.5 some of deformed configurations
are presented. The number of such numerical experimentswas 1260, including
incrementation.The full constitutive data base containing all rotated datawas
much bigger. As far as the increments with the negative sign were concerned
it was about 15000 patterns. The training was organized by epochs that con-
tained about 1250 patterns. After the first decisive reduction of the error of
the training, most of the patterns used as the test data revealed very good
coherence with the training results.

Fig. 4. A single cell of periodicity of the composite

5. The self-learning Finite Element code for deducing effective

constitutive relationships from a numerical experiment

The self-learning finite element procedure is based on the standard FE
code including the Newton-Raphson type iterative method. The neural con-
stitutivemodel, Eq. (3.1), can be applied for the solution to general structural
problems in the samemanner as the conventional one.All details can be found


Hybrid, finite element-artificial neural network model... 555

Fig. 5. Examples of trial deformations of the cell of periodicity – a source of the
constitutive data

in Shin and Pande (2000, 2001). The main advantage of this substitution is
the possibility tomodify the constitutive model simply by retraining the neu-
ral network. This feature results in a self-learning code, which evaluates the
suppliedANNmodel andadjusts itself to a desirable response of the structure.
This strategy permits the constitutive relationship to be built directly from
experimental data or, alternatively, from the exact FE solution. In our case,
it is solution of (2.1) for a structure composed of many repetitive cells of the
given internal structure (may be very complicated). If the considered number
of cells is large enoughwe are able to capture its effective (global) properties.


556 M.Lefik

The procedure starts with calculation of the trial FE solution using any
initial constitutive law (e.g. linear constitutive matrix). Pairs (eI,σI) at each
Gauss point on any considered load level are saved for future training of the
ANN. The displacements at some strategic points of the structure are moni-
tored δI and compared with the known data describing the behaviour of the
compositematerial δD. Those vectors constitute, respectively, the output and
the target of the whole procedure.
The current adjustment ismeasuredbymeans of thenormalized root-mean

square error calculated as follows (instead of (2.6)1)

Err=
1

δmax
D
− δmin
D

√

√

√

√

1

MN

N
∑

n=1

M
∑

m=1

(δm
II,n
)2 <τ (5.1)

In (5.1), N is the number of load increments, M – number of monitored
displacements, the vector δII = δI − δD represents the errors made by the
ANNat themonitored points and δmaxD , δ

min
D are themaximumandminimum

values of |δD|, respectively.
In the following, the displacement differences δII are treated as the load

for the new solution. The obtained strain-stress pairs (eII,σII) are used to
correct the current guess (eI,σI)

etrain = eI σ
train =σI +σII (5.2)

Suchpreparedtrainingdata (5.2) areused for retraining theneuralnetwork
that forms the constitutivemodel. The newweights of theANNare saved and
the next step of the self-learning procedure starts with the corrected ANN
constitutivemodel. If the value of E becomes lower than the tolerance level τ
? or the admissible number of self-learning steps is reached, the process is
finished.According to Shin andPande (2000, 2001), this procedure converges.

5.1. Example

Weconsider the same example as inSection 4.3.This time, however, a sam-
ple containing several cells is considered. The non-deformed cell of periodicity
formsa squarewith ahole of diameter equal to half of the edge. For acquisition
of the constitutive data we perform three different numerical FE experiments
(because of (2.6)). All these experiments are qualitatively different than those
proposedbyShinandPande.We imposeuniformdisplacements of borders of a
portion of the composite. The nature of these kinematic loadings is easily seen
inFig.8: two different tests of tension (free andconstrained tension) anda she-
ar test. The controlled quantities are now values of reactions in the boundary


Hybrid, finite element-artificial neural network model... 557

nodes. A test like this was never proposed by the authors of the self-learning
method since its original application was acquisition of the constitutive data
from real and in situ observations. It is obvious that the point-wise reaction
is not an observable quantity. In the context of the numerical experiment this
strategyworks quitewell, even in the analysis of compositematerials forwhich
the stress jumps in the small scale of the body. In the approximation of the
constitutive law an incremental constitutive description and the corresponding
architecture of the ANN has been used (described by Lefik (2001)).

Fig. 6. Three numerical tests on a sample of the composite used simultaneously for
the preparation of the data-base for ANN training

5.2. Discussion of the results

In Fig.7 two testing finite element computations are shown. These com-
putations have been executed for a loading never used in the training and
with the FE code including the ANN inside. A relatively small ANN with 10
and 7 neurones in hidden layers have been trained with the constitutive data
collected from each first Gauss point of the homogeneous problem (the black,
rectangularmesh in Fig.7). The performance of the trainedANNwas checked
with the loading scheme never used in the training. This scheme is shown in
Fig.7. The coarse, rectangular mesh (dark line) represents the displacements
of the ”effective”, homogeneous body under action of the horizontal shearing


558 M.Lefik

Fig. 7. Two numerical tests on a sample of the composite that have never been used
in the training. The coarsmesh – results obtained for the effective model, the light

deformedmesh – ”exact” solution


Hybrid, finite element-artificial neural network model... 559

stress vector applied to the upper edge of the rectangular sample. This compu-
tationwas carried out byFE-ANN code (the FE code publishedbyBonet and
Wood (1997) wasmodified and used). The triangularmesh (light lines) repre-
sents the ”true” deformed configuration. The coarse mesh is defined for the
homogenised body. The qualitative accordance is good, themaximum error in
displacements does not exceed 5%.

In Fig.8, the difference of the return curve and the one obtained with the
loading is used as the measure of the error. It is known that for hyperela-
stic composites these two curves should coincide. What is shown in Fig.8 is
obtained for the example trained within the direct approach. In Fig.8b the
curves coincide much better that in Fig.8a. The self-learning method of data
acquisition is more efficient. In this case the ANN is simply ”tailored” for the
Finite Element code in which will be included as a constitutive procedure.

Fig. 8. Comparison of loading and unloading stress-strain curves used as themeasure
of the error of the ANN representation of the constitutive law inside the FE code

In the example, only three cases of load are reported. As far as the co-
nvergence to zero is concerned, see Bandeira et al. (2001), a relatively small
trial portion of the composite is sufficient to predict the overall behaviour of
the non-homogeneous material. In our example we consider only 20 cells of
periodicity but even in this case we notice that the perturbation of the mean
displacements is negligible.

6. Conclusions

We conclude that the effective constitutive law for a composite can by
approximated by the Artificial Neural Network with hidden layers. This ap-
proximation does not constrain the form of unknown effective relationships.


560 M.Lefik

The following condition must be fulfilled to assure successful approximation:

• The constitutive law must be formulated in the incremental form

• The ANN must be included as a subroutine in the Finite Element pro-
cedure.

Moreover:

• The self-learning procedure seems to be a very effective tool for the
identification of the global behaviour of composites. Themicrostructure
of thematerial (position of themonitoredpoint on the cell of periodicity)
does not influence the identification abilities of the program.

• The ANN representation of any constitutive law is a flexible tool for
the representation of the effective (global) behaviour of materials with a
complex internal microstructure.

• This representation is very suitable for analysis of composites since it is
”automatic” in the sense that it does not require any a priori choice or
adaptation of the existing constitutive theory for the description of the
observed material behaviour.

• The convergence is surprisingly fast. Four steps are enough to obtain a
qualitatively goodmodel.

• A representation of the effective constitutive law can be very simple (a
network of the architecture 3-6-3 is sufficient in the first example).

• The usefulness of the hybrid FE-ANN code has been confirmed since it
opens up new possibilities in comparison with the standard FE codes
in the sense that the constitutive models can be easily modified and its
simplicity accelerates work of the program.

These conclusions have been supported by examples, not by a theoretical
proof.

References

1. Bandeira A.A., Wriggers P., Pimenta P.M., 2001, Homogenizationme-
thod leading to interface laws of contactmechanics – a finite element approach
for large 3D deformation using ALM, IJMN

2. Bonet J., Wood R.D., 1997, Nonlinear Continuum Mechanics for Finite
Element Analysis, Cambridge University Press

3. Boutin C., 2000, Study of permeability by periodic and self-consistent homo-
genization,Eur. J. Mech., A, 19, 603-632


Hybrid, finite element-artificial neural network model... 561

4. Boutin C., Royer P., Auriault J.L., 1998, Acoustic absorption of porous
surfacing with double porosity, Int. J. Solid Structures, 35, (34-35), 4709-4737

5. Chen T., Chen H., 1995, Universal approximation to non-linear operators
by neural networks with arbitrary activation functions and its application to
dynamical systems, IEEE Trans. on Neural Networks, 6, 4, 911-917

6. Garcia S., Romo M.P., Taboada-Urtuzuastegui V., 2000, Knowledge-
basedmodellingof sandbehaviour,Proceedings of ECCOMAS2000, Barcelona,
11-14

7. GawinD., LefikM., SchreflerB.A., 2001,ANNapproach to sorption hy-
steresiswithin a coupled hygro- thermo-mechanicalFEanalysis, Int. J. Numer.
Meth. Engng., 50, 299-323

8. Ghaboussi J., Garrett J.H., Wu X., 1991, Knowledge-basedmodelling of
material behaviour with neural networks, Journal of Engineering Mechanics,
117, 132-151

9. Ghaboussi J., Sidarta D.E., 1998, New nested adaptive neural networks
(NANN) for constitutive modelling,Computers and Geotechnics, 22, 1, 29-52

10. Hashin Z., 1968,Assessment of self-consistent scheme approximation: conduc-
tivity of particulate composites, J. Comp. Mater., 2, 284-304

11. Hertz J., Krogh A., Palmer G.R., 1991, Introduction to the Theory of
Neural Computation, Lecture Notes Vol. I, Santa Fe Institute Studies in the
Sciences of Complexity, Addison-Wesley,

12. Hu Y.H., Hwang J-N., Eds., 2002,Handbook of Neural Network Signal Pro-
cessing, CRCPRESS

13. Kortesis S., Panagiotopoulos P.D., 1993,Neural networks for computing
in structural analysis: Methods and prospects of applications, Int. J. Num.
Meth. Eng., 36, 2305-2318

14. Kouznetsova V., Brekelmans W., Baaijens F., 2001, An approach to
micro-macromedelling of heterogeneousmaterials,Comp. Mech., 27, 37-48

15. Kroner E., 1978, Self-consistent scheme and graded disorder in poly crystal
elasticity, J. Phys., 8, 22-61

16. Lefik M., 1997,Use of artificial neural network to define a non-linear effective
constitutive law for a composite,Proc. 13th Polish Conf. ComputerMethods in
Mechanics PCCMM’97, Poznań, 725-732

17. Lefik M., 2001,Modified BP artificial neural network as an incremental non-
linear constitutivemodel,Proceedings of European Conference on Computatio-
nal Mechanics, ECCM-2001, on CD

18. Lefik M., Schrefler B.A., 2002a, Artificial neural network for parameter
identifications foran elasto-plasticmodel of super-conductingcableunder cyclic
loading,Computers and Structures, 80, 22, 1699-1713


562 M.Lefik

19. Lefik M., Schrefler B.A., 2002b, One-dimensional model of cable-in-
conduit superconductors under cyclic loading using artificial neural networks,
Fusion Engineering and Design, 60, 2, 105-117

20. LefikM., SchreflerB.A., 2003,Artificial neural network as an incremental
non-linear constitutive model for a finite element code, Computer Methods in
Applied Mechanics and Engineering, Elsevier, Volume/Issue 192/28-30, 3265-
3283

21. Mucha G., Waszczyszyn Z., 1997, Hybrid neural-network/computational
program for bending analysis of elastoplastic beams, Proc. of the XIII Polish
Conf. on Computer Methods in Mechanics, 949-956

22. Nijuhuis A., Noordman N.H.W., Ten Kate H.H.J., 1998, Mechanical
andElectrical testing of an ITERCS1Model Coil Conductor underTransverse
Loading in a Cryogenic Press, PreliminaryReport, University of Twente

23. Osowski S., 1996, Sieci neuronowe w ujęciu algorytmicznym, Wydawnictwo
NaukowoTechniczne,Warszawa

24. Penumadu D., Zhao R., 1999, Triaxial compression behaviour of sand and
gravel using artificial neural networks (ANN),Computers and Geotechnics, 24,
207-230

25. Pichler B.,MangH., 2001,Parameter identification for sophisticatedmate-
rialmodels bymeans of iterativebackanalyseson soft computing,ECCM-2001,
Cracow, Poland

26. Shin H.S., Pande G.N., 2000, On self-learning finite element codes based on
monitored response of structues,Computers and Geotechnics, 27, 161-178

27. Shin H.S., Pande G.N., 2001, ”Intelligent” Finite Elements, in:Computatio-
nalMechanics –NewFrontiers forNewMillenium, S.ValliappanandN.Khalili
(Edit.), Elsevier Science

28. Sanchez-PalenciaE., 1980,Non-HomogeneousMedia andVibrationTheory,
Springer V.

29. Sikora Z., Ossowski R., Ichikawa Y., Tkacz K., 1998, Neural networks
as a tool for constitutive modelling, in: Oka and Yashima, Edit., Localization
and Bifurcation Theory for Soils and Rocks, Balkema, Rotterdam

30. Suquet P.M., 1985, Elements of Homogenisation for Inelastic Solid Mecha-
nics, in: Lecture Notes in Physics, 272, Springer Verlag

31. Tadeusiewicz R., 1993, Sieci Neuronowe, Akademicka OficynaWydawnicza

32. Waszczyszyn Z., 1998, Some new results in applications of backpropagation
neural networks in structural and civil engineering, in:Advances in Engineering
Computational Technology, Civil-Comp Press, Edinburgh, 173-187


Hybrid, finite element-artificial neural network model... 563

33. Waszczyszyn Z., 2000, Neural networks in plasticity: some new results and
prospects of applications, European Congress on Computational Methods in
Applied Sciences and Engineering ECCOMAS 2000, on CD

34. Wriggers P., Lohnert S., 2002, Virtual testing of composite materials,
WCCM V, Vienna, Austria

35. Yagawa G., Okuda H., 1996, Neural networks in computational mechanics,
Archives of Computational Methods in Engineering, 3, 4, 435-512

36. Zaoui A., 1997, Structural morphology and constitutive behaviour of hetero-
geneousmaterials, in:ContinuumMicromechanics, P. Suquet (Edit.), Springer,
291-347

Zastosowanie sztucznych sieci neuronowych w modelowaniu

numerycznym kompozytów przy pomocy metody elementów skończonych

Streszczenie

Wartykuleopisanozastosowaniesztucznych sieci neuronowychdookreśleniaefek-
tywnego związku konstytutywnego dla kompozytów. To narzędzie numeryczne użyte
zostało dwojako: do bezpośredniego zapisu wyników otrzymanychw ramach klasycz-
nejmetody homogenizacji oraz downioskowania owłasnościach efektywnych na pod-
stawie eksperymentu numerycznego (zastępującego eksperyment rzeczywisty) wyko-
nanegonamałej, lecz reprezentatywnej próbce kompozytu.Wtymdrugimprzypadku
zastosowano schemat „samouczącego się” programumetody elementów skończonych,
wktórymzwiązek konstytutywnyopisany jest siecią neuronową.Schemat ten zaadap-
towano tak, żemoże być użytywprzypadku obciążeń niemonotonicznych orazwtedy,
gdy zależność: miara odkształcenia–miara naprężenia nie jest wzajemnie jednoznacz-
na. Te nowe możliwości uzyskane zostały dzięki przedstawieniu związku konstytu-
tywnego w formie przyrostowej oraz opracowania odpowiedniej do tego budowy sieci
neuronowej. Schemat „samouczącego się” programuMES charakteryzuje się tym, że
proces formułowania nieznanego związku konstytutywnego jest szybki, a zgodność
modelu numerycznego z eksperymentemwiększa niż dla innychmetod.

Manuscript received April 13, 2004; accepted for print April 26, 2004