Arg516.dvi


CUBO A Mathematical Journal
Vol.12, No¯ 01, (149–159). March 2010

An Improved Convergence and Complexity Analysis

for the Interpolatory Newton Method

Ioannis K. Argyros

Cameron University, Department of Mathematical Sciences,

Lawton, OK 73505, USA

email : iargyros@cameron.edu

ABSTRACT

We provide an improved compared to [5]–[7] local convergence analysis and complexity for
the interpolatory Newton method for solving equations in a Banach space setting. The re-
sults are obtained using more precise error bounds than before [5]–[7] and the same hypothe-
ses/computational cost.

RESUMEN

Nosotros entregamos aqúı un análisis de convergencia local y complejidad para el método
de interpolación de Newton para resolver ecuaciones en espacios de Banach. Los resultados
mejoran los de [5]–[7] e son obtenidos usando mas precisas cotas de error y las mismas hipotesis
y costo computacional.

Key words and phrases: Newton’s method, local convergence, Banach space, interpolatory New-

ton method, complexity, radius of convergence.

Math. Subj. Class.: 65G99, 65H10, 65B05, 47H17, 49M15.


150 Ioannis K. Argyros CUBO
12, 1 (2010)

1 Introduction

In this study we are concerned with the problem of approximating a simple solution α of the

equation

F (x) = 0, (1.1)

where F is an operator defined on a convex subset D of a Banach space X with values in a Banach

space Y over the real or complex fields of dimension N ,

dim(X) = dim(Y ) = N, 1 ≤ N ≤ +∞.

We consider interpolatory iteration In for approximating x
∗ defined as follows: Let xi be an

approximation to α and let wi be the interpolatory polynomial of degree ≤ n − 1 such that

w
(j)
i (xi) = F

(j)(xi), j = 0, 1, . . . , n − 1 (n ≥ 2). (1.2)

The next approximation x∗i+1 is a zero of wi. For n = 2 we obtain Newton’s method:

x∗i+1 = xi − F
′(xi)

−1F (xi) (i ≥ 0). (1.3)

We approximate xi+1 by applying a number of Newton iterations to wi(x) = 0. Let {xi} be the

interpolatory Newton iteration INn given by:

z0 = xi

zj+1 = zj − w
′

i(zj )
−1wi(zj ), j = 0, 1, . . . , k − 1 (1.4)

xi+1 = zk, k = [log 2n].

A local convergence analysis and the corresponding complexity of method (1.4) was studied

in the elegant paper by Traub and Wozniakowski [7]. Relevant works can be found in [1]–[7], and

the references there.

Here we are motivated by paper [7] and optimization considerations. In particular using more

precise estimates on the distances ‖xi − α‖ (i ≥ 0) we show that under the same hypotheses and

computational cost as in [5]–[7], we can provide a larger convergence radius, sharper error bounds

on the distances and consequently a finer complexity for method (1.4).

Numerical examples are introduced which compare favorably with results to the corresponding

ones in [5]–[7].

2 Local Convergence Analysis of Method (1.4)

Let Γ ≥ 0. We introduce the closed ball U = U (α, Γ) = {x ∈ X | ‖x−α‖ ≤ Γ}, and the parameters

Aj = Aj (Γ) = sup
x∈U

∥

∥

∥

∥

F ′(α)−1
F (j)(x)

j!

∥

∥

∥

∥

, (j ≥ 2) (2.1)


CUBO
12, 1 (2010)

An Improved Convergence And Complexity Analysis ... 151

provided that F (j) exists.

Moreover we introduce the parameter A by

A = A(Γ) = sup
x∈U

‖F ′(α)−1[F ′(x) − F ′(α)]‖

2‖x − α‖
. (2.2)

The foundation of our approach and what makes it more precise than the corresponding one in

[7] is the fact that we use (2.2) instead of (2.1) (for j = 2) to obtain upper bounds on the crucial

quantity ‖w′j (x)
−1F ′(α)‖.

Indeed, on the one hand note that

A ≤ A2 (2.3)

holds in general and A2
A

can be arbitrarily large [1], [2]. On the other hand see (2.28), (2.46), and

Remark 2.4.

Let us set

a =
A

A2
, A2 6= 0. (2.4)

Note that a ∈ [0, 1].

We showed in [3] the following improvement of Theorem 2.1 in [6] and Theorem 2.1 in [5]

respectively:

Theorem 2.1. If F is twice differentiable in U , (2.2) holds and

A2Γ ≤
1

2(1 + a)
, (2.5)

xi ∈ U, (2.6)

then the next approximation x∗i+1 generated by Newton method (1.3) is well defined, and satisfies

for all i ≥ 0:

‖x∗i+1 − α‖ ≤
A2

1 − 2aA2
‖xi − α‖

2 ≤
1

2
‖xi − α‖ (2.7)

and

x∗i+1 − α =
1

2
F ′(α)−1F ′(α)(xi − α)

2 + O(‖xi − α‖
2). (2.8)

Theorem 2.2. If F is n-times differentiable, n ≥ 3 in U , (2.2) holds, and

nAnΓ
n−1

1 − aA2Γ
<

(

2

3

)n−1

(2.9)

xi ∈ U,

then the polynomial wi has a unique zero in U
∗ = U ∗

(

α, Γ
2

)

and defining x∗i+1 as the zero of wi in

U ∗ the following estimates hold for all i ≥ 0

‖x∗i+1 − α‖ ≤
An(1 + ‖x

∗

i+1 − α‖/‖xi − α‖)
n

1 − aA2‖x
∗

i+1 − α‖
‖xi − α‖

n ≤
1

2
‖xi − α‖, (2.10)


152 Ioannis K. Argyros CUBO
12, 1 (2010)

and

x∗i+1 − α =
(−1)n

n!
F ′(α)−1F (n)(α)(xi − α)

n + O(‖xi − α‖
n). (2.11)

We can show the main local convergence theorem for method (1.4):

Theorem 2.3. If F is n-times differentiable, n ≥ 3 in U , (2.2) holds, and

0 ≤ Ã2Γ ≤
1

3 + 2a
(2.12)

where,

Ã2 =
A2 +

n(n−1)
2

An(2Γ)
n−2

1 − aA2Γ − nAn
(

3
2

)n−1
Γn−1

(2.13)

x0 ∈ U, (2.14)

then sequence {xi} (i ≥ 0) generated by interpolary-Newton iteration INn is well defined, remains

in U for all i ≥ 0, converges to α so that the following estimates hold for all i ≥ 0:

ei+1 = ‖xi+1 − α‖ ≤

{

1

2
+

3

2

(

1

2

)k
}

ei, (2.15)

ei+1 ≤ ci,ne
n
i (2.16)

where,

ci,n =

(

1 +
e∗i+1
ei

)

[

An
1 − aA2e

∗

i+1

+ (Ã2(1 + Hi))
2k−1

]

((

1 +
e∗i+1
ei

)

ei

)2k−n

, (2.17)

for

e∗i+1 = ‖x
∗

i+1 − α‖, Hi = O(ei), 0 ≤ Hi ≤
3 + 2a

2
, k = [log 2n], (2.18)

lim
i→∞

ci,n = An + δÃ
n−1
2 where δ = 0

if 2k > n and δ = 1, if 2k = n, (2.19)

xi+1 − α = Fn(xi − α)
n + bi,k + O(‖xi − α‖

n), (2.20)

where

bi,1 = F2(xi − α)
2, (2.21)

bi,j+1 = F2b
2
i,j , j = 1, 2, . . . , k − 1, (2.22)

and

Fj =
(−1)j

j!
F ′(α)−1F (j)(α) for j = 2 and n. (2.23)


CUBO
12, 1 (2010)

An Improved Convergence And Complexity Analysis ... 153

The proof is similar to Theorem 3.1 in [7], but there are differences where we use (2.2) instead

of (2.1) (for i = 2).

Proof. We shall first show using induction on j ≥ 0 that w′j (zj) is invertible and zj ∈ U .

Set

F (j)(x) − w
(j)
i (x) = R

(j)
n (x; xi), x ∈ U, j = 0, 1, 2, (2.24)

where,

‖F ′(α)−1R(j)n (x; xi)‖ ≤ j!

(

n

j

)

An‖x − xi‖
n−1. (2.25)

We can write

w′i(x) = F
′(x) − R′n(x; xi)

= F ′(α)[I + F ′(α)−1{F ′(x) − F ′(α)} − F ′(α)−1R′n(x; xi)] (2.26)

and in view of (2.2), (2.12) and (2.24) for x ∈ U we get in turn

‖F ′(α)−1[w′j (x) − F
′(α)]‖ ≤ 2aA2‖x − α‖ + nAn‖x − xi‖

n−1 (2.27)

≤ 2aA2Γ + nAn(2Γ)
n−1 ≤

2

3 + 2a
< 1. (2.28)

It follows from (2.28) and the Banach Lemma on invertible operators [4] that w′i(x) is invertible

for all x ∈ U , and

‖w′i(x)
−1F ′(α)‖ ≤

1

1 − 2aA2‖x − α‖ − nAn‖x − xi‖n−1
. (2.29)

Since the denominator in (2.13) is positive we get

nAnΓ
n−1

1 − aA2Γ
<

(

2

3

)n−1

(2.30)

and from Theorem 2.2 wi has a unique zero x
∗

i+1 in U
∗ and (2.10) holds.

Using (2.24) and (2.29) we get for x ∈ U
∥

∥

∥

∥

w′j (x
∗

i+1)
−1 w

′′

i (x)

2

∥

∥

∥

∥

≤ ‖w′i(x
∗

i+1)
−1F ′(α)‖

∥

∥

∥

∥

F ′(α)−1
w′′i (x)

2

∥

∥

∥

∥

≤
A2 +

n(n−1)
2

An‖x − xi‖
n−2

1 − 2aA2‖x
∗

i+1 − α‖ − nAn‖x
∗

i+1 − xi‖
n−1

≤
A2 +

n(n−1)
2

An(2Γ)
n−2

1 − aA2Γ − nAn
(

3
2
Γ
)n−1

= Ã2. (2.31)

It follows from Theorem 3.1 and (2.12) that for z1 = xi − F
′(xi)

−1F (xi)

‖z1 − α‖ ≤
1

2
‖xi − α‖. (2.32)


154 Ioannis K. Argyros CUBO
12, 1 (2010)

Since x∗i+1 ∈ U
∗, ‖z1 − x

∗

i+1‖ ≤ Γ, we shall show

zj+1 ∈ Dj =

{

x : ‖x − x∗i+1‖ ≤
1

2
‖zj − x

∗

i+1‖

}

∩ U. (2.33)

Set

wi(x) = wi(zj ) + w
′

i(zj )(x − zj) + R2(x; zj ), (2.34)

where,

R2(x; y) =

∫ 1

0

w′′i (y + t(x − y))(x − y)
2(1 − t)dt. (2.35)

Note that zj+1 is the solution of equation

x = H(x) = x∗i+1 + w
′(xi+1)

−1
{

R2(x; zj ) − R2(x; x
∗

i+1)
}

. (2.36)

We shall show H is contractive on Dj .

It follows from (2.12), (2.31) and (2.36):

‖H(x) − x∗i+1‖ ≤ Ã2(‖x − zj‖
2 + ‖x − x∗i+1‖

2)

≤
2 + 3a

2
Ã2‖zj − x

∗

i+1‖ ≤
1

2
‖zj − x

∗

i+1‖. (2.37)

Moreover we have

‖H(x) − α‖ ≤ ‖x∗i+1 − α‖ + ‖H(x) − x
∗

i+1‖ ≤

(

1

2
+

1

2

)

Γ = Γ. (2.38)

It follows by the contraction mapping principle [4], (2.37) and (2.38) that zj+1 is the unique zero

of H in Dj . It follows that xi+1 = zk ∈ U , and

‖xi+1 − α‖ ≤ ‖xi+1 − x
∗

i+1‖ + ‖x
∗

i+1 − α‖

≤

(

1

2

)k

‖z0 − x
∗

i+1‖ +
1

2
‖xi − α‖

≤

[

3

2

(

1

2

)k

+
1

2

]

‖xi − α‖ ≤
7

8
‖xi − α‖, (2.39)

which shows xi ∈ U and (2.15) hold true.

Set ej = ‖zj − x
∗

i+1‖ and x = zj+1 in (2.36). Then we get

ej+1 ≤
Ã2
(

1 +
ej+1
ej

)2

1 − Ã2ej+1
e2j ≤ Ã2(1 + Hi)ej , (2.40)

where Hi = O(ej ) and 0 ≤ Hi ≤
2+3a

2
compare to (2.7). In view of ej = O(ej ) we can set


CUBO
12, 1 (2010)

An Improved Convergence And Complexity Analysis ... 155

Hi = O(ei). It follows from (2.10) and (2.40)

ei+1 ≤ ‖xi+1 − x
∗

i+1‖ + ‖x
∗

i+1 − α‖ = ek + ‖x
∗

i+1 − α‖

≤
[

Ã2(1 + Hi)
]2k−1

‖xi − x
∗

i+1‖
2k

+
An

1 − aA2e
∗

i+1

(

1 +
e∗i+1
ei

)n

eni

≤

(

1 +
e∗i+1
ei

)n
(

An
1 − aA2e

∗

i+1

+
[

Ã2(1 + Hi)
]2k−1

[(

1 +
e∗j+1
ei

)

ei

]2k−n
)

eni = ci,ne
n
i . (2.41)

In view of
e
∗

i+1

ei
and Hi tending to zero we get

lim
i→∞

ci,n = An + δÃ
n−1
2 , (2.42)

where δ = 0 if 2k > n and δ = 1 otherwise. Hence, (2.16) holds.

Furthermore, we have

zj+1 − x
∗

i+1 = w
′

i(x
∗

i+1)
−1 w

′′

i (x
∗

i+1)

2
(zj − x

∗

i+1)
2 + O(ẽ3j )

= F ′(α)−1
F ′′(α)

2
(zj − x

∗

i+1)
2 + O(e∗i+1ẽ

2
j + ẽ

3
j )

= F2(zj − x
∗

i+1)
2 + O(ẽ2j ). (2.43)

Therefore, we get

zk − x
∗

i+1 = F2
(

F2 · · · · (F2(xi − x
∗

i+1)
2)2 · · ·

)2
+ O(e2ki )

= F2
(

F2 · · · · (F2(xi − α)
2)2 · · ·

)2
+ O(e2ki ). (2.44)

In view of (2.21), (2.22), and (2.44) we have

zk − x
∗

i+1 = bi,k + O(e
2k
i ). (2.45)

In view of (2.11) and (2.45) we deduce

xi+1 − α = zk − x
∗

i+1 + x
∗

i+1 − α = bi,k + Fn(xi − α)
n + O(eni ), (2.46)

which shows (2.20).

That completes the proof of the theorem.

Remark 2.4. The less precise estimate (using (2.1) for j = 2 instead of sharper (2.2) that is

actually needed)

‖F ′(α)−1[w′j (x) − F
′(α)]‖ ≤ 2A2‖x − α‖ + nAn‖x − xi‖

n−1 (2.47)


156 Ioannis K. Argyros CUBO
12, 1 (2010)

was used in [7] instead of (2.28), together with

0 ≤ Ã2Γ ≤
1

5
(2.48)

instead of weaker (2.12).

If A = A2 our results Theorem 2.1, Theorem 2.2 and Theorem 2.3 reduce to the corresponding

Theorem 2.1 in [6], Theorem 2.1 in [5] and Theorem 3.1 in [7] respectively. Otherwise our results

constitute improvements with advantages already stated in the Introduction.

We now give conditions under which INn enjoys a “type of global convergence”.

Let

F (x) =

∞
∑

i=1

1

ι!
F (i)(xi − α)

i (2.49)

be analytic in D = U 0(α, R), and

‖F ′(α)−1F (i)(α)‖

ι!
≤ Ki−1 (2.50)

for i ≥ 2 and R ≥ 1
K

.

As in [7], one way to find K is to use Cauchy’s formula

‖F ′(α)−1F (i)(α)‖

ι!
≤

M

Ri
, (2.51)

where,

M = sup
x∈D

‖F ′(α)−1F (x)‖. (2.52)

Let K = max
[

1
R

, M
R2

]

. Then
M

R
≤ KR ≤ (KR)i−1 (2.53)

and
M

Ri
≤ Ki−1. (2.54)

We can show:

Theorem 2.5. If (2.2) and (2.50) hold then the interpolary Newton method (1.4) converges

provided that x0 ∈ U (α, Γn), where

Γn =
xn
K

(2.55)

and xn, 0 < xn < x∞, satisfies the equation

(3 + 2a)

[

x

(1 − x)3
+

n(n − 1)

4(1 − x)2

(

2x

1 − x

)n−1
]

= 1 −
ax

(1 − x)3
−

n

(1 − x)2

[

3x

2(1 − x)

]n−1

(2.56)


CUBO
12, 1 (2010)

An Improved Convergence And Complexity Analysis ... 157

and xn → x∞, where

x∞ ≥ .12 (2.57)

is the positive solution of equation
x

(1 − x)3
=

1

4 + 2a
. (2.58)

Proof. In view of (2.50) we have for

f (x) =
x

1 − Kx
, (2.59)

that

‖F ′(α)−1F (i)(x)‖ ≤ f (i)(‖x − α‖). (2.60)

Using

f (i)(x) =
i!Ki−1

(1 − Kx)i+1
(i ≥ 2), (2.61)

we get

Ai(Γ) ≤
Ki−1

(1 − KΓ)i+1
(i ≥ 2). (2.62)

It follows from (2.13) and (2.62) that

Ã2Γ ≤

[

KΓ
(1−KΓ)3

+
n(n−1)

4(1−KΓ)2

(

2KΓ
1−KΓ

)n−1
]

1 − aKΓ
(1−KΓ)3

− n
(1−KΓ)2

(

3KΓ
2(1−KΓ)

)n−1
=

1

3 + 2a
. (2.63)

Letting KΓ = x we see that x satisfies equation (2.56). It is simple calculus to show that x = x(n)

is an increasing function of n and x∞ = lim
n→∞

x(n) satisfies equaiton (2.58).

Remark 2.6. If A = A2 (i.e. a = 1) our Theorem 2.5 reduces to Theorem 3.2 in [7]. Otherwise it

is an improvement, since the limit of sequence x(n) in [7] is .12 which is smaller than ours implying

by (2.55) that we provide a larger radius of convergence.

In particular if R is related to 1
K

, say R = c1
K

, then

Γn =
xn
K

=
xn
c1R

≤
x∞
c1R

. (2.64)

The rest of the results introduced in [7] are improved. In particular with the notation introduced

in [7] we have for

I: ei = Gie
n
i−1, Gi ≤ G,

G = G(n) =



























A2
1 − 2aA2Γ

, n = 2

(1 + q)n
[

An

1 − aA2
Γ
2

+

(

7

2
Ã2

)2k−1

[(1 + q)Γ]2
k
−n, n > 2

(2.65)


158 Ioannis K. Argyros CUBO
12, 1 (2010)

where Ã2 is given by (2.13), q =
1
2

+ 3
2

(

1
2

)k
, and K = [log 2n].

II: If the total number of arithmetic operations necessary to solve a system of N linear equa-

tions is O(nβ ), β ≤ 3, then

d(INn) =























O

(

N β[log 2n] + N 2
(

N + n − 2

n − 2

)

([log 2n] − 1

)

for N ≥ 2,

(3 + 2a)[log 2n] + O(1), for n = 1.

(2.66)

Remark 2.7. If A = A2 our results reduce to the ones in [7]. Otherwise they constitute an

improvement.

We complete this study with an example to show that strict inequality can hold in (2.3):

Example 2.8. Let X = Y = R, x∗ = 0 and define function F on U = U (0, 1) by

F (x) = ex − 1. (2.67)

Using (2.1), (2.2), (2.4) and (2.66) we obtain

A =
e − 1

2
<

e

2
= A2 (2.68)

and

a = .632120588. (2.69)

It follows from (2.5) that our radius of convergence is given by

ΓA = .112699836. (2.70)

The corresponding radius ΓT W given in Theorem 2.1 in [6] or [7] is:

ΓT W =
1

4A2
= .09196986. (2.71)

That is

ΓT W < ΓA. (2.72)

Received: October, 2008. Revised: January, 2009.

References

[1] Argyros, I.K., A unifying local-semilocal convergence analysis and applications for two-

point Newton-like methods in Banach space, J. Math. Anal. Applic., 298 (2004), 374–397.


CUBO
12, 1 (2010)

An Improved Convergence And Complexity Analysis ... 159

[2] Argyros, I.K., Approximate Solution of Operator Equations with Applications, World

Scientific Publ. Comp., River Edge, New Jersey, 2005.

[3] Argyros, I.K., An improved convergence and complexity analysis of Newton’s method for

solving equations, (to appear).

[4] Kantorovich, L.V. and Akilov, G.P., Functional Analysis in Normed Spaces, Moscow,

1959.

[5] Ortega, J.M. and Rheinboldt, W.C., Iterative Solution of Nonlinear Equations in Sev-

eral Variables, Academic Press, New York, 1970.

[6] Traub, J.F. and Wozniakowski, H., Strict lower and upper bounds on iterative compu-

tational complexity . In Analytic Computational Complexity, J.F. Traube, Ed., Academic

Press, New York, 1976, pp. 15–34.

[7] Traub, J.F. and Wozniakowski, H., Convergence and complexity of Newton iteration

for operator equations, J. Assoc. Comput. Machinery, 26, No. 2 (1979), 250–258.