Model Calibration in Option Pricing


SQU Journal for Science, 17 (1) (2012) 84-102 

© 2012 Sultan Qaboos University 

84 

Model Calibration in Option Pricing 

Andre Loerx* and Ekkehard W. Sachs**  

Department of Mathematics, University of Trier, Trier, Germany. *Email: loerx@uni-trier.de 
and **Email: sachs@uni-trier.de 
 

ABSTRACT: We consider calibration problems for models of pricing derivatives which occur in 

mathematical finance. We discuss various approaches such as using stochastic differential equations 

or partial differential equations for the modeling process. We discuss the development in the past 

literature and give an outlook into modern approaches of modelling. Furthermore, we address 

important numerical issues in the valuation of options and likewise the calibration of these models. 

This leads to interesting problems in optimization, where, e.g., the use of adjoint equations or the 

choice of the parametrization for the model parameters play an important role.  

  
KEYWORDS: Adjoints, Calibration, Jump models, Local volatility models, Mixed models, Partial 

differential equation (PDE), Stochastic differential equation (SDE), Stochastic volatility models. 

 معايرة النماذج في تسعير الخيارات

 أندريه لوركس و إكهارد زاكس

فً التموٌل الرٌاضً. نناقش طرق متنوعة كاستخدام  التسعٌر التً تظهر اتنفترض مسائل المعاٌرة لنماذج مشتق :خصمل
التطورات المنشورة مسبقا ونعطً  عرضأو المعادالت التفاضلٌة الجزئٌة لعملٌة النمذجة. نالعشوائٌة المعادالت التفاضلٌة 

تلك النماذج.  ةنعالج مواضٌع عددٌة هامة فً تقٌٌم الخٌارات ومعاٌر ،آفاق حول الطرق الحدٌثه للمعاٌرة. إضافة إلى ذلك
ا فً ماها دور تلعب تم معادالت مساعدة أو اختٌار معامالااستخدأن حٌث، مثآل،  ٌاتمثلاألهذا ٌؤدي إلى مسائل مهمة فً 

 .نموذج المعامالت

1. Introduction 

inancial derivatives, like options and futures, have gained considerable importance since the Chicago Board 

Options Exchange (CBOE), the first exchange to list standardized exchange-traded stock options, was 

founded in 1973. Starting with 911 contracts on 16 underlying stocks on the first trading day on April 26, 1973, 

the CBOE reported a total number of over 1.1 billion traded contracts in 2009, which corresponds to an average 

volume of more than 4.5 million contracts a day.
1
 The rapid growth over the last 40 years of financial derivative 

markets, certainly owes its success to the publication of Black and Scholes (1973) and its extension by Merton 

(1973), since they laid the foundation of preference-free valuation of contingent claims. Particularly, they 

developed a simple, but powerful model that governs the price of European-style call and put options over time.  

The main achievement, however, was not only the derivation of a valuation formula in closed form but also 

the idea of building a (hedge) portfolio by buying and selling the underlying asset and a risk-free bond in such a 

                                                      
1
Source: http://www.cboe.com/aboutcboe/History.aspx & http://www.cboe.com/data/marketstats-2009.pdf 

F 


MODEL CALIBRATION IN OPTION PRICING 

85 

self-financing way that it perfectly matches the payoff (at maturity) of the option to be priced. Consequently, the 

amount of initial capital needed for building up the hedge portfolio coincides with the price of the considered 

European-style option. These publications form the cornerstone of today's financial industry. 

But not only the total number of contracts, also the variety have grown in a remarkable way. Nowadays, in 

addition to standard European-style plain vanilla call and put options, exotic derivatives like digital or barrier 

options of American- or Bermudan-type, Asian-style options like lookbacks, or chooser options, cliquets or any 

reasonable combination, are frequently traded on financial derivative markets. However, it has been shown (in a 

variety of publications and text books) that due to its simplicity the classical Black-Scholes model cannot 

properly capture the real market dynamics. The Black-Scholes model is, unfortunately, not suitable to adequately 

price and hedge exotic options. In the literature, a multiple of models can be found subsequently relaxing the 

assumptions of the classical Black-Scholes model, for instance, by adding another degree of freedom to the 

process of the underlying asset. 

In order to extract accurate market dynamic information to price and hedge exotic options, practitioners, 

e.g., traders and risk managers, need to adapt their models to the current market situation, i.e. the models have to 

be calibrated to a set of liquidly traded standard instruments like plain vanilla options. 

The pricing of options as well as model calibration are interesting mathematical problems from various 

points of view. They pose challenges in several areas like mathematical modeling, stochastic processes, partial 

differential equations, optimization and numerical analysis. In Section 2, we briefly review the fundamentals of 

smile-consistent option pricing and its numerical pricing techniques like bi- and trinomial trees, Monte Carlo 

methods, and PDE pricing, for the case where no closed form solution is available. We will focus mostly on 

European-style call options and briefly discuss some pros and cons of the main classes of smile-consistent 

volatility models proposed in the literature. More precisely, we consider stochastic volatility models, local 

volatility models, jump models, as well as mixed volatility models and emphasize their relevance in practice. 

Section 3 gives an exhaustive survey of publications on the calibration of financial market models. Although 

several references on Monte Carlo calibration are given, we focus mostly on literature concerning the 

reconstruction of the local volatility function. We distinguish parametric and non-parametric approaches and 

briefly illustrate three categories of calibration procedures proposed in the literature. In doing so, we closely 

follow the distinction of Bouchouev and Isakov (1999), i.e. optimization-based algorithms, extra- and 

interpolation schemes, and iterative methods. 

2. Option pricing 

Starting with the Black-Scholes model, today's price of a European-style call (put) option with maturity T 

and strike K under some risk-neutral measure Q is defined as  

                        0Call: ( , 0) = (max( , 0)),
rT Q

TC S e S K


  (1) 

                        0Put: ( , 0) = (max( , 0)),
rT Q

TP S e K S


  (2) 

where TS denotes the asset price at maturity T given the asset price process 0( )t t TS    as a solution of the  

Black-Scholes stochastic differential equation (SDE)  

 = ( )t t t tdS r d S dt S dW   (3) 

with 0 (0, )S    and [0, ].t T  The (constant) instantaneous drift term consists of the risk-less interest rate r and 

the dividend yield d of the underlying.
2
 Furthermore,   denotes the (constant) instantaneous volatility function 

and 0( )t t TW    represents a Brownian motion (or Wiener process) defined on a probability space ( , , )Q  

with  -algebra  over the set =   and Q the unique risk-neutral measure (or martingale measure). The 

Brownian motion 0( )t t TW    is adapted to an adequate filtration 0( ) ,t t T   where the filtration 0( )t t T   

                                                      
2
For simplicity, we assume constant interest rates and dividend yields and further omit equity premiums. 


ANDRE LOERX and EKKEHARD W. SACHS 

86 

satisfies some 'technical' conditions (see (Karatzas and Shreve, 2008) for details). As already mentioned, one of 

the key achievements of Black and Scholes (1973) was to provide an explicit valuation formula (known as the 

Black-Scholes formula
3
) for European-style call and put options, namely  

    
( ) ( )

1 2( , ; , ; , , ) = ( ) ( ) ,
BS d T t r T t

t tC S t K T r d e S e K    
   

  (4) 

    
( ) ( )

2 1( , ; , ; , , ) = ( ) ( ) ,
BS r T t d T t

t tP S t K T r d e K e S    
   

    (5) 

where  

2

1 2 1

1
ln( / ) ( )( )

2= , =
tS K r d T t

T t
T t


   



   

 


 
with ( )x  the cumulative distribution function of the standard normal distribution, i.e.  

 
2

2
1

( ) = .
2

y

x
x e dy





  

Hence, the Black-Scholes option price at time [0, ]t T  depends on the current value of the underlying ,tS  i.e. 

the spot price, the time the option expires T, i.e. the maturity, the exercise price or strike price K, the interest rate 

r, dividend yield d and finally the (constant) volatility .  A well-known model-free relationship between calls 

and puts on the same underlying asset, with equal strike and maturity, is the put-call parity  

 
( ) ( )

( , ; ) ( , ; ) = .
BS BS d T t r T t

t t tC S t P S t e S e K
   

   

It is easy to show that given a fixed Black-Scholes price C  satisfying reasonable non-arbitrage conditions, i.e. 
( ) ( ) ( )

max( , 0) ,
d T t r T t d T t

t te S Ke C e S
     

    the mapping  

 ( )C C    

has a unique root ,
impl

  called  implied volatility. Conversely, in the classical Black-Scholes model the option 

price ( , ; , ; , , )
BS impl

tC S t K T r d   depends uniquely on its implied volatility ,
impl

  where impl  is assumed 

to be constant in time t, stock price ,tS  strike K and maturity T. This assumption, however, cannot be observed 

on the market. If one plots the observed market implied volatility against the strike K, the resulting graph will  

usually be downward sloping in equity markets, while it is typically valley-shaped in currency markets or for 

equity index options. This behavior is referred to as 'volatility skew' or 'volatility smile', respectively. 

Furthermore, it can be observed that the volatility skew or smile usually flattens for long term maturities. The 

change of implied volatilities with respect to different maturities is called 'term structure' of the implied 

volatility surface. Finally, the implied volatility surface also changes dynamically over time. A more detailed 

introduction to this topic is given, e.g., in (Hull, 2011). 

Although, the classical Black-Scholes model lacks on realism, implied volatility serves as a standardized 

(or normalized) value (usually quoted in %) of market volatility. In sticky-moneyness markets
4
, the implied 

volatility provides more stability than the Black-Scholes option price. Practitioners use implied volatility as a 

language, rather than as a model. 

A lot of research has been done over the last 40 years, trying to explain this strike deviation from the 

Black-Scholes constant volatility assumption. Many factors have been investigated as being possibly responsible 

for the smile and term structure of the implied volatility surface. They range from the existence of transaction 

costs or liquidity constraints, to stochastic volatility and jump processes for the underlying asset price process. In 

the following we focus on the latter ones. 

                                                      
3
In fact, Black and Scholes (1973) derived their valuation formula by solving the Black-Scholes partial differential equation, which will be 

introduced later on. 
4
Moneyness is defined as the quotient between stock price tS  and strike price K. 


MODEL CALIBRATION IN OPTION PRICING 

87 

2.1  Smile-consistent pricing models 

The idea behind the development of new pricing models, so called 'smile-consistent pricing models', is to 

directly extract information about the asset price and volatility dynamics from frequently traded standardized 

plain vanilla options in order to price and hedge exotic options. This is done by assuming the coefficients of (3) 

to be some deterministic function of the spot price tS  and the time t, by adding new sources of randomness or 

by adding all of it. Since the sources of randomness are usually added to the volatility [cf. Gatheral (2006)] the 

generalized framework (or extension) of the Black-Scholes model is given by replacing (3) by  

 = ( , ) ( , )t t t t t tdS a S t S dt b S t S dW  (6) 

with 0 (0, ).S    The asset price 0( )t t TS    is, therefore, modeled by a 0( )t t T   adapted stochastic process, 

driven by the SDE (6), where ( , )ta S t  and ( , )tb S t  are the instantaneous drift and the volatility, respectively. 

Fengler (2005) assumed that the instantaneous volatility ( , , )t tb S t   follows some 0( )t t T   adapted arbitrarily 

depending stochastic process, where the ' t -dependence' simply emphasizes that ( , , )t tb S t   may also depend 

either on the history of ,tS  i.e. 1
= , , ,t t tN

S S  for 0 <nt t  for all n, or some other sources of randomness. 

Furthermore, the absence of arbitrage, i.e. the existence of some risk-neutral measure under which a discounted 

asset price 0( )t t TS    is a martingale, is assumed. However, the martingale measure needs not to be unique (see 

(Björk, 2004, p. 150)). To the knowledge of the authors, at least two main lines can be identified. 'Stochastic 

volatility models' impose a new source of randomness to the volatility, while 'local volatility models' treat the 

volatility as a deterministic function of tS  and t. Adding jumps as a new source of randomness leads to 'jump 

diffusion models'. Most recently, 'stochastic local volatility models' have become very attractive, since they 

allow a perfect fit while preserving the advantages of stochastic volatility models.  

2.1.1 Stochastic volatility models 

The most prominent stochastic volatility (SV) model was introduced by Heston (1993),  

 
= ( ) ,

= ( )

s
t t t t t

v
t t t t

dS r d S dt v S dW

dv v v dt v dW 

 

 
 (7) 

with 0 0, (0, )S v    and  , = ,
s v

t tdW dW   

where r d  denotes the (deterministic) instantaneous drift of stock price returns,   the volatility of volatility, 

  the speed of reversion of tv  to its long-term mean v  and   the correlation between 
s

tW  and ,
v
tW  the 

Brownian motions driving the stock price process and variance process, respectively. The latter one is a special 

case of the square root process proposed by Cox et al. (1985) to model interest rates, known as the CIR model. 

Other stochastic volatility models have been proposed in the literature, e.g., (Hagan et al., 2002, SABR; 

Bollerslev, 1986, GARCH; Stein and Stein, 1991), just to name a few of them. However, it is proven, for 

instance, in (Duffie et al., 2000) that the Heston model has a quasi-closed form solution for European-style 

options. Although other, possibly more realistic, stochastic volatility models are available, the existence of a 

quasi-closed form solution contributed substantially to the outstanding success of Heston's model in practice. 

This is due to the fact that computational efficiency becomes essential when calibrating the model to observed 

market data. Furthermore, according to Gatheral (2006), all stochastic volatility models generate roughly the 

same shape of implied volatilities and thus the same implications for the valuation of non-vanilla derivatives. On 

the other hand, he shows that, although the Heston model fits observed implied volatility for longer expirations, 

this is, unfortunately, not true for shorter-dated options. This motivates the use of 'jump diffusion models', since, 

loosely speaking, introducing jumps has very little impact for long term maturity options, while jumps are 

strongly noticeable in terms of implied volatility for short-expiration options. 


ANDRE LOERX and EKKEHARD W. SACHS 

88 

2.1.2 Adding jumps  

Jump diffusion models were first considered by Merton (1973) and later by Kou (2002) as an extension of 

one-dimensional processes. Duffie et al. (2000) proved that 'affine jump diffusion' (AJD) processes, which 

consist, roughly speaking, of a jump diffusion process for which the drift vector, the 'instantaneous' covariance 

matrix, and the jump intensities all have affine dependence on the state vector, are in general analytically 

tractable. In case of a two-dimensional process we have  

 
=1

=1

= ( ) 1 ,

= ( )

N st Zs n
t t t t t n

n

N tv v
t t t t n

n

dS r d k S dt v S dW d S e

dv v v dt v dW d Z



 



  
      

  

 
    

 

 (8) 

with 0 0, (0, )S v    and  , = ,
s v

t tdW dW   

where r, d, ,  ,  ,v  and   are as in (7). Further, tN  denotes a Poisson process with jump intensity > 0  

and drift compensation k. The jump sizes in stock price and volatility, i.e. 
s
nZ  and ,

v
nZ  respectively, for 

= 1, , ,tn N  are assumed to be i.i.d. (independent and identically distributed). Finally, := lim t tnn
S S 

 
denotes the stock price at n  right before the jump occurs. 

Thus, (8) denotes a stochastic volatility model with simultaneous jumps in stock price and volatility 

(SVJJ). Note that Heston's stochastic volatility model is a jump-free special case, i.e. = = 0
s v
n nZ Z  for all n, of 

an AJD process and as such has at least a quasi-closed form solution as mentioned before.
5
 The Merton and 

Heston approaches were combined by Bates (1996), who proposed a model with stochastic volatility and jumps 

(SVJ). Bates' model is also incorporated in (8) as a special case, where = 0
v
nZ  for all n. Gatheral (2006) shows 

that SVJ models perform empirically as well as SVJJ, but they have less parameters. Therefore, SVJ models like 

Heston's model are frequently used in practice. An extensive discussion about jump diffusion models can be 

found in (Cont and Tankov, 2004). 

We now turn our focus onto 'local volatility (LV) models'. They became quite popular in the past due to 

their simplicity, however, they have also gained a lot of criticism in financial literature (cf., e.g., (Ayache et al., 

2004; Hagan et al., 2002)).   

2.1.3 Local volatility models  

Following Fengler (2005), 'local variance' may be defined as the risk-neutral expectation of the 

instantaneous variance conditional on =TS K  and time filtration ,t  i.e.  

 
2 2

,ˆ ( , ) := ( ( , , )| = , ) ,
Q

K T t T T T tS t b S T S K   

where ( , , )t tb S t   is as before. Then, 'local volatility' (also called 'forward volatility') is given as the square root 

of local variance. The main advantage of this definition of local volatilities is that it naturally implies the purely 

deterministic case, but also offers some insights into the concept of stochastic volatility. Within this framework 

of local volatilities, for some market level = tK S  at = ,T t  the instantaneous volatility is given by  

 ,ˆ( , ) = ( , ) ,t S t tt
S t S t   

such that (with ( , ) := ,ta S t r d  for simplicity)  

 ,ˆ= ( ) ( , )t t S t t t tt
dS r d S dt S t S dW   (9) 

                                                      
5
In fact, the Heston's formula is given as a linear combination of two integrals of real-valued functions. 


MODEL CALIBRATION IN OPTION PRICING 

89 

defines the stock price process, which generalizes the classical Black-Scholes theory as desired. The intrinsic 

stochasticity is integrated out and we are left with a one-factor diffusion process. However, if by assumptions the 

instantaneous volatility is deterministic in spot and time, i.e. ( , , ) = ( , ),t t tb S t S t   both concepts of 

instantaneous and local variance coincide, since  

           
2 2

,ˆ ( , ) := ( ( , , )| = , )
Q

K T t T T T tS t b S T S K   

                               
2 2

= ( ( , )| = , ) = ( , ) .
Q

T T tS T S K K T   

The local volatility assumption is the easiest way of relaxing the constant volatility case and it introduces 

much more flexibility. In contrast to stochastic volatility models, the concept of local volatility preserves the 

assumption of market completeness.
6
 Originally, Dupire (1994) and Derman and Kani (1994a)

7
 have shown that 

given the distribution of the final stock price TS  for each time T conditional on some starting price 0 ,S  there 

exists a unique risk-neutral diffusion process (9) consistent with these distributions. The reason is that there 

exists a 'dual' or 'adjoint' PDE to the classical Black-Scholes PDE (cf. Section 2). The remarkable observation 

that local volatility can be seen as the market expectation of future volatility, known as 'Markovian projection', 

was independently derived by Dupire (1996) and by Derman and Kani (1998). 

Different assumptions on the special shape of the local volatility function have been made in the literature. 

They are either motivated by model calibration in order to reduce the number of unknowns (see Section 3.3.2), 

or by empirical observations (see, e.g., (Dumas et al., 1998; Coleman et al., 2001)) in order to properly capture 

the dynamics of the underlying asset. A prominent example is the 'constant elasticity of variance model' (CEV) 

introduced by Cox and Ross (1976), where 
1

( , ) =t tS t S


 


 with , > 0.   The CEV model attempts to 

heuristically capture the stochastic volatility, where   controls the relationship between volatility and price. 

When < 1,  commonly observed in equity markets, the volatility of the underlying increases as its price falls. 

Conversely, in commodity markets, the volatility of the underlying tends to increase as its price increases. Note 

that for = 1  we obtain the Black and Scholes case. 

Ingersoll (1997) and Rady (1997) introduced the class of bounded quadratic diffusion models, i.e. ( , )tS t  

which is considered to be a bounded quadratic function in asset price and/or time. Zühlsdorff (2001) has proven 

the existence and uniqueness of the solution of the underlying SDE and provided explicit formulas for call 

options assuming that the deterministic local volatility function can be split in a strictly positive and bounded 

function   and a quadratic polynomial p such that ( , ) = ( ) ( ).t tS t p S t   Option pricing in the quadratic 

volatility model is a rather delicate issue, since it touches the limits of no-arbitrage theory. Andersen (2011) 

clarified some confusion in literature and further extended the range of existing pricing formulas. Coleman et al. 

(2001) published empirical evidence that a spline representation can provide a more accurate representation in 

terms of hedging compared to the quadratic model considered by Dumas et al. (1998). 

Although the deterministic local volatility function may look very complicated, considering local 

volatilities can be a questionable model simplification. Ayache et al. (2004) and Hagan et al. (2002) doubt that a 

one-factor diffusion model delivers an adequate description of the asset price behavior. Hagan et al. (2002) 

illustrated that the model delta of deterministic local volatilities is wrong or at best very misleading. This, 

however, is a crucial issue in terms of the dynamic hedging performance of the model. Furthermore, another 

undesirable feature of the local volatility model is that it predicts flat future smiles, such that forward-start 

options or cliquets are likely to be mispriced. Beside these pricing and hedging problems, Ayache et al. (2004) 

criticized that local volatility models reveal no reasonable explanation for the existing smile phenomenon. 

Despite all criticism, local volatility models are widely used in practice. Common problems arising from 

using complex models like 'jump diffusion models' or even 'mixed volatility models' are the additional 

                                                      
6
Note that volatility is not a tradeable asset, which implies that the completeness of the market, i.e. the ability to hedge options with the 

underlying asset only, is lost.  
7
While Dupire (1994) developed a continuous time theory, Derman and Kani (1994a) used a discrete binomial tree approach. 


ANDRE LOERX and EKKEHARD W. SACHS 

90 

computational effort, the high implementation costs, the loss of intuition, and a potential decrease in calibration 

stability. Hence, 'practitioners may, and in fact often do, favor a simple and intuitive model', see (Coleman et al., 

2011). Furthermore, it is most likely that Dupire (1994) and Derman and Kani (1994a) did not introduce the 

local volatilities as a model of its own, but instead they intended to propose an intuitive way to price exotic 

derivatives under certain market circumstances. 

2.1.4 Mixed or hybrid volatility models  

As jump processes have been added to stochastic volatility models to provide a better fit of model implied 

volatilities to market implied volatilities (especially for short term maturities), the local volatility framework has 

been applied to stochastic volatility models. So-called 'stochastic local volatility (SLV) models' were proposed by 

Blacher (2001) and Lipton (2002) and were studied further, e.g., in (Ren et al., 2007; Piterbarg, 2007; Alexander 

and Nogueira, 2008; Henry-Labordère, 2009). As an example, the governing SDEs for a 'Heston-type stochastic 

local volatility model' are:  

 
= ( ) ( , )

= ( )

s
t t LV t t t t

v
t t t t

dS r d S dt S t v S dW

dv v v dt v dW



 

 

 
 (10) 

with 0 0, (0, )S v    and  , = ,
s v

t tdW dW   

where again r, d, ,  ,  ,v  and   are as in (7) and LV  denotes the local volatility function. Then, in the 

framework of local variance, the instantaneous 'hybrid' variance takes the form:  

 
2 2

,ˆ ( , ) = ( , ) ( | = , ) .
Q

K T t LV T T tS t K T v S K    (11) 

Because of this particular form of (11), it is not possible to separate the influence of the stochastic component 

from the local component in an intuitive manner. Thus, Tavella et al. (2005) prefer to define the instantaneous 

hybrid volatility as a weighted sum of a stochastic component and a local component. It is worth mentioning that 

Lipton (2002) and Lipton and McGhee (2002) further extended (10) by adding jumps to the stock price process 

.tS  Among others, this extension of (10), called the 'universal model', was strongly criticized by Ayache et al. 

(2004). It is argued that, roughly speaking, there is no chance to reveal the true market smile dynamics, since the 

freedom of the local volatility function can nearly compensate every dynamics introduced by the stochastic or 

jump component. In practice, this problem is usually addressed by separately calibrating the model parameters to 

extract plausible dynamics from the market. 

2.2  Numerical evaluation of smile-consistent pricing models 

Fast model evaluation is a crucial issue in practice. In order to be competitive with other market 

participants, very complex derivatives need to be priced nearly on-the-fly. Additionally, a fast and stable pricing 

scheme is essential when calibrating a financial market model to a large number of market data. Therefore, it is 

not surprising that models, which provide a closed or quasi-closed form solution, have become popular in 

practice. A survey of most of the existing market models with closed (or quasi-closed) form solutions has been 

given , e.g., in (Kolb and Overdahl, 2010, Chap. 27; Hull, 2011; Andersen, 2011), - especially for unbounded 

quadratic local volatility models. In the early years, bi- or trinomial trees have been a typical approach to price 

path-independent and path-dependent options in consistence with the prevailing volatility smile. This valuation 

method, which can be seen as a discrete version of Black-Scholes pricing PDE, was pioneered by Cox et al. 

(1979) (CRR).
8
 

A very natural way to price complex derivatives are Monte Carlo or quasi-Monte Carlo methods. They are 

based on the continuous time models, i.e. the fundamental pricing formula (1) or (2) and the considered market 

model:  

                                                      
8
More precisely, it can be easily shown that, for instance, the trinomial method is an example of an explicit finite difference scheme of Black-

Scholes pricing PDE and therefore it inherits certain stability properties of finite difference methods, cf. (Duffy, 2006, Chap. 13) 


MODEL CALIBRATION IN OPTION PRICING 

91 

 
 0

0

( , 0) = ( , )

s.t. = ( , ) ( , , ) , (0, ) ,

rT Q
T T

t t t t t t t

C S e S

dS a S t S dt b S t S dW S

 






  
 (12) 

where ( , )T TS   denotes the payoff function, depending on the asset price TS  at maturity T and possibly on 

some history 
1

= , ,T t tN
S S  (0 <nt T  for all n). As discussed, the general market model in (12) may be 

replaced by a specific one like (7), (8), (9), or (10). Thus, in case of a European-style call option under local 

volatility we obtain  

 
 0

0

( , 0) = max( , 0)

s.t. = ( ) ( , ) , (0, ) .

rT Q
T

t t t t t

C S e S K

dS r d S dt S t S dW S


 

   
 (13) 

In order to generate Monte Carlo samples, the stochastic differential equation in (14) is usually discretized by a 

Euler-Maruyama or Milstein scheme (see (Kloeden and Platen, 1999)) which is then used to approximate the 

expected value functional. Following the well-known law of large numbers, the pricing problem (14) can be 

formulated as  

 
0
=1

1

0 0

1
( , 0) max( , 0)

s.t. = ( ) ( , ) ,

= , = 0, , 1, = 1, , ,

M
rT m

N
m

m m m m m m
n n n n n n n n

m

C S e s K
M

s s r d s t s t s W

s S n N m M







 

    



 (14) 

where M, sufficiently large, denotes the number of random samples. Further, 
m
ns  is the m-th realization of the 

solution of (9) at time nt  given a time discretization 0 10 = < < <t t T  with step size 1:=n n nt t t   for 

= 0, , 1.n N   Accordingly, 1:= ( )n n nW W W   are the discrete increments of the Brownian motion tW  at 

time .nt  

Due to their flexibility to changes in either the payoff function or the considered market model, (quasi)-

Monte Carlo methods are widely used in practice. They further allow the computation of path-dependent 

derivatives with relatively small extra costs. Moreover, Monte Carlo methods are very easy to parallelize, such 

that the use of modern graphics processors (GPUs) allows a tremendous speed-up in computation time. 

Consequently, high dimensional problems can be solved in a reasonable time. An exhaustive discussion of 

(quasi)-Monte Carlo methods can be found in (Glasserman, 2003). 

While (quasi)-Monte Carlo methods are quite meaningful in high dimensions, they can be very slow for 

low dimensional problems. Representation theorems like Feynman-Kac's theorem (cf. (Karatzas and Shreve, 

2008)) show that, for instance, in the case of European-style call options, the price process of a plain vanilla call 

option follows some parabolic partial differential equation, i.e.  

 
2 21
( , ) = ( , ) ( , ) ( ) ( , ) ( , ) ,

2

( , ) (0, ) [0, ) ,

( , ) = max( , 0),    (0, ) .

C S t S S t C S t r d SC S t rC S t

S t T

C S T S K S

   

  

  

 (15) 

To put it differently, under reasonable assumptions of the existence of unique solutions of the local volatility 

model, i.e. SDE (9) and Black-Scholes pricing PDE (15), the unique solution ( , )tC S t  of (15) admits for all 

(0, )tS    and [0, ]t T  the stochastic representation  

 
 ( )( , ) = max( , 0)

s.t. = ( ) ( , ) , (0, )

r T t Q
t T

t

C S t e S K

dS r d S d S S dW S      

 
 

   
 (16) 

with ( , ].t T   Similar results can be obtained for, among others, barrier-, digital-, or plain vanilla call and put 

options with local or stochastic volatility. While basket options and models with stochastic volatility and/or 


ANDRE LOERX and EKKEHARD W. SACHS 

92 

stochastic interest rate lead to multidimensional PDEs (see (Wilmott, 2006)), an integral term is added in the 

PDE when considering jump diffusion models (see (Cont and Tankov, 2004)). Thus, the latter one requires the 

numerical solution of a partial integro-differential equation (PIDE). Pricing American-style options yields the 

challenge of solving free-boundary value problems (see (Wilmott, 2006)). Asian-style options can be modeled 

using one- or two-dimensional PDEs, depending on special payoff characteristics (see (Zvan et al., 1998)). In 

either case, numerical methods like finite difference or finite element methods need to be applied, when no 

analytic solution is available. Among multiple standard textbooks on numerical methods for PDEs, an exhaustive 

discussion on robust, accurate and efficient finite difference methods particularly for pricing various derivative 

products can be found in (Duffy, 2006;Tavella and Randall, 2000). Topper (2005) and Achdou and Pironneau 

(2005) focus on finite element methods used in quantitative finance. A current survey of efficient numerical 

methods for solving those types of PIDEs is given in (Feng and Linetsky, 2008; Sachs and Strauss, 2008). 

Beside the fact that under risk-neutrality there is a unique diffusion process consistent with the prevailing 

market smiles, Dupire (1994) and Derman and Kani (1994a) discovered that European-style option prices in the 

local volatility model satisfy a certain forward PDE, in which the independent variables are the options' strike K 

and maturity T, i.e.  

 
2 2

max

0

1
( , ) = ( , ) ( , ) ( ) ( , ) ( , ) ,

2

( , ) (0, ) (0, ] ,

( , 0) = max( , 0),    (0, ) .

D K T K K T D K T r d KD K T d D K T

K T T

D K S K K

   

  

  

 (17) 

This forward evolution equation, called 'Dupire's equation', is of twofold importance: (i) Dupire's equation in a 

local volatility framework is used to explicitly determine the underlyings' instantaneous volatility function, see 

Section 3. (ii) Once volatility function is known, the forward PDE can be solved numerically to efficiently price 

a collection of European-style options of different strikes and maturities all written on the same underlying asset. 

Due to its significance in option pricing and model calibration, several extensions, also called 'forward 

equations', have been proposed in financial literature. Andersen and Andreasen (1999) derived and Andreasen 

and Carr (2002) further extended, forward equations for European-style options in jump diffusion models. It is 

straightforward to develop the relevant forward equation for barrier or digital options (see, e.g., (Pironneau, 

2006; Pironneau, 2007; Carr and Hirsa, 2007)). Buraschi and Dumas (2001) give a forward representation of 

compound option prices for general diffusion processes with deterministic volatility. Forward equations for 

American-style put options with jump diffusion processes are derived in (Carr and Hirsa, 2003; Amster et al., 

2009) and are considered Dupire-like equations for multi-asset options, while Bentata and Cont (2010) further 

generalized Dupire's forward equation to a large class of non-Markovian models with jumps. 

Basically two lines of techniques to derive Dupire-like forward equations can be found in the literature. 

Either a generalization of Itô's formula, called 'Tanaka-Meyer formula' (see (Karatzas and Shreve, 1998, p. 

220)), is applied to the underlying SDE, or classical adjoint calculus is used to formulate the 'dual' problem of 

Black-Scholes pricing PDE. The first approach is explained, for instance, in (Bentata and Cont, 2010), while 

adjoint calculus (see, e.g., (Friedman, 1964)) is applied in (Pironneau, 2006; Pironneau, 2007; Pironneau, 2009). 

A comparison of both approaches for the original Dupire's equation can be found in (Fengler, 2005). 

3. Model calibration 

One of the key issues in quantitative finance is model calibration. Practitioners, like traders or risk 

managers, need to extract accurate market dynamic information, in order to correctly price and hedge 

derivatives. Usually, this is done by calibrating the relevant financial market model to a set of frequently traded 

standard instruments. In this section, we shortly review the common techniques used in practice. 

Due to its considerable importance, we start with restricting our attention to the calibration of local 

volatility models. In the literature, mainly three groups of approaches are proposed: extra- and interpolation 


MODEL CALIBRATION IN OPTION PRICING 

93 

schemes, iterative procedures on analytic approximations, and optimization-based methods.
9
 In the last part, 

however, we extend our view to the calibration of other models as well, since optimization-based methods 

naturally allow more flexibility in terms of changing the considered model. 

3.1  Extra- and interpolation techniques 

The concept of 'implied bi- and trinomial trees' is aligned with the option pricing scheme introduced by 

Cox et al. (1979). Instead of setting up a smile-consistent pricing tree in advance using, e.g., parameters inferred 

from a separate calibration routine, implied trees are directly recovered from observed market data 

approximating all necessary risk-neutral transition probabilities. Binomial trees, proposed by Derman and Kani 

(1994b) and Barle and Cakici (1998), however, suffer from a number of fundamental problems. According to 

Boyle and Lau (1994), binomial trees may encounter unpredictable convergence behavior when pricing options 

with discontinuous payoffs (like barriers or digitals). Secondly, and more importantly, negative transition 

probabilities may occur, from which arbitrage opportunities ensue. In contrast to Derman and Kani (1994b) and 

Barle and Cakici (1998), who construct the tree using a forward recursion formula, Rubinstein (1994) and 

Jackwerth (1997) propose to inductively build up the tree beginning from a risk-neutral distribution at the 

terminal node. This, by construction, prevents the probabilities becoming negative. Derman and Kani (1994b) 

suggest the use of trinomial trees, which are somehow equivalent to a simple explicit finite difference 

approximation of Dupire's equation (17). Trinomial trees provide a more flexible approximation to the state 

space than a binomial tree, but suffer from many of the same problems as the binomial tree and are prone to 

instability. Andersen and Brotherton-Ratcliffe (1997/1998), therefore, use the well-known (semi-implicit) Crank-

Nicolson approximation, since it exhibits much better stability and convergence properties. 

In any tree or tree-related approach, it is assumed that plain vanilla options are available for every strike 

and time to maturity. Therefore, option prices need to be inter- and extrapolated into regions where no market 

data are observable. This becomes even more relevant when considering the time continuous theory. 

Having built the bridge to Dupire's equation (17) earlier, practitioners sometimes prefer a slightly different 

view on (17), i.e.  

                                          
2

2

( , ) ( ) ( , ) ( , )
( , ) = 2 .

( , )

D K T r d KD K T d D K T
K T

K D K T


  
                                       (18) 

In this approach, calibration is meant to find a continuous and sufficiently smooth option price function 

( , )D K T  with respect to strike K and maturity T, such that the local volatility function can be recovered from 

(18), known as 'Dupire's formula'. Therefore, again some inter- and extrapolation techniques are required, in 

order to obtain a continuous and smooth price function for market data information. Aside from the smoothness 

requirements, the challenge is to guarantee that no standard arbitrage bounds are violated and that the local 

variance remains positive and finite. 

Due to the fact that, under no-arbitrage conditions, there is a unique implied volatility given an option price 

and vice versa (see Section 2), the necessary inter- and extrapolation is usually done on the implied volatility 

side. Kahalé's interpolation procedure (see (Kahalé, 2004)), for instance, is based on piecewise convex 

polynomials, which mimics the Black-Scholes formula. If the input data are free of arbitrage, so will be the 

resulting implied volatility surface. Fengler (2009) used natural cubic splines in space and finite differences in 

time to parametrize the local volatility function. The idea is to solve a sequence of small quadratic programs 

(QP) arising from a least-squares formulation (LSQ) of the spline representation under additional no-arbitrage 

constraints. Calender arbitrage is avoided by imposing transfer conditions through the iterates of the sequence of 

QPs. In contrast to Kahalé (2004), the main advantage of Fengler (2009) is that the input data need not to be free 

of arbitrage. Benko et al. (2007) suggest estimating the implied volatility surface with local quadratic 

polynomials. Arbitrage is ruled out by forcing the state-price density to be non-negative. Hanke and Rösler 

(2005) solve a normal equation to get an equivalent minimal norm solution of a LSQ problem using the 

                                                      
9
See also (Bouchouev and Isakov, 1999). 


ANDRE LOERX and EKKEHARD W. SACHS 

94 

discretized Dupire's formula and cubic splines in space and finite differences in time to parametrize the local 

volatility function. An extensive review about useful smoothing techniques is given in (Fengler, 2005, Chap. 4). 

Calibration using Dupire's formula (18) requires an interpolation method for either the implied volatility 

surface or the option price function. However, practitioners have stated that the resulting local volatility surface 

is very unstable and that the option prices are very sensitive to the interpolation method (see, e.g., (Lipton, 

2001)). Furthermore, a difficulty is the extra- and interpolation into areas where no market data are observed. For 

instance, it is not clear how to extrapolate prices for options that mature before the closest expiration date. 

3.2  Analytic approximations (and iterative methods) 

Bouchouev and Isakov (1999) apply the classical parametrix technique for PDEs to derive an analytic 

approximation of the option premium as a sum of its Black-Scholes price and an integral correction for the non-

constant volatility, which explicitly shows the nonlinear relationship between the option price and volatility. The 

resulting integral equation is then discretized and iteratively solved at the points where observed option prices 

are available. A second iterative algorithm is applied directly to the fundamental solution of the underlying PDE. 

Both algorithms proposed are straightforward to implement. However, they should only be used for short term 

maturities, since the algorithm is applied for a single time period and then repeated for all consecutive maturities. 

Therefore, it might not be able to capture the term structure of the local volatility surface. Analytic 

approximation formulae for one-dimensional local volatility models using the parametrix methods are also 

derived in (Corielli et al., 2010), but not considered in the context of calibration. 

3.3  Optimization-based calibration 

While usually very fast, the methods described above are only applicable to a quite small class of financial 

market models. Optimization-based calibration methods offer more flexibility and are applicable to almost any 

financial market model. Trying to minimize the distance between model data and observed market data, for 

instance, in a least-squares formulation (LSQ), i.e.  

 
2

0
=1

1
( ( ; , 0) ) ,min

2

M
mod obs
m m

a m

C a S C


  (19) 

the model prices can either be given in closed form or as a solution of the underlying SDE or PDE model. 

Nonlinear constrained and unconstrained optimization problems arise in many sciences, and numerous methods 

exploiting the special structure have been proposed to efficiently solve them. An introduction to state-of-the-art 

numerical methods for constrained and unconstrained optimization problems can be found in (Nocedal and 

Wright, 1999). 

As an example, to calibrate the time-dependent Heston call option model, Gerlich et al. (2010) designed a 

special sequential quadratic programming (SQP) optimization algorithm for the constrained least-squares 

problem (19), where 
n

a R  denotes the vector of Heston parameters, 0( ; , 0)
mod
mC a S  and ,

obs
mC  for 

= 1, , ,m M  and the model, respectively, market data with maturity T and strike K given an initial stock price 

0S  of the option's underlying at 0 = 0.t  The feasible set  arises from additional constraints on the Heston 

parameters a, for example, by assuming that the variance of the underlying stochastic process remains strictly 

positive. The model prices 0( ; ,0),
mod
mC a S  for = 1, , ,m M  are evaluated using the closed form solution. 

As mentioned before, closed form solutions are only available for a small class of models. In the following, 

we consider the least-squares calibration of financial market models in a more general setting. 

3.3.1  SDE constrained optimization (Monte Carlo calibration) 

Since financial market models are often characterized by the SDE of their underlying asset, calibration 

problem (19) may be written (for example, for a local volatility model) as  


MODEL CALIBRATION IN OPTION PRICING 

95 

  

2
0

=1

0

0

1
( ) := ( ( ; , 0) )min

2

where ( ; , 0) = max( , 0) ,

s.t. = ( ) ( , ) , (0, ) ,

M
mod obs
m m

m

mod rT Q
m T mm

f C S C

C S e S K

dS r d S d S S dW S



    

 



  







 

   

 (20) 

where  is an appropriate set of volatility functions. SDE constrained optimization problems have not been 

much considered in financial market literature, since Monte Carlo methods are known to be very slow. Giese et 

al. (2007), for instance, use the Euler-Maruyama scheme to simulate the underlying stock price process in the 

Heston framework and combine it with a multi-layer method related to multigrid methods from the solution of 

PDEs as proposed, e.g, in (Kelley and Sachs, 1994). Further acceleration is achieved by parallelization of the 

price evaluations. 

Recently, adjoint techniques have proven to be quite successful to accelerate Monte Carlo pricing. Adjoint 

techniques have their origin in the field of optimal control (see (Giles and Pierce, 2000)) and were recently 

introduced into finance in the context of automatic differentiation to compute derivatives (greeks) (see (Giles and 

Glasserman, 2006)). Kaebe et al. (2009a) applied adjoint calculus combined with a multi-layer approach to gain 

a tremendous speed-up in the calibration of a very general setting of systems of SDEs. Furthermore, Kaebe et al. 

(2009b, unpublished data) extended their consideration of Monte Carlo calibration of SDE models by adding 

jump components and proposed the use of a semi-analytical adjoint framework, which is based on a 

decomposition of the sensitivity computation into a sufficiently smoothed diffusion and jump part. Groß and 

Sachs (2011, unpublished data) derived an adjoint approach for the Milstein scheme (a second order 

discretization scheme) in connection with predictor-corrector methods. The feasibility and efficiency of Monte 

Carlo calibration for financial market models is discussed in (Käbe, 2010). 

Monte Carlo calibration using adjoints provides at least two main considerable advantages. Beside its 

flexibility, it allows a strong parallelization and thus the use of GPUs (see Section 2) to gain remarkable speed 

ups. Secondly, second order information can easily be obtained due to pathwise use of adjoints (cf. (Kaebe et al., 

2009a)). Hence, the number of necessary optimization iterations can be reduced tremendously. 

3.3.2  PDE constrained optimization 

We again consider the nonlinear LSQ problem (19) as an example for the calibration problem using the 

PDE framework (15), i.e.  

  
2

0
=1

1
 ( ) := ( , 0)min

2

M
mod obs
m m

m

f C S C





  (21) 

 
2 21

s.t. = ( , ) ( ) , ( , ) ,
2

m m m mC S S t C r d SC rC S t       

 ( , ) = max( , 0), (0, ), for 1, , .m m mC S T S K S m M      

 
with max= (0, ) [0, ).T    A drawback of (21) is the fact that M PDEs need to be solved in order to obtain one 

function evaluation of f. Since market data are given with different strikes K and maturities T, one can 

substantially reduce the computational effort replacing the M Black-Scholes equations (15) in (21) by, whenever 

available, one single Dupire's equation (17). Hence, the optimization problem (21) becomes  

  
2

=1

1
 ( ) := ( , )min

2

M
obs

m m m
m

f D K T C





  (22) 

 
2 21

s.t. = ( , ) ( ) , ( , ) ,
2

D K T K D r d KD d D K T       

 ( , 0) = max( , 0), (0, ) ,D K S K K     


ANDRE LOERX and EKKEHARD W. SACHS 

96 

with max= (0, ) (0, ].T    The main tasks, when solving PDE constrained optimization problems like (21) or 

(22), are: (i) the discretization of the model PDE, (ii) the parametrization of the parameter functions, for instance, 

the local volatility function ( , )    and (iii) the type of regularization to overcome the potential ill-posedness of 

the problem. Finally, (iv) the efficient computation of derivative information is a crucial issue to apply standard 

optimization routines. A gradient evaluation at low cost accelerates the calibration procedure tremendously and 

makes the calibration problem amenable for the application in practice. 

Since (i) is more a pricing issue, references on proper discretization schemes of the model PDE are already 

given in Section 2. As mentioned earlier (cf. Section 2.1), the parametrization of model functions can either be 

motivated by desired model properties in terms of hedging performance and implied volatility behavior or by 

reducing the number of unknowns in the calibration procedure. While in the time-dependent Heston model the 

corresponding parameter functions are usually parametrized via piecewise constant or piecewise linear functions, 

the type of parametrization of the local volatility function varies strongly in the financial market literature. 

Beaglehole and Chebanier (2002), for example, used piecewise quadratic functions, whereas Brown and Randall 

(1999) applied hyperbolic trigonometric functions. McIntyre (2001) considered Hermite polynomials and B-

splines were used by Hamida and Cont (2005). Bicubic splines were applied to parametrize the local volatility 

function, for instance, in (Coleman et al., 1999; Jackson et al., 1999; Pironneau, 2009). Spline representations, 

however, require rectangular parametrization grids. Orosi (2010), Glover and Ali (2011) and Coleman et al. 

(2011) use radial basis functions like Gaussian, multi-quadratic and thin-plate splines, a popular choice for 

reconstruction surfaces from sparse data. The main advantage is the arbitrary placement of parametrization 

knots, which allows the number of parameters to be kept to a minimum. According to Glover and Ali (2011), 

thin-plate splines seem to perform best in terms of showing accurate and robust solutions to parametrize the local 

volatility function. 

A clever parametrization strategy can be very helpful to stabilize the calibration procedure, since it reduces 

the number of unknowns. However, the right choice can be a difficult task. As an example, parametrizing the 

local volatility function with cubic splines (as in (Coleman et al., 1999)) is very difficult to automate, since one 

is faced with the trade-off between a lack of stability and unrealistic oscillations. Thus, in case of a high number 

of unknowns, usually some regularization is needed to overcome the ill-posedness of the optimization problem. 

The best-known stabilization method for ill-posed nonlinear inverse problems is the Tikhonov regularization 

introduced by Tikhonov (1963). Bodurtha (2000) and Bodurtha and Jermakyan (1999), for instance, minimize 

the sum of squared deviations from the Black-Scholes constant variance 
2
0 ,  i.e. 

2
2min ( , ) ,

L
f x t  where 

2 2
0( , ) = ( , ).x t f x t  

10
  Lagnado and Osher (1997a, 1997b), Jackson et al. (1999) and Coleman et al. (2001) 

suggest regularizing the ill-posed inverse problem by additionally minimizing the 
2

L -norm of the gradient of 

the volatility function, i.e. 
2

2min ( , ) ,
L

x t  subject to a finite number of constraints. Theoretical stability and 

convergence results for the calibration problem of local volatility models can be found in (Jiang and Tao, 2001; 

Crépey, 2003; Egger and Engl, 2005). Egger and Engl (2005) prove convergence rates for the case of time-

independent volatilities, i.e. = ( ),S   under simple and interpretable smoothness assumptions. The results can 

be generalized to a class of local volatility functions, where the term structure is assumed to be known, more 

precisely, where ( , ) = ( ) ( )S t S t    and the function ( )t  is known. Purely time-dependent volatilities are 

extensively studied in (Hein and Hofmann, 2003) and the ill-posedness of the inverse problem is proven. 

Uniqueness for state-dependent volatilities and the relation of optimal control problems corresponding to time-

discrete and time-continuous observations are investigated in (Jiang and Tao, 2001). 

As mentioned before, the efficient derivative evaluation is crucial when applying existing optimization 

routines. Coleman et al. (1999) use a trust region / interior point method and formulate the box-constrained 

                                                      
10

Note that Bodurtha (2000) and Bodurtha and Jemarkyan (1999) used a trinomial tree model (cf. Section 3.1) to compute the option prices. 


MODEL CALIBRATION IN OPTION PRICING 

97 

nonlinear least-squares problem,  

                         
2
2

1
( ) := ( )min

2p
f R



 
R

 (23) 

                           s.t. ,l u   

where 1= ( , , )
T

p    is the vector of a cubic spline parameterization of the local volatility function ( , ),x t  

l and u the vectors of lower and upper bounds on ,  respectively, and ( )R   the residual function defined as  

 1 1 1( ) := ( ( ; , ) , , ( ; , ) ) ,
obs obs T

M M MR D K T C D K T C     

where ( ; , )D     is the solution of Dupire's equation (21).
11

 In (Coleman et al., 1999) two possibilities to 

efficiently compute the Jacobian of R are explored, i.e. the use of automatic differentiation (AD) (see (Griewank 

and Walther, 2008; Coleman and Verma, 1996)) and an approximation of the Jacobian via a secant update 

formula. He et al. (2006) extended the calibration method of Coleman et al. (1999) to a jump diffusion model 

coupled with local volatilities. When applying a classical steepest decent algorithm or quasi-Newton approach, 

the use of adjoint equations to compute the gradient of f at low costs is proposed in (Achdou and Pironneau, 

2005; Egger and Engl, 2005; Loerx et al., 2010, unpublished data). An optimal control framework also using 

adjoints is applied by Jiang et al. (2003) to recover the local volatility surface. Turinici (2008) chose an SQP 

method to solve optimization problem (21). Note that the SQP method already needs some second order 

information of f which comes at a cost of solving 1M   Black-Scholes PDEs. According to Coleman et al. 
(1999), optimization approaches, which do not require the calculation of second order information, typically 

converge very slowly, such that the additional computational effort to obtain at least some second order 

information can be profitable in the overall computation time of the optimization routine (see also (Loerx, 

2011)). Schulze (2002) developed an inexact Gauss-Newton method to recover a non-parametric local volatility 

function. Since, in general, the Jacobian of the residual function cannot be stored in the non-parametric setting, 

the Gauss-Newton subproblems are solved with an iterative method (CG method). The matrix-vector products, 

which are needed within the CG framework, can be provided via sensitivity and adjoint equations. This approach 

was further improved in terms of computational efficiency by Loerx et al. (2011, unpublished data). A reduced 

order model technique, known from fluid dynamics, is used in (Pironneau, 2009) and applied to the local 

volatility framework. In (Sachs and Schu, 2008, 2010) reduced order models using proper orthogonal 

decomposition (POD) are used to solve the PIDE problem of jump diffusion models including local volatility.  

Based on Berestycki et al. (2002), Turinici (2008, 2009a, 2009b) discovered other forms of cost 

functionals, for instance, including implied volatilities and proved convergence and stability properties. 

Originally, Berestycki et al. (2002) derived a new cost functional based on results of asymptotic relations 

between implied and local volatility. The new functional is close to a convex functional at least for short term 

maturities and therefore exhibits a more stable minimizer. More precisely, Berestycki et al. (2002) proved that 

near expiry, the implied volatility can be represented as the spatial harmonic mean of the local volatility. 

Furthermore, they showed that, for deep in- and out-of-the-money options under certain assumptions, the 

squared implied volatility can be expressed as a time weighted average of squared local volatilities. These results 

can either be exploited to regularize the ill-posed inverse problem in a least-squares framework (as in (Turinici, 

2008, 2009a, 2009b)), or it can be used to continuously extrapolate the implied volatility surface into regions, 

where no observed implied volatilities are available. Therefore, one is able to overcome one of the main 

drawbacks of extra- and interpolation approaches using Dupire's formula. 

Another interesting approach, known from the field of 'dynamic programming', is to consider the 

calibration of financial market models in the framework of a 'stochastic control problem'. The method was 

proposed by Avellaneda et al. (1997) and studied in a deeper way by Samperi (2002). In contrast to the previous 

methods, this approach does not rely on any parametrization of the volatility. It leads to an unconstrained 

                                                      
11

Note the equivalence of calibration problem (22) and (23). 


ANDRE LOERX and EKKEHARD W. SACHS 

98 

optimization problem at the cost of solving nonlinear Hamilton-Jacobi-Bellman equations. As a regularizer 

Avellaneda et al. (1997) minimize the relative-entropy distance to a prior given distribution. Some further details 

can also be found in (Achdou and Pironneau, 2005, Chap. 8). 

4. Conclusion 

To capture realistic asset price behavior and volatility dynamics, a variety of financial market models has 

been developed over the last 40 years. In practice smile-consistent models are used to extract this market 

information from frequently traded standardized options in order to price and hedge exotic derivative products. 

We introduced the most common models and briefly discussed their characteristics from different perspectives. 

From the modeling side, for instance, we saw that despite all criticism simple models are widely used in practice, 

due to smaller computation and implementation costs and a better intuitiveness for risk-takers. 

Numerical methods are needed whenever no closed form solution is available. We discussed pros and cons 

of common numerical methods, like the Monte Carlo method or PDE methods. Dupire or Dupire-like equations 

have proven to be particularly useful for pricing and calibration purposes, since they provide option prices for 

different strikes and maturities with a computational effort of one PDE evaluation. 

The main objective of this paper, however, was to illustrate one of the key issues in quantitative finance 

and that is model calibration. Thus, we reviewed the existing literature on extra- and interpolation techniques as 

well as iterative methods applied to analytic approximation. Whereas the previous methods are mostly restricted 

to the calibration of local volatility models, optimization-based calibration methods offer more flexibility and are 

applicable to almost any financial market model. Hence, we introduced SDE and PDE constrained optimization 

and addressed issues like parameter parameterization and problem regularization. The efficiency of optimization-

based methods strongly depends on the computational effort necessary to compute derivative information. We 

emphasized that adjoint techniques for derivative computations, recently introduced in financial market 

literature, have the potential to substantially speed-up optimization-based calibration procedures. 

5. References 

ACHDOU, Y. and PIRONNEAU, O. 2005. Computational Methods for Option Pricing. SIAM, Philadelphia. 

ALEXANDER, C. and NOGUEIRA, L.M. 2008. Stochastic Local Volatility. Available at SSRN: 

http://ssrn.com/abstract=1107685. 

AMSTER, P., DE NAPOLI, P. and ZUBELLI, J.P. 2009. Towards a generalization of Dupire's equation for 

several assets. Journal on Mathematical Analysis and its Applications, 355(1): 170-179. 

ANDERSEN, L. 2011. Option pricing with quadratic volatility: A revisit. Fin. and Stochastics, 15(2): 191-219. 

ANDERSEN, L. and ANDREASEN, J. 1999. Jumping smiles. RISK, 12(11): 65-68. 

ANDERSEN, L. and BROTHERTON-RATCLIFFE, R. 1997/1998. The equity option volatility smile: an 

implicit finite-difference approach. Journal of Computational Finance, 1(2): 5-38. 

ANDREASEN, J. and CARR, P. 2002. Put Call Reversal. Manuscript, New York University, New York, USA. 

AVELLANEDA, M., FRIEDMAN, C., HOLMES, R. and SAMPERI, D. 1997. Calibrating volatility surfaces 

via relative-entropy minimization. Applied Mathematical Finance, 4(1): 37-64. 

AYACHE, E., HENROTTE, P., NASSAR, S. and WANG, X. 2004. Can anyone solve the smile problem?   

WILMOTT, 3: 78-96. 

BARLE, S. and CAKICI, N. 1998. How to grow a smiling tree. Journal of Financial Eng., 7(2): 127-146. 

BATES, D.S. 1996. Jump and stochastic volatility: Exchange rate processes implicit in Deutsche Mark options. 

Reviews of Financial Studies, 9(1): 69-107. 

BEAGLEHOLE, D. and CHEBANIER, A. 2002. Mean-reverting smiles. RISK, 15(4): 95-98. 

BENKO, M., FENGLER, M.R., HÄRDLE, W. and KOPA, M. 2007. On extracting information implied in 

options. Computational Statistics, 22: 543-553. 

BENTATA, A. and CONT, R. 2010.  Forward Equations for Option Prices in Semimartingale Models.  


MODEL CALIBRATION IN OPTION PRICING 

99 

Available at arXiv: http://arxiv.org/abs/1001.1380v3. 

BERESTYCKI, H., BUSCA, J. and FLORENT, I. 2002. Aysmptotics and calibration of local volatility models. 

Quantitative Finance, 2: 61-69. 

BJÖRK, T. 2004. Arbitrage Theory in Continuous Time. 2nd edn., Oxford University Press, Oxford, UK. 

BLACHER, G. 2001. A new approach for designing and calibrating stochastic volatility models for optimal 

delta-vega hedging of exotic options. Conference presentation at Global Derivatives, Juan-les-Pins. 

BLACK, F. and SCHOLES, M. 1973. The pricing of options and corporate liabilities. Journal of Political 

Economy, 81(3): 637-654. 

BODURTHA, J.N. 2000. A linearization-based solution to the ill-posed local volatility estimation problem.  

Techical Report, Georgetown University, USA. 

BODURTHA, J. N. and JERMAKYAN, M. 1999.  Non-parametric estimation of an implied volatilitiy surface.   

Journal of Computational Finance, 2(4): 29-61. 

BOLLERSLEV, T. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics,  

31(3): 307-327. 

BOUCHOUEV, I. and ISAKOV, V. 1999. Uniqueness, stability and numerical methods for the inverse problem 

that arises in financial markets. Inverse Problems, 15(3): 95-116. 

BOYLE, P.P. and LAU, S.H. 1994. Bumping up against the barrier with the Binomial method. Journal of 

Derivatives, 1(4): 6-14. 

BROWN, G. and RANDALL, C. 1999. If the skew fits. RISK, 12(4): 62-65. 

BURASCHI, A. and DUMAS, B. 2001. The forward valuation of compound options. Journal of Derivatives,  

9(1): 8-17. 

CARR, P. and HIRSA, A. 2003. Why be backward? RISK, 16(1): 103-107. 

CARR, P. and HIRSA, A. 2007.  Forward evolution equations for knock-out options. Pages 195-217 of:  FU, M. 

C., JARROW, R. A., YEN, J.-Y. and ELLIOTT, R. J. (eds), Advances in Mathematical Finance.  

Birkhäuser, Boston, MA, USA. 

COLEMAN, T.F. and VERMA, A. 1996. Structure and efficient Jacobian calculation. Pages 149-159 of:  

BERZ, M., BISCHOF, C., CORLISS, G. and GRIEWANK, A. (eds), Computational Differentiation: 

Techniques, Applications, and Tools.  SIAM, Philadelphia, USA. 

COLEMAN, T.F., LI, Y. and VERMA, A. 1999. Reconstructing the unknown local volatility function. Journal 

of Computational Finance, 2(3): 77-102. 

COLEMAN, T.F., KIM, Y., LI, Y. and VERMA, A. 2001. Dynamic hedging with a deterministic local volatility 

function model. Journal of Risk, 4(1): 63-89. 

COLEMAN, T.F., LI, Y. and WANG, C. 2011. Stable Local Volatility Function Calibration Using Spline 

Kernel (to appear). 

CONT, R. and TANKOV, P. 2004. Financial Modelling with Jump Processes. Chapman and Hall/CRC, Boca 

Raton, Florida, USA. 

CORIELLI, F., FOSCHI, P. and PASCUCCI, A. 2010. Parametrix approximation of diffusion transition 

densities. SIAM Journal on Financial Mathematics, 1: 833-867. 

COX, J.C. and ROSS, S.A. 1976. The valuation of options for alternative stochastic processes. Journal of 

Financial Economics, 3(1-2): 145-166. 

COX, J.C., ROSS, S.A. and RUBINSTEIN, M. 1979. Option pricing: A simplified approach. Journal of 

Econometrics, 7(3): 229-263. 

COX, J.C., INGERSOLL, J.E. and ROSS, S.A. 1985. A theory of the term structure of interest rates. 

Econometrica, 53(2): 385-407. 

CRÉPEY, S. 2003. Calibration of the local volatility in a generalized Black-Scholes model using Tikhonov 

regularization. SIAM Journal of Mathematical Analysis, 34(5): 1183-1206. 

DERMAN, E. and KANI, I. 1994a. Riding on a smile. RISK, 7(2): 32-39. 

DERMAN, E. and KANI, I. 1994b.  The Volatility Smile and Its Implied Tree. Quantitative Strategies Research 

Notes, Goldman Sachs. 


ANDRE LOERX and EKKEHARD W. SACHS 

100 

DERMAN, E. and KANI, I. 1998. Stochastic implied trees: Arbitrage pricing with stochastic term and strike 

structure of volatility. International Journal of Theoretical and Applied Finance, 1(1): 61-110. 

DUFFIE, D., PAN, J. and SINGLETON, K. 2000. Transform analysis and asset pricing for affine jump-

diffusions. Econometrica, 68(6): 1343-1376. 

DUFFY, D.J. 2006. Finite Difference Methods in Financial Engineering: A Partial Differential Equation 

Approach. John Wiley & Sons, Chichester, UK. 

DUMAS, B., FLEMING, J. and WHALEY, R.E. 1998. Implied volatility functions: Empirical tests. Journal of 

Finance, 53(6): 2059-2106. 

DUPIRE, B. 1994. Pricing with a smile. RISK, 7(1): 18-20. 

DUPIRE, B. 1996. A unified theory of volatility. Discussion paper Paribas Captital Markets. Preprint in 

"Derivatives Pricing", edited by P. Carr, 2004 (Risk Books, London, UK). 

EGGER, H. and ENGL, H.W. 2005. Tikhonov regularization applied to the inverse problem of option pricing: 

Convergence analysis and rates. Inverse Problems, 21(3): 1027-1045. 

FENG, L. and LINETSKY, V. 2008.  Pricing options in jump-diffusion models: An extrapolation approach.  

Operations Research, 56(2): 304-325. 

FENGLER, M.R. 2005. Semiparametric Modeling of Implied Volatility. Springer-Verlag, Berlin, Germany. 

FENGLER, M.R. 2009. Arbitrage-free smoothing of the implied volatility surface. Quant. Fin.,  9(4): 417-428. 

FRIEDMAN, A. 1964. Partial Differential Equations of Parabolic Type.  Prentice-Hall, Englewood Cliffs, New 

Jersey, USA. 

GATHERAL, J. 2006. The Volatility Surface - A Practitioner's Guide. John Wiley & Sons, Hoboken, New 

Jersey, USA. 

GERLICH, F., GIESE, A.M., MARUHN, J.H. and SACHS, E.W. 2010. Parameter identification in stochastic 

volatility models with a feasible point SQP algorithm. Computational Optimization and Applications, 1-25. 

GIESE, A.M., KAEBE, C., MARUHN, J.H. and SACHS, E.W. 2007. Efficient calibration for problems in 

option pricing. PAMM, 7(1): 1062601-1062602. 

GILES, M.B. and GLASSERMAN, P. 2006. Smoking adjoints: Fast Monte Carlo greeks. RISK, 19: 88-92. 

GILES, M.B. and PIERCE, N.A. 2000. An introduction to the adjoint approach to design. Flow, Turbulence and 

Combustion, 65(3-4): 393-415. 

GLASSERMAN, P. 2003. Monte Carlo Methods in Financial Engineering. Springer-Verlag, New York, USA. 

GLOVER, J. and ALI, M.M. 2011. Using radial basis functions to construct local volatility surfaces. Applied 

Mathematics and Computation, 217(9): 4834-4839. 

GRIEWANK, A. and WALTHER, A. 2008.  Evaluating Derivatives: Principles and Techniques of Algorithmic 

Differentiation. 2nd edn. SIAM, Philadelphia, USA. 

GROß, B.P. and SACHS, E.W. 2011. Fast calibration of financial models under SDE constraints using adjoint 

technique (preprint). 

HAGAN, P.S., KUMAR, D., LESNIEWSKI, A.S. and WOODWARD, D.E. 2002. Managing smile risk.   

WILMOTT, 1: 84-108. 

HAMIDA, S.B. and CONT, R. 2005. Recovering volatility from option prices by evolutionary optimization.   

Journal of Computational Finance, 8(3): 43-76. 

HANKE, M. and RÖSLER, E. 2005. Computation of local volatilities from regularized Dupire equations.   

International Journal of Theoretical and Applied Finance, 8(2): 207-221. 

HE, C., KENNEDY, J.S., COLEMAN, T.F., FORSYTH, P.A., LI, Y. and VETZAL, K.R. 2006. Calibration and 

hedging under jump diffusion. Review of Derivatives Research, 9(1): 1-35. 

HEIN, T. and HOFMANN, B. 2003. On the nature of ill-posedness of an inverse problem arising in option 

pricing. Inverse Problems, 19: 1319-1338. 

HENRY-LABORDÈRE, P. 2009. Calibration of local stochastic volatility models to market smiles: A Monte-

Carlo approach. RISK, Sep. 2009: 112-117. 

HESTON, S.L. 1993. A closed-form solution for options with stochastic volatility, with application to bond and 

currency options. Reviews of Financial Studies, 6(2): 327-343. 


MODEL CALIBRATION IN OPTION PRICING 

101 

HULL, J.C. 2011. Options, Futures and Other Derivatives. 8th edn. Prentice-Hall, Englewood Cliffs, New 

Jersey, USA. 

INGERSOLL, J.E. 1997. Valuing foreign exchange rate derivatives with a bounded exchange process.  Review 

of Derivatives Research, 1(2): 159-181. 

JACKSON, N., SÜLI, E. and HOWISON, S. 1999. Computation of deterministic volatility surfaces. Journal of 

Computational Finance, 2(2): 5-32. 

JACKWERTH, J.C. 1997. Generalized Binomial trees. Journal of Derivatives, 5(2): 7-17. 

JIANG, L. and TAO, Y. 2001. Identifying the volatility of the underlying assets from option prices. Inverse 

Problems, 17(1): 137-155. 

JIANG, L., CHEN, Q., WANG, L. and ZHANG, J.E. 2003. A new well-posed algorithm to recover implied local 

volatility. Quantitative Finance, 3(6): 451-457. 

KÄBE, C. 2010. Feasibility and Efficiency of Monte Carlo Based Calibration of Financial Market Models.  

Ph.D. thesis, University of Trier, Trier, Germany. 

KAEBE, C., MARUHN, J.H. and SACHS, E.W. 2009a. Adjoint-based Monte Carlo calibration of financial 

market models. Finance and Stochastics, 13(3): 351-379. 

KAEBE, C., MARUHN, J.H. and SACHS, E.W. 2009b. Speeding up Monte Carlo calibrations of jump diffusion 

models with adjoint calculus (submitted). 

KAHALÉ, N. 2004. An arbitrage-free interpolation of volatilities. RISK, 17(5): 102-106. 

KARATZAS, I. and SHREVE, S.E. 1998. Methods of Mathematical Finance. 1st edition. Springer-Verlag, New 

York, USA. 

KARATZAS, I. and SHREVE, S.E. 2008. Brownian Motion and Stochastic Calculus. 2nd edition. Springer-

Verlag, New York, USA. 

KELLEY, C.T. and SACHS, E.W. 1994. Multilevel algorithms for constrained compact fixed point problems. 

SIAM Journal on Scientific Computing, 15(3): 645-667. 

KLOEDEN, P.E. and PLATEN, E. 1999. Numerical Solution of Stochastic Differential Equations. 1st edn. 

Springer-Verlag, Berlin, Germany. 

KOLB, R.W. and OVERDAHL, J.A. 2010. Financial Derivatives: Pricing and Risk Management. John Wiley & 

Sons, Hoboken, New Jersey, USA. 

KOU, S.G. 2002. A jump-diffusion model for option pricing. Management Science, 48(8): 1086-1101. 

LAGNADO, R. and OSHER, S. 1997a. Reconciling differences. RISK, 10(4): 79-83. 

LAGNADO, R. and OSHER, S. 1997b. A technique for calibration derivative security pricing models: 

Numerical solution of the inverse problem. Journal of Computational Finance, 1(1): 13-25. 

LIPTON, A. 2001. Mathematical Methods For Foreign Exchange: A Financial Engineer's Approach. World 

Scientific, Singapore. 

LIPTON, A. 2002. The vol smile problem. RISK, 15(2): 61-65. 

LIPTON, A. and MCGHEE, W. 2002. Universal barriers. RISK, 15(5): 81-85. 

LOERX, A. 2011.  Adjoint Based Calibration of Local Volatility Models. Ph.D. Thesis, University of Trier, 

Trier, Germany. 

LOERX, A., MARUHN, J.H. and SACHS, E.W. 2010. The Role of Adjoints in the Calibration of Local olatility 

Models (submitted). 

LOERX, A., SCHULZE, M. and SACHS, E.W. 2011. The calibration of local volatility models using an inexact 

Gauss-Newton approach (forthcoming). 

MCINTYRE, M.L. 2001.  Performance of Dupire's implied diffusion approach under sparse and incomplete 

data. Journal of Computational Finance, 4(4): 33-84. 

MERTON, R.C. 1973. Theory of rational option pricing. The Bell Journal of Economics and Management 

Science, 4(1): 141-183. 

NOCEDAL, J. and WRIGHT, S.J. 1999. Numerical Optimization. 2nd Edition. Springer-Verlag, New York, 

USA. 

OROSI, G. 2010. Improved implementation of local volatility and its application to S&P 500 index options.   


ANDRE LOERX and EKKEHARD W. SACHS 

102 

Journal of Derivatives, 17(3): 53-64. 

PIRONNEAU, O. 2006. Calibration of barrier options. In: Fitzgibbon, W.E., Hoppe, R., Periaux, J., 

PIRONNEAU, O. and VASSILEVSKI, Yu. (eds),  Advances in Numerical Mathematics: Proc. Int. Conf. 

60th jubilee Y. Kuznetsov. Institute of Numerical Mathematics RAS. 

PIRONNEAU, O. 2007. Dupire-like identities for complex options. Comptes Rendus Mathematique, 344(2): 

127-133. 

PIRONNEAU, O. 2009. Calibration of options on a reduced basis. Journal of Computational and Applied 

Mathematics, 232(1): 139-147. 

PITERBARG, V. 2007. Markovian projection for volatility calibration. RISK, 20(4): 84-89. 

RADY, S. 1997. Option pricing in the presence of natural boundaries and a quadratic diffusion term. Finance 

and Stochastics, 1(4): 331-344. 

REN, Y., MADAN, D. and QIAN, M. 2007. Calibrating and pricing with embedded local volatility models.   

RISK, 20(9): 138-143. 

RUBINSTEIN, M. 1994. Implied Binomial trees. Journal of Finance, 49(3): 771-818. 

SACHS, E.W. and SCHU, M. 2008. Reduced order models (POD) for calibration problems in finance. Pages 

735-742 of:  Kunisch, Karl, Of, Günter and Steinbach, Olaf (eds), Numerical Mathematics and Advanced 

Applications, ENUMATH 2007. 

SACHS, E.W. and SCHU, M. 2010. Reduced order models in PIDE constrained optimization. Control and 

Cybernetics, 39(3): 661-675. 

SACHS, E.W. and STRAUSS, A.K. 2008. Efficient solution of a partial integro-differential equation in finance. 

Applied Numerical Mathematics, 58(11): 1687-1703. 

SAMPERI, D. 2002. Calibrating a diffusion pricing model with uncertain volatility: Regularization and stability. 

Mathematical Finance, 12(1): 71-87. 

SCHULZE, M. 2002. Parameter Identification for Underdetermined Systems Arising in Option Pricing Models 

and Neural Networks. Ph.D. thesis, University of Trier, Trier, Germany. 

STEIN, E.M. and STEIN, J.C. 1991. Stock price distributions with stochastic volatility: An analytic approach.  

The Review of Financial Studies, 4(4): 727-752. 

TAVELLA, D. and RANDALL, C. 2000. Pricing Financial Instruments: The Finite Difference Method. (1st 

Edition) John Wiley & Sons, New York, USA. 

TAVELLA, D., GIESE, A. and VERMEIREN, D. 2005.  Hybrid stochastic volatility calibration.  Pages 221-228 

of:  WILMOTT, P. (ed),  The Best of Wilmott 2. John Wiley & Sons, Chichester, UK. 

TIKHONOV, M. 1963. Regularization of incorrectly posed problems. Soviet Math. Doklady, 4: 1624-1627. 

TOPPER, J. 2005. Financial Engineering with Finite Elements. (1st Ed.) John Wiley & Sons, Chichester, UK. 

TURINICI, G. 2008. Local volatility calibration using an adjoint proxy. Review of Economic and Business 

Studies, 2: 93-106. 

TURINICI, G. 2009a. Calibration of local volatility using the local and implied instantaneous variance. Journal 

of Computational Finance, 13(2): 1-18. 

TURINICI, G. 2009b. Control-theoretic framework for a quasi-Newton local volatility surface inversion. Pages 

254-257 of: MAROULIS, G. and SIMOS, T.E. (eds), Computational Methods in Science and Engineering: 

Advances in Computational Science, vol. 1148. American Institute of Physics Conference Series. 

WILMOTT, P. 2006. Paul Wilmott on Quantitative Finance. (2nd Edition) John Wiley & Sons, Chichester, UK. 

ZÜHLSDORFF, C. 2001. The pricing of derivatives on assets with quadratic volatility. Applied Mathematical 

Finance, 8(4): 235-262. 

ZVAN, R., FORSYTH, P.A. and VETZAL, K.R. 1998. Robust numerical methods for PDE models of Asian 

options. Journal of Computational Finance, 1(2): 39-78. 

 
Received   15 January 2012                

Accepted   7 February 2012