final_report_template

Stochastic Diﬀeretial Equations
G1PMD
MSc Dissertation in
Statistics and Applied Probability
Summer 2015/16
School of Mathematical Sciences
University of Nottingham
Panayiotis Charalampous
Supervisor: Dr. Andrew Wood
I have read and understood the School and University guidelines on plagiarism. I conﬁrm
that this work is my own, apart from the acknowledged references.

Acknowledgments
First of all, I would like to express my sincere thanks to my supervisor Professor Andrew
Wood for his patience, perseverance and valuable guidance which leaded me to fulfil my
postgraduate thesis. In addition, huge thanks go to my personal tutor Dr. Simon Preston
for his continues support. From the first meeting, I felt his willingness to guide me in
order to have a positive academic year. Moreover, I would like to thank all my colleagues
for the strong relationship that we created and I would also express my gratitude to Ms
Christina Gregoriou for the valuable help in English language. Finally, special thanks
to my fiancé Eirini Tsiarli for her great patience and systematic encouragement and my
family for their support and understanding.

Abstract
Stochastic differential equations have a dominant role in various scientific fields such as Fi-
nance, Chemistry, Physics etc. Unfortunately, most of them cannot be solved analytically
and, consequently, it is occurred various numerical methods which give their approximate
solutions. The main purpose of the current thesis is to study the theory behind stochastic
differential equations, to analyse the implementation of numerical methods and to give
numerical examples of them. This thesis was conducted by having in mind three main
objectives. The analysis of theory in stochastic integration, the detailed explanation and
illustration of two known numerical methods, Euler-Maruyama and Milstein, and the pro-
vision of applications of SDEs in Finance. Firstly, this thesis points out important details
regarding the theory of SDEs and analyses the structure of their solutions. Moreover,
it provides significant conclusions through the simulated approximations of the two nu-
merical methods and it proves the crucial role of SDEs in the scientific field of Finance.
Finally, the numerical approximations have been simulated using the statistical software
R.
KEYWORDS: Stochastic differential equations, Stochastic processes, Brownian mo-
tion, Stochastic integral, Numerical methods, Black-Scholes model.

Contents
1 Introduction 3
1.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 Aim . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.4 Structure of the thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2 Review of Probability, Brownian Motion and Stochastic Integration 6
2.1 Stochastic Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2 Brownian Motion (or Wiener process) . . . . . . . . . . . . . . . . . . . . . 11
2.3 Brief mention to Stieltjes integral . . . . . . . . . . . . . . . . . . . . . . . 20
2.4 Itô’s integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.4.1 The Itô’s integral for a step function . . . . . . . . . . . . . . . . . 22
2.4.2 C’s stochastic integral . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.5 The Itô’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
3 Stochastic Differential Equations 37
3.1 Basic concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
3.2 The Itô’s Lemma as a solving method of SDEs . . . . . . . . . . . . . . . . 39
3.3 Strong and Weak Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
3.4 Numerical methods of SDEs . . . . . . . . . . . . . . . . . . . . . . . . . . 52
3.4.1 The Euler-Maruyama method . . . . . . . . . . . . . . . . . . . . . 53
3.4.2 The Milstein method . . . . . . . . . . . . . . . . . . . . . . . . . . 59
4 Applications in Mathematical Finance 65
4.1 Portfolios and assets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.2 Normalizing Assets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
4.3 Arbitrage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
4.4 European options in a general complete market . . . . . . . . . . . . . . . 75
1

4.4.1 The Black-Scholes formula . . . . . . . . . . . . . . . . . . . . . . . 80
5 Conclusions and Further propositions 83
5.1 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
5.2 Further propositions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
A Appendix 85
A.1 Dominated convergence theorem . . . . . . . . . . . . . . . . . . . . . . . . 85
A.2 Bounded convergence theorem . . . . . . . . . . . . . . . . . . . . . . . . . 85
A.3 The Chebyshev inequality: . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
A.4 Banach ﬁxed point theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 86
A.5 Levy’s theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
A.6 The Markov inequality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
A.7 Itˆo’s representation theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 88
A.8 The Novikov condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
B Coding in the statistical software R 89
B.1 R-coding for Chapter 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
B.2 R-coding for Chapter 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
2

1 Introduction
1.1 Background
There is no doubt that, Stochastic Analysis is one of the most widespread disciplines
in modern Mathematics, both for its applications in the majority of scientific fields and
for the utilization of many mathematical fields which are required for its foundation and
development. Nowadays, one of its significant application is the explanation of various
phenomena which contain variations relative to time. Specifically, it has been imported
that the concept of stochastic models which, in contrast with deterministic models, are
defined as a tool to assess the interactions between variables of a particular phenomenon
evolving with time with uncertainty present.
In addition, stochastic modelling concerns the utilization of probability theory to
model physical phenomena. Consequently, these kind of models are used to obtain a
better approximation of real conditions which evolves a physical phenomenon. Moreover,
stochastic models make possible the modelling of random factors which may influence
this phenomenon. However, the attempt to represent a physical phenomenon in terms of
a stochastic model is not an easy assumption at all. Over the years, it is demonstrated
that the modelling of a physical phenomenon, during the transformation of it in equa-
tions, requires significant experience. The procedure of doing this is by converting the
phenomenon into a differential equation hoping to end up with a stochastic model which
has experimental verification.
In Stochastic Analysis, the differential equations which are used to explain different
kind of phenomena are called the stochastic differential equations (SDEs). Similarly,
a stochastic differential equation is the differential equation which contains stochastic
processes in one or more of its terms. Therefore, this fact indicates that its solution
is a stochastic process as well. According to Mao(2007, p.2) a stochastic differential
equation includes a variable which represents a random white noise. This particular white
noise is considered to be the derivative of Brownian motion although several studies have
3

investigated that the Brownian motion is nowhere differentiable. Considerable excitement
has been generated by the fact that this kind of differential equations have applications
outside mathematics.
For instance, in the sector of Chemistry the description of chemical reactions which
are occurred between molecules can be modelled according to the stochastic differential
equation:
dX(t) = f(Xn)dt + C(Xn)dBm(t)
where Xn = (X1, ..., Xn)T
are the number of molecules of corresponding species Sn =
(S1, ..., Sn)T
, Bm(t) = (B1, ..., Bm)T
is the vector of m independent Brownian motions and
C is a n × m-matrix (Allen, 2007, p.168). Specifically, the underlined term is originated
by the stochastic modelling of the real conditions of the phenomenon. Another relevant
example of stochastic modelling is provided in the field of Biology. There is a stochastic
differential equation which can model the dynamics of random populations of the same or
different species (Allen, 2007, p.147). Specifically, it takes into consideration the random
variations which happen in the population structure over the time. This differential
equation of the dynamics of two interacting populations are presented below:
dX(t) = µ(t, X1, X2)dt + D(t, X1, X2)dB2(t)
where X1, X2 are the sizes of two populations, B2(t) = (B1, B2)T
the two-dimensional
Brownian motion and t denotes the time. Furthermore, in the domain of Finance, we can
observe the stochastic differential equation as an equation for the geometric Brownian
motion:
dXt = µXtdt + σXtdBt
which describes the stock price in the Black-Scholes option pricing model (Mao, 2007,
p.302), (Øksendal, 2003, p.275), (Karatzas and Shreve, 1991, p.371).
Finally, it is important to emphasize the fact that there are stochastic differential equa-
tions which can be solved analytically such as in geometric Brownian motion, Ornstein-
Uhlenbeck process etc. Unfortunately, there are various stochastic differential equations,
4

which appear in many applications, and are impossible to be solved analytically. There-
fore, several numerical methods were invested over the time in order to enable the solving
of almost all stochastic differential equations.
1.2 Aim
The main purpose of this thesis is to study the theory and the particularities of stochastic
differential equations and to provide computational aspects around of them.
1.3 Objectives
• To analyse the theory behind stochastic integration.
• To explain and illustrate two specific numerical methods of simulating SDEs.
• To present applications of SDEs in the scientific field of Finance.
1.4 Structure of the thesis
The current thesis is divided into three main chapters. Initially, in Chapter 2 we provide a
review of Probability, Brownian Motion and Stochastic Integration. Specifically, we give
the mathematical definition of a stochastic process and then we explain analytically, the
main characteristics of Brownian motion. The above mathematical notions lead to the
development both of Itô’s integral and Itô’s Lemma.
In addition, in Chapter 3 we mention the Itô’s Lemma as a solving method of SDEs
by providing some examples. Then, we illustrate numerical and graphical analysis of two
known numerical approximations, the Euler-Maruyama’s and Milstein’s methods.
Furthermore, in Chapter 4 we utilize SDEs as an application in several domains of
Finance. Especially, it is interpreted the using of SDEs to the Black-Scholes model of
option pricing.
Finally, Chapter 5 presents the overall conclusions of this thesis accompanied by con-
tingent prospects for future researches.
5

2 Review of Probability, Brownian Motion and Stochas-
tic Integration
2.1 Stochastic Processes
In this subsection we present some fundamental definitions and properties which are
significant to define stochastic processes. This mathematical term is directly related to
the general theme of this dissertation and it is presented in order to has a proper flow in
this thesis.
Definition 2.1.1. (Mikosch, 1998, p.62). Let Ω sample space. A family/partition F of
subsets on Ω is called σ-field if it satisfies the following properties:
(i) Ω ∈ F and ∅ ⊂ F
(ii) If A ∈ F, then Ac
∈ F, which Ac
is the complement of A.
(iii) If A1, A2, A3, . . . ∈ F, then
∞
n=1
An ∈ F and
∞
n=1
An ∈ F.
Example 2.1.2. Some examples to illustrate the above definition and its properties are
now given.
1. Let the set Ω = {1, 2, 3}. Then, the partition:
F = {Ω, ∅, {2}, {1, 3}} (2.1)
is a σ-field since it satisfies the definition 2.1.1 as below:
(i) Ω, ∅ ∈ F
(ii) the complement set of {2} exists in F. Indeed, the {1, 3} is the complement
set of {2}.
(iii) {2} ∪ {1, 3} = Ω ∈ F
{2} ∩ {1, 3} = ∅ ∈ F
6

2. Let assume Ω = {1, 2, 3, 4} Then, the partition:
F = {Ω, ∅, {1}, {2}, {1, 3, 4}} (2.2)
is not a σ-field since it doesn’t satisfy the properties of definition 2.1.1 as we can
see below:
(i) Ω, ∅ ∈ F
(ii) the complement set of {2} exists in F. Indeed, the {1, 3, 4} is the complement
set of {2}. However, the complement set of {1}, which is {2, 3, 4}, doesn’t exist
in F.
(iii) {1} ∪ {2} = {1, 2} which there isn’t in the partition F.
3. Let assume A1, A2, A3 are subsets of Ω such that:
Ai ∩ Aj = ∅, i = j and A1 ∪ A2 ∪ A3 = Ω
The collection of:
F = {∅, A1, A2, A3, A1 ∩ A2
Ac
3
, A2 ∩ A3
Ac
1
, A1 ∩ A3
Ac
2
, Ω}
is a σ-field because it satisfies the above three properties from definition 2.1.1.
Namely, the first property is satisfied because ∅ ∈ F and Ω ∈ F. Moreover, as
we can see from the elements of F, it contains Ac
1 Ac
2 Ac
3 and therefore the property
(ii) of definition is fulfilled. Finally, the property (iii) is also satisfied because the
state space Ω is equal to A1 ∪ A2 ∪ A3 and Ai ∩ Aj = ∅ for i = j. This particular
σ(F) is the smallest σ -field that include the set of A = {A1, A2, A3}.
Definition 2.1.3. (Mao, 2007, p.3). Let assume the set Ω and a σ-field F which are
composed by subsets in Ω. The pair of (Ω, F) is called a measurable space.
Example 2.1.4. Providing the following examples it is helpful to comprehend the defini-
tion 2.1.3.:
7

1. Assume the set Ω = {1, 2, 3} and a σ-field F1 = {Ω, ∅, {2}, {1, 3}}. The pair of
(Ω, F1) is a measurable space.
2. Let us consider the same set of Ω and a σ-field F2 = {Ω, ∅, {1}, {2, 3}}. The pair
of (Ω, F2) is also a measurable space.
Definition 2.1.5. (Mao, 2007, p.4). Let assume the set Ω, a σ-field F which are com-
posed by subsets of Ω and a probability measure P. The trinity (Ω, F, P) is called proba-
bility space.
Example 2.1.6. (i)Let Ω = {1, 2, 3} and a σ-field F1 = {Ω, ∅, {1}, {2}, {3}, {1, 2}{1, 3}{2, 3}}.
The pair of (Ω, F1) is a measurable space. (ii)Let us consider the same set of Ω and a
σ-field F2 = {Ω, ∅, {1}, {2, 3}}. Let define the probability measure as below:
P(∅) = 0,P(Ω) = 1, P({1}) =
1
3
, P({2}) =
1
3
, P({3}) =
1
3
P({1, 3}) =
2
3
, P({2, 3}) =
2
3
.
The trinity (Ω, F, P) is a probability space.
Afterwards, we are ready to give a definition of a stochastic process.
Definition 2.1.7. A stochastic process is defined as a parametrized collection of ran-
dom variables {Xt}t∈T which are defined on a probability space (Ω, F, P). It can be im-
printed as:
(Xt, t ∈ T) = (Xt(ω), t ∈ T, ω ∈ Ω).
Therefore, it is remarkable that a stochastic process has two parameters t and ω. If
the parameter of time t is fixed then we are going to receive the random variable Xt(ω),
ω ∈ Ω. On the other hand, if the parameter of the outcome ω is fixed then we have a
function of X(·, ω) which t ∈ T. This latter function is named as a path of Xt.
The study of Mikosch(1998, p.23), Karatzas and Shreve(1991, p.1) and Friedman(1975,
p.3) enables us to describe clearly what a stochastic process is. It is obvious that the
stochastic process is a phenomenon that changes in relation to time or space and this can
8

be investigated with the aid of the Probability Theory. For example, this process can
describe the mortality of population in a particular area, taking into consideration some
historical events of this current group of people. Furthermore, through the stochastic
processes we can study and describe a customer service rate depending on the number of
employees and the time it takes each of them to serve a customer.
Definition 2.1.8. (Mikosch, 1998, p.77). Let Ft be a family of σ-fields on Ω. We call it
filtration if:
Fs ⊂ Ft for all 0 s t.
From the above relation it follows that the σ-field of Ft can be considered as the informa-
tion that is available up to the time t. Generally, a filtration is an increase sequence of
information as the time goes on.
Definition 2.1.9. (Mikosch, 1998, p.77). Let a stochastic process X = {Xt, t ∈ T}. It
is said to be adapted to the filtration (Ft, t 0) if the σ- field of Xt is a subset to Ft, ∀
t 0.
Generally, this means that the information which refers on the stochastic variable Xt,
up to the time t, is included on σ-field Ft.
Subsequently, it is important to have a brief reference of a very significant example of
stochastic process, the martingale process.
Definition 2.1.10. (Ikeda and Watanabe, 1989, p.25), (Mikosch, 1998, p.80). A stochas-
tic process X = (Xt, t 0) is called a martingale process with respect to the filtration
(Ft, t 0), if:
(i) E | Xt |< ∞ for each t 0
(ii) X is adapted to the filtration (Ft)t 0.
(iii) E(Xt | Fs) = Xs for every t, s ∈ T such that s < t. Hence, it is remarkable that Xs
is the best prediction for (Xt | Fs) . If we substitute (iii) by E(Xt | Fs) Xs, then
9

the sequence is called submartingale, and if E(Xt | Fs) Xs, then the sequence is
named supermartingale.
Example 2.1.11. In Mikosch(1998, p.81, example 1.5.3) it is seen a brief answer of this
example. Therefore, let’s give a more detailed solution in order to assimilate the definition
2.1.10 for a discrete time. Let check if a random path S = (Sn)n 0 of independent random
variables with finite expectations is a martingale with respect to the filtration (Fn, n 0).
We define:
S0 = 0,
Sn = X0 + X1 + . . . + Xn
and
Fn = σ(X0, X1, X2, . . . , Xn) for n 0.
The property (i) from the definition 2.1.10 is satisfied since Sn is a sum of variables with
finite expectations. Hence E | Sn |< ∞. Furthermore, it is noticeable from the definition
2.1.9 that the process Sn is adaptable to Fn since Fn = σ(X0, X1, X2, . . . , Xn) for n 0
σ(Sn) ⊂ Fn and so the property (ii) is also fulfilled. In addition, regarding the property
(iii) we have that:
E(Sn+1 | Fn) = E(Sn + Xn+1 | Fn) = E(Sn | Fn) + E(Xn+1 | Fn
independence
) = Sn + E(Xn+1) = Sn.
In the first equality we use the definition of (Sn)n 0 and the second equality is demon-
strated from the linearity of the expectation. For the third equality and in accordance of
Milkosh(1998, p.71), we apply the rule that if the σ- field σ(Sn) is contained in the filtra-
tion F, then E(Sn | Fn) = Sn. Also, in this step we apply the formula of the conditional
probability and due to independence between the two elements, we conclude to Xn+1. Fi-
nally, on the fourth equality we use that E(Xn) = 0 so E(Xn+1) = 0. Hence, the process
S is a martingale.
10

2.2 Brownian Motion (or Wiener process)
This subsection provides the concept of Brownian Motion (or similarly Wiener Process)
and its properties as well as the proofs of them. Nowadays, it is known that Brownian
Motion is one of the most significant stochastic processes in Mathematics. Its name is
derived by the botanist Robert Brown, who observed by using a microscope the continuous
and irregular movement of particles in the water (at the age of 1827). The Brownian
process plays an important role in the theory of stochastic differential equations and
constitutes one of the cornerstones in mathematical finance and beyond.
Figure 1: Two sample paths of standard Brownian motion on the time interval [0, 1] [see
code B.1.1].
Definition 2.2.1. Brownian Motion (or Wiener Process) (Friedman, 1975, p.36),
(Mikosch, 1998, p.33). Assume a stochastic process B = (Bt, t ∈ [0, ∞)). It is called
Brownian motion (or a Wiener process) if it satisfies the following conditions:
(i) B0 = x - this means that the process starts from the point x.
(ii) it has independent increments. For instance, for every sequence with 0 t1 < t2 <
. . . < tn, the increments:
Btn − Btn−1 , . . . , Bt2 − Bt1
are independent random variables.
11

(iii) the increments Bt+h −Bt are normally distributed with expectation zero and variance
h.i.e.
Bt+h − Bt ∼ N(0, h)
(iv) Its sample paths are continuous.
Remark 2.2.2. (i) From Embrechts and Maejima(2000 p.6) and Mikosch(1998, p.33)
it can be seen that Brownian motion is closely linked to normal distribution.
(ii) Through the review of literature it is remarkable the fact that many definitions for
the Brownian motion assume that the process has 0 as a starting point. Specifically,
the condition (i) of the definition 2.2.1 is changed to B0 = 0, i.e. x = 0 and this
case is defined as a standard Brownian motion. Hence, the figure 1 (p. 7) shows
sample paths of standard Brownian motion. In this study it is determined any point
x to be a possible starting point of Brownian motion. (Mikosch, 1998, p.33), (Mao,
2007, p.15 ), (Friedman, 1975, p.36).
(iii) The condition (iv) of definition 2.2.1 can be found in many sources either as part of
the definition of Brownian motion, or as a property which follows from conditions
(i) − (iii). For instance on Mikosch(1998, p.33) it is written as a condition of the
definition. However, it is noticeable in the lecture presentation slides (Chapter 5,
p. 8/35) of Professor Philip O’Neill for the module of Stochastic Models that this
condition presents as a property of the Brownian motion. Furthermore, in Karatzas
and Shreve(1991, p.53-55) by using at first the Kolmogorov-Centsov Theorem and
then by considering the conditions (i)−(iii) of definition 2.2.1 to be the prerequisites
of the continuity, the desideratum is proved.
Corollary 2.2.3. Properties of Brownian Motion:
(i) 1. E[Bt | B0 = x] = x
2. V ar[Bt | B0 = x] = t
(ii) Sample paths of the Brownian motion are NOT differentiable.
12

Proof. (i) From the condition (iii) we know that the probability density function is
f(x) = 1√
2πt
e
−x2
2t and the expectation of a general random variable g(Bt), where g
is a given function, is:
E[g(Bt)] =
1
√
2πt
∞
−∞
g(x)e
−x2
2t dx.
Using the above formula, it follows that:
E[Bt | B0 = x] = E[x + Bt] =
1
√
2πt
∞
−∞
(x + y)e
−y2
2t dy
. Therefore:
E[(x + Bt)] =
1
√
2πt
∞
−∞
(x + y)e
−y2
2t dy
=
1
√
2πt
∞
−∞
(xe
−y2
2t + ye
−y2
2t )dy
=
1
√
2πt
∞
−∞
(xe
−y2
2t )dy +
1
√
2πt
∞
−∞
(ye
−y2
2t )dy
=
x
√
2πt
∞
−∞
(e
−y2
2t )dy +
1
√
2πt
−te
−y2
2t
∞
−∞
=
x
√
2πt
∞
−∞
(e
−y2
2t )dy +
1
√
2πt



 lim
y→∞
(−te
−y2
2t )
0
− lim
y→−∞
(−te
−y2
2t )
0




Set z =
y
√
2t
and dy = dz
√
2t.
=
x
√
2t
√
2πt
∞
−∞
e−z2
dz
√
π
=
x
√
2t
√
π
√
2πt
= x, as required.
13

Whilst for the variance we have:
V ar[(x + Bt)] =
1
√
2πt
∞
−∞
(x + y)2
e
−y2
2t dy
=
x2
√
2πt
∞
−∞
e
−y2
2t dy +
2x
√
2πt
∞
−∞
(ye
−y2
2t )dy +
1
√
2πt
∞
−∞
y2
e
−y2
2t dy
=
1
√
2πt
y(−2te
−y2
2t )
∞
−∞
+
1
√
2πt
y(−te
−y2
2t )
∞
−∞
+
1
√
2πt
∞
−∞
y(−te
−y2
2t ) dy
= 0 + 0 −
1
√
2πt
∞
−∞
−te
−y2
2t dy
Set z =
y
√
2t
and dy = dz
√
2t.
=
t
√
2t
√
2πt
∞
−∞
e−z2
dz
√
π
= t, as required.
(ii) The basic consideration for this property is to show that Brownian motion is H-
self similar process defined in definition 2.2.5 below. That is the most important
point of the whole proof because we will subsequently show that any H-self similar
process is nowhere differentiable with probability 1. This particular syllogism can
lead us to our initial purpose. Namely, to show that Brownian motion is nowhere
differentiable. First of all, it is necessary to refer some helpful definitions.
Definition 2.2.4. (Mikosch, 1998, p.30). Let a stochastic process X = (Xt, t ∈ T)
and T be a subset of R. The process X has stationary increments if:
Xt − Xs
d
= Xt+h − Xs+h ∀t, s ∈ T and t + h, s + h ∈ T.
Definition 2.2.5. [Embrechts and Maejima, 2000, p.3] Assuming a stochastic pro-
cess X = (Xt, t = 0, 1, 2, 3, . . .). It is called to be ”H-self-similar” if ∀a > 0, ∃H > 0
such that:
{Xat}
d
= {aH
Xt}
In plain words, the term of ”self-similarity” can be described as the ability of a
graph which if you choose any part of it, in any time length interval, you can earn
the similar image as the initial one. It is important to note that you can’t get the
14

same image as the original.
The following proposition is stated without proof in Mikosch(1998, p.36). A proof
of this result is given below.
Proposition 2.2.6. (Mikosch, 1998, p.36). Let {Bt, t 0} be a standard Brownian
Motion. Then {Bt} is 1
2
-self-similar process.
Proof. According to the proof of Embrechts and Maejima [2000, p. 5] it is obvious
that the proof which is given has brief description. As a result, in this thesis it is
provided a detailed proof to show that {Bt} is 1
2
-self-similar process.
From definition 2.2.4, we consider the relation below:
{Bat}
d
= {a
1
2 Bt} ∀a > 0.
However, it would be simpler to be proven that:
{a−1
2 Bat}
d
= {Bt} ∀a > 0.
Hence, we have to examine if the above relation can satisfy the conditions of the
definition 2.2.1. The condition (i) (of definition 2.2.1) is shown as below:
For t = 0,
{a−1
2 B0}
B0=0
= 0.
Moreover, to proof the condition (ii) (of definition 2.2.1), it is enough to show
that for all 0 t1 t2 . . . tk the random variables:
a−1
2 Bat1 − a−1
2 Bat0 , a−1
2 Bat2 − a−1
2 Bat1 , . . . , a−1
2 Batk
− a−1
2 Batk−1
have correlation equals to zero. Hence, when ti < tj:
E[(a−1
2 Bati
− a−1
2 Bati−1
)(a−1
2 Batj
− a−1
2 Batj−1
)] =
= E[a−1
Bati
Batj
− a−1
Bati
Batj−1
− a−1
Bati−1
Batj
+ a−1
Bati−1
Batj−1
]
= a−1
[ati − ati − ati−1 + ati−1] = 0
15

and we conclude to the deduction that a−1
2 Bt has independent increments. Regards
the condition (iii) (of deﬁnition 2.2.1) we proceed with the following methods:
E[a−1
2 Bat] = a−1
2 E[Bat] = 0
E[(a−1
2 Bat)2
] = a−1
at = t
Finally, the condition (iv) follows from the condition (i) to (iii). Consequently
{a−1
2 Bat}
d
= {Bt} as required.
Figure 2: Self-similarity of Brownian motion (Mikosch, 1998, p.37).
16

Proposition 2.2.7. (Mikosch, 1998, p.188). Suppose (Xt) is H-self-similar with
stationary increments for some H ∈ (0, 1). Thus, ∀ fixed t0 it follows:
lim sup
t→t0
| Xt − Xt0 |
t − t0
= ∞,
namely, H-self-similar processes which are composed by sample paths are non-differentiable
with probability 1.
Proof. This proof is sketched by Mikosch(1998, p.189) but this thesis provides more
detailed explanation of it. At first, we want to consider that:
| Xt − Xt0 |
t − t0
d
=
(t − t0)H
X1
t − t0
d
= (t − t0)H−1
X1.
Now, by assuming that t0 = 0 it follows from the above quantity that:
(t − t0)H−1
X1
d
= (t)H−1
X1
d
=
Xt
t
.
Let’s define the sequence tn such that limn→∞ tn = 0 and the set of:
A(tn) = sup
0 s tn
|
Xs
s
|> x
Therefore, from the claim that tn → 0 we have:
A(tn+1) ⊂ A(tn)
and hence:
P{ lim
n→∞
A(tn)} = lim
n→∞
P{A(tn)}
lim
n→∞
P{A(tn+1)}
= lim
n→∞
P sup
0 s tn+1
|
Xs
s
|> x
= lim sup
n→∞
P{|
Xtn
tn
|> x}
= lim sup
n→∞
P{
tH
n
tn
| X1 |> x}
= lim sup
n→∞
P{tH−1
n | X1 |> x}
= lim sup
n→∞
P{| X1 |> t1−H
n x} = 1 since n → ∞.
17

Thus, from the proportions 2.2.6 and 2.2.7 it follows that a Brownian sample path
is nowhere diﬀerentiable.
Therefore, we have just completed the proves of properties which were referred on
corollary 2.2.3.
Remark 2.2.8. As can be seen from Figure 1, which illustrates two random sample paths
of Brownian motion, it is noticeable that there is no regularity in the trend of the paths.
Therefore, the non-diﬀerentiability of Brownian motion is distinguishable by its graphical
representation.
Theorem 2.2.9. (Embrecht and Maejima, 2000, p.4). Let a stochastic process {Xt} be
H-self-similar and we assume E[X2
(1)] < ∞. Then
E[XtXs] =
1
2
{t2H
+ s2H
− |t − s|2H
}E[X2
1 ]
.
Proof.
E[X2
t ] = V ar[Xt] = V ar[tH
X1] = t2H
V ar[X1] = t2H
E[X2
1 ]
E[X2
s ] = V ar[Xs] = V ar[sH
X1] = s2H
V ar[X1] = s2H
E[X2
1 ]
E[(Xt − Xs)2
] = V ar[(Xt − Xs)] = V ar[X1(| t − s |)H
] =| t − s |2H
E[X2
1 ]
Therefore,
E[XtXs] =
1
2
{E[X2
t ] + E[X2
s ] − E[(Xt − Xs)2
]}
=
1
2
t2H
+ s2H
− |t − s|2H
E[X2
1 ]
Theorem 2.2.10. E[BtBs] = min {t, s}
18

Proof. As we can see in Embrecht and Maejima(2000, p.5) the proof is based on the fact
that standard Brownian motion is 1
2
-self-similar process. Therefore, it can be used the
formula of the theorem 2.2.9. as below:
E[BtBs] =
1
2
t2H
+ s2H
− |t − s|2H
≡ min {t, s} .
Thus it helps us to acquire the desired result.
However, it can be provided in this thesis a second method in order to prove E[BtBs] =
min {t, s}.
Let assume 0 < t < s. Then,
Bs = Bs + Bt − Bt
BtBs = BtBs + B2
t − B2
t
BtBs = B2
t + Bt[Bs − Bt]
(linearity of expectation) E[BtBs] = E[B2
t ] + E[Bt[Bs − Bt]]
(by def.2.2.1, cond. (ii) and (iii)) E[BtBs] = t + 0 = t
Now, let assume 0 < s < t. Then,
Bt = Bt + Bs − Bs
BtBs = BtBs + B2
s − B2
s
BtBs = B2
s + Bs[Bt − Bs]
(linearity of expectation) E[BtBs] = E[B2
s ] + E[Bs[Bt − Bs]]
(by def.2.2.1, cond. (ii) and (iii)) E[BtBs] = s + 0 = s
Therefore, it is proved that E[BtBs] = min {t, s}.
Example 2.2.11. Find E[(Bt+w − Bt)2
] for t, w > 0.
E[(Bt+w − Bt)2
] = E[B2
t+w − 2BtBt+w + B2
t ]
(linearity of expectation) = E[B2
t+w] − 2E[BtBt+w] + E[B2
t ]
(by def.2.2.1, cond. (ii)(iii) and theorem 2.2.10) = t + w − 2t + t
= w.
19

The above example shows that E[(dBt)2
] = dt where dBt = Bt+w −Bt. Specifically, the
expectation of (dBt)2
equals to the difference of t+dt and t. Obviously, V ar[Bt+w −Bt] =
E[(Bt+w − Bt)2
] since E[Bt+w − Bt] = 0.
2.3 Brief mention to Stieltjes integral
In this subsection we give a brief mention to the concept of Stieltjes integral and explain
why this concept is not valid for the definition of a stochastic integral. Initially, we start
by the definition of the Stieltjes integral for a step function.
Definition 2.3.1. (Dragomir, 2001, p.42). Consider a step function:
h = h010 +
n
i=1
h01(ti,ti+1]
where {ti} is a partition of the interval [0, t]. Let m be a random function on R+
. The
Stieltjes integral of h over m is defined as below:
t
0
hdm :=
n
i=1
hi[m(ti+1) − m(ti)]
A continues function h can be approached by a sequence hn of step functions with the
concept that hn → h. By this sequence we can create a new sequence consists by Stieltjes
integrals
t
0
hndm. Each one of these Stieltjes integrals is defined by the definition 2.3.1.
The limit, as n → ∞, exists and, actually, if hn and hn are two different sequences of
step functions which approach the same function h then the limits
t
0
hmdm and
t
0
hndm
coincide between each other. Consequently,
Definition 2.3.2. Consider h is a continues function and hn is a sequence of step func-
tions which approaches h. The Stieltjes integral of h over a function m is defined to
be:
t
0
h · dm := lim
n→∞
hn · dm
The function m is called integrator.
20

The following theorem shows us the significance of the total variation of an integrator
in the Stieltjes integral. The proof is omitted since it is provided in many literatures e.g.
Dragomir(2001).
Theorem 2.3.3. Suppose m has a local bounded variation and h is continues function.
Then the Stieltjes integral
t
0
hdm exists and satisfies the condition:
|
t
0
hdm |
t
0
| h || dm | sup
0 s t
h(s)
t
0
| dm |
where
t
0
| dm |:= sup{
n
i=1
| m(ti+1) − m(ti) |}
and {ti} a possible partition of [0, t]. The supremum is obtained on all partitions.
The above theorem shows us that if an integrator does not have a finite variation then
the Stieltje’s integral has difficulties to be defined.
Consider now that the functions h and m are stochastic processes, i.e. h = h(t, w) and
m = m(t, w). Furthermore we define m(t, w) = Bt i.e. the integrator will be a Brownian
motion. The theorem (2.3.3) presents the main reason why a Stieltjes integral can not
exists with a Brownian motion as an integrator. A Brownian motion is a function with
an unbounded variation. Consequently, this specific characteristic puts forward the view
that the Steltjies integral can not be defined in terms of a Brownian motion, since the
integrator must be a bounded function. Therefore, the Steltjies integral can not define
stochastic integrals. The significant step to solve this problem is to set a limit as n → ∞
in L2
. This change in the determination of limit is the vital difference between the Itô’s
integral and Stieltjes integral.
2.4 Itô’s integral
As we have seen in a previous subsection the Brownian motion is a function which is
nowhere differentiable. However, the integration process can be applied to it. Hence,
through this subsection we will attend to the integration of a stochastic process with
21

Brownian motion as an integrator. During the second world war, the Japanese math-
ematician Kyoshi Itô indicated the way to define this kind of integral and thus, it was
called the Itô’s stochastic integral. Today, this kind of integrations have applications in
many scientific fields, such as in Finance, Physics etc. Furthermore, on this subsection we
will see that the variation of Brownian motion is the main idea regarding the definition of
the Itô’s integral. Initially, we define the stochastic integral for stochastic step functions
and then, for ”suitable” stochastic processes.
Definition 2.4.1. A stochastic process f(t), t ∈ [a, b] is called a step function if there is
a finite sequence of numbers a = t0 < t1 < . . . < tn = b and a finite sequence of random
variables f0, f1, f2, . . . , fn−1 such that:
f(t) = fj if t ∈ (tj, tj+1], j = 1, 2, . . . , n − 1
Moreover, this kind of stochastic process f(t) can be written in form:
f(t) =
n−1
j=0
fj1(tj,tj+1](t) = f01[t0,t1](t) +
n−1
j=1
fj1(tj,tj+1](t) (2.3)
where,
1(tj,tj+1](t) =



0, if t ∈ (tj, tj+1]
1, if t ∈ (tj, tj+1].
From now on, we will denote the set of step functions on [a, b] as Mstep([a, b]).
From the above definition it is important to be mentioned the fact that the indicator
function is the reason of the appearance of steps in the graph of a step function.
2.4.1 The Itô’s integral for a step function
Then, it is defined the Itô’s stochastic integral for a step function.
Definition 2.4.2. The Itô’s integral for a step function Let a stochastic step func-
tion f of the form (2.3). The Itô’s integral of f in [a, b] with respect to the Brownian
22

motion is defined as below:
I(f(t)) =
b
a
f(t)dBt =
n−1
j=0
fj(Btj+1
− Btj
).
Theorem 2.4.3. The Itô’s stochastic integral of a step function has the following prop-
erties:
(i) The Itô’s integral is linear, i.e. if f(t) and g(t) are two step functions then:
I(λ1f(t) + λ2g(t)) = λ1I(f(t)) + λ2I(g(t))
(ii)
E[I(f(t))] = E
b
a
f(t)dBt = 0
(iii) This property is called ”Itô’s isometry”:
E[I(f(t))]2
= E
b
a
f(t)dBt
2
=
b
a
E[f(t)]2
dt
Proof. (i) In Mao(2007, p.20), Allen(2007, p.70) and Mikosch(1998, p.107) the proof
leaves to the reader. However, the current thesis includes this proof as below:
I(λ1f(t) + λ2g(t)) =
b
a
(λ1f(t) + λ2g(t))dBt
by def. 2.4.2
=
n−1
j=0
(λ1fj + λ2gj)(Btj+1
− Btj
)
=
n−1
j=0
λ1fj(Btj+1
− Btj
) +
n−1
j=0
λ2gj(Btj+1
− Btj
)
= λ1
n−1
j=0
fj(Btj+1
− Btj
) + λ2
n−1
j=0
gj(Btj+1
− Btj
)
= λ1
b
a
f(t)dBt + λ2
b
a
g(t)dBt = λ1I(f(t)) + λ2I(g(t))
23

(ii) By the deﬁnition 2.4.2:
E[I(f(t))] = E
b
a
f(t)dBt = E
n−1
i=0
fi(Bti+1
− Bti
)
=
n−1
i=0
E fi(Bti+1
− Bti
)
*
=
n−1
i=0
E[fi]E[Bti+1
− Bti
]
by def. 2.2.1
= 0.
explanetion of (*):
As we know, the variation Bti+1
− Bti
of Brownian motion is independent of Fti
.
Also, due to the fact that the random variable fi is Fti
− measurable, we conclude
that fi and Bti+1
− Bti
are independent.
(iii) According to some references, this proof is presented without detailed explanation.
Therefore this thesis provides a detailed step by step proof in order to have better
understanding and supervision of the methods that are used. Let assume a step
function f(t) of the form (2.3). Therefore, by the deﬁnition 2.4.2, this function has
the following stochastic integral:
I(f(t)) =
n−1
j=0
fj(Btj+1
− Btj
)
Then, it is able to be calculated the expectation of the square of the above quantity.
Indeed, by separating the diagonal and non-diagonal terms of the sum we get:
| I(f(t)) |2
=
n−1
j=0
f2
j (Btj+1
− Btj
)2
+ 2
k<j
fj(Btj+1
− Btj
)fk(Btk+1
− Btk
)
The variation of the Brownian motion (Btj+1
− Btj
) is independent of anything that
happened before of the time tj. Since the random variable fj is Ftj
measurable, the
random variables (Btj+1
− Btj
) and fj are independent. In addition, any function of
these variables have the same condition. Hence:
n−1
j=0
E[f2
j (Btj+1
− Btj
)2
] =
n−1
j=0
E[f2
j ]E[(Btj+1
− Btj
)2
] =
n−1
j=0
E[f2
j ](tj+1 − tj)
24

since E[(Btj+1
− Btj
)2
] = (tj+1 − tj) (similar case and steps as the example 2.2.11).
Moreover, the random variables tk, Bt+1 −Btk
, tj, Bj+1 −Btj
are mutually indepen-
dent since j < k. Therefore:
E[fj(Btj+1
− Btj
)fk(Btk+1
− Btk
)] = 0
since E[Btj+1
− Btj
] = 0 by the conditions of the Brownian motion. By applying
the above steps it follows that:
E[I(f(t))]2
=
n−1
j=0
E[f2
j ](tj+1 − tj)
Afterwords, we proceed to the following step:
| f(t) |2
=
n−1
j=0
f2
j 1[tj,tj+1)(t) +
k<j
fkfj1[tj,tj+1)(t)1[tk,tk+1)(t) =
n−1
j=0
f2
j 1[tj,tj+1)(t)
since 1[tj,tj+1)(t)1[tk,tk+1)(t) = 0 for k < j. Then, following the previous steps we
have that:
E
b
a
| f(t) |2
dt
(2.3)
= E
b
a
|
n−1
j=0
fj1[tj,tj+1) |2
dt
=
n−1
j=0
E[f2
j ]
b
a
1[tj,tj+1)dt
=
n−1
j=0
E[f2
j ](tj+1 − tj).
Consequently the Itô Isometry is proved (Mao, 2007, p.19), (Øksendal, 2003, p.29).
Example 2.4.4. Let assume {Bt : t 0} to be a standard Brownian motion i.e.B0 = 0.
Calculate the expectation and the standard deviation of
∞
0
e−t
dBt.
Firstly, we are going to find the expectation.
E[
∞
0
e−t
dBt] = E[Itô] = 0.
25

We know for the variance that:
V ar[X] = E[X2
] − E[X]2
.
Since E[Itô] = 0 then:
V ar[
∞
0
e−t
dBt] = E[(
∞
0
e−t
bBt)2
]
= E[(
∞
0
e−t
dBt)(
∞
0
e−t
dBt)]
(Itô’s isometry)
= E[
∞
0
e−2t
dt] =
1
−2e2t
∞
0
=
1
2
Finally the standard deviation is equals to 1√
2
.
Subsequently, it will be defined now a more general class of stochastic processes (not
necessarily only for step functions) for which we will be able to define the Itô’s stochastic
integral.
Definition 2.4.5. A stochastic process f(t) belongs to the set M2
([a, b]) if it is adapted
to the filtration Fs = σ(Bs, s t) and also it satisfies the condition:
E
b
a
| f(t) |2
dt < ∞
The stochastic processes that belongs to M2
([a, b]) can be approached by step functions
as this conclusion is demonstrated by the theorem below:
Theorem 2.4.6. For any f ∈ M2
([a, b]), there is a step function sequence fstep,n(t), such
that:
lim
n→∞
E
b
a
| f(t) − fstep,n(t) |2
dt = 0
i.e. the stochastic process f is the limit of fstep,n(t).
Proof. In this proof we follow the three steps according to Mao(2007, p.20) but with more
detailed assumptions. Before starting it is considered necessary to have a short reference
to the three steps of this proof and the main idea of using them. This reference will be
helpful for the reader to understand better the syllogism of this proof and to be adapted
26

to its particularities.
The ﬁrst step shows that any stochastic process which belongs to M2
([a, b]) can be
approached by a sequence of bounded processes in M2
([a, b]). The second step refers to
the fact that any bounded stochastic process in M2
([a, b]) can be approached by a sequence
which consists of bounded and continuous processes in M2
([a, b]). Consequently,
any stochastic process in M2
([a, b]) can be approached by a sequence of bounded and
continuous processes in M2
([a, b]). Finally, the third step demonstrates that any bounded
and continuous processes in M2
([a, b]) can be approached by a step function sequence.
The composition of these three steps concludes that any stochastic process which belongs
to M2
([a, b]) can be approached by a step function sequence. Then, we are ready to cite
the three steps analytically.
1st
step: Let the stochastic process f(t) ∈ M2
([a, b]). We deﬁne a sequence of stochastic
processes which is: φn(t) = [−n ∨ f(t)] ∧ n. The sequence φn(t) is bounded by f(t)
and so it belongs to M2
([a, b]) for every n. Furthermore,
f(t) = lim
n→∞
φn(t),
and by the dominated convergence theorem [see A.1] it follows:
E
b
a
lim
n→∞
| f(t) − φn(t) |2
dt = 0
2nd
step: Assume φ(t) ∈ M2
([a, b]) to be bounded and we are going to create a sequence ψn(t)
as follows: ∀n there is a continuous function ρn : R → R+
such that ρn(t) = 0 for
0 t 1
n
and
∞
−∞
ρn(t)dt = 1. Therefore, it feasible that:
ψn(t) =
b
a
ρn(s − t)φ(s)dt. (2.4)
It is remarkable the fact that ψn(t) is a sequence of stochastic processes because φ(s)
is a stochastic sequence too. The integral (2.4) is a Riemann-Stieltjes integral and by
its properties and the fact that ψ(t) is bounded we conclude that the sequence ψn(t) is
27

consisted by continuous functions and ψ(t) is a bounded sequence. Since ψ(t) ∈ M2
([a, b])
and from the bounded convergence theorem [see A.2] it follows:
E
b
a
lim
n→∞
| f(t) − φn(t) |2
dt = 0
3rd
step: Finally, if ψ(t) ∈ M2
([a, b]) is bounded and continuous we can create the step
function sequence fstep,n as below:
fstep,n(t) = ψ(a)1[a,a+b−a
a
](t) +
n−1
i=1
ψ a + i
b − a
a
1(a+i b−a
a
,a+(i+1)b−a
a
](t)
The above sequence is bounded. By applying again the bounded convergence theorem
[see A.2] it concludes that:
lim
n→∞
E
b
a
| ψ(t) − fstep,n(t) |2
dt = 0
By the above three steps and the use of the triangle inequality we have the ﬁnal result:
lim
n→∞
E
b
a
dt = 0.
Example 2.4.7. Let’s wonder which will be the choice of the partition tj. One example
of such a choice is given below. Assume the initial interval [0, T]. One possible choice of a
partition that belongs to this interval would be: 0 = tn
0 < tn
1 < tn
2 < . . . < tn
j < . . . < tn
n =
T, where j = 0, . . . , n and tn
j = jT
n
. By using this partition it would be possible to choose
a sequence of step functions as the possible approach of the stochastic process f(t, ω) to
the interval [0, T]. This sequence would be:
fstep,n(t) =
n−1
j=0
f(tj, ω)1[tj,tj+1)(t).
If the stochastic process f(t, ω) is continuous with respect to the variable t, it follows that:
lim
n→∞
E
b
a
dt = 0.
Namely, the sequence of step functions fstep,n approaches the stochastic process f. For
instance, let f(t, ω) = B2
t . An approximation of the stochastic process f is:
fstep,n(t) =
n−1
j=0
B2
jT
n
1[jT
n
,
(j+1)T
n
)
(t).
28

2.4.2 C’s stochastic integral
As we have seen, any stochastic process f ∈ M2
([a, b]) can be approached by a sequence of
step functions fstep,n. Moreover, due to the fact that the stochastic integral is also defined
with respect to a stochastic step function, we are ready to define the Itô’s stochastic
integral as below:
Definition 2.4.8. Let f ∈ M2
([a, b]). The Itô’s integral of stochastic process f with
respect to Brownian motion is defined by the following limit:
I(f(t)) =
b
a
f(t)dBt := lim
n→∞
I(fstep,n(t)) = lim
n→∞
b
a
fstep,n(t)dBt
where
lim
n→∞
E |
b
a
f(t) − fstep,n(t)dBt |2
= 0.
As we can see, the I(f(t)) =
b
a
f(t)dBt is a random variable and it is the L2
limit of
the sequence of random variables I(fstep,n(t)) = limn→∞
b
a
fstep,n(t)dBt. i.e.
lim
n→∞
E | I(f(t)) − I(fstep,n(t)) |2
= 0.
The existence of the above definition is ensured by the convergence of I(fstep,n(t)) =
b
a
fstep,n(t)dBt in L2
(Ω, F, P). Hence, we conclude to the following theorem:
Theorem 2.4.9. Let f ∈ M2
([a, b]) and fstep,n(t) a step function sequence which ap-
proaches the process f(t). Therefore, the sequence of stochastic integrals I(fstep,n(t)) =
b
a
fstep,n(t)dBt goes to a square integrable random variable.i.e. tends to L2
(Ω, F, P).
Proof. Because of the completeness of L2
, it is enough to show that the sequence sn :=
I(fstep,n(t)) is a Cauchy sequence in L2
. i.e. sn − sm L2 → 0 for n, m → ∞. Indeed:
29

sn − sm L2 := E | sn − sm |2
1
2
= E |
b
a
fstep,n(t)dBt −
b
a
fstep,m(t)dBt |2
1
2
(Theorm 2.4.3, (i))
= E |
b
a
fstep,n(t) − fstep,m(t)(t)dBt |2
1
2
(Itô’s isometry)
= E
b
a
| fstep,n(t) − fstep,m(t) |2
dt
1
2
= E
b
a
| fstep,n(t) − f(t) + f(t) − fstep,m(t) |2
dt
1
2
= fstep,n(t) − f(t) + f(t) − fstep,m(t) M2
fstep,n(t) − f(t) M2 + f(t) − fstep,m(t) M2 → 0 for n, m → ∞
because fstep,n(t) → f(t). For that reason, the limit of I(fstep,n(t)) exists in L2
(Ω, F, P)
and it is called I(f(t)). That is followed because of the completeness of L2
(Ω, F, P).
Hence, through the definition of the stochastic integral I(f(t)) we achieve to overcome
the main issue of the infinite variation of Brownian motion. We exploit the result that its
quadratic variation is bounded.
Corollary 2.4.10. ∀ f, g ∈ M2
([0, T]) applied that:
E[I(f(t))I(g(t))] = E
b
a
f(t)dBt ·
b
a
g(t)dBt = E
b
a
f(t)g(t)dt
Proof. We use the following formula:
ab =
1
4
(| a + b |)2
− | a − b |)2
(2.5)
where a = I(f) and b = I(g). By taking expectations and substituting a and b to (2.5) it
follows:
E[I(f)I(g)] =
1
4
(E[| I(f)+I(g) |2
]−E[| I(f)−I(g) |2
]) =
1
4
(E[| I(f+g) |2
]−E[| I(f−g) |2
])
(2.6)
30

where it has been used the linearity of Itô’s stochastic integral. However from Itô’s
isometry,
E | I(f + g) |2
= E
T
0
| f + g |2
dt
E | I(f − g) |2
= E
T
0
| f − g |2
dt .
Thus,
E[I(f(t))I(g(t))] =
1
4
E
T
0
| f + g |2
dt − E
T
0
| f − g |2
dt
=
1
4
E
T
0
| f + g |2
− | f − g |2
dt
=
1
4
E
T
0
4fgdt = E
T
0
fgdt
The above proof is presented in different versions by Karatzas and Shreve(1991, p.180)
and Mao(2007, p.27).
An example to show the way which a stochastic integral is solved by its definition is now
given. This particular example is solved by Øksendal(2003, p.29) who uses the hypothesis
B0 = 0. However, in this thesis will be provided a more analytical proof without the
above assumption.
Example 2.4.11. According to the definition of Itô’s stochastic integral prove that:
I(Bt) =
T
0
BtdBt =
1
2
B2
T −
T
2
.
The integrable stochastic process is f(t, ω) = Bt. We choose the partition 0 = tn
0 < tn
1 <
tn
2 < . . . < tn
j < . . . < tn
n = T, where j = 0, . . . , n and tn
j = jT
n
and its approximation will
be:
fstep,n(t) =
n
j=0
Btn
j
1[tn
j ,tn
j+1)(t).
Therefore, we take the following sequence of random variables:
I(fstep,n) =
n−1
j=0
Btn
j
(Btn
j+1
− Btn
j
).
31

Its limit will be the stochastic integral that we want to ﬁnd. Subsequently, we use the
following formula:
a(b − a) =
1
2
(b2
− a2
) −
1
2
(b − a)2
substituting a = Btn
j
, b = Btn
j+1
. Then, it follows:
Btn
j
(Btn
j+1
− Btn
j
) =
1
2
(B2
tn
j+1
− B2
tn
j
) −
1
2
(Btn
j+1
− Btn
j
)2
.
By taking sums from j = 0 to j = n − 1 we have that:
I(fstep,n) =
1
2
n−1
j=0
(B2
tn
j+1
− B2
tn
j
) −
1
2
n−1
j=0
(Btn
j+1
− Btn
j
)2
=
1
2
n−1
j=0
(Btn
j+1
− Btn
j
)(Btn
j+1
+ Btn
j
) −
1
2
n−1
j=0
(Btn
j+1
− Btn
j
)2
=
1
2
(Btn
n
− Btn
0
)(Btn
n
+ Btn
0
) −
1
2
n−1
j=0
(Btn
j+1
− Btn
j
)2
=
1
2
B2
T −
1
2
n−1
j=0
(Btn
j+1
− Btn
j
)2
.
It is therefore suﬃcient to estimate the L2
limit of random variable n−1
j=0 (Btn
j+1
− Btn
j
)2
.
Suppose that this limit would be 1
2
T. Indeed,
E |
1
2
n−1
j=0
(Btn
j+1
− Btn
j
)2
−
1
2
T |2
=
= E
1
4
n−1
j=0
(Btn
j+1
− Btn
j
)4
+
1
2 k<j
(Btn
j+1
− Btn
j
)2
(Btn
k+1
− Btn
k
)2
−
T
2
n−1
j=0
(Btn
j+1
− Btn
j
)2
+
T2
4
=
1
4
n−1
j=0
E[(Btn
j+1
− Btn
j
)4
] +
1
2 k<j
E[(Btn
j+1
− Btn
j
)2
]E[(Btn
k+1
− Btn
k
)2
] −
T
2
n−1
j=0
E[(Btn
j+1
− Btn
j
)2
] +
T2
4
=
n3T2
4n2
+
T2
4
−
T2
4n
−
nT2
2n
+
T2
4
=
T2
2n
→ 0.
since n → ∞. Therefore,
T
0
BtdBt =
1
2
B2
T −
T
2
.
32

The above example indicates that the Itô’s stochastic integral has different proper-
ties in comparison with the Riemann-Stieltjes integral which is studied in Real Calculus
Analysis. This conclusion seems by:
T
0
BtdBt =
1
2
B2
T −
T
2
unexpected term
The undersigned term is an unexpected term for Riemann-Stieltjes integration. The ap-
pearance of this term is a result of the properties and definition of Itô’s stochastic integral
and to the fact that the integration is carried out on the Brownian motion. Specifi-
cally, the non-bounded variation of the Brownian motion does not allow the definition of
the Riemann-Stieltjes integral to be applied on it. Consequently, the definition of Itô’s
stochastic integral is imposed by this particularity of Brownian motion as it was seen in
the above example. Obviously, the above method that was used to define a stochastic
integral is complicated. As a result, in the next subsection is going to show an other
method that makes the definition of the stochastic integral much more easier.
According to the properties of stochastic integral we are able to mention the following
theorem for a stochastic process It.
Theorem 2.4.12. Let assume M2
([0, T]) be a set where any stochastic process, which
belongs to it, must satisfy the condition:
E
T
0
| f(t) |2
dt < ∞
and it has to be adapted to the filtration Fs = σ(Bs, s t). Suppose to have the stochastic
process f ∈ M2
([0, T]), 0 t Tand:
It =
t
0
f(s)dBs.
The following properties hold:
(i) The stochastic process It is a square-integrable continuous martingale.
(ii) The quadratic variation of It is given by:
< I >t=
t
0
| f(s) |2
ds
33

Proof. (i) In order to show that the stochastic process It is a square-integrable contin-
uous martingale we have to proof:
(1) E[| It |] < ∞
(2) E[It | Fs] = Is
(3) E[supt∈[0,T] | It |2
] < ∞
For the point (1) we have that:
E[| It |] = E[|
t
0
f(s)dBs |]
(Mean V alue Property)
[E[|
t
0
f(s)dBs |]2
]
1
2
(Itô’s Isometry)
= [E[
t
0
| f(s) |2
ds]]
1
2 < ∞.
since f ∈ M2
([0, T]) and 0 t T.
For the point (2) we can initially notice from the definition of stochastic integral
that It is adapted to Ft. Furthermore:
E[It | Fs] = E[Is +
t
s
f(u)dBu | Fs]
= E[Is | Fs] + E[
t
s
f(u)dBu | Fs]
= Is
since the variation It − Is =
t
s
f(u)dBu is independent of the filtration Fs.
For the point (3) we know that for all t ∈ [0, T] it follows:
E[| It |2
] = E[|
t
0
f(s)dBs |2
] = E[
t
0
| f(s) |2
ds] < ∞.
Moreover, because [0, T] is a close interval we have that E[supt∈[0,T] | It |2
] < ∞.
Therefore, the stochastic process It is a square-integrable martingale regarding the
filtration Ft.
(ii) In order to proof the second property it is sufficient to show that Mt = I2
t − < I >t
34

is a continuous martingale which becomes zero on t = 0. Indeed:
E[Mt | Fs] = E[I2
t −
t
0
| f(u) |2
du | Fs]
= E[(Is +
t
s
f(u)dBu)2
−
s
0
| f(u) |2
du −
t
s
| f(u) |2
du | Fs]
= I2
s + 2IsE[
t
s
f(u)dBu | Fs] + E[|
t
s
f(u)dBu |2
| Fs]
−
s
0
| f(u) |2
du − E[
s
0
| f(u) |2
du | Fs]
= I2
s + 2IsE[
t
s
f(u)dBu | Fs] −
s
0
| f(u) |2
du
= I2
s + E[Itô]
=0
−
s
0
| f(u) |2
du = Ms
Therefore, Mt is a martingale and by the uniqueness of the quadratic variation we
conclude to the desired result.
2.5 The Itô’s Lemma
As we saw in the previous subsection, the calculation of Itô’s stochastic integral can be
fulfilled by its definition. However, this method is extremely complicated and is used
mainly in particular cases. Therefore, it was necessary to be found some other method
which can make easier the calculation of this specific type of integrals. This method is
known as the Itô’s Lemma.
Lemma 2.5.1. The Itô Lemma. Assume the function f(x, t) which is twice continu-
ously differentiable. Hence:
df(B(t), t) =
∂f(B(t), t)
∂B(t)
dB(t) +
∂f(B(t), t)
∂t
dt +
1
2
∂2
f(B(t), t)
∂2B(t)
dt. (2.7)
Proof. It is recall that (dB(t))2
= dt, dtdB(t) = dB(t)dt = 0, (dt)2
= 0 and:
dB(t) = B(t + h) − B(t). (2.8)
35

Furthermore, let that t → t + h and B(t) → B(t + h). According to the formal Taylor
expansion we have that:
f [B(t + h), t + h]
(2.8)
= f [dB(t) + B(t), t + h]
= f(B(t), t) +
∂f(B(t), t)
∂B(t)
dB(t) +
∂f(B(t), t)
∂t
dt +
1
2
∂2
f(B(t), t)
∂2B(t)
(dB(t))2
+
1
2
∂2
f(B(t), t)
∂2t
(dt)2
0
= f(B(t), t) +
∂f(B(t), t)
∂B(t)
dB(t) +
∂f(B(t), t)
∂t
dt +
1
2
∂2
f(B(t), t)
∂2B(t)
dt.
Hence, it follows that:
df(B(t), t)
(2.8)
= f(B(t + h), t + h) − f(B(t), t)
= f(B(t), t) +
∂f(B(t), t)
∂B(t)
dB(t) +
∂f(B(t), t)
∂t
dt +
1
2
∂2
f(B(t), t)
∂2B(t)
dt − f(B(t), t)
=
∂f(B(t), t)
∂B(t)
dB(t) +
∂f(B(t), t)
∂t
dt +
1
2
∂2
f(B(t), t)
∂2B(t)
dt.
Thus, the Lemma it is proved.
An example is given below in order to illustrate the application of the Itˆo’s lemma.
Example 2.5.2. Let {B(t) : t 0} be a Brownian motion. Find E[B(t)2
].
Assume that f(x, t) = x2
. Then X(t) = f(B(t), t) and by Itˆo’s lemma we have that:
dX(t) = df(B(t), t)
=
∂f(B(t), t)
∂B(t)
dB(t) +
∂f(B(t), t)
∂t
+
1
2
∂2
f(B(t), t)
∂2B(t)
dt.
Therefore, we have to estimate the partial derivatives as below:
∂f(x, t)
∂x
= 2x,
∂f(x, t)
∂t
= 0,
∂2
f(x, t)
∂2x
= 2.
36

Then, we substitute:
∂f(B(t), t)
∂B(t)
= 2B(t),
∂f(x, t)
∂t
= 0,
∂2
f(B(t), t)
∂2B(t)
= 2
and from (2.7) it follows that:
dB2
(t) = 2B(t)dB(t) + [0 +
1
2
2]dt
= 2B(t)dB(t) + dt
(integrate both sides) B2
(t) − B2
(0)
=0
=
t
0
2B(s)dB(s) +
t
0
1ds
B2
(t) =
t
0
2B(s)dB(s) +
t
0
1ds
B2
(t) =
t
0
2B(s)dB(s) + t.
It is noticeable that the integral is an Itô’s integral. Then, we take expectations on both
sides as follows:
E[B2
(t)] = E[
t
0
2B(s)dB(s)] + t
= E[Itô]
(theorem2.4.3,prop.(ii))
+t
= 0 + t = t.
3 Stochastic Differential Equations
This chapter includes an introduction to the theory of stochastic differential equations and
their solutions. The Stochastic differential equations arise in various scientific disciplines
of science and technology. They arise whenever the relationship between continuously
fluctuating quantities and the rate of change over time and space is known. Or other-
wise, when such a relationship can be assumed in order to model and describe natural
37

phenomena, technical or physical processes, dynamical systems in biology, economy and
elsewhere. Specifically, one of the most important application of Stochastic differential
equations is in mathematical finance.
3.1 Basic concepts
Initially, we start by the definition of a stochastic differential equation.
Definition 3.1.1. A Stochastic differential equation is an equation of the form below:
dXt = b(t, Xt)dt + σ(t, Xt)dBt (3.1)
or equivalently in integral form:
Xt = X0 +
t
0
b(s, Xs)ds +
t
0
σ(s, Xs)dBs
where Bt is m-dimensional Brownian motion and σ : Rn
→ Rn×m
and b : Rn
→ Rn
are
measurable real functions and Xt ∈ Rn
.
We say that a Stochastic differential equation has a solution if there is an Itô’s stochas-
tic process Xt which satisfies it. Such a kind of solutions are called pathwise solutions.
A very relevant concept to the pathwise solution is the strong solution. In the current
thesis we are going to follow the terminology of strong solution instead of the terminology
of pathwise solution. At the time t = 0, the stochastic process X0 is known.
Remark 3.1.2. (i) Stochastic differential equation is a differential equation in which
one or more components are stochastic processes. We have to identify (calculate)
Xt, which is also a stochastic process.
(ii) The integral
t
0
b(s, Xs)ds is a Riemann-Stieltjes integral and the integral
t
0
σ(s, Xs)dBs
is an Itô stochastic integral.
(iii) The above definition 3.1.1 gives the general form of a vector stochastic differential
38

equation. The equation (3.1) can be written in a components form as below:









dX1,t
dX2,t
...
dXn,t









=









b1(t, Xt)
b2(t, Xt)
...
bn(t, Xt)









dt+









σ11(t, Xt) σ12(t, Xt) . . . σ1m(t, Xt)
σ21(t, Xt) σ22(t, Xt) . . . σ2m(t, Xt)
...
...
...
σn1(t, Xt) σn2(t, Xt) . . . σnm(t, Xt)


















dB1,t
dB2,t
...
dBm,t









where m is the number of Brownian motions which contributes to the equation of
(3.1) and n is the dimension of the stochastic process Xt = (X1,t, . . . , Xn,t)T
∈ Rn
.
If we have the special case of n = 1, it follows a scalar stochastic differential equation
for the stochastic process Xt ∈ Rn
, which has the following form:
dXt = b(t, Xt)dt + σ(t, Xt)dBt
where b = (b1, . . . , bn)T
, σ = (σik)m
i,k=1 and dBt = (dB1,t, . . . dBm,t)T
. The bjand σik
are real-valued functions. Finally, the simplest form is the form which has n, m = 1. It
is a scalar stochastic differential equation and it is driven by a Brownian motion.
3.2 The Itô’s Lemma as a solving method of SDEs
Many stochastic differential equations can be solved by using the Itô’s lemma.
Example 3.2.1. Geometric Brownian motion. Consider the linear model:
dXt
dt
= αtXt
and let us assume that the coefficient αt is a random variable of the form:
αt = µ + σ
•
Bt
where
•
Bt is denoted to be the derivative of the Brownian motion. This seems paradox
since, as has been shown by the subsection 2.2, a Brownian motion is nowhere differ-
entiable. But according to Mao(2007, p.2), a white noise
•
Bt can be considered as the
derivative of a Brownian motion Bt and can be written as
•
Bt= dBt
dt
⇒ dt
•
Bt= dBt.
39

Therefore:
dXt
dt
= (µ + σ
•
Bt)Xt ⇔
dXt = µXtdt + σXtdBt (3.2)
and it is obvious that:
dXt
Xt
= µdt + σdBt ⇒
t
0
dXt
Xt
=
t
0
µdt + σdBt = µt + σBt (3.3)
Then, we apply the Itô’s Lemma in the stochastic process Zt = ln(Xt) in order to find a
solution for the above stochastic differential equation. Firstly, it is assumed f(x, t) = lnx.
Thus:
∂f(x, t)
∂x
=
1
x
∂f(x, t)
∂t
= 0
∂2
f(x, t)
∂2x
= −
1
x2
by substituting:
∂f(Xt, t)
∂Xt
=
1
Xt
∂f(Xt, t)
∂t
= 0
∂2
f(Xt, t)
∂2Xt
= −
1
X2
t
and from the Itô’s formula (2.7) we have that:
d(lnXt) =
1
Xt
dXt +
1
2
[0 −
1
X2
t
]dt
=
1
Xt
dXt −
1
2
1
X2
t
dt
=
1
Xt
dXt −
1
2
1
X2
t
(dX(t))2
(from 3.2)
=
1
Xt
dXt −
1
2
1
X2
t
σ2
X2
t dt
=
dXt
Xt
−
1
2
σ2
dt
40

by integrating:
Zt = lnXt =
t
0
dXt
Xt
−
t
0
1
2
σ2
dt ⇒
t
0
dXt
Xt
= lnXt +
1
2
σ2
t
Subsequently, we substitute the above result into (3.3) and concludes:
Xt = X0e(µ−1
2
σ2
)t+σBt
The terms of 1
2
σ2
and σBt are due to the stochasticity. The stochastic process Xt is called
geometric Brownian motion and is used as a model for stock prices in the field of finance.
For that reason, it is customary for the stochastic process Xt to be denoted as St.
Corollary 3.2.2. From the above result it is clear that log(Xt
X0
) follows normal distribution
with expectation:
E log(
Xt
X0
) = E µ −
1
2
σ2
t + σ E [Bt]
=0
= µ −
1
2
σ2
t
and variance:
V ar log(
Xt
X0
) = V ar µ −
1
2
σ2
t
=0
+σ2
V ar [Bt] = σ2
t
Specifically, the logarithm of the stock price ratio is modelled as a normal random
variable. Furthermore, that model is based on the observation that the return of a stock
(i.e. the quantity dXt
Xt
) is equal to a constant µ and a variance of the form V ar(σBt) =
σ2
dBt, where Bt is a Brownian motion and σ governs the width of those fluctuations.
41

Figure 3: Sample path of geometric Brownian motion [see code B.2.1].
Example 3.2.3. The Ornstein-Uhlenbeck process. Consider the stochastic diﬀer-
ential equation:
dXt = −αXtdt + σdBt
with initial condition X0 = x. The solution of the above SDE (i.e a stochastic process)
is called the Ornstein-Uhlenbeck process and is used to many applications in Physics and
Finance. By applying Itˆo’s lemma to f(x, t) = xeαt
and with σ ∈ R we get:
df(x, t) = eαt
dXt + αxeαt
dt
because:
∂f(x, t)
∂x
= eαt
,
∂2
f(x, t)
∂2x
= 0,
∂f(x, t)
∂t
= xαeαt
.
Then,
df(Xt, t) = eαt
(−αXtdt + σdBt) + αXteαt
dt = eαt
σdBt.
By integrating on both sides:
f(Xt, t) − f(X0, 0) = σ
t
0
eαt
dBt ⇒
42

eαt
Xt − x = σ
t
0
eαt
dBt,
where f(x0, 0) = X0 = x. Hence, we conclude:
Xt = xe−αt
+ e−αt
σ
t
0
eαt
dBt.
Corollary 3.2.4. By using the properties of the stochastic integral it is feasible to define
the mean value and the variance of Ornstein-Uhlenbeck process. herefore for the mean
value:
E[Xt] = E[xe−αt
+ e−αt
σ
t
0
eαt
dBt]
= E[xe−αt
] + E[e−αt
σ
t
0
eαt
dBt]
= E[xe−αt
] + e−αt
σE[
t
0
eαt
dBt]
= E[xe−αt
] + E[Itô]
=0
= xe−αt
and the variance:
V ar[Xt] = E[Xt − E[Xt]]2
.
But,
Xt − E[Xt] = xe−αt
+ e−αt
σ
t
0
eαt
dBt − xe−αt
= e−αt
σ
t
0
eαt
dBt.
Hence:
V ar[Xt] = E[e−αt
σ
t
0
eαt
dBt]2
= e−2αt
σ2
E[
t
0
eαt
dBt]2
= e−2αt
σ2
t
0
E[eαt
]2
dt
= e−2αt
σ2
t
0
e2αt
dt = e−2αt
σ2 1
2α
(e2αt
− 1) =
σ2
2α
(1 − e−2αt
)
This stochastic differential equation can be used as a model for interest rates. If we assume
the interest rate r as a function of time, then it is possible to have a stochastic process rt
43

since the interest rates are not constant and simultaneously random over time. According
to Chan, Karolyi, Longstaff, Sanders(1992, p. 1211) the model of Vasicek which is a
stochastic differential equation of the form:
dXt = (α + βXt)dt + σdBt
is a generalization of the stochastic process of Ornstein - Uhlenbeck. Its solution can be
found by the same method that we used above for the equation of Ornstein - Uhlenbeck.
Figure 4: Sample path of Ornstein - Uhlenbeck process. Left: X0 = 5, α = 2, σ = 1.
Right: X0 = 5, α = −2, σ = −1 [see code B.2.2].
Example 3.2.5. Brownian Bridge. A Brownian Bridge is a continuous -time stochas-
tic process which is obtained by considering a standard Brownian motion process in a closed
time interval e.g. in [0, 1] as we can see on figure 5. Specifically, it has initial condition
X0 = 0 with terminal condition X1 = 0. Hence, it follows graphically a shaped bridge
which justifies the naming of the process.
Now, it is given below another differential equation which is often recommended in
probability theory:
dXt = −
Xt
1 − t
dt + dBt (3.4)
The solution of the above SDE is called Brownian Bridge. To solve it we apply the Itô’s
Lemma in the function f(x, t) = g(t)x where g(t) is a C1
function (i.e. its derivative
44

exists and it is continues) . Moreover, we consider x = Xt. Thus:
df(Xt, t) = xg (t)dt + g(t)dXt (3.5)
because:
∂f(x, t)
∂x
= g(t),
∂2
f(x, t)
∂2x
= 0,
∂f(x, t)
∂t
= g (t)x.
Subsequently we substitute (3.4) in (3.5) with respect of Xt. Therefore we conclude to the
following equation:
df(Xt, t) = g (t) −
g(t)
1 − t
Xtdt + g(t)dBt
we choose:
g (t) −
g(t)
1 − t
= 0 ⇒ g (t) =
g(t)
1 − t
by integrating it is given:
ln(g(t)) = − ln(1 − t) ⇒ g(t) =
1
1 − t
From all the above it follows:
df(Xt, t) =
1
1 − t
dBt
and by integrating:
g(t)Xt − 0 =
t
0
dBt
1 − t
1
1 − t
Xt =
t
0
dBt
1 − t
or
Xt = (1 − t)
t
0
dBt
1 − t
45

Corollary 3.2.6. By using again the properties of stochastic integration we can deﬁne
the mean value and the variance of Brownian bridge. Indeed:
E[Xt] = E (1 − t)
t
0
dBt
1 − t
= (1 − t)E
t
0
dBt
1 − t
= (1 − t)E[Itˆo] = 0
and therefore V ar[Xt] = E[X2
t ]. So:
E[X2
t ] = (1 − t)2
E
t
0
dBt
1 − t
2
= (1 − t)2
E
t
0
dBt
1 − t
t
0
dBt
1 − t
= (1 − t)2
E
t
0
1
(1 − t)2
dt = (1 − t)2 1
1 − t
t
0
= t − t2
Corollary 3.2.7. From the above mean value and variance of Xt, it holds that Xt
Pr
→ 0
as t → 1.
Proof. Xt is said to converge in probability to 0 if ∀ > 0,
lim
t→1
P(| Xt − 0 |> ) = 0.
Indeed:
P[(Xt − 0) > ] = P[(Xt − E[Xt]) > ]
By Chebyshev’s inequality [see A.3]:
V ar[Xt]
2
=
t − t2
2
=
t(1 − t)
2
→ 0
since t → 1.
Therefore we have the desideratum.
The Brownian bridge has been used on Finance mathematics for the modelling of zero
- coupon bonds. Moreover, according to Horne, Garton, Krone and Lewis(2007, p.2354)
Brownian bridge can be used as a method to estimate movement paths (e.g. animal’s
movements) using discrete location data in short time intervals.
46

Figure 5: Sample path of Brownian Bridge [see code B.2.3].
3.3 Strong and Weak Solutions
We will now define the concept of uniqueness for solutions of stochastic differential equa-
tions.
Definition 3.3.1. The solution of stochastic differential equation (3.1) is said to be
strongly unique if given two stochastic processes Xt and Xt which satisfy the equation
(3.1) and it holds that:
P(X0 = X0) = 1 ⇒ P(Xt = Xt, ∀t 0) = 1.
Under certain circumstances it is possible to show that the stochastic differential
equations have unique strong solutions. Such conditions are the Lipschitz conditions
on the coefficients of the equation.
Theorem 3.3.2. Consider the stochastic differential equation (3.1) with coefficients σ
and b. These two functions are bounded and satisfy the following conditions:
Lipschitz condition:
| b(t, x) − b(t, y) | + | σ(t, x) − σ(t, y) | K | x − y | , ∀t, x, y
Linear growth condition:
| b(t, x) |2
+ | σ(t, x) |2
(1+ | x |2
), ∀t, x.
47

Therefore the stochastic differential equation has a strong solution. This solution is
also unique.
The proof of this theorem is based on using the Banach fixed point theorem [see A.4].
For the sake of simplicity, the proof is investigated for stochastic differential equations in
the case of m = n = 1. Finally, it is important to be said that there are several variations
of this proof such as in Øksendal(2003, p.66), Allen(2007, p.91) and Mao(2007, p.51).
Proof. As we mentioned before, we will utilize the Banach fixed point theorem in order to
show the existence of a strong unique solution of an SDE. Initially, we define the following
iterative formula:
X
(i+1)
t = X0 +
t
0
b(t, X
(i)
t )dt +
t
0
σ(t, X
(i)
t )dBt.
If there is a stochastic process for which the relation X
(i+1)
t = X
(i)
t is satisfied (∀i), then
this particular process is the solution of the stochastic differential equation.
Let assume a function space M2
who is consisted by the stochastic processes Xt such
that, E[
T
0
| Xt |2
dt] < ∞. In addition, we suppose that this space is equipped with the
norm · λ who is defined below:
Xt
2
λ = E
T
0
e−λt
| Xt |2
dt .
Hence, this space is a complete space with norm, i.e. is a Banach space. Therefore, we
can define the operator Y : M2
→ M2
according to the relation:
YXt = X0 +
t
0
b(t, Xt)dt +
t
0
σ(t, Xt)dBt.
A point χ = Xt of space M2
which satisfies the equality Yχ = χ (or equivalently
Xt = YXt) is called the fixed point of the operator Y. In our case, this point is a stochastic
process Xt and will be the solution of the stochastic differential equation. Thus, in order
to prove that the equation can be solved, it is sufficient to use the fixed point theorem
which determines the existence contraction operator (Jost, 2005, p.46). In our case, there
is a contraction operator Y if for any χ1, χ2 ∈ M2
it applies:
Yχ1 − Yχ2 θ χ1 − χ2
48

where 0 < θ < 1. In this case, we use the norm · λ.
Hence, it is sufficient to show that the Y is a contraction operator. Let’s take different
points χ1 and χ2 of M2
, which corresponds to the stochastic processes X1,s and X2,s. By
the definition of the operator Y we have that:
YX1,s − YX2,s =
T
0
(b(t, X1,s) − b(t, X2,s))ds +
T
0
(σ(t, X1,s) − σ(t, X2,s))dBs ≡ I1 + I2.
We are going to estimate each term separately. By applying the norm · 2
λ we have that:
YX1,s − YX2,s
2
λ = I1 + I2
2
λ
(Cauchy-Schwarz inequality) ( I1 λ + I2 λ)2
= I1
2
λ + I1
2
λ
since dtdBt = 0.
For I1:
I1
2
λ = E
T
0
e−λt
|
t
0
(b(s, X1,s) − b(s, X2,s))ds |2
dt
(triangular inequality) E
T
0
e−λt
t
0
| b(s, X1,s) − b(s, X2,s) |2
dsdt
(∗)
C2
E
T
0
e−λt
t
0
| X1,s − X2,s |2
dsdt
(∗∗)
= C2
E
T
0
T
s
e−λt
eλs
dt e−λs
| X1,s − X2,s |2
ds
(∗∗∗) C2
λ
E
T
0
e−λs
| X1,s − X2,s |2
ds =
C2
λ
X1,s − X2,s
2
λ
(∗): We use the Lipschitz property for function b.i.e:
| b(t, x1) − b(t, x2) | C | x1 − x2 |.
(∗∗): The solution for the integral is:
T
0
e−λt
t
0
f(s)dsdt =
T
0
e−λt
T
0
f(s)H(t − s)dsdt
=
T
0
T
0
e−λt
eλs
e−λs
f(s)H(t − s)dsdt
=
T
0
T
0
e−λt
eλs
H(t − s)dt e−λs
f(s)ds
=
T
0
T
s
e−λt
eλs
dt e−λs
f(s)ds
49

where f(s) could be any function of s. In our case f(s) =| X1,s − X2,s |2
and
H(s) is the Heaviside function, which is a function of the form:
H(p) =



0, if p < 0
1, if p 0.
(∗ ∗ ∗):
T
s
e−λt
eλs
dt = eλs e−λt
−λ
T
s
=
1
λ
(1 − e−λ(T−s)
)
1
λ
For I2: For this term we are going to repeat the above steps incorporating the Itô’s Isometry
for the stochastic integral. Namely,
I1
2
λ = E
T
0
e−λt
|
t
0
(σ(s, X1,s) − σ(s, X2,s))dBs |2
dt
= E
T
0
e−λt
t
0
| (σ(s, X1,s) − σ(s, X2,s)) |2
dsdt
(As above)
C2
E
T
0
e−λt
t
0
| X1,s − X2,s |2
dsdt
(As above) C2
λ
E
T
0
e−λs
| X1,s − X2,s |2
ds =
C2
λ
X1 − X2
2
λ
By adding the two results we conclude that: YX1,s −YX2,s
2
λ = 2C2
λ
X1,s −X2,s
2
λ.
By choosing λ such that 2C2
λ
< 1 then Y will be a contraction operator in M2
.
Consequently, the space M2
provides a unique fixed point which is the solution of
the stochastic differential equation.
Definition 3.3.3. Let assume the known functions f, g. It should be found a filtration
Ft, a Brownian motion
∼
Bt and an Itô stochastic process Xt in order to apply the following
equation:
Xt = X0 +
t
0
f(s, Xs)ds +
t
0
g(s, Xs)d
∼
Bs .
Hence the desideratum is (Xt,
∼
Bt, Ft). This particular set is called the weak solution
of the stochastic differential equation.
50

Proposition 3.3.4. A strong solution of a stochastic differential equation is also a weak
solution but the inverse is not valid. The concept of uniqueness can be generalized also in
non-strong solutions.
An example is now given in order to illustrate a possible form of a weak solution of
stochastic differential equation. This example is referred without solution in Mao(2007,
p.78). In Karatzas and Shreve(1991, p.301) and Berestycki(2010, p.75) give solutions of
this problem but a more detailed version of them is given below.
Example 3.3.5. Let assume a weak solution of a stochastic differential equation. Suppose
to have the below stochastic differential equation (which is known as Tanaka’s equation):
dXt = sign(Xt)dBt (3.6)
. where:
sign(Xt) =



+1, if Xt 0
−1, if Xt < 0.
And let’s say that Xt is an any Brownian motion
∼
Bt and Ft is a filtration for which
∼
Bt
is a Brownian motion. Suppose Bt to be the Itô’s stochastic integral:
Bt:=
t
0
sign(
∼
Bt)d
∼
Bt . (3.7)
Since Xt :=
∼
Bt, we can write (3.3) in the following differential form:
d Bt= sign(Xt)dXt ⇒ dXt = sign(Xt)d Bt .
Consequently, Xt =
∼
Bt will be a weak solution of the stochastic differential equation (3.6)
if the stochastic process Bt is a Brownian motion (in response to the filtration Ft). This
can be shown by applying the Levy’s Theorem [see A.5] and the properties of the Itô’s
stochastic integral. Indeed, Bt is adapted to the filtration Ft in accordance to its definition.
Moreover, from theorem 2.3.12 it is obvious that Bt is also a martingale because it can be
expressed as an Itô’s stochastic integral. Regarding the application of Levy’s theorem it is
51

sufficient to proof that the stochastic process Zt :=B2
t −t is also a martingale. We have
that:
E[Zt | Fs] = E[B2
t −t | Fs] = E
t
0
sign(
∼
Bt)
∼
dBt
2
− t | Fs
= E
s
0
sign(
∼
Bt)
∼
dBt +
t
s
sign(
∼
Bt)
∼
dBt
2
− t | Fs
=
s
0
sign(
∼
Bt)
∼
dBt
2
+ E
t
0
sign(
∼
Bt)
∼
dBt
2
| Fs
+ 2
s
0
sign(
∼
Bt)
∼
dBt
2
E
t
s
sign(
∼
Bt)
∼
dBt | Fs − t
=
s
0
sign(
∼
Bt)
∼
dBt
2
+ (t − s) − 0 − t
=B2
s −s = Zs
For that reason, Zt is also a martingale. Consequently, in accordance with Levy’s Theorem
Bt is a Brownian motion.
3.4 Numerical methods of SDEs
One might hope that analytical methods exist for all stochastic differential equations and
which give their solutions. However, this can not be the case as most stochastic differential
equations do not have an analytical solution. Therefore, the most appropriate methods
for the solution of stochastic differential equations are numerical methods. In the current
subsection will focus on two of these methods, Euler and Milstein approximations.
Consider the scalar stochastic differential equation of the form:
dXt = b(t, Xt)dt + σ(t, Xt)dBt (3.8)
in the time interval [τ0, T] with initial condition Xτ0 = X0. The term b(t, Xt) is called
the displacement coefficient which is the deterministic part of the stochastic differential
equation and characterizes the local trend. The term σ(t, Xt) is called the diffusion
coefficient which influences the average size of the fluctuations of X. In addition, those
fluctuations are originated by the Brownian motion Bt.
52

3.4.1 The Euler-Maruyama method
The simplest stochastic numerical approximation (Schaffter, 2010, p.4) and the most
commonly used method is the approximation of Euler-Maruyama (or similarly method of
Euler) and it is named after Leonhard Euler and Gisiro Maruyama.
For a given partition:
τ0 = t0 < t1 < t2 < . . . < tj < . . . < tn = T
of the time interval [τ0, T], the Euler approximation is a stochastic process of continuous
time Y = {Yt, τ0 t T} which satisfies the iterative formula:
Yi+1 = Yi + b(ti, Yi)(ti+1 − ti) + σ(ti, Yi)(Bti+1
− Bti
) (3.9)
for i = 0, 1, 2, ..., n − 1, Yi = Yti
and initial value Y0 = X0.
It will be denoted:
∆ti = ti+1 − ti
to be the i-th increment and we will call:
δ = max
i
∆ti
to be the greatest of them. Usually, it will be considered a uniform partition (i.e. equidis-
tant points in time):
ti = τ0 + iδ,
where δ = ∆ti ≡ ∆t = (T−t0)
n
(for a large enough number n ∈ Z such that δ ∈ (0, 1)).
The key question is how can we define the random increments:
∆Bi = Bi+1 − Bi
for i = 0, 1, ..., n−1. According to the example (2.2.11) we have that Bt −Bs ∼ N(0, t−s)
where t > s. Consequently, it holds:
Bt − Bs =
√
t − sZs,t
53

where Zs,t ∼ N(0, 1). By applying this solution in each subinterval [ti, ti+1] we have that:
Yi+1 = Yi + b(ti, Yi)(ti+1 − ti) + σ(ti, Yi) ∆tiZi
where Zi ∼ N(0, 1) are independent random variables, since the increments of the Brow-
nian motion in non-overlapping intervals are independent.
Definition 3.4.1. By the above notations, the Euler approximation can be written simpler
as below:
Yi+1 = Yi + b(ti, Yi)∆ti + σ(ti, Yi)∆Bi (3.10)
for i = 0, 1, ..., n − 1.
The successful implementation of the Euler method on a statistical software is based
on the recurring structure of the equation, which calculates approximate values of Itô
process only at discrete points in time. The term ”form” will be used to describe a recur-
ring algorithm which gives us the values of the discrete-time approximation to the given
moments. Recall that although the values at discrete time points are more interesting to
us, we will consider an approximation of a discrete time as a continuous time stochastic
process defined in the entire interval [τ0, T]. Moreover, each discrete-time approximation
corresponds to a specific ”form” which describes, as it is said above, a recurring algorithm.
Remark 3.4.2. Consider Xt be a geometric Brownian motion and b(t, X) = bX and
σ(t, X) = σX. Then the Euler’s form becomes:
Xt+∆t = Xt + bXt∆t + σXt(Bt+∆t − Bt)
whilst the exact solution is:
Xt+∆t = Xte((b−1
2
σ2
)∆t+σ(Bt+∆t−Bt)).
We know that if b ∈ R, which | b | is sufficiently small, then 1 + b ≈ eb
is a good
54

approximation. Consequently, if ∆t is sufficiently small, then:
Xt + bXt∆t + σXt(Bt+∆t − Bt) = Xt[1 + b∆t + σ(Bt+∆t − Bt)]
≈ Xte(b∆t+σ(Bt+∆t−Bt)
= Xt+∆te( 1
2
σ2∆t)
and with small ∆t we have 1
2
σ2
∆t ≈ 0.
Afterwards, it is going to be defined the convergence of the method. Before doing this,
it is necessary to be defined the criterion of absolute error as below:
ε = E[| XT − YT |]. (3.11)
Specifically, it is the mean of the absolute value of the difference between the Itô’s stochas-
tic process and the approximation in a finite terminal moment T. Usually, we will use
the notation Y δ
, where δ is a given maximum step size in (0, δ0) (δ0 is a finite positive
number), to show the dependence of the approximation Y on this size.
Definition 3.4.3. A general discrete-time approximation Y δ
, with a maximum step δ,
converges strongly with order γ > 0 to a stochastic process X on the time moment T, if
there exists a positive constant C such that:
εstrong(δ) = E(| XT − Y δ
T |) Cδγ
for each δ ∈ (0, δ0 > 0). (3.12)
The above definition has been provided in many variations in the literature, but the
inequality (3.12) is a physical generalization of the deterministic case.
(Schaffter, 2010, p.6), (Higham, 2001, p.534), (Mikosch, 1998, p.161)
Definition 3.4.4. On the other hand, a method has a weak order of convergence with
oerder γ > 0 if there exists a positive constant C such that for all smooth functions f, in
some class, it follows:
εweak(δ) =| Ef(XT ) − Ef(Y δ
T ) | Cδγ
for each δ ∈ (0, δ0 > 0). (3.13)
55

Remark 3.4.5. According to the literature if the conditions of theorem 3.3.2 are satisfied
(i.e. the coefficients b and σ satisfy the Lipschitz condition and the linear growth con-
dition), then the Euler’s form converges and it has strong order of convergence γ = 1
2
.
Specifically,
εstrong(δ) = E(| XT − Y δ
T |) Cδ
1
2 for each δ ∈ (0, δ0 > 0)
and strong order of convergence γ = 1. Specifically,
εweak(δ) =| Ef(XT ) − Ef(Y δ
T ) | Cδ1
for each δ ∈ (0, δ0 > 0).
(Onskog, 2007, p.76), (Higham, 2001, p.537), (Shores, 2006, p.77).
Corollary 3.4.6. As it is mentioned, the Euler’s method converges strongly with order
γ = 1
2
. Then, by the Markov’s inequality [see A.6] we have that:
P E(| Xtn − Y δ
n |) ∆t
1
4
1
∆t
1
4
E(| Xtn − Y δ
n |) C∆t
1
4
or equivalently
P E(| Xtn − Y δ
n |< ∆t
1
4 ) 1 − C∆t
1
4 ,
i.e. the error is small with great probability along any path.
Remark 3.4.7. A strong convergence is a pathwise approximation of X and provide
measurements of the rate at which mean of the error decomposes as C∆γ
is sufficiently
small (3.12). On the other hand, a weak convergence measures is the approximation of the
moments of X and the decomposition rate of the error of the means when C∆γ
becomes
very small (3.13).
Now an example is given below in order to illustrate the different aspects of simulation
of an approximation to a discrete-time of an Itô’s stochastic process. Moreover through
its detailed solution it can be ascertained if the Euler’s method can be attributed in a
practical way.
56

Example 3.4.8. Initially, consider the Itô’s stochastic process X = {Xt, t 0} (geomet-
ric Brownian motion) which satisfies the linear stochastic differential equation:
dXt = µXtdt + σXtdBt, (3.14)
where t ∈ [0, T] and with initial value X0 ∈ R.
Obviously, µ(t, x) = µx is the displacement coefficient and σ(t, x) = σx is the diffusion
coefficient. By the example 3.2.1 we know that the analytical solution of (3.14) is:
Xt = X0e(µ−1
2
σ2
)t+σBt
(3.15)
for t ∈ [0, T] and the given Brownian motion B = {Bt, t 0}. Due to the fact that we
know the solution analytically, we have the chance to compare the Euler’s approximation
with the exact solution and to estimate the error. In order to simulate a path of an Euler’s
approximation, for a given time partition, we start from the initial condition Y0 = X0 and
we proceed recursively to produce the next term. The Euler’s form is:
Yi+1 = Yi + µYi∆t + σYi∆Bi (3.16)
with i = 0, 1, 2, ... and coefficients µ(t, x) = µx and σ(t, x) = σx as above. ∆Bi is the
increase of Brownian motion in the interval ti t ti+1. We can determine the values
of the exact solution by (3.15) for the same path of Brownian motion. It can be taken:
Xτn = X0e(µ−1
2
σ2
)τn+σBτn
where {τn : n = 1, 2, ...., n1} is a uniform partition of [0, T].
It is noticeable that the statistical softwares can estimate the paths of the Brownian
motion which are used to give the increments Bti+1
−Bti
that are necessary to (3.16). For
simplicity, we choose for the numerical method the step ∆t to be an integer multiple R 1
of the increment δt for Brownian path. Therefore, we are sure that the set of points {τj}
contains the points {tj} which can define the Euler’s approximation. Hence, we conclude
to the following graphs:
57

Figure 6: Euler’s approximation (red) and the exact solution of geometric Brownian
Motion (light blue) with X0 = 1, µ = 0.05 and σ = 0.2 [see code B.2.4 and remark B.2.5].
The figure 6 illustrates the production of an Euler’s approximation with equidistant
points in the time interval [0, 1] for the Itô’s stochastic process which satisfies (3.14). The
left up image has δ = 2−2
, the right up has step: δ = 2−4
, the left down has: δ = 2−6
and
the right down: δ = 2−8
. Consequently, it is obvious that the smaller the step, the higher
the approximation to the Itô’s stochastic process.
Subsequently, we use the above example 3.4.8 to check the criterion of absolute error with
more accuracy and to proof numerically the conclusions from the figure 6. Recall that we
will deal with the geometric Brownian motion which satisfies (3.14) and has the analytical
solution (3.15).
Hence, we simulate N - paths of Brownian motion to use them to create N approximate
solutions for Y and N exact solutions for X. We denote YT,κ and XT,κ to be the final
value of the κ-th approximate solution and exact solution respectively. From (3.11) the
58

absolute error can be estimated by the following quantity:
ˆε =
1
N
N
κ=1
| XT,κ − YT,κ | . (3.17)
Therefore, by simulating N = 25 paths of the Itô’s process (which satisfies 3.14) we can
estrimate the absolute error ˆε as it is defined by (3.17). The following results are occurred:
δ 2−2
2−4
2−6
2−8
ˆε 0.01267835 0.006312543 0.002848442 0.001257862
Table 1: Absolute errors ˆε for different steps [see code B.2.6 and remark B.2.7].
Finally, by comparing the results of the table 1, it can be observed that the estimation
of the absolute error ˆε decreases when δ becomes increasingly small. Hence it is proved
that our predictions from the above graphs are verified.
3.4.2 The Milstein method
Initially, we are going to describe the Taylor-Itô expansion which is the stochastic rep-
resentation of the Taylor expansion in the deterministic case. Generally, the Taylor-Itô
expansion is an iterative implementation of the Itô’s Lemma.
Consider the following scalar stochastic differential equation:
dXt = b(Xt)dt + σ(Xt)dBt (3.18)
For the sake of simplicity, the functions b and σ do not depend on the variable of time t.
By applying the Itô’s Lemma on f(Xt) follows that:
df(Xt) = σ(Xt)
∂f(Xt)
∂x
dBt + b(Xt)
∂f(Xt)
∂t
+
1
2
σ2
(Xt)
∂2
f(Xt)
∂x2
dt. (3.19)
If we define:
K0 ≡ b(Xt)
∂
∂t
+
1
2
σ2
(Xt)
∂2
∂x2
K1 ≡ σ(Xt)
∂
∂x
59

the equation (3.19) becomes:
df(Xt) = K0f(Xt)dt + K1f(Xt)dBt
or equivalently in integral form:
f(Xt) = f(Xt0 ) +
t
t0
K0f(Xs)ds +
t
t0
K1f(Xs)dBs. (3.20)
By choosing f(X) = X, the above equation (3.20) gives:
Xt = Xt0 +
t
t0
b(Xs)ds +
t
t0
σ(Xs)dBs. (3.21)
Therefore, the given stochastic diﬀerential equation is obtained in an integral form. Sub-
sequently, we will apply the Itˆo’s Lemma to the functions which are within the integrals.
Hence, for f(X) = b(X), (3.21) gives:
b(Xt) = b(Xt0 ) +
t
t0
K0b(Xs)ds +
t
t0
K1b(Xs)dBs. (3.22)
Similarly, if f(X) = σ(X) we have that:
σ(Xt) = σ(Xt0 ) +
t
t0
K0σ(Xs)ds +
t
t0
K1σ(Xs)dBs. (3.23)
Now, by substituting (3.22), (3.23) in (3.21) gives that:
Xt = Xt0 +
t
t0
b(Xt0 ) +
s
t0
K0b(Xr)dr +
t
t0
K1σ(Xr)dBr ds
+
t
t0
σ(Xt0 ) +
r
t0
K0σ(Xr)dr +
t
t0
K1σ(Xr)dBr dBs. (3.24)
where,
K0b = b(Xt)
∂b
∂t
+
1
2
σ2
(Xt)
∂2
b
∂x2
= bb +
1
2
σ2
b
K0σ = b(Xt)
∂σ
∂t
+
1
2
σ2
(Xt)
∂2
σ
∂x2
= bσ +
1
2
σ2
σ
K1b ≡ σ(Xt)
∂b
∂x
= σb
K1σ ≡ σ(Xt)
∂σ
∂x
= σσ
60

Then we pick up together the four double integrals and we symbolize them as R (by the
word residual). This can be seen below:
Xt = Xt0 + b(Xt0 )
t
t0
ds + σ(Xt0 )
t
t0
dBs + R. (3.25)
where
R =
t
t0
s
t0
K0b(Xr)drds+
t
t0
s
t0
K1b(Xr)dBrds+
t
t0
s
t0
K0σ(Xr)drdBs+
t
t0
s
t0
K1σ(Xr)dBrdBs.
Essentially, the method uses the substitution iteratively in order to receive fixed integrable
quantities in terms with increasingly higher order. For instance, the last term which
corresponds to the residual R is:
t
t0
s
t0
K1σ(Xr)dBrdBs =
t
t0
s
t0
K1σ(Xt0 ) +
r
t0
K0K1σ(Xq)dq +
r
t0
K1K1σ(Xq)dBq dBrdBs.
where f = K1σ in the equation (3.20). The first term of the right hand side of the above
equation can be written as below:
K1σ(Xt0 )
t
t0
s
t0
dBrdBs = σ(Xt0 )σ (Xt0 )
t
t0
s
t0
dBrdBs.
Consequently, the equation (3.25) becomes:
Xt = Xt0 + b(Xt0 )
t
t0
ds + σ(Xt0 )
t
t0
dBs + σ(Xt0 )σ (Xt0 )
t
t0
s
t0
dBrdBs+
∼
R (3.26)
where
∼
R is the new residual. The next step is to formulate numerical algorithms according
to the above equations. For the equation (3.26) we define the double integral as below:
t
t0
s
t0
dBrdBs =
t
t0
Bs − Bt0 dBs =
t
t0
BsdBs −
t
t0
Bt0 dBs
=
1
2
(B2
t − t − B2
t0
+ t0) − Bt0 (Bt − Bt0 )
=
1
2
B2
t − Bt0 Bt −
1
2
B2
t0
+ B2
t0
−
1
2
t +
1
2
t0
=
1
2
B2
t − Bt0 Bt +
1
2
B2
t0
−
1
2
(t − t0)2
=
1
2
(Bt − Bt0 )2
−
1
2
(t − t0)2
(3.27)
61

By applying ∆ := t − t0 and ∆Bt := Bt − Bt0 to (3.27), it follows that:
t
t0
s
t0
dBrdBs =
1
2
(∆Bt)2
−
1
2
∆t.
Generally, for the random interval of the form [ti, ti+1] we substitute:
t0 = ti
t = ti+1
∆t := ti+1 − ti
and combining (3.26) and (3.27) we have:
Xti+1
= Xti
+ b(Xti
)∆t + σ(Xti
)∆Bi +
1
2
σ(Xti
)σ (Xti
)((∆Bt)2
− ∆t)+
∼
R
It is obvious that the first three terms of the above recurrent equation:
Yi+1 = Yi + b(Yi)∆t + σ(Yi)∆Bi
represent the Euler-Maruyama method. However, by keeping all the terms of the equation:
Yi+1 = Yi + b(Yi)∆t + σ(Yi)∆Bi +
1
2
σ(Yi)σ (Yi)((∆Bt)2
− ∆t)
we conclude to the Milstein method.
(Higham, 2001, p.539), (Schaffter, 2010, p.5), (Mikosch, 1998, p.164), (Onskog, 2007,
p.96), (Allen, 2007, p.102).
Furthermore, it is observable that the the last term of the Milstein’s form requires to
define the derivative of σ .
Subsequently, an example is presented below in order to show the Milstein and Euler
approximations on a stochastic differential equation and to examine their accuracy on it.
Example 3.4.9. Consider again the Itô’s stochastic process X = {Xt, t 0} which
satisfies the linear stochastic differential equation:
dXt = µXtdt + σXtdBt, (3.28)
62

where t ∈ [0, T] and with initial value X0 ∈ R. As we have seen on the example 3.4.7, the
Euler’s form of this stochastic diﬀerential equation is:
Yi+1 = Yi(1 + µ∆t) + σYi∆Bi.
The Milstein’s form will be:
Yi+1 = Yi + µYi∆t + σYi∆Bi +
1
2
σ2
Yi (∆Bi)2
− ∆t
= Yi 1 + (µ −
1
2
σ2
)∆t + σYi∆Bi +
1
2
σ2
Yi(∆Bi)2
.
Therefore, we conclude to the following graph:
Figure 7: Euler’s approximation (red), Milstein’s approximation (black) and the exact
solution of geometric Brownian Motion (light blue) when δ = 2−6
, X0 = 1, µ = 0.05 and
σ = 0.2 [see code B.2.8].
A closer look at ﬁgure 7 indicates that the approximation with black line (Milstein’s
approximation) is more close to the exact solution than the red line (Euler’s approxima-
tion). Hence, it is obvious that the Milstein’s form is more accurate than the Euler’s form.
In addition, it is illustrated the following graph:
63

Figure 8: Euler’s approximation (red), Milstein’s approximation (black) and the exact
solution of geometric Brownian Motion (light blue) when δ = 2−8
, X0 = 1, µ = 0.05 and
σ = 0.2 [see remark B.2.9].
Looking at ﬁgure 8, we can notice that there is more accuracy in both of the two
approximations. The two forms are very close to the exact solution but, as it will be shown
later numerically, the Milstein’s approximation is also closer to the analytical solution of
Geometric Brownian motion than the Euler’s approximation.
Indeed, for the terminal point T = 1 we ﬁnd for δ = 2−6
:
ε(Euler) = 0.0006773981
and
ε(Milstein) = 0.00005194361964777
Similarly, for δ = 2−8
:
ε(Euler) = 0.000303467591
and
ε(Milstein) = 0.00002640726142
64

Consequently, it is obvious that the error terms for Milstein’s approximations are smaller
than Euler’s approximations in these examples, as the theory predicts.
4 Applications in Mathematical Finance
This chapter provides the applications of Itô’s stochastic calculus to the scientific field
of Financial Mathematics. Stochastic processes is a fundamental tool which can give
rigorous mathematical models and interpretations of option pricing, arbitrage, portfolios
behaviour etc. For instance the Black-Scholes formula and European option pricing are
two significant mathematical models which are referred to in this thesis.
4.1 Portfolios and assets
In this subsection we will set some of the basic ideas of finance in a continues time.
Specifically, we can generalize these fundamental concepts in real time and we will consider
that the financial situations can happen in a continues period of time. The values of
these assets present fluctuations which can be modelled as diffusions by Itô’s stochastic
processes.
Consider that we have n + 1 assets. Their returns will be denoted at time t as the
vector below:
S(t, w) = (S0(t, w), S1(t, w), ..., Sn(t, w)).
This vector consist of a collection of stochastic processes with time parameter t ∈ I ⊂ R+
(i.e. the time and I denotes an interval which is a subset of R).
By Karatzas and Shreve(1991, p.371), consider that the values of assets are Itô’s stochastic
processes of the forms:
dS0 = r(t, w)S0dt (4.1)
dSi = µi(t, w, Si)dt +
m
j=1
σij(t, w, Si)dBj(t) = µi(t, w, Si)dt + σij(t, w, Si)dBj(t) (4.2)
65

where Si is the value of share i and the Bj are independent Brownian motions.
It is important to note that the above equation has more than one Brownian motion.
This fact models the existence of more than one source of randomness. For instance, if
S1 represents the value of the share of a computer company and S2 represents the value
of the share of a food company, then a natural disaster may influence the share S2 more
than the share S1. On the other hand, the discovery of a new processor may influence the
S1 and it may not have any effect on S2.
Subsequently, some examples are given below to illustrate applications of the stochastic
processes (4.1) and (4.2).
Example 4.1.1. Consider a market which is consisted by two assets, one riskless asset
S0(t) and one risky S1(t). The evolution of the riskless asset is given by the equation:
dS0(t) = rS0dt (4.3)
where r is the return of the riskless asset (e.g. bank account, bond etc). If we set that r
is a constant then we have a differential equation which gives us the value of the riskless
asset at any given time. By solving (4.3) we have:
t
0
dS0(t)
S0
=
t
0
rdt
ln(S0(t)) − ln(S0(0)) = rt
ln(
S0(t)
S0(0)
) = rt
S0(t)
S0(0)
= ert
S0(t) = S0(0)ert
.
The evolution of the risky asset is given by the Itô’s process:
dS1(t) = µS1(t)dt + σS1(t)dBt. (4.4)
The above equations are referred also by Mao(2007, p.302-303) and Øksendal(2003, p.251).
A brief description of (4.4) is that the return and the fluctuations of the risky asset are
66

proportional of its value. However, its fluctuations are multiplied with the variation of
Brownian motion. The coefficient σ is called the volatility of the share. We consider that
the uncertainty of the share value is introduced by a single Brownian motion, Bt. The
above model (4.4) is identified as the geometric Brownian motion. As we showed on the
example 3.2.1, by applying the Itô’s lemma to the function f(x, t) = lnx for x = S1(t) we
conclude to the solution:
S1(t) = S1(0)e(µ−1
2
σ2
)t+σBt
The above example introduced a typical model in finance, the Black-Scholes model.
(Davis, 2010, p.1)
Example 4.1.2. Let assume again two assets one riskless and one risky which are denoted
S0(t) and S1(t) respectively. The evolution of the riskless asset is given, as the above
example, by the equation:
dS0(t) = rS0dt
while the risky asset is the Itô’s stochastic process:
dS1(t) = δ(S1(t) − ν)dt + σdBt.
The stochastic process of the risky asset is the Ornstein-Uhlenbeck process and is used as
a model of real assets (e.g. goods).
Definition 4.1.3. Another usual model is the following. Consider one riskless asset S0(t)
and one risky S1(t). The evolution of the riskless asset is given by:
dS0(t) = rS0dt.
by assuming that the return of a riskless asset depends also on time (i.e. the bank rate
changes over the time) then the value of the riskless asset is defined by:
S0 = S0(0)e
t
0 r(t )dt
. (4.5)
The price of the risky asset is given by the Itô stochastic process:
dS1(t) = µS1(t)dt + σS1(t)1−α
dBt
67

where 0 α 1.
This model is called the constant elasticity of variance model and was suggested by Cox
and Ross at 1976 for the interpretation of a phenomenon which is related with the volatility
smile (Cox and Ross, 1976, p.145 - 165). In this model the share’s volatility equals σS−α
1 .
If α = 0 we return again to the initial model of geometric Brownian motion.
Now some significant definitions are given below and we provide brief interpretations
for each of them.
Definition 4.1.4. A portfolio is an Ft-adapted process of the form:
θ(t, w) = (θ0(t, w), ..., θn(t, w)) ∈ Rn+1
.
The θi(t, w) element represents the price of asset i which an investor has. Due to the
fact that the portfolio is adapted means that an investor knows the financial conditions
until the time t in order to make a decision for the portfolio’s readjustment. More simply,
it is not allowed to the investor to examine the future because the portfolio is adapted to
previous financial conditions until the time t.
Definition 4.1.5. The value of a portfolio θ(t, w) at time t is given by the relation:
V θ
(t, w) := θ(t, w) · S(t, w) =
n
i=0
θi(t, w)Si(t, w). (4.6)
The value of a portfolio is also a stochastic process which is adapted at Ft.
The above relation is defined that V θ
is the total value of investments which are held
at time t.
We can further define the gains process of an asset. Consider a partition of the interval
(0, t) and the number of assets-i of θi remains stable to the interval [tk−1, tk] to θi(tk−1).
In this interval, the return of the asset i is changed in Si(tk) − Si(tk−1). The total profit
(positive or negative) of the investor is θi(tk−1)[Si(tk) − Si(tk−1)] because of the variation
of the asset value in this particular interval. By summing the previous relation and taking
the limit ∆t → 0, we conclude to the integral
t
0
θi(t)dSi(t) which gives the value of the
68

asset i. Obviously, this integral is an Itô’s stochastic integral of the stochastic process θi
on the Itô’s diffusion Si. If we want to find the gains process for the whole portfolio we
have to sum the gains processes for each asset which constitutes the portfolio. Specifically,
G(t) =
n
i=0
t
0
θi(t)dSi(t)
The above syllogism was based to Duffie(2001, p.84-85) and Schweizer(1992, p.5). How-
ever it was given a supplier explanation around the gains process of an asset and generally
of a portfolio.
Therefore, we can see that the stochastic integrals over Brownian motions or generally
over Itô’s processes are appeared with a physical way to the scientific field of finance.
Another one important concept is the self-financing portfolio.
Definition 4.1.6. A portfolio is said to be self-financing if its value can be exclusively
determined by the profits of the assets which constitute it. This value is defined as:
V θ
(t) = V θ
(0) +
t
0
θ(s) · dS(s)
for t ∈ [0, T], and in differential form:
dV θ
(t) = θ(s) · dS(s) (4.7)
According to Mikosch[1998,p.169], the above definition explains that a holder of a
self-financing portfolio can invest only its profits without adding external amounts. By
defining the partial derivatives from V θ
(s) = θ(s) · S(s): ∂V
∂S
= θ, ∂V
∂θ
= S, ∂2V
∂S2 = 0 we
apply the Itô’s lemma as follows:
dV (s) = θdS + Sdθ + 0.
Then, by (4.7) we lead that for a self-financing portfolio is held:
θ · dS = θ · dS + S · dθ ⇒ S · dθ = 0. (4.8)
69

4.2 Normalizing Assets
According to Tavella(2003, p.51), the pricing problem can be better formulated when a
normalizing asset is used. A typical example of such normalization is the discounting
which is defined below.
Example 4.2.1. (Øksendal, 2003, p.250) Discounted prices. We use the value of the
riskless asset as a measurement unit for the normalization of the market i.e. S∗
i = S−1
0 Si
for i = 1, .., n. In this case, a normalized market has the form
S∗
i = (1, S∗
1, ..., S∗
n). (4.9)
From (4.5) and the fact that S∗
0 = 1, it follows:
S−1
0 (t) = e− t
0 r(s,w)ds
(4.10)
or simpler where the return of the riskless asset is constant and equals to r:
S−1
0 (t) = e− t
0 rds
= e−[rs]t
0 = e−rt
.
The S∗
i (t) is the discount price of asset i that has risk. Specifically, if S∗
i (t) will be placed
in a bank account at time 0, then it will yield at time t the today’s price of the asset Si.
The following example is given as a comment by Øksendal(2003, p.251). In this thesis
we provide a more detailed explanation of it.
Example 4.2.2. Discounted value process. Consider that the rate of the return of a
riskless asset is r(t, w). By multiplying V θ
(t) to (4.10) on both sides we have:
¯V θ
(t) = V θ
(t)S−1
0 (t) = e− t
0 r(s,w)ds
V θ
(t). (4.11)
Then, if and only if the portfolio is self-financing we can show that:
¯V θ
(t) = V θ
(0) +
n
i=1
t
0
θi(t)dS∗
i (t)
70

final_report_template

Recommended

Recommended

More Related Content

What's hot

What's hot (17)

Viewers also liked

Viewers also liked (20)

Similar to final_report_template

Similar to final_report_template (20)

final_report_template