Random Walk -undeterministic process

with mean zero.

For example, price rates: path traced by

molecules as it travels in a liquid or a gas,

the price ﬂuctuating stocks and the

ﬁnancial status of gambler.

Econophysics, 4th Year Random Walk

Random Walk

One dimensional discrete case (One dimensional discrete

random walk):

Motion of certain object whose time evolution of position at any time is

underterministic and stochastic (random process) is called random walk.

Let us consider x

be the independent and identically distributed

random variable with i = 1, 2, ....n. Suppose that x

can randomly take

values s where s is the single step size of the random walk. ∆t be the

time of single step random walk. If the process performs n random walk

the total time taken for n random values is t = n∆t

Suppose, s

= x

+ x

+ ...... + x

is the sum of random variable.

Then the 1st moment of the random walk is

E(x

) =

Because x

can take either +s or -s with equal probability.

Econophysics, 4th Year Random Walk

Second moment of random walk is

E(x

) =

(±s)

∴ E(x

) = s

, x

) =

i=1

Econophysics, 4th Year Random Walk

Properties of 1D discrete Random walk:

Show that expectation value of ﬁnal position in random walk is

zero. or

Prove that next step in Random walk cannot be predicted.

Let us consider x

be the independent and identically distributed

random variable which can take value ±s; s being the step size of

random walk. Let ∆t be the time taken to walk step s. Then the ﬁnal

state of the random walk;

= x[n∆t] = x

+ x

+ ...... + x

...(1)

Now, the expectation value of ﬁnal position of hte random walk is

E(s

) = E[x(n∆t)] = E(x

+ x

+ ...... + x

) ...(2)

Econophysics, 4th Year Random Walk

As x

’s are independent,

E(x

+ x

+ ...... + x

) = E(x

) + E(x

) + ....E(x

) =

i=1

nE(x

)

Using equation (ii) and (iii) we obtain,

E[x(n∆t)] =

i=1

nE(x

) ...(2)

But for random walk E(x

) = 0 where i = 1, 2, ....n.

E[x(n∆t)] =

i=1

nE(x

) = 0

Hence, the expetation value of ﬁnal position of random walk is zero.

Econophysics, 4th Year Random Walk

Show that variance of random walk process grows linearly with

the no. of step n.

Letus consider x

be the independent and identically distributed random

variable which can take value ±s: s being the step size of random walk.

Let ∆t be the time taken to walk step s. Then the ﬁnal state of the

random walk:

= x[n∆t] = x

+ x

+ ...... + x

...(1)

Now,

E[x(n∆t)

] = E[(x

+ x

+ ...... + x

)

] ...(2)

for random walk,

E(x

, x

) = δ

where, δ

= 1 if i = j

......(3) = 0 , otherwise

where, i = 1, 2, ....m

Econophysics, 4th Year Random Walk

As x

are independent (covariance = 0) random variable,

E[(x

+ x

+ ...... + x

)

] = E(x

+ x

+ ....... + x

)

= E(x

)

+ E(x

)

+ ......... + E(x

)

i=1

E(x

)

......(4)

Equation (4) is valid becausex

’s are independent and the covariance of

independent variables is zero.

using equation (3) and (4)

i=1

E(x

)

i=1

= ns

......(5)

Then from equation (2) and (5) we obtain,

E[x(n∆t)

] = ns

Econophysics, 4th Year Random Walk

which means the variance of random walk process grows linearly with

the no. of steps n.

So disxrete random walk vary symmetrically from origin but ﬁnite

variance which increses as no. of random walk increases.

Econophysics, 4th Year Random Walk

Continous Random walk (Wiener Process): The continous limit

The random walk process with very large no of steps and very small

time step is called continous random walk process. The continous limit

of random walk may ne achieved by considering the limit n → ∞ and

∆t → 0 such that t = n∆t is ﬁnite; n, t and ∆t being the no. of step,

time of entire walk and time step of individual walk.

If x

is independent and identically distributed random variable with

i = 1, 2, ......n which can take ±s; s being the size of each of each step.

Then variance of ﬁnal step x(n∆t) is,

E[x

(n∆t)] = ns

∆t

t ∵ t = n∆t

Let s

= D∆t

then D =

∆t

E[x

(t)] = Dt ....(1)

Econophysics, 4th Year Random Walk

where D is termed as diﬀusion coﬃcient and this linear dependence of

the variance of x

(t) on t is the characteristics of a diﬀusive process.

This type of stochastic process is called wiener process (Diﬀusive

Random walk)

for n → ∞ and ∆t → 0 stochastic process becomes Gaussian process.

i.e., for n → ∞ and ∆t → 0

Random walk → Gaussian walk

holds only when n → ∞ and is not generally true in the discrete case

when n is ﬁnite, since the distribution of s

is characterized by its

probability density function. it is mostly non-Gaussian and assumes the

Gaussian shape only asymptotically (symmetric) with n. The

probability density function of the process P [x(n∆t)] - or equivalently

P (S

) - is a function of n, and P (x

) is arbitrary.

Econophysics, 4th Year Random Walk

How does the shape of P [x(n∆t)] change with time?

Under the assumption of independence,

P [x(2∆t)] = P (x

) ⊗ P (x

where ⊗ denotes the convolution.

The ﬁgure 1 below shows four diﬀerent probabilty density fucntions

P (x).

(i) a delta distribution with P (x) = δ(x + 1)/2 + δ(x − 1)/2

(ii) a uniform distribution with zero mean and unit standard deviation

(iii) a Gaussian distribution with zero mean and unit standard

deviation, and

(iv) a Lorentzian (or Cauchy) distribution with unit scale factor

Econophysics, 4th Year Random Walk

It is seen that for delta and uniform distribution the function P (S

)

change both in scale and in functional as n increases, while for Gaussian

and Cauchy distribution, the functional form is same for both P (S

) and

P(x). When the functional form of P (S

) is same as the functional form

of P (x

) the stochastic process is called stable. Figure below shows the

behaviour of P (S

) for independent and identical distributed variables

with n = 1, 2 for the probability density functions of Figure above.

Econophysics, 4th Year Random Walk

Central Limit Theorem

It states that ”if x

(

1 with probability p is independent

2 with probability q

random variables then S

= x

+ x

+ ..........x

follows normal

distribution (Gaussian distribution)”.

i.e., x

∼ N(0, 1) with mean zero and variance 1.

Proof:

Here x

; i = 1, 2, .....n are the independent random variables distributed

identically. Then,

moment generating function of x

= M

(t)

= E(e

)

(pmf)

= (e

t.1

p + e

t.0

= (q + pe

)

Econophysics, 4th Year Random Walk

Again, moment generating function of S

= x

+ x

+ ...... + x

(t) = M

+......+x

(t)

= M

(t).M

(t).......M

(t)

[∵ x

′

s are independent and identically distributed random variables]

∴ M

(t) = (q + pe

)

......(1)

The moment generating function of S

is exactly same as that of

Binomial distribution. Then from the principle of equivalence of

Moment generating function we can say that

∼ B(n, p)

Now, we can write,

E(S

) = µ = np

= npq

Econophysics, 4th Year Random Walk

We deﬁne a standard variable

z =

− E(S

)

√

− np

√

npq

− µ

Then moment generating function of standard variables,

(t) = E(e

)

= E[(e

−k

)

]

= E[e

(

)

−

tµ

)]

= e

−

tµ

(

) [∵ M

(

) = E[e

(

)

]]

As,

(

) = E(q + pe

t/σ

)

, we get

Econophysics, 4th Year Random Walk

(t) = e

−

tµ

(q + pe

t/σ

)

= e

−

tnp

√

npq

(q + pe

√

npq

)

[∵ µ = np & σ =

√

npq as S

∼ B(n.p)]

= [e

−

√

npq

(q + pe

√

npq

)]

....(2)

Now,

−

√

npq

= 1 −

√

npq

2npq

+ ....

and e

√

npq

= 1 +

√

npq

2npq

+ ....

also, q = 1 − p

so, e

−

√

npq

(q + pe

√

npq

)



1 −

√

npq

2npq

+ ....



1 − p + p



1 +

√

npq

2npq

+ ....



Econophysics, 4th Year Random Walk

neglecting higher terms,



1 −

√

npq

2npq



1 − p + p +

√

npq

2npq



= 1 −

√

npq

2npq

√

npq

−

npq

2(npq)

3/2

2npq

−

2(npq)

3/2

4(npq)

= 1 +

2nq

−

2nq

+ O(n

−3/2

)

= 1 +

2nq

(1 − p) + O(n

−3/2

)

= 1 +

+ O(n

−3/2

) ....(3)

where O(n

−3/2

) represents the terms contaning n

3/2

or higher power in

the denominator.

Econophysics, 4th Year Random Walk

If n is very large then O(n

−3/2

) takes a very small value which can be

neglected.

So, from equation (2) and (3)

(t) = lim

n→∞



1 +



= e

The above equation shows that M

(t) is equal to that Normal

distribution. So, by principle of equivalence, we can say that,

z ∼ N (0, 1)

i.e.,

− µ

∼ N(0, 1)

That means S

∼ N(µ, σ

)

i.e., for large no. of independent random variables the sum takes

Gaussian distribution. Hence proved.

Econophysics, 4th Year Random Walk

State and Prove Central limit Theroem (Alternative Method)

Statement: Tne central limit theorem sates that ”If the variable x has

non normal distribution with mean µ and variance σ, then the variate z

is z =

¯x−µ

σ/

√

has Gaussian (normal) distribution when n → ∞.”

Proof:

Here x

, i = 1, 2, ...n are the independent random varaibles distributed

identically then

moment generating function M

(t) =

p(x

) ...(1)

Here, the given variate corresponding to ¯x is,

z =

¯x − µ

σ/

√

where,

¯x = sample mean

µ = population mean

σ/

√

n = sample s.d.

Econophysics, 4th Year Random Walk

Now, put

√

= h

or, z =

¯x − µ

=⇒ ¯x = zh + µ

then M

¯x

(t) =

t¯x

t(zh+µ)

= e

tµ

tzh

put th = t

′

∴ M

¯x

(

′

) = e

µ(

′

)

′

or, M

¯x

(

′

) = e

µ(

′

)

′

)

or, M

′

) = e

−µ(

′

)

¯x

(

′

)

Econophysics, 4th Year Random Walk

Replacing t

′

by t

(t) = e

−µ(

′

)

¯x

(

′

)

here, M

¯x

(

′

) =



(

)



then M

(t) = e

−µ(

′

)



(

)



or, M

(t) = e

−

µt

√



(

√

)



Taking log on both sides

log

(t) = −

µt

√

+ n log



(

√

)



...(2)

Econophysics, 4th Year Random Walk

We know moment generating function can be expressed as

(t) = 1 + tµ

′

+ .... +

′

where µ

′

, µ

′

, ....µ

′

be the moments

Then equa (2) can be written as,

log

(t) = −

µt

√

+ n log



1 +

√

′

2σ

′

+ ....



= −

µt

√

+ n



√

′

2σ

′

+ ....



−



√

′

2σ

′

+ ....



[∵ log(1 + x) = x −

]

Econophysics, 4th Year Random Walk

= −

µt

√

′

√

′

−

′2

+ .... [∵ µ = µ

′

]

′

−

′2

[neglecting higher terms]



′

− µ

′2



[∵ σ

= µ

′

− µ

′2

]

.σ

=⇒ log

(t) =

then as n → ∞, we have

∴ M

(t) = e

Econophysics, 4th Year Random Walk

This is the moment generating fucntion of normal distribution. Hence

the limiting distribution of standarized sum s

= x

+ x

+ .... + x

deﬁned by

z =

− nµ

√

i=1

− µ

√

is the normal distribution as n → ∞.

This proves central limit theorem.

Econophysics, 4th Year Random Walk

Importance of Central Limit Theorem

The mean of the sampling distribution will be equal to the

population mean regardless of the sample size even if the population

is not normal.

As the sample size increases the sampling distribution of mean will

approach normality. Regardless of the shape of the population

distribution.

The relationship between the shape of hte population distribution

and shape of sampling distribution on the mean is called central

limit theorem.

CLT is perhaps most important theorem in all statistical inference.

it assumes that the sampling distribution of the mean approaches

normal as the sampe size increases.

so that the central limit theorem is powerful in statistics.

Econophysics, 4th Year Random Walk

Central limit theorem in Random Walk Process.

Suppose that a random variable s

is composed of many parts x

i.e., s

...(1)

where, x

is independent such that

E(x

) = 0

and E(x

) = s

Then δ

= E(S

) =

i=1

Now, we deﬁne trancated random variable such that for every ε > 0.

(

when |x

| ≤ εσ

o otherwise

Econophysics, 4th Year Random Walk

Then for σ

→ ∞, the Lindeberg condition holds true

i.e.,

i=1

E(u

) → 1

In such condition the central limit theorem sttes that,

S =

+ ....x

′

√

where σ

′

sample s.d.

The

S follows the normal distribution [N(0,1)] with probability density

function

S) =

√

2π

−

This can also be written as,

p(s

) =

√

2πσ

−S

/2σ

[∵

S =

]

i.e., s

follows Gaussian distribution with mean zero and s.d. of σ

Econophysics, 4th Year Random Walk

Speed of Convergence

For independent random variable with ﬁnite variance, the central limit

theorem entails that s

i=1

converges to stochastic process with

probability density function

p(s

) =

√

2πσ

−S

/2σ

where σ

= E(s

) with E(x

) = 0 and E(s

) = 0

then the main question everyone arise is how fast does these converges

i.e., How steep is the tail of the distribution of S

Considering S

to be a sum of independent and identically

distributed randon variables x

with

+....x

√

+ x

+ ....x

√

˜x

+ ˜x

+ ....˜x

or,

˜x

with ˜x =

√





Econophysics, 4th Year Random Walk

Then the scaled distribution function is given by

(S) =

−∞

˜p(

... (2) [σ

= E(S

)]

where,







˜p(

) =

√

np(

)

) =

√

2π

−

...(3)

According to Gnedenko and Kolmogorov, the scaled distribution

function f

(s) diﬀers from asymptotic scaled normal distribution

function Φ(s) by an amount

(s) − Φ(s)| ≈

−s

√

2π



(s)

1/2

(s)

2/2

+ ...... +

(s)

j/2



...(4)

where, Q

(s) are polynomials in s, the coeﬃcients of which depends on

the ﬁrst (j + 2) momentum of the random variables x

. Larger the value

of s, or smaller the value of diﬀerence in LHS in equation (4) faster will

the scaled distribution converge to corresponding normal distribution

and vice versa.

Econophysics, 4th Year Random Walk

So equation (4) represents the rate of convergence of scaled distribution

function to the normal (Gausiian) distribution. This is known as the

Chebysheve’s solution to the problem of rate of convergance with s being

the maximum epoch (time series data points) i.e., s = (s

)

max

Berry-Essen Theorem

It provides simplier inequality controlling the absolute diﬀerence

between the scaled distribution function of the given random process

and the asympotic scaled normal distribution function i.e., it gives rate

of convergence of scaled distribution function to standard normal

distribution function in more easy and computatble form.

Berry - Essen Theorem - 1:

Let x

be independent random variable with a common distribution F

such that

E(x

) = 0; E(x

) = σ

> 0 .....(1)

and E(|x

) = δ < ∞

Econophysics, 4th Year Random Walk

and set f

stands for the distribution of normalized sum

+....x

√

then for all s and n

(s) − Φ(s)| ≤

3δ

√

....(2)

[as n increases diﬀ decreases, whenn → ∞, diﬀ = 0]

where s = max(s

), Φ(s) is asymptotic scaled normal distribution

function.

The striking feature of the inequality (2) is is depends only on the ﬁrst

three moment. The statement canbe veriﬁed with he use of smooth

inequality.

Inequality (2) tells us that the convergence speed of the distribution

function of ˜s

to its asympototic Gaussian shape is essentially controlled

by the rate of the third moment of the absolute value of x

. It is also

seen that the real distribution function (observed) emerges to ideal

normal if n → ∞

Econophysics, 4th Year Random Walk

Berry - Essen Theorem - 2:

Let the x

be any random variable such that,

E(x

) = 0; E(x

) = σ

; E(|x

) = r

and deﬁne

= σ

+ σ

+ ..... + σ

= σ

and

= r

+ r

+ ....... + r

We use f

to denote the distribution of the normalized sum

+......+x

Then for all s and n

(s) − Φ(s)| ≤

6δ

(

)

3/2

This also shows that the rate convergence of f

(s) to the scaled

normalized distribution Φ(s) depends upon the ﬁrst three moments.

This is a generalization for random variables that might not be

identically distributed.

Econophysics, 4th Year Random Walk

Attractor

A point or set of point in any deﬁned functional space such as phase

space is said to be attractor if any process with diﬀerent initial

conditions converge into that point as the process evolve with time.

Mathematically, on attractor is a subset A of the phase space

characterized by the following three conditions.

A is forward invariant under f (a, t) if a is an element of A then so is

a(t) = f(a, t) for all t > 0.

There exists a neighbourhood of A, called the basin of attraction

B(A), which consists of all points b that enter A in the limit t → ∞.

There is no proper (non-empty) subset of A having ﬁrst two

property.

[Phase space - 6N dimensional space spanned by position and moment]

There are diﬀerent types of attractor and the diﬀerent basins pertaining

to diﬀerent real process. Gaussian distribution play role of attractor for

probability density function of any random statistical process.

Econophysics, 4th Year Random Walk

Basin of Attraction

An attractor basin of attraction isthe region of phase space, over which

iterations are deﬁned, such that the point inthat region will eventually be

iterated into that attractor. For example the Gaussian probability density

function is an attractor in the functional space of probability density function

for all the probability density functions that fulﬁll the requirements of the

central limit theorem. The set of such probability density functions constitutes

tha basin of attraction of Gaussian probability density function.

The functional form of p(s

) change with n and , if the hypothesis of the CLT

are veriﬁed, assumes the Gaussian functional form for an asymptotically large

value of n when n increases, probability density function p(s

) becomes

progressively closer and closer to the Gaussian attractor P

∞

). The number

of steps required to observe the convergence of P (S

) to P

∞

) provides an

indication of the speed of convergence of the probability density function and

coresponding process.

i.e., P (S

)

lim n→∞

−−−−−−→ P

∞

)

↑ ↑

Basin attractor

Econophysics, 4th Year Random Walk

Numericals

Q. If f(x) has probability density kx

, 0 < x, 1, determine k and

ﬁnd the probability that

< x <

Solution:

Since f(x) has probability density kx

, therefore

f(x) = kx

; 0 < x < 1

Total probability density is 1

f(x)dx = 1

i.e., kx

dx = 1

=⇒ k



= 1

or, k = 3

Then probability density function f(x) = 3a

Econophysics, 4th Year Random Walk

Again



< x <



1/2

1/3

f(x)dx =

1/2

1/3

= 3





1/2

1/3

−

216

Econophysics, 4th Year Random Walk

Numericals

Q. If a function f(x) of x is deﬁned as follows:

f(x) =











0 for x < 2

(3 + 2x) for 2 ≤ x ≤ 4

0 for x > 4

ﬁnd the probability within interval 2 ≤ x ≤ 3 and show that it is

density function.

Solution:

∞

−∞

f()x)dx =

−∞

f(x)dx +

∞

f(x)dx

−∞

adx +

3 + 2x

dx +

∞

0dx

[3(4 − 2) + (16 − 4)]

= 1

Econophysics, 4th Year Random Walk

Now, probability between (2 ≤ x ≤ 3) is

p =

f(x)dx

3 + 2x

Econophysics, 4th Year Random Walk

Numericals

Q. A bomb plane carrying three bombs ﬂies directly above the

Nail road tract. If a bomb fall within 40 feet of the traﬃc

within a certain bomb sight the density of the points of impact

of a bombs is

f(x) =











100+x

10000

for 100 ≤ x ≤ 0

100−x

10000

for 0 ≤ x ≤ 100

0 else where

If all three bombs are used what is the probability that the

track will be damanged where

′

represent the vertical

deviation from the aiming points which is the track in this case.

solution:

Since a bomb tail within 40 feet of the track if the track will be damaged

when the bombs fall within 40 feet either side;

Econophysics, 4th Year Random Walk

P (−40 < x < 40) =

−40

f(x)dx

−40

f(x)dx +

f(x)dx

−40

100 + x

10000

dx +

100 − x

10000



100x +



−40



100x −



−40

So, probability that track is not damaged by bomb is = 1 −

Econophysics, 4th Year Random Walk