Part 13 - Brownian Motion and Gaussian Processes I

Introduction

In this part, we will introduce Brownian motion and Gaussian processes, which are fundamental concepts in stochastic processes and have wide applications in various fields such as physics, finance, and machine learning.

So far, we have looked at discrete-time discrete state space processes (Discrete Markov chains and Branching processes). Then we moved to discrete-time continuous state space processes (in connection to MCMC). Most recently, we studied continuous-time discrete state space processes (Poisson processes and more generally Continuous-time Markov chains).

Now, we will start looking at continuous-time continuous state space processes.

Brownian Motion

Intuition (Brownian Motion)

In a gas, atoms bump into each other and change course randomly. Over time, how does a single atom move, on average? If f(x,t)f(x, t) represents the probability density for the position xx of an atom at time tt moving along a line, Albert Einstein showd that [^1],

ft=122fx2.\frac{\partial f}{\partial t} = \frac{1}{2} \frac{\partial^2 f}{\partial x^2}.

Which has the solution,

f(x,t)=12πtex22t.f(x, t) = \frac{1}{\sqrt{2 \pi t}} e^{-\frac{x^2}{2t}}.

This means that the position of the atom at time tt is normally distributed with mean 00 and variance tt, i.e., XtN(0,t)X_t \sim \mathcal{N}(0, t).

These random movements are called Brownian motion (or Wiener process).

Definition 1 (Brownian Motion)

Brownian motion is a continuous-time stochastic process {Bt}t0\{B_t\}_{t \geq 0} with the following properties:

  1. B0=0B_0 = 0.

  2. For t>0t >0, BtN(0,t)B_t \sim \mathcal{N}(0, t) (i.e., normally distributed with mean 00 and variance tt).

  3. For s,t>0s, t > 0, Bt+sBsN(0,t)B_{t + s} - B_s \sim \mathcal{N}(0, t) (i.e., the increments are stationary).

  4. For 0q<rs<t0 \leq q < r \leq s < t, BtBsB_t - B_s is independent of BrBqB_r - B_q (i.e., the increments are independent).

  5. The function tBtt \mapsto B_t is continuous with probability 11 (almost surely).

Intuition (Simulation of Brownian Motion)

Given time points 0t0<t1<t2<<tn0 \eqqcolon t_0 < t_1 < t_2 < \ldots < t_n, we write for i>0i > 0,

Bti=Bti1+(BtiBti1)=Bti1+Zi,B_{t_i} = B_{t_{i - 1}} + (B_{t_i} - B_{t_{i - 1}}) = B_{t_{i - 1}} + Z_i,

where ZiN(0,titi1)Z_i \sim \mathcal{N}(0, t_i - t_{i - 1}).

Thus, we get for independent Z1,Z2,,ZnZ_1, Z_2, \ldots, Z_n,

Btn=i=1nZi.B_{t_n} = \sum_{i = 1}^{n} Z_i.

This gives a simple way to simulate Brownian motion at discrete time points. A good way to simulate the path tBtt \mapsto B_t on t[0,a]t \in [0, a] is to set ti=aint_i = \frac{a_i}{n}, simulate independently,

ZiN(0,an),Z_i \sim \mathcal{N}\left(0, \frac{a}{n}\right),

and compute,

Bti=j=1iZj,B_{t_i} = \sum_{j = 1}^{i} Z_j,
Note

We could also write Zi=anYiZ_i = \sqrt{\frac{a}{n}} Y_i where YiN(0,1)Y_i \sim \mathcal{N}(0, 1) are standard normal random variables.

Intuition (Fractal Nature of Brownian Motion)

What if we have a Brownian motion path simulated as above, and want to plot it at twice the detail (i.e., at 2n2n points instead of nn)? We have,

Bti+a2nBti=Zi0N(0,a2n)Bti+1Bti+a2n=Zi1N(0,a2n).\begin{align*} B_{t_{i + \frac{a}{2n}} - B_{t_i}} & = Z_{i0} \sim \mathcal{N}\left(0, \frac{a}{2n}\right) \newline B_{t_{i + 1} - B_{t_{i + \frac{a}{2n}}}} & = Z_{i1} \sim \mathcal{N}\left(0, \frac{a}{2n}\right). \newline \end{align*}

Reformulating using Zi=Bti+1BtiZ_i = B_{t_{i + 1}} - B_{t_i}, we have,

Zi0N(0,a2n),ZiZi0N(Zi0,a2n).Z_{i0} \sim \mathcal{N}\left(0, \frac{a}{2n}\right), \quad Z_i \mid Z_{i0} \sim \mathcal{N}\left(Z_{i0}, \frac{a}{2n}\right).

Since we have Normal-Normal conjugacy,

Zi0ZiN(12Zi,a4n).Z_{i0} \mid Z_i \sim \mathcal{N}\left(\frac{1}{2} Z_i, \frac{a}{4n}\right).

So we get Bti+a2nB_{t_{i + \frac{a}{2n}}} by simulating Zi0Z_{i0} given ZiZ_i as above and adding the result to BtiB_{t_i}.

Intuition (Computations with Brownian Motion)

To compute probabilities for Brownian motions, we generally use the properties of the Normal distribution and the independent increments.

Example 1 (Probability Computation with Brownian Motion)

Show that B1+B3+2B7N(0,50)B_1 + B_3 + 2 B_7 \sim \mathcal{N}(0, 50).

Solution

We can write,

B1+B3+2B7=B1+(B3B1)+B1+2(B7B3)+2B3=2B1+(B3B1)+2(B7B3)+2(B3B1)+2B1=4B1+2(B3B1)+2(B7B3).\begin{align*} B_1 + B_3 + 2 B_7 & = B_1 + (B_3 - B_1) + B_1 + 2 (B_7 - B_3) + 2 B_3 \newline & = 2 B_1 + (B_3 - B_1) + 2 (B_7 - B_3) + 2(B_3 - B_1) + 2 B_1 \newline & = 4 B_1 + 2 (B_3 - B_1) + 2 (B_7 - B_3). \end{align*}

Looking at each term separately, we have,

E[4B1]=0Var(4B1)=42Var(B1)=42\begin{align*} \mathbb{E}[4 B_1] & = 0 \newline \mathrm{Var}(4 B_1) & = 4^2 \mathrm{Var}(B_1) = 4^2 \end{align*}

and,

E[2(B3B1)]=0Var(2(B3B1))=22Var(B3B1)=222\begin{align*} \mathbb{E}[2 (B_3 - B_1)] & = 0 \newline \mathrm{Var}(2 (B_3 - B_1)) & = 2^2 \mathrm{Var}(B_3 - B_1) = 2^2 \cdot 2 \end{align*}

and,

E[2(B7B3)]=0Var(2(B7B3))=22Var(B7B3)=224\begin{align*} \mathbb{E}[2 (B_7 - B_3)] & = 0 \newline \mathrm{Var}(2 (B_7 - B_3)) & = 2^2 \mathrm{Var}(B_7 - B_3) = 2^2 \cdot 4 \end{align*}

Since the increments are independent, we have,

E[B1+B3+2B7]=0Var(B1+B3+2B7)=42+222+224=50.\begin{align*} \mathbb{E}[B_1 + B_3 + 2 B_7] & = 0 \newline \mathrm{Var}(B_1 + B_3 + 2 B_7) & = 4^2 + 2^2 \cdot 2 + 2^2 \cdot 4 = 50. \newline \end{align*}

Thus, B1+B3+2B7N(0,50)B_1 + B_3 + 2 B_7 \sim \mathcal{N}(0, 50). _\blacksquare

Example 2 (Conditional Distribution with Brownian Motion)

Show that P(B2>0B1=1)=0.8413P(B_2 > 0 \mid B_1 = 1) = 0.8413.

Solution

We can rewrite the conditional probability as,

P(B2B1>01B1=1).P(B_2 - B_1 > 0 - 1 \mid B_1 = 1).

Since we have independent increments, B2B1B_2 - B_1 is independent of B1B_1.

P(B2B1>1)P(B_2 - B_1 > -1)

Further, B2B1N(0,1)B_2 - B_1 \sim \mathcal{N}(0, 1). Thus, we have,

P(B2B1>1)=P(Z>1)=0.8413,P(B_2 - B_1 > -1) = P\left(Z > -1\right) = 0.8413,

where ZN(0,1)Z \sim \mathcal{N}(0, 1). _\blacksquare (we can compute this with 1 - pnorm(-1, 0, 1) in R).

Example 3 (Covariance Computation with Brownian Motion)

Show that Cov(Bs,Bt)=min(s,t)\mathrm{Cov}(B_s, B_t) = \min(s, t).

Solution

Without loss of generality, assume that sts \leq t. We can write,

Cov(Bs,Bt)=Cov(Bs,BtBs+Bs)=Cov(Bs,BtBs)+Cov(Bs,Bs)=0+Var(Bs)=s\begin{align*} \mathrm{Cov}(B_s, B_t) & = \mathrm{Cov}(B_s, B_t - B_s + B_s) \newline & = \mathrm{Cov}(B_s, B_t - B_s) + \mathrm{Cov}(B_s, B_s) \newline & = 0 + \mathrm{Var}(B_s) \newline & = s \newline \end{align*}

Conversely, if tst \leq s, we would get Cov(Bs,Bt)=t\mathrm{Cov}(B_s, B_t) = t. Thus, we have,

Cov(Bs,Bt)=min(s,t) .\mathrm{Cov}(B_s, B_t) = \min(s, t) \ _\blacksquare .
Intuition (Brownian Motion as Limit of Random Walks)

A random walk is a discrete-time Markov chain S0,S1,S2,S_0, S_1, S_2, \ldots where S0=0S_0 = 0 and,

Sn=Y1+Y2++Yn,S_n = Y_1 + Y_2 + \ldots + Y_n,

and Y1,Y2,Y_1, Y_2, \ldots are i.i.d. random variables. Assume E[Yi]=0\mathbb{E}[Y_i] = 0.

Further, if we assume Var(Yi)=1\mathrm{Var}(Y_i) = 1, we get Var(Sn)=n\mathrm{Var}(S_n) = n.

Interpolating between the values SnS_n we can make this into a continuous-time process StS_t where Var(St)t\mathrm{Var}(S_t) \approx t.

We may scale with an s>0s > 0 to get processes St(s)=SstsS_t^{(s)} = \frac{S_{st}}{\sqrt{s}} where we get limsVar(St(s))=t\lim_{s \to \infty} \mathrm{Var}(S_t^{(s)}) = t.

It turns out that the processes St(s)S_t^{(s)} when ss \to \infty are exactly Brownian motion, no matter what type of YiY_i we start with.

This is the Donsker’s invariance principle, which is a generalization of the central limit theorem to stochastic processes [^2].

Gaussian Processes

Recall (Multivariate Normal Distribution)

A set of random variables X1,X2,,XkX_1, X_2, \ldots, X_k has a multivariate normal distribution if, for all real a1,a2,,aka_1, a_2, \ldots, a_k, a1X1+a2X2++akXka_1 X_1 + a_2 X_2 + \ldots + a_k X_k is normally distributed.

It is completely determined by the expectation vector μ=(E[X1],E[X2],,E[Xk])\mu = (\mathbb{E}[X_1], \mathbb{E}[X_2], \ldots, \mathbb{E}[X_k]) and the (k×k)(k \times k) covariance matrix Σ\Sigma where Σij=Cov(Xi,Xj)\Sigma_{ij} = \mathrm{Cov}(X_i, X_j).

The joint density function on the vector x=(x1,x2,,xk)x = (x_1, x_2, \ldots, x_k) is given by,

π(x)=12πΣ1/2exp(12(xμ)TΣ1(xμ)).\pi(x) = \frac{1}{|2 \pi \Sigma|^{1/2}} \exp\left(-\frac{1}{2} (x - \mu)^T \Sigma^{-1} (x - \mu)\right).

where 2πΣ|2 \pi \Sigma| is the determinant of the matrix 2πΣ2 \pi \Sigma.

Note

All marginal distributions and all conditional distributions are also multivariate normal.

Definition 2 (Gaussian Process)

A Gaussian process is a continuous-time stochastic process {Xt}t0\{X_t\}_{t \geq 0} with the property that for all n1n \geq 1 and 0t1<t2<<tn0 \leq t_1 < t_2 < \ldots < t_n, Xt1,Xt2,,XtnX_{t_1}, X_{t_2}, \ldots, X_{t_n} have a multivariate normal distribution.

Thus, a Gaussian process is completely determined by its mean function E[Xt]\mathbb{E}[X_t] and its covariance function Cov(Xs,Xt)\mathrm{Cov}(X_s, X_t).

Intuition (Brownian Motion as a Gaussian Process)

Brownian motion is a Gaussian process, as we can show that any a1Bt1+a2Bt2++anBtna_1 B_{t_1} + a_2 B_{t_2} + \ldots + a_n B_{t_n} is normally distributed.

A Gaussian process {Xt}t0\{X_t\}_{t \geq 0} is a Brownian motion if and only if,

  1. X0=0X_0 = 0.

  2. E[Xt]=0\mathbb{E}[X_t] = 0 for all tt.

  3. Cov(Xs,Xt)=min(s,t)\mathrm{Cov}(X_s, X_t) = \min(s, t) for all s,ts, t.

  4. The function tXtt \mapsto X_t is continuous with probability 11 (almost surely).

Intuition (Transformations of Brownian Motion)

The following transformations of Brownian motion also yield Brownian motion:

  • {Bt}t0\{-B_t\}_{t \geq 0}, negating the process yields another (reflected) Brownian motion.
  • {Bt+sBs}t0\{B_{t + s} - B_s\}_{t \geq 0} for any fixed s0s \geq 0, shifting the time origin yields another Brownian motion.
  • {1aBat}t0\{\frac{1}{\sqrt{a}} B_{a t}\}_{t \geq 0} for any fixed a>0a > 0, scaling time and space yields another Brownian motion.
  • The process {Xt}t0\{X_t\}_{t \geq 0} where X0=0X_0 = 0 and Xt=tB1tX_t = t B_{\frac{1}{t}} for t>0t > 0, time inversion yields another Brownian motion.
Intuition (Stopping Times)

We saw above that, for any fixed tt (Bt+sBs)t0(B_{t + s} - B_s)_{t \geq 0} is a Brownian motion. Does thius phenomenon also hold if we start the chain anew from TT when TT is random? It depends.

If TT is the largest value less than 1 where BT=0B_T = 0, then (BT+sBT)s0(B_{T + s} - B_T)_{s \geq 0} is not a Brownian motion.

If TT is the smallest value where BT=aB_T = a for some constant aa, then (BT+sBT)s0(B_{T + s} - B_T)_{s \geq 0} is a Brownian motion. The reason is that the event T=tT = t can be determined based on BrB_r where 0rt0 \leq r \leq t.

Random TT’s that have this property are called stopping times. For these BT+sBTB_{T + s} - B_T is a Brownian motion.

Intuition (The Distribution of the First Hitting Time)

Given that a0a \neq 0, what is the distribution of the first hitting time Ta=min{t:Bt=a}T_a = \min\{t : B_t = a\}?

We will prove that,

1TaGamma(12,a22).\frac{1}{T_a} \sim \mathrm{Gamma}\left(\frac{1}{2}, \frac{a^2}{2}\right).

Assuming that a>0a > 0 and using that TaT_a is a stopping time we get for any t>0t > 0 that P(B1t>aTa<1t)=P(B1tTa>0)=12P(B_{\frac{1}{t}} > a \mid T_a < \frac{1}{t}) = P(B_{\frac{1}{t} - T_a} > 0) = \frac{1}{2}.

We also have,

P(B1t>aTa<1t)=P(B1t>a,Ta<1t)P(Ta<1t)=P(B1t>a)P(Ta<1t).\begin{align*} P(B_{\frac{1}{t}} > a \mid T_a < \frac{1}{t}) & = \frac{P(B_{\frac{1}{t}} > a, T_a < \frac{1}{t})}{P(T_a < \frac{1}{t})} \newline & = \frac{P(B_{\frac{1}{t}} > a)}{P(T_a < \frac{1}{t})}. \end{align*}

Further, it follows that P(Ta<1t)=2P(B1t>a)P(T_a < \frac{1}{t}) = 2 P(B_{\frac{1}{t}} > a) and thus,

P(1Tat)=2P(B1t>a)1=2P(B1at)1\begin{align*} P\left(\frac{1}{T_a} \leq t\right) & = 2 P\left(B_{\frac{1}{t}} > a\right) -1 \newline & = 2 P(B_1 \leq a \sqrt{t}) - 1 \newline \end{align*}

Taking the derivative with respect to tt we get the Gamma density,

π1/Ta(t)=212πexp(12(at)2)a2t1/2.\pi_{1/T_a}(t) = 2 \frac{1}{\sqrt{2 \pi}} \exp\left(-\frac{1}{2} \left(a \sqrt{t}\right)^2\right) \frac{a}{2} t^{-1/2}.
Intuition (Maximum of Brownian Motion)

We can define Mtmax0stBsM_t \coloneqq \max_{0 \leq s \leq t} B_s.

We may compute for a>0a > 0 (using the results above),

P(Mt>a)=P(Ta<t)=2P(Bt>a)=P(Bt>a).\begin{align*} P(M_t > a) & = P(T_a < t) \newline & = 2 P(B_t > a) \newline & = P(|B_t| > a). \end{align*}

Thus, MtM_t has the same d istribution as Bt|B_t|, i.e., the absolute value of BtB_t.

Example 4

What is the probability that M3>5M_3 > 5?

Solution

We have,

P(M3>5)=P(B3>5)=2P(B3>5)=2P(Z>53)=0.046.\begin{align*} P(M_3 > 5) & = P(|B_3| > 5) \newline & = 2 P(B_3 > 5) \newline & = 2 P\left(Z > \frac{5}{\sqrt{3}}\right) \newline & = 0.046. \end{align*}

where ZN(0,1)Z \sim \mathcal{N}(0, 1). _\blacksquare (we can compute this with 2 * (1 - pnorm(5 / sqrt(3), 0, 1)) in R).

Example 5

Find tt such that P(Mt4)=0.9P(M_t \leq 4) = 0.9.

Solution

We have,

P(Mt4)=P(Bt4)=12P(Bt>4)=12P(Z>4t).\begin{align*} P(M_t \leq 4) & = P(|B_t| \leq 4) \newline & = 1 - 2 P(B_t > 4) \newline & = 1 - 2 P\left(Z > \frac{4}{\sqrt{t}}\right). \newline \end{align*}

Setting this equal to 0.90.9 we get,

P(Z>4t)=0.05.P\left(Z > \frac{4}{\sqrt{t}}\right) = 0.05.

Looking up in the standard normal table (or using qnorm(0.95, 0, 1) in R) we get 4t=1.645\frac{4}{\sqrt{t}} = 1.645 and thus t=5.92t = 5.92. _\blacksquare