Joe Doob & Change of measures on path-space

SDE
markov
Published

14 05 2024

Modified

14 05 2024

Joseph Doob (1910 – 2004)

Consider a continuous time Markov process Xt on the time interval [0,T] and with value in the state space X. This defines a probability P on the set of X-valued paths. Now, it is often the case that one has to consider a perturbed probability distribution Q defined as

(1)dQdP(x[0,T])=1Zexp[g(XT)]

for a (typically unknown) normalization constant Z and some function g:XR. For example, collecting a noisy observation yTF(XT)+(noise) at time T, the distribution Q defined with the log-likelihood function g(x)=logP(yTXT=x) describes the dynamics of the Markov process Xt conditioned on the observation yT; we will use this interpretation in the following since this is the most common use case and gives the most intuitive interpretation. Doob h-transforms are a powerful tool to describe the dynamics of the conditioned process.

For convenience, let us use the notation Ex[]E[xt=x]. For a test function φ:XR and a time increment δ>0, we have

(2)E[φ(xt+δ)|xt,yT]=Ext[φ(xt+δ)exp(g(xT))]/Ext[exp(g(xT))]=Ext[φ(xt+δ)h(t+δ,xt+δ)]h(t,x).

We have introduced the important function h:[0,T]×XR defined as

h(t,x)=E[exp[g(xT)]xt=x]=P(yTxt=x).

One can readily check that the function h satisfies the Kolmogorov equation

(t+L)h=0

with boundary condition h(T,x)=exp[g(x)]. Furthermore, denoting by L the infinitesimal generator of the Markov process Xt, we have:

(3)Ext[φ(xt+δ)h(t+δ,xt+δ)]φ(xt)h(t,xt)+δ(t+L)[hφ](t,xt)+o(δ).

The infinitesimal generator L of the conditioned process is

Lφ(t,xt)=limδ0+E[φ(xt+δ)|xt,yT]φ(xt)δ.

Plugging Equation 3 within Equation 2 directly gives that

Lφ=Lφ+L[φh]h+φthh.

The generator L describes the dynamics of the conditioned process. In fact, the same computation holds with a more general change of measure of the type (4)dQdP(x[0,T])=1Zexp{0Tf(s,Xs)ds+g(xT)}

for some function f:[0,T]×XR. One can define the function h similarly as

(5)h(t,xt)=E[exp{tTf(Xs)ds+g(xT)}xt].

This function satisfies the Feynman-Kac formula (t+L+f)h=0 and one obtains entirely similarly that the probability distribution Q describes a Markov process with infinitesimal generator

(6)Lφ=Lφ+L[hφ]h+(thh+f)φ.

To see how this works, let us see a few examples:

General diffusions

Consider a diffusion process

dX=b(X)dt+σ(X)dW

with generator Lφ=bφ+12σσ:2φ and initial distribution μ0(dx). We are interested in describing the dynamics of the “conditioned” process given by the probability distribution Q defined in Equation 4. Algebra applied to Equation 5 then shows that

Lφ=Lφ+φ(t+L+f)[h]h=0+σσ(logh)φ

where the function h is described in Equation 5. Since (t+L+f)h=0, this reveals that the probability distribution Q describes a diffusion process X with dynamics

dX=b(X)dt+σ(X){dW+u(t,X)dt}.

The additional drift term σ(X)u(t,X)dt is involves a “control” u(t,X) with (7)u(t,x)=σ(x)logh(t,x).

Note that the initial distribution of the conditioned process is

μ0(dx)=1Zμ0(dx)h(0,x).

Unfortunately, apart from a few straightforward cases such as a Brownian motion or an Ornstein-Uhlenbeck process, the function h is generally intractable. However, there are indeed several numerical methods available to approximate it effectively.

Brownian bridge

What about a Brownian motion in RD conditioned to hit the state xRD at time t=T, i.e. a Brownian bridge? In that case, the function h is given by

h(t,x)=P(BT=xBt=x)=exp{xx22(Tt)}/ZTt

for some irrelevant normalization constant ZTt that only depends on Tt. Plugging this into Equation 7 gives that the conditioned Brownian X motion has dynamics

dX=XxTtdt+dB.

The additional drift term (Xx)/(Tt) is intuitive: it points in the direction of x and gets increasingly large as tT.

Positive Brownian motion

What about a scalar Brownian conditioned to stay positive at all times? Let us consider T and let us condition first on the event that the Brownian motion stays positive within [0,T] and later consider the limit T. The function h reads

h(t,x)=P(Bt stays >0 on [t,T]Bt=x).

This can easily be calculated with the reflection principle. It equals

h(t,x)=12P(BT<0BT=x)=P(Ttξ<x)

for a standard Gaussian ξN(0,1). Plugging this into Equation 7 gives that the additional drift term is

logh(t,x)=exp(x2/(2(Tt)))x1x

as T. This shows that a Brownian motion conditioned to stay positive at all times has a upward drift of size 1/x,

dX=1X+dB.

Incidentally, it is the dynamics of a Bessel process of dimension d=3, i.e. the law of the modulus of a three-dimensional Brownian motion. More generally, if one conditions a Brownian motion to stay within a closed domain D, the conditioned dynamics exhibit a repulsive drift term of size about 1/dist(x,D) near the boundary D of the domain, as described below.

Brownian motion staying in a domain

What about a Brownian motion conditioned to stay within a domain D forever? As before, consider an time horizon T and define the function h as

h(t,x)=P(Bt stays in D on [t,T]Bt=x).

One can see that the function h satisfies the PDE

(t+Δ)h=0

and equals zero on the boundary D of the domain. Furthermore h(t,x)1 as tT for all xD. Consider the eigenfunctions ψk of the negative Laplacian Δ with Dirichlet boundary conditions on D. Recall that Δ is a positive operator with a discrete spectrum λ1λ2 of non-negative eigenvalues. The eigenfunction corresponding to the smallest eigenvalue λ1 is the principal eigenfunction ψ1 and it is standard that it is a positive function within the domain D, as a “slight” generalization of the Perron-Frobenius in linear algebra shows it. Expanding h in the basis of eigenfunctions ψk gives that

h(t,x)=c1eλ1(Tt)ψ1(x)dominant contribution+k2ckeλk(Tt)ψk(x).

Eigenfunctions of the Laplacian

Since we are interested in the regime T, it holds that

xlogh(t,x)logψ1(x).

This shows that the conditioned Brownian motion has a drift term expressed in terms of the principal eigenfunction ψ1 of the Laplacian:

dX=logψ1(X)dt+dB.

For example, if D[0,L] for a 1D Brownian motion, the principal eigenfunction is ψ1(x)=sin(πx/L). This shows that there is a upward drift of size 1/x near x0 and a downward drift of size 1/(Lx) near xL.