Difference between revisions of "Markov process"
(refs format) |
(latex details) |
||
(7 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
+ | <!-- | ||
+ | m0624901.png | ||
+ | $#A+1 = 265 n = 0 | ||
+ | $#C+1 = 265 : ~/encyclopedia/old_files/data/M062/M.0602490 Markov process, | ||
+ | Automatically converted into TeX, above some diagnostics. | ||
+ | Please remove this comment and the {{TEX|auto}} line below, | ||
+ | if TeX found to be correct. | ||
+ | --> | ||
+ | |||
+ | {{TEX|auto}} | ||
+ | {{TEX|done}} | ||
+ | |||
''process without after-effects'' | ''process without after-effects'' | ||
Line 5: | Line 17: | ||
[[Category:Markov processes]] | [[Category:Markov processes]] | ||
− | A [[Stochastic process|stochastic process]] whose evolution after a given time | + | A [[Stochastic process|stochastic process]] whose evolution after a given time $ t $ |
+ | does not depend on the evolution before $ t $, | ||
+ | given that the value of the process at $ t $ | ||
+ | is fixed (briefly; the "future" and "past" of the process are independent of each other for a known "present" ). | ||
− | The defining property of a Markov process is commonly called the [[ | + | The defining property of a Markov process is commonly called the [[Markov property]]; it was first stated by A.A. Markov . However, in the work of L. Bachelier it is already possible to find an attempt to discuss [[Brownian motion]] as a Markov process, an attempt which received justification later in the research of N. Wiener (1923). The basis of the general theory of continuous-time Markov processes was laid by A.N. Kolmogorov . |
==The Markov property.== | ==The Markov property.== | ||
− | There are essentially distinct definitions of a Markov process. One of the more widely used is the following. On a [[Probability space|probability space]] | + | There are essentially distinct definitions of a Markov process. One of the more widely used is the following. On a [[Probability space|probability space]] $ ( \Omega , F , {\mathsf P} ) $ |
+ | let there be given a stochastic process $ X ( t) $, | ||
+ | $ t \in T $, | ||
+ | taking values in a [[Measurable space|measurable space]] $ ( E , {\mathcal B} ) $, | ||
+ | where $ T $ | ||
+ | is a subset of the real line $ \mathbf R $. | ||
+ | Let $ N _ {t} $( | ||
+ | respectively, $ N ^ {t} $) | ||
+ | be the $ \sigma $- | ||
+ | algebra in $ \Omega $ | ||
+ | generated by the variables $ X ( s) $ | ||
+ | for $ s \leq t $( | ||
+ | $ s \geq t $), | ||
+ | where $ s \in T $. | ||
+ | In other words, $ N _ {t} $( | ||
+ | respectively, $ N ^ {t} $) | ||
+ | is the collection of events connected with the evolution of the process up to time (starting from time) $ t $. | ||
+ | $ X ( t) $ | ||
+ | is called a Markov process if (almost certainly) for all $ t \in T $, | ||
+ | $ \Lambda _ {1} , \Lambda _ {2} \in N ^ {t} $ | ||
+ | the Markov property | ||
− | + | $$ \tag{1 } | |
+ | {\mathsf P} \{ \Lambda _ {1} \Lambda _ {2} \mid X ( t) \} = \ | ||
+ | {\mathsf P} \{ \Lambda _ {1} \mid X ( t) \} {\mathsf P} | ||
+ | \{ \Lambda _ {2} \mid X ( t) \} | ||
+ | $$ | ||
− | holds, or, what is the same, if for any | + | holds, or, what is the same, if for any $ t \in T $ |
+ | and $ \Lambda \in N ^ {t} $, | ||
− | + | $$ \tag{2 } | |
+ | {\mathsf P} \{ \Lambda \mid N _ {t} \} = \ | ||
+ | {\mathsf P} \{ \Lambda \mid X ( t) \} . | ||
+ | $$ | ||
− | A Markov process for which | + | A Markov process for which $ T $ |
+ | is contained in the natural numbers is called a [[Markov chain|Markov chain]] (however, the latter term is mostly associated with the case of an at most countable $ E $). | ||
+ | If $ T $ | ||
+ | is an interval in $ \mathbf R $ | ||
+ | and $ E $ | ||
+ | is at most countable, a Markov process is called a continuous-time Markov chain. Examples of continuous-time Markov processes are furnished by diffusion processes (cf. [[Diffusion process|Diffusion process]]) and processes with independent increments (cf. [[Stochastic process with independent increments|Stochastic process with independent increments]]), including Poisson and Wiener processes (cf. [[Poisson process|Poisson process]]; [[Wiener process|Wiener process]]). | ||
− | In what follows the discussion will concern only the case | + | In what follows the discussion will concern only the case $ T = [ 0 , \infty ) $, |
+ | for the sake of being specific. The formulas (1) and (2) give an explicit interpretation of the principle of independence of "past" and "future" events when the "present" is known, but the definition of Markov process based on them has proved to be insufficiently flexible in the numerous situations where one is obliged to consider not one, but a collection of conditions of the type (1) or (2) corresponding to different, but in some sense consistent, measures $ {\mathsf P} $. | ||
+ | Such reasoning has led to the acceptance of the following definitions (see , ). | ||
Suppose one is given: | Suppose one is given: | ||
− | a) a measurable space | + | a) a measurable space $ ( E , {\mathcal B} ) $, |
+ | where the $ \sigma $- | ||
+ | algebra $ {\mathcal B} $ | ||
+ | contains all one-point sets in $ E $; | ||
− | b) a measurable space | + | b) a measurable space $ ( \Omega , F ) $, |
+ | equipped with a family of $ \sigma $- | ||
+ | algebras $ F _ {t} ^ { s } \subset F $, | ||
+ | $ 0 \leq s \leq t \leq \infty $, | ||
+ | such that $ F _ {t} ^ { s } \subset F _ {v} ^ { u } $ | ||
+ | if $ [ s , t ] \subset [ u , v ] $; | ||
− | c) a function ( "trajectory" ) | + | c) a function ( "trajectory" ) $ x _ {t} = x _ {t} ( \omega ) $, |
+ | defining for $ t \in [ 0 , \infty ) $ | ||
+ | and $ v \in [ 0 , t ] $ | ||
+ | a [[Measurable mapping|measurable mapping]] from $ ( \Omega , F _ {t} ^ { v } ) $ | ||
+ | to $ ( E , {\mathcal B} ) $; | ||
− | d) for each | + | d) for each $ s \geq 0 $ |
+ | and $ x \in E $ | ||
+ | a [[Probability measure|probability measure]] $ {\mathsf P} _ {s,x} $ | ||
+ | on the $ \sigma $- | ||
+ | algebra $ F _ \infty ^ { s } $ | ||
+ | such that the function $ {\mathsf P} ( s , \cdot ; t , B ) = {\mathsf P} _ {s , \cdot } \{ x _ {t} \in B \} $ | ||
+ | is measurable with respect to $ {\mathcal B} $, | ||
+ | if $ s \in [ 0 , t ] $ | ||
+ | and $ B \in {\mathcal B} $. | ||
− | The collection | + | The collection $ X ( t) = ( x _ {t} , F _ {t} ^ { s } , {\mathsf P} _ {s , x } ) $ |
+ | is called a (non-terminating) Markov process given on $ ( E , {\mathcal B} ) $ | ||
+ | if $ {\mathsf P} _ {s,x} $- | ||
+ | almost certainly | ||
− | + | $$ \tag{3 } | |
+ | {\mathsf P} _ {s,x} \{ \Lambda \mid F _ {t} ^ { s } \} | ||
+ | = {\mathsf P} _ {t , x _ {t} } \{ \Lambda \} , | ||
+ | $$ | ||
− | for any | + | for any $ 0 \leq s \leq t $ |
+ | and $ \Lambda \in N ^ {t} $. | ||
+ | Here $ \Omega $ | ||
+ | is the space of elementary events, $ ( E , {\mathcal B} ) $ | ||
+ | is the phase space or state space and $ P ( s , x ; t , B ) $ | ||
+ | is the [[Transition function|transition function]] or transition probability of $ X ( t) $. | ||
+ | If $ E $ | ||
+ | is endowed with a topology and $ {\mathcal B} $ | ||
+ | is the collection of Borel sets in $ E $, | ||
+ | then it is commonly said that the Markov process is given on $ E $. | ||
+ | Usually included in the definition of a Markov process is the requirement that $ P ( s , x ; s , \{ x \} ) \equiv 1 $, | ||
+ | and then $ {\mathsf P} _ {s,x} \{ \Lambda \} $, | ||
+ | $ \Lambda \in F _ \infty ^ { s } $, | ||
+ | is interpreted as the probability of $ \Lambda $ | ||
+ | under the condition that $ x _ {s} = x $. | ||
− | The following question arises: Is every Markov transition function | + | The following question arises: Is every Markov transition function $ P ( s, x ; t , B ) $, |
+ | given on a measurable space $ ( E , {\mathcal B} ) $, | ||
+ | the transition function of some Markov process? The answer is affirmative if, for example, $ E $ | ||
+ | is a separable, locally compact space and $ {\mathcal B} $ | ||
+ | is the family of Borel sets in $ E $. | ||
+ | In addition, let $ E $ | ||
+ | be a complete metric space and let | ||
− | + | $$ | |
+ | \lim\limits _ {h \downarrow 0 } \alpha _ \epsilon ( h) = 0 | ||
+ | $$ | ||
− | for any | + | for any $ \epsilon > 0 $, |
+ | where | ||
− | + | $$ | |
+ | \alpha _ \epsilon ( h) = \ | ||
+ | \sup \{ {P ( s , x ; t , V _ \epsilon ( x) ) } : { | ||
+ | x \in E , 0 < t - s < h } \} | ||
+ | $$ | ||
− | and | + | and $ V _ \epsilon ( x) $ |
+ | is the complement of the $ \epsilon $- | ||
+ | neighbourhood of $ x $. | ||
+ | Then the corresponding Markov process can be taken to be right-continuous and having left limits (that is, its trajectories can be chosen so). The existence of a continuous Markov process is guaranteed by the condition $ \alpha _ \epsilon ( h) = o ( h) $ | ||
+ | as $ h \downarrow 0 $( | ||
+ | see , ). | ||
− | In the theory of Markov processes most attention is given to homogeneous (in time) processes. The corresponding definition assumes one is given a system of objects a)–d) with the difference that the parameters | + | In the theory of Markov processes most attention is given to homogeneous (in time) processes. The corresponding definition assumes one is given a system of objects a)–d) with the difference that the parameters $ s $ |
+ | and $ u $ | ||
+ | may now only take the value 0. Even the notation can be simplified: | ||
− | + | $$ | |
+ | {\mathsf P} _ {x} = \ | ||
+ | {\mathsf P} _ {0x} ,\ \ | ||
+ | F _ {t} = F _ {t} ^ { 0 } ,\ \ | ||
+ | P ( t , x , B ) = P | ||
+ | ( 0 , x ; t , B ) , | ||
+ | $$ | ||
− | + | $$ | |
+ | x \in E ,\ t \geq 0 ,\ B \in {\mathcal B} . | ||
+ | $$ | ||
− | Subsequently, homogeneity of | + | Subsequently, homogeneity of $ \Omega $ |
+ | is assumed. That is, it is required that for any $ \omega \in \Omega $ | ||
+ | and $ s \geq 0 $ | ||
+ | there is an $ \omega ^ \prime \in \Omega $ | ||
+ | such that $ x _ {t} ( \omega ^ \prime ) = x _ {t+s} ( \omega ) $ | ||
+ | for $ t \geq 0 $. | ||
+ | Because of this, on the $ \sigma $- | ||
+ | algebra $ N $, | ||
+ | the smallest $ \sigma $- | ||
+ | algebra in $ \Omega $ | ||
+ | containing the events $ \{ \omega : {x _ {s} \in B } \} $, | ||
+ | the time shift operators $ \theta _ {t} $ | ||
+ | are defined, which preserve the operations of union, intersection and difference of sets, and for which | ||
− | + | $$ | |
+ | \theta _ {t} \{ \omega : {x _ {s} \in B } \} | ||
+ | = \{ \omega : {x _ {t+s} | ||
+ | \in B } \} | ||
+ | , | ||
+ | $$ | ||
− | where | + | where $ s , t \geq 0 $, |
+ | $ B \in {\mathcal B} $. | ||
− | The collection | + | The collection $ X ( t) = ( x _ {t} , F _ {t} , {\mathsf P} _ {x} ) $ |
+ | is called a (non-terminating) homogeneous Markov process given on $ ( E , {\mathcal B} ) $ | ||
+ | if $ {\mathsf P} _ {x} $- | ||
+ | almost certainly | ||
− | + | $$ \tag{4 } | |
+ | {\mathsf P} _ {x} \{ \theta _ {t} \Lambda \mid F _ {t} \} = {\mathsf P} _ {x _ {t} } \{ \Lambda \} | ||
+ | $$ | ||
− | for | + | for $ x \in E $, |
+ | $ t \geq 0 $ | ||
+ | and $ \Lambda \in N $. | ||
+ | The transition function of $ X ( t) $ | ||
+ | is taken to be $ P ( t , x , B ) $, | ||
+ | where, unless otherwise indicated, it is required that $ P ( 0 , x , \{ x \} ) \equiv 1 $. | ||
+ | It is useful to bear in mind that in the verification of (4) it is only necessary to consider sets of the form $ \Lambda = \{ \omega : {x _ {s} \in B } \} $, | ||
+ | where $ s \geq 0 $, | ||
+ | $ B \in {\mathcal B} $, | ||
+ | and in (4), $ F _ {t} $ | ||
+ | may always replaced by the $ \sigma $- | ||
+ | algebra $ \overline{F}\; _ {t} $ | ||
+ | equal to the intersection of the completions of $ F _ {t} $ | ||
+ | relative to all possible measures $ {\mathsf P} _ {x} \{ x \in B \} $. | ||
+ | Often, one fixes on $ {\mathcal B} $ | ||
+ | a probability measure $ \mu $( | ||
+ | the "initial distribution" ) and considers a random Markov function $ ( x _ {t} , F _ {t} , {\mathsf P} _ \mu ) $, | ||
+ | where $ {\mathsf P} _ \mu $ | ||
+ | is the measure on $ F _ \infty $ | ||
+ | given by | ||
− | + | $$ | |
+ | {\mathsf P} _ \mu \{ \cdot \} = \int\limits {\mathsf P} _ {x} \{ \cdot \} \mu ( d x ) . | ||
+ | $$ | ||
− | A Markov process | + | A Markov process $ X ( t) = ( x _ {t} , F _ {t} , {\mathsf P} _ {x} ) $ |
+ | is called progressively measurable if for each $ t > 0 $ | ||
+ | the function $ x( s, \omega ) = x _ {s} ( \omega ) $ | ||
+ | induces a measurable mapping from $ ( [ 0 , t ] \times \Omega , {\mathcal B} _ {t} \times F _ {t} ) $ | ||
+ | to $ ( E , {\mathcal B} ) $, | ||
+ | where $ {\mathcal B} _ {t} $ | ||
+ | is the $ \sigma $- | ||
+ | algebra of Borel subsets of $ [ 0 , t ] $. | ||
+ | A right-continuous Markov process is progressively measurable. There is a method for reducing the non-homogeneous case to the homogeneous case (see ), and in what follows homogeneous Markov processes will be discussed. | ||
==The strong Markov property.== | ==The strong Markov property.== | ||
− | Suppose that, on a measurable space | + | Suppose that, on a measurable space $ ( E , {\mathcal B} ) $, |
+ | a Markov process $ X ( t) = ( x _ {t} , F _ {t} , {\mathsf P} _ {x} ) $ | ||
+ | is given. A function $ \tau : \Omega \rightarrow [ 0 , \infty ] $ | ||
+ | is called a [[Markov moment|Markov moment]] (stopping time) if $ \{ \omega : {\tau \leq t } \} \in F _ {t} $ | ||
+ | for $ t \geq 0 $. | ||
+ | Here a set $ \Lambda \subset \Omega _ \tau = \{ \omega : {\tau < \infty } \} $ | ||
+ | is considered in the family $ F _ \tau $ | ||
+ | if $ \Lambda _ \cap \{ \omega : {\tau < t } \} \in F _ {t} $ | ||
+ | for $ t \geq 0 $( | ||
+ | most often $ F _ \tau $ | ||
+ | is interpreted as the family of events connected with the evolution of $ X ( t) $ | ||
+ | up to time $ \tau $). | ||
+ | For $ \Lambda \in N $, | ||
+ | set | ||
− | + | $$ | |
+ | \theta _ \tau \Lambda = \ | ||
+ | \cup _ {t \geq 0 } | ||
+ | [ \theta _ {t} \Lambda \cap \{ \omega : {\tau = t } \} | ||
+ | ] . | ||
+ | $$ | ||
− | A progressively-measurable Markov process | + | A progressively-measurable Markov process $ X $ |
+ | is called a strong Markov process if for any Markov moment $ \tau $ | ||
+ | and all $ t \geq 0 $, | ||
+ | $ x \in E $ | ||
+ | and $ \Lambda \in N $, | ||
+ | the relation | ||
− | + | $$ \tag{5 } | |
+ | {\mathsf P} _ {x} \{ \theta _ \tau \Lambda \mid F _ \tau \} = \ | ||
+ | {\mathsf P} _ {x _ \tau } \{ \Lambda \} | ||
+ | $$ | ||
− | (the strong Markov property) is satisfied | + | (the strong Markov property) is satisfied $ {\mathsf P} _ {x} $- |
+ | almost certainly in $ \Omega _ \tau $. | ||
+ | In the verification of (5) it suffices to consider only sets of the form $ \Lambda = \{ \omega : {x _ {s} \in B } \} $ | ||
+ | where $ s \geq 0 $, | ||
+ | $ B \in {\mathcal B} $; | ||
+ | in this case $ \theta _ \tau \Lambda = \{ \omega : {x _ {s + \tau } \in B } \} $. | ||
+ | For example, any right-continuous Feller–Markov process on a topological space $ E $ | ||
+ | is a strong Markov process. A Markov process is called a Feller–Markov process if the function | ||
− | + | $$ | |
+ | P ^ {t} f ( \cdot ) = \int\limits f ( y) P ( t , \cdot , d y ) | ||
+ | $$ | ||
− | is continuous whenever | + | is continuous whenever $ f $ |
+ | is continuous and bounded. | ||
− | In the case of strong Markov processes various subclasses have been distinguished. Let the Markov transition function | + | In the case of strong Markov processes various subclasses have been distinguished. Let the Markov transition function $ P ( t , x , B ) $, |
+ | given on a locally compact metric space $ E $, | ||
+ | be stochastically continuous: | ||
− | + | $$ | |
+ | \lim\limits _ | ||
+ | {t \downarrow 0 } \ | ||
+ | P ( t , x , U ) = 1 | ||
+ | $$ | ||
− | for any neighbourhood | + | for any neighbourhood $ U $ |
+ | of each point $ x \in B $. | ||
+ | If $ P ^ {t} $ | ||
+ | maps the class of continuous functions that vanish at infinity into itself, then $ P ( t , x , B ) $ | ||
+ | corresponds to a standard Markov process $ X $. | ||
+ | That is, a right-continuous strong Markov process for which: 1) $ F _ {t} = \overline{F}\; _ {t} $ | ||
+ | for $ t \in [ 0 , \infty ) $ | ||
+ | and $ F _ {t} = \cap _ {s>} t F _ {s} $ | ||
+ | for $ t \in [ 0 , \infty ) $; | ||
+ | 2) $ \lim\limits _ {n \rightarrow \infty } x _ {\tau _ {n} } = x _ \tau $, | ||
+ | $ P _ {x} $- | ||
+ | almost certainly on the set $ \{ \omega : {\tau < \infty } \} $, | ||
+ | where $ \tau = \lim\limits _ {n \rightarrow \infty } \tau _ {n} $ | ||
+ | and $ \tau _ {n} $( | ||
+ | $ n \geq 1 $) | ||
+ | are Markov moments that are non-decreasing as $ n $ | ||
+ | increases. | ||
==Terminating Markov processes.== | ==Terminating Markov processes.== | ||
Frequently, a physical system can be best described using a non-terminating Markov process, but only in a time interval of random length. In addition, even simple transformations of a Markov process may lead to processes with trajectories given on random intervals (see [[Functional of a Markov process|Functional of a Markov process]]). Guided by these considerations one introduces the notion of a terminating Markov process. | Frequently, a physical system can be best described using a non-terminating Markov process, but only in a time interval of random length. In addition, even simple transformations of a Markov process may lead to processes with trajectories given on random intervals (see [[Functional of a Markov process|Functional of a Markov process]]). Guided by these considerations one introduces the notion of a terminating Markov process. | ||
− | Let | + | Let $ \widetilde{X} ( t) = ( \widetilde{X} _ {t} , \widetilde{F} _ {t} , \widetilde {\mathsf P} _ {x} ) $ |
+ | be a homogeneous Markov process in a phase space $ ( \widetilde{E} , {\mathcal B} tilde ) $, | ||
+ | having a transition function $ \widetilde{P} ( t , x , N ) $, | ||
+ | and let there be a point $ e \in \widetilde{E} $ | ||
+ | and a function $ \zeta : \Omega \rightarrow [ 0 , \infty ) $ | ||
+ | such that $ \widetilde{x} _ {t} ( \omega ) = e $ | ||
+ | for $ \zeta ( \omega ) \leq t $ | ||
+ | and $ \widetilde{x} _ {t} ( \omega ) \neq e $ | ||
+ | otherwise (unless stated otherwise, take $ \zeta > 0 $). | ||
+ | A new trajectory $ x _ {t} ( \omega ) $ | ||
+ | is given for $ t \in [ 0 , \zeta ( \omega ) ) $ | ||
+ | by the equality $ x _ {t} ( \omega ) = \widetilde{x} _ {t} ( \omega ) $, | ||
+ | and $ F _ {t} $ | ||
+ | is defined as the trace of $ \widetilde{F} _ {t} $ | ||
+ | on the set $ \{ \omega : {\zeta > t } \} $. | ||
− | The collection | + | The collection $ X ( t) = ( x _ {t} , \zeta , F _ {t} , \widetilde {\mathsf P} _ {x} ) $, |
+ | where $ x \in E = \widetilde{E} \setminus \{ e \} $, | ||
+ | is called the terminating Markov process obtained from $ \widetilde{X} ( t) $ | ||
+ | by censoring (or killing) at the time $ \zeta $. | ||
+ | The variable $ \zeta $ | ||
+ | is called the censoring time or lifetime of the terminating Markov process. The phase space of the new process is $ ( E , {\mathcal B} ) $, | ||
+ | where $ {\mathcal B} $ | ||
+ | is the trace of the $ \sigma $- | ||
+ | algebra $ {\mathcal B} tilde $ | ||
+ | in $ E $. | ||
+ | The transition function of a terminating Markov process is the restriction of $ \widetilde{P} ( t , x , B ) $ | ||
+ | to the set $ t \geq 0 $, | ||
+ | $ x \in E $, | ||
+ | $ B \subset {\mathcal B} $. | ||
+ | The process $ X ( t) $ | ||
+ | is called a strong Markov process or a standard Markov process if $ \widetilde{X} ( t) $ | ||
+ | has the corresponding property. A non-terminating Markov process can be considered as a terminating Markov process with censoring time $ \zeta \equiv \infty $. | ||
+ | A non-homogeneous terminating Markov process is defined similarly. | ||
''M.G. Shur'' | ''M.G. Shur'' | ||
==Markov processes and differential equations.== | ==Markov processes and differential equations.== | ||
− | A Markov process of Brownian-motion type is closely connected with partial differential equations of parabolic type. The transition density | + | A Markov process of Brownian-motion type is closely connected with partial differential equations of parabolic type. The transition density $ p ( s , x , t , y ) $ |
+ | of a diffusion process satisfies, under certain additional assumptions, the backward and forward Kolmogorov equations (cf. [[Kolmogorov equation|Kolmogorov equation]]): | ||
+ | |||
+ | $$ \tag{6 } | ||
+ | |||
+ | \frac{\partial p }{\partial s } | ||
+ | |||
+ | + \sum_{k=1}^ { n } | ||
+ | a _ {k} ( s , x ) | ||
+ | \frac{\partial p }{\partial x _ {k} } | ||
+ | + | ||
+ | |||
+ | \frac{1}{2} | ||
− | + | \sum _ {k , j = 1 } ^ { n } | |
+ | b _ {kj} ( s , x ) | ||
− | + | \frac{\partial ^ {2} p }{\partial x _ {k} \partial x _ {j} } | |
+ | = | ||
+ | $$ | ||
− | + | $$ | |
+ | = \ | ||
− | + | \frac{\partial p }{\partial s } | |
+ | + L ( s , x ) p = 0 , | ||
+ | $$ | ||
− | + | $$ \tag{7 } | |
− | The expectations of various functionals of diffusion processes are solutions of boundary value problems for the differential equation . Let | + | \frac{\partial p }{\partial t } |
+ | = - \sum_{k=1}^ { n } | ||
+ | \frac \partial {\partial y _ {k} } | ||
+ | ( a _ {k} ( t , y ) p ) + | ||
+ | $$ | ||
+ | |||
+ | $$ | ||
+ | + | ||
+ | |||
+ | \frac{1}{2} | ||
+ | \sum _ {k , j = 1 } ^ { n } | ||
+ | \frac{\partial ^ {2} }{ | ||
+ | \partial y _ {k} \partial y _ {j} } | ||
+ | ( b _ {kj} ( t , y ) p ) = L ^ {*} ( t , y ) p . | ||
+ | $$ | ||
+ | |||
+ | The function $ p ( s , x , t , y ) $ | ||
+ | is the Green's function of the equations (6)–(7), and the first known methods for constructing diffusion processes were based on existence theorems for this function for the partial differential equations (6)–(7). For a time-homogeneous process the operator $ L ( s , x ) = L ( x ) $ | ||
+ | coincides on smooth functions with the infinitesimal operator of the Markov process (see [[Transition-operator semi-group|Transition-operator semi-group]]). | ||
+ | |||
+ | The expectations of various functionals of diffusion processes are solutions of boundary value problems for the differential equation . Let $ {\mathsf E} _ {s , x } ( \cdot ) $ | ||
+ | be the expectation with respect to the measure $ {\mathsf P} _ {s , x } $. | ||
+ | Then the function $ {\mathsf E} _ {s ,x } \phi ( X ( T) ) = u _ {1} ( s , x ) $ | ||
+ | satisfies (6) for $ s < T $ | ||
+ | and $ u _ {1} ( T , x ) = \phi ( x ) $. | ||
Similarly, the function | Similarly, the function | ||
− | + | $$ | |
+ | u _ {2} ( s , x ) = \ | ||
+ | {\mathsf E} _ {s , x } | ||
+ | \int\limits _ { s } ^ { T } | ||
+ | f ( t , X ( t) ) dt | ||
+ | $$ | ||
− | satisfies, for < | + | satisfies, for $ s < T $, |
− | + | $$ | |
− | + | \frac{\partial u _ {2} }{\partial s } | |
− | + | + L ( s , x ) u _ {2} = \ | |
+ | - f ( s , x ) , | ||
+ | $$ | ||
− | + | and $ u _ {2} ( T , x ) = 0 $. | |
+ | |||
+ | Let $ \tau $ | ||
+ | be the time at which the trajectories of $ X ( t) $ | ||
+ | first hit the boundary $ \partial D $ | ||
+ | of a domain $ D \subset \mathbf R ^ {n} $, | ||
+ | and let $ \tau \wedge T = \min ( \tau , T ) $. | ||
+ | Then, under certain conditions, the function | ||
+ | |||
+ | $$ | ||
+ | u _ {3} ( s , x ) = \ | ||
+ | {\mathsf E} _ {s , x } | ||
+ | \int\limits _ { s } ^ { \tau \wedge T } | ||
+ | f ( t , X ( t) ) dt + | ||
+ | {\mathsf E} _ {s , x } | ||
+ | \phi ( \tau \wedge T , X ( \tau \wedge T )) | ||
+ | $$ | ||
satisfies | satisfies | ||
− | + | $$ | |
− | + | \frac{\partial u }{\partial s } | |
− | + | + L ( s , x ) u = - f | |
+ | $$ | ||
+ | |||
+ | and takes the value $ \phi $ | ||
+ | on the set | ||
+ | |||
+ | $$ | ||
+ | \Gamma = \ | ||
+ | \{ s < T , x \in \partial D \} \cup | ||
+ | \{ s = T , x \in D \} . | ||
+ | $$ | ||
The solution of the first boundary value problem for a general second-order linear parabolic equation | The solution of the first boundary value problem for a general second-order linear parabolic equation | ||
− | + | $$ \tag{8 } | |
+ | \left . | ||
+ | |||
+ | \begin{array}{c} | ||
+ | |||
+ | \frac{\partial u }{\partial s } | ||
+ | |||
+ | + L ( s , x ) u + c ( s , x ) u = - f ( s ,x ) , \\ | ||
+ | u \mid _ \Gamma = \phi , \\ | ||
+ | \end{array} | ||
+ | |||
+ | \right \} | ||
+ | $$ | ||
can, under fairly general assumptions, be described in the form | can, under fairly general assumptions, be described in the form | ||
− | + | $$ \tag{9 } | |
+ | u ( s , x ) = \ | ||
+ | {\mathsf E} _ {s , x } | ||
+ | \int\limits _ { s } ^ { \tau \wedge T } | ||
+ | \mathop{\rm exp} | ||
+ | \left \{ | ||
+ | \int\limits _ { s } ^ { v } | ||
+ | c ( t , X ( t) ) dt | ||
+ | \right \} | ||
+ | f ( v , X ( v) ) dv + | ||
+ | $$ | ||
− | + | $$ | |
+ | + | ||
+ | {\mathsf E} _ {s , x } \left \{ \mathop{\rm exp} | ||
+ | \left \{ \int\limits _ { s } ^ { \tau \wedge T } c ( t , X ( t) ) \ | ||
+ | dt \right \} \phi ( \tau \wedge T , X ( \tau \wedge T ) ) \right \} . | ||
+ | $$ | ||
− | When the operator | + | When the operator $ L $ |
+ | and the functions $ c $ | ||
+ | and $ f $ | ||
+ | do not depend on $ s $, | ||
+ | a representation similar to (9) is possible also for the solution of a linear elliptic equation. More precisely, the function | ||
− | + | $$ \tag{10 } | |
+ | u ( x) = \ | ||
+ | {\mathsf E} _ {x} \int\limits _ { 0 } ^ \tau | ||
+ | \mathop{\rm exp} \left \{ | ||
+ | \int\limits _ { 0 } ^ { v } | ||
+ | c ( X ( t) ) dt \right \} | ||
+ | f ( X ( v) ) dv + | ||
+ | $$ | ||
− | + | $$ | |
+ | + | ||
+ | {\mathsf E} _ {x} \left \{ \mathop{\rm exp} \left \{ \int\limits _ { 0 } ^ \tau c ( X ( t) ) dt \right \} \phi ( X ( \tau ) ) \right \} | ||
+ | $$ | ||
is, under certain assumptions, the solution of | is, under certain assumptions, the solution of | ||
− | + | $$ \tag{11 } | |
+ | L ( x ) u + c ( x) u = \ | ||
+ | - f ( x) ,\ \ | ||
+ | u \mid _ {\partial D } = \phi . | ||
+ | $$ | ||
− | When | + | When $ L $ |
+ | is degenerate $ ( \mathop{\rm det} b ( s , x ) = 0 ) $ | ||
+ | or $ \partial D $ | ||
+ | is not sufficiently "smooth" , the boundary values need not be taken by the functions (9), (10) at individual points or on whole sets. The notion of a [[Regular boundary point|regular boundary point]] for $ L $ | ||
+ | has a probabilistic interpretation. At regular points the boundary values are attained by (9), (10). The solution of (8) and (11) allows one to study the properties of the corresponding diffusion processes and functionals of them. | ||
There are methods for constructing Markov processes which do not rely on the construction of solutions of (6) and (7). For example, the method of stochastic differential equations (cf. [[Stochastic differential equation|Stochastic differential equation]]), of absolutely-continuous change of measure, etc. This situation, together with the formulas (9) and (10), gives a probabilistic route to the construction and study of the properties of boundary value problems for (8) and also to the study of properties of the solutions of the corresponding elliptic equation. | There are methods for constructing Markov processes which do not rely on the construction of solutions of (6) and (7). For example, the method of stochastic differential equations (cf. [[Stochastic differential equation|Stochastic differential equation]]), of absolutely-continuous change of measure, etc. This situation, together with the formulas (9) and (10), gives a probabilistic route to the construction and study of the properties of boundary value problems for (8) and also to the study of properties of the solutions of the corresponding elliptic equation. | ||
− | Since the solution of a stochastic differential equation is insensitive to degeneracy of | + | Since the solution of a stochastic differential equation is insensitive to degeneracy of $ b ( s , x ) $, |
+ | probabilistic methods can be applied to construct solutions of degenerate elliptic and parabolic differential equations. The extension of the averaging principle of N.M. Krylov and N.N. Bogolyubov to stochastic differential equations allows one, with the help of (9), to obtain corresponding results for elliptic and parabolic differential equations. It turns out that certain difficult problems in the investigation of properties of solutions of equations of this type with small parameters in front of the highest derivatives can be solved by probabilistic arguments. Even the solution of the second boundary value problem for (6) has a probabilistic meaning. The formulation of boundary value problems for unbounded domains is closely connected with recurrence in the corresponding diffusion process. | ||
− | In the case of a time-homogeneous process ( | + | In the case of a time-homogeneous process ( $ L $ |
+ | is independent of $ s $), | ||
+ | a positive solution of $ L ^ {*} q = 0 $ | ||
+ | coincides, under certain assumptions and up to a multiplicative constant, with the stationary density of the distribution of a Markov chain. Probabilistic arguments turn out to be useful even for boundary value problems for non-linear parabolic equations. | ||
''R.Z. Khas'minskii'' | ''R.Z. Khas'minskii'' | ||
Line 211: | Line 598: | ||
====Comments==== | ====Comments==== | ||
− | |||
====References==== | ====References==== | ||
Line 221: | Line 607: | ||
|valign="top"|{{Ref|Do}}|| J.L. Doob, "Stochastic processes" , Wiley (1953) {{MR|1570654}} {{MR|0058896}} {{ZBL|0053.26802}} | |valign="top"|{{Ref|Do}}|| J.L. Doob, "Stochastic processes" , Wiley (1953) {{MR|1570654}} {{MR|0058896}} {{ZBL|0053.26802}} | ||
|- | |- | ||
− | |valign="top"|{{Ref|We}}|| A.D. | + | |valign="top"|{{Ref|We}}|| A.D. Wentzell, "A course in the theory of stochastic processes" , McGraw-Hill (1981) (Translated from Russian) {{MR|0781738}} {{MR|0614594}} {{ZBL|0502.60001}} |
|- | |- | ||
|valign="top"|{{Ref|Kur}}|| T.G. Kurtz, "Markov processes" , Wiley (1986) {{MR|0838085}} {{ZBL|0592.60049}} | |valign="top"|{{Ref|Kur}}|| T.G. Kurtz, "Markov processes" , Wiley (1986) {{MR|0838085}} {{ZBL|0592.60049}} |
Latest revision as of 08:38, 21 January 2024
process without after-effects
2020 Mathematics Subject Classification: Primary: 60Jxx [MSN][ZBL]
A stochastic process whose evolution after a given time $ t $ does not depend on the evolution before $ t $, given that the value of the process at $ t $ is fixed (briefly; the "future" and "past" of the process are independent of each other for a known "present" ).
The defining property of a Markov process is commonly called the Markov property; it was first stated by A.A. Markov . However, in the work of L. Bachelier it is already possible to find an attempt to discuss Brownian motion as a Markov process, an attempt which received justification later in the research of N. Wiener (1923). The basis of the general theory of continuous-time Markov processes was laid by A.N. Kolmogorov .
The Markov property.
There are essentially distinct definitions of a Markov process. One of the more widely used is the following. On a probability space $ ( \Omega , F , {\mathsf P} ) $ let there be given a stochastic process $ X ( t) $, $ t \in T $, taking values in a measurable space $ ( E , {\mathcal B} ) $, where $ T $ is a subset of the real line $ \mathbf R $. Let $ N _ {t} $( respectively, $ N ^ {t} $) be the $ \sigma $- algebra in $ \Omega $ generated by the variables $ X ( s) $ for $ s \leq t $( $ s \geq t $), where $ s \in T $. In other words, $ N _ {t} $( respectively, $ N ^ {t} $) is the collection of events connected with the evolution of the process up to time (starting from time) $ t $. $ X ( t) $ is called a Markov process if (almost certainly) for all $ t \in T $, $ \Lambda _ {1} , \Lambda _ {2} \in N ^ {t} $ the Markov property
$$ \tag{1 } {\mathsf P} \{ \Lambda _ {1} \Lambda _ {2} \mid X ( t) \} = \ {\mathsf P} \{ \Lambda _ {1} \mid X ( t) \} {\mathsf P} \{ \Lambda _ {2} \mid X ( t) \} $$
holds, or, what is the same, if for any $ t \in T $ and $ \Lambda \in N ^ {t} $,
$$ \tag{2 } {\mathsf P} \{ \Lambda \mid N _ {t} \} = \ {\mathsf P} \{ \Lambda \mid X ( t) \} . $$
A Markov process for which $ T $ is contained in the natural numbers is called a Markov chain (however, the latter term is mostly associated with the case of an at most countable $ E $). If $ T $ is an interval in $ \mathbf R $ and $ E $ is at most countable, a Markov process is called a continuous-time Markov chain. Examples of continuous-time Markov processes are furnished by diffusion processes (cf. Diffusion process) and processes with independent increments (cf. Stochastic process with independent increments), including Poisson and Wiener processes (cf. Poisson process; Wiener process).
In what follows the discussion will concern only the case $ T = [ 0 , \infty ) $, for the sake of being specific. The formulas (1) and (2) give an explicit interpretation of the principle of independence of "past" and "future" events when the "present" is known, but the definition of Markov process based on them has proved to be insufficiently flexible in the numerous situations where one is obliged to consider not one, but a collection of conditions of the type (1) or (2) corresponding to different, but in some sense consistent, measures $ {\mathsf P} $. Such reasoning has led to the acceptance of the following definitions (see , ).
Suppose one is given:
a) a measurable space $ ( E , {\mathcal B} ) $, where the $ \sigma $- algebra $ {\mathcal B} $ contains all one-point sets in $ E $;
b) a measurable space $ ( \Omega , F ) $, equipped with a family of $ \sigma $- algebras $ F _ {t} ^ { s } \subset F $, $ 0 \leq s \leq t \leq \infty $, such that $ F _ {t} ^ { s } \subset F _ {v} ^ { u } $ if $ [ s , t ] \subset [ u , v ] $;
c) a function ( "trajectory" ) $ x _ {t} = x _ {t} ( \omega ) $, defining for $ t \in [ 0 , \infty ) $ and $ v \in [ 0 , t ] $ a measurable mapping from $ ( \Omega , F _ {t} ^ { v } ) $ to $ ( E , {\mathcal B} ) $;
d) for each $ s \geq 0 $ and $ x \in E $ a probability measure $ {\mathsf P} _ {s,x} $ on the $ \sigma $- algebra $ F _ \infty ^ { s } $ such that the function $ {\mathsf P} ( s , \cdot ; t , B ) = {\mathsf P} _ {s , \cdot } \{ x _ {t} \in B \} $ is measurable with respect to $ {\mathcal B} $, if $ s \in [ 0 , t ] $ and $ B \in {\mathcal B} $.
The collection $ X ( t) = ( x _ {t} , F _ {t} ^ { s } , {\mathsf P} _ {s , x } ) $ is called a (non-terminating) Markov process given on $ ( E , {\mathcal B} ) $ if $ {\mathsf P} _ {s,x} $- almost certainly
$$ \tag{3 } {\mathsf P} _ {s,x} \{ \Lambda \mid F _ {t} ^ { s } \} = {\mathsf P} _ {t , x _ {t} } \{ \Lambda \} , $$
for any $ 0 \leq s \leq t $ and $ \Lambda \in N ^ {t} $. Here $ \Omega $ is the space of elementary events, $ ( E , {\mathcal B} ) $ is the phase space or state space and $ P ( s , x ; t , B ) $ is the transition function or transition probability of $ X ( t) $. If $ E $ is endowed with a topology and $ {\mathcal B} $ is the collection of Borel sets in $ E $, then it is commonly said that the Markov process is given on $ E $. Usually included in the definition of a Markov process is the requirement that $ P ( s , x ; s , \{ x \} ) \equiv 1 $, and then $ {\mathsf P} _ {s,x} \{ \Lambda \} $, $ \Lambda \in F _ \infty ^ { s } $, is interpreted as the probability of $ \Lambda $ under the condition that $ x _ {s} = x $.
The following question arises: Is every Markov transition function $ P ( s, x ; t , B ) $, given on a measurable space $ ( E , {\mathcal B} ) $, the transition function of some Markov process? The answer is affirmative if, for example, $ E $ is a separable, locally compact space and $ {\mathcal B} $ is the family of Borel sets in $ E $. In addition, let $ E $ be a complete metric space and let
$$ \lim\limits _ {h \downarrow 0 } \alpha _ \epsilon ( h) = 0 $$
for any $ \epsilon > 0 $, where
$$ \alpha _ \epsilon ( h) = \ \sup \{ {P ( s , x ; t , V _ \epsilon ( x) ) } : { x \in E , 0 < t - s < h } \} $$
and $ V _ \epsilon ( x) $ is the complement of the $ \epsilon $- neighbourhood of $ x $. Then the corresponding Markov process can be taken to be right-continuous and having left limits (that is, its trajectories can be chosen so). The existence of a continuous Markov process is guaranteed by the condition $ \alpha _ \epsilon ( h) = o ( h) $ as $ h \downarrow 0 $( see , ).
In the theory of Markov processes most attention is given to homogeneous (in time) processes. The corresponding definition assumes one is given a system of objects a)–d) with the difference that the parameters $ s $ and $ u $ may now only take the value 0. Even the notation can be simplified:
$$ {\mathsf P} _ {x} = \ {\mathsf P} _ {0x} ,\ \ F _ {t} = F _ {t} ^ { 0 } ,\ \ P ( t , x , B ) = P ( 0 , x ; t , B ) , $$
$$ x \in E ,\ t \geq 0 ,\ B \in {\mathcal B} . $$
Subsequently, homogeneity of $ \Omega $ is assumed. That is, it is required that for any $ \omega \in \Omega $ and $ s \geq 0 $ there is an $ \omega ^ \prime \in \Omega $ such that $ x _ {t} ( \omega ^ \prime ) = x _ {t+s} ( \omega ) $ for $ t \geq 0 $. Because of this, on the $ \sigma $- algebra $ N $, the smallest $ \sigma $- algebra in $ \Omega $ containing the events $ \{ \omega : {x _ {s} \in B } \} $, the time shift operators $ \theta _ {t} $ are defined, which preserve the operations of union, intersection and difference of sets, and for which
$$ \theta _ {t} \{ \omega : {x _ {s} \in B } \} = \{ \omega : {x _ {t+s} \in B } \} , $$
where $ s , t \geq 0 $, $ B \in {\mathcal B} $.
The collection $ X ( t) = ( x _ {t} , F _ {t} , {\mathsf P} _ {x} ) $ is called a (non-terminating) homogeneous Markov process given on $ ( E , {\mathcal B} ) $ if $ {\mathsf P} _ {x} $- almost certainly
$$ \tag{4 } {\mathsf P} _ {x} \{ \theta _ {t} \Lambda \mid F _ {t} \} = {\mathsf P} _ {x _ {t} } \{ \Lambda \} $$
for $ x \in E $, $ t \geq 0 $ and $ \Lambda \in N $. The transition function of $ X ( t) $ is taken to be $ P ( t , x , B ) $, where, unless otherwise indicated, it is required that $ P ( 0 , x , \{ x \} ) \equiv 1 $. It is useful to bear in mind that in the verification of (4) it is only necessary to consider sets of the form $ \Lambda = \{ \omega : {x _ {s} \in B } \} $, where $ s \geq 0 $, $ B \in {\mathcal B} $, and in (4), $ F _ {t} $ may always replaced by the $ \sigma $- algebra $ \overline{F}\; _ {t} $ equal to the intersection of the completions of $ F _ {t} $ relative to all possible measures $ {\mathsf P} _ {x} \{ x \in B \} $. Often, one fixes on $ {\mathcal B} $ a probability measure $ \mu $( the "initial distribution" ) and considers a random Markov function $ ( x _ {t} , F _ {t} , {\mathsf P} _ \mu ) $, where $ {\mathsf P} _ \mu $ is the measure on $ F _ \infty $ given by
$$ {\mathsf P} _ \mu \{ \cdot \} = \int\limits {\mathsf P} _ {x} \{ \cdot \} \mu ( d x ) . $$
A Markov process $ X ( t) = ( x _ {t} , F _ {t} , {\mathsf P} _ {x} ) $ is called progressively measurable if for each $ t > 0 $ the function $ x( s, \omega ) = x _ {s} ( \omega ) $ induces a measurable mapping from $ ( [ 0 , t ] \times \Omega , {\mathcal B} _ {t} \times F _ {t} ) $ to $ ( E , {\mathcal B} ) $, where $ {\mathcal B} _ {t} $ is the $ \sigma $- algebra of Borel subsets of $ [ 0 , t ] $. A right-continuous Markov process is progressively measurable. There is a method for reducing the non-homogeneous case to the homogeneous case (see ), and in what follows homogeneous Markov processes will be discussed.
The strong Markov property.
Suppose that, on a measurable space $ ( E , {\mathcal B} ) $, a Markov process $ X ( t) = ( x _ {t} , F _ {t} , {\mathsf P} _ {x} ) $ is given. A function $ \tau : \Omega \rightarrow [ 0 , \infty ] $ is called a Markov moment (stopping time) if $ \{ \omega : {\tau \leq t } \} \in F _ {t} $ for $ t \geq 0 $. Here a set $ \Lambda \subset \Omega _ \tau = \{ \omega : {\tau < \infty } \} $ is considered in the family $ F _ \tau $ if $ \Lambda _ \cap \{ \omega : {\tau < t } \} \in F _ {t} $ for $ t \geq 0 $( most often $ F _ \tau $ is interpreted as the family of events connected with the evolution of $ X ( t) $ up to time $ \tau $). For $ \Lambda \in N $, set
$$ \theta _ \tau \Lambda = \ \cup _ {t \geq 0 } [ \theta _ {t} \Lambda \cap \{ \omega : {\tau = t } \} ] . $$
A progressively-measurable Markov process $ X $ is called a strong Markov process if for any Markov moment $ \tau $ and all $ t \geq 0 $, $ x \in E $ and $ \Lambda \in N $, the relation
$$ \tag{5 } {\mathsf P} _ {x} \{ \theta _ \tau \Lambda \mid F _ \tau \} = \ {\mathsf P} _ {x _ \tau } \{ \Lambda \} $$
(the strong Markov property) is satisfied $ {\mathsf P} _ {x} $- almost certainly in $ \Omega _ \tau $. In the verification of (5) it suffices to consider only sets of the form $ \Lambda = \{ \omega : {x _ {s} \in B } \} $ where $ s \geq 0 $, $ B \in {\mathcal B} $; in this case $ \theta _ \tau \Lambda = \{ \omega : {x _ {s + \tau } \in B } \} $. For example, any right-continuous Feller–Markov process on a topological space $ E $ is a strong Markov process. A Markov process is called a Feller–Markov process if the function
$$ P ^ {t} f ( \cdot ) = \int\limits f ( y) P ( t , \cdot , d y ) $$
is continuous whenever $ f $ is continuous and bounded.
In the case of strong Markov processes various subclasses have been distinguished. Let the Markov transition function $ P ( t , x , B ) $, given on a locally compact metric space $ E $, be stochastically continuous:
$$ \lim\limits _ {t \downarrow 0 } \ P ( t , x , U ) = 1 $$
for any neighbourhood $ U $ of each point $ x \in B $. If $ P ^ {t} $ maps the class of continuous functions that vanish at infinity into itself, then $ P ( t , x , B ) $ corresponds to a standard Markov process $ X $. That is, a right-continuous strong Markov process for which: 1) $ F _ {t} = \overline{F}\; _ {t} $ for $ t \in [ 0 , \infty ) $ and $ F _ {t} = \cap _ {s>} t F _ {s} $ for $ t \in [ 0 , \infty ) $; 2) $ \lim\limits _ {n \rightarrow \infty } x _ {\tau _ {n} } = x _ \tau $, $ P _ {x} $- almost certainly on the set $ \{ \omega : {\tau < \infty } \} $, where $ \tau = \lim\limits _ {n \rightarrow \infty } \tau _ {n} $ and $ \tau _ {n} $( $ n \geq 1 $) are Markov moments that are non-decreasing as $ n $ increases.
Terminating Markov processes.
Frequently, a physical system can be best described using a non-terminating Markov process, but only in a time interval of random length. In addition, even simple transformations of a Markov process may lead to processes with trajectories given on random intervals (see Functional of a Markov process). Guided by these considerations one introduces the notion of a terminating Markov process.
Let $ \widetilde{X} ( t) = ( \widetilde{X} _ {t} , \widetilde{F} _ {t} , \widetilde {\mathsf P} _ {x} ) $ be a homogeneous Markov process in a phase space $ ( \widetilde{E} , {\mathcal B} tilde ) $, having a transition function $ \widetilde{P} ( t , x , N ) $, and let there be a point $ e \in \widetilde{E} $ and a function $ \zeta : \Omega \rightarrow [ 0 , \infty ) $ such that $ \widetilde{x} _ {t} ( \omega ) = e $ for $ \zeta ( \omega ) \leq t $ and $ \widetilde{x} _ {t} ( \omega ) \neq e $ otherwise (unless stated otherwise, take $ \zeta > 0 $). A new trajectory $ x _ {t} ( \omega ) $ is given for $ t \in [ 0 , \zeta ( \omega ) ) $ by the equality $ x _ {t} ( \omega ) = \widetilde{x} _ {t} ( \omega ) $, and $ F _ {t} $ is defined as the trace of $ \widetilde{F} _ {t} $ on the set $ \{ \omega : {\zeta > t } \} $.
The collection $ X ( t) = ( x _ {t} , \zeta , F _ {t} , \widetilde {\mathsf P} _ {x} ) $, where $ x \in E = \widetilde{E} \setminus \{ e \} $, is called the terminating Markov process obtained from $ \widetilde{X} ( t) $ by censoring (or killing) at the time $ \zeta $. The variable $ \zeta $ is called the censoring time or lifetime of the terminating Markov process. The phase space of the new process is $ ( E , {\mathcal B} ) $, where $ {\mathcal B} $ is the trace of the $ \sigma $- algebra $ {\mathcal B} tilde $ in $ E $. The transition function of a terminating Markov process is the restriction of $ \widetilde{P} ( t , x , B ) $ to the set $ t \geq 0 $, $ x \in E $, $ B \subset {\mathcal B} $. The process $ X ( t) $ is called a strong Markov process or a standard Markov process if $ \widetilde{X} ( t) $ has the corresponding property. A non-terminating Markov process can be considered as a terminating Markov process with censoring time $ \zeta \equiv \infty $. A non-homogeneous terminating Markov process is defined similarly.
M.G. Shur
Markov processes and differential equations.
A Markov process of Brownian-motion type is closely connected with partial differential equations of parabolic type. The transition density $ p ( s , x , t , y ) $ of a diffusion process satisfies, under certain additional assumptions, the backward and forward Kolmogorov equations (cf. Kolmogorov equation):
$$ \tag{6 } \frac{\partial p }{\partial s } + \sum_{k=1}^ { n } a _ {k} ( s , x ) \frac{\partial p }{\partial x _ {k} } + \frac{1}{2} \sum _ {k , j = 1 } ^ { n } b _ {kj} ( s , x ) \frac{\partial ^ {2} p }{\partial x _ {k} \partial x _ {j} } = $$
$$ = \ \frac{\partial p }{\partial s } + L ( s , x ) p = 0 , $$
$$ \tag{7 } \frac{\partial p }{\partial t } = - \sum_{k=1}^ { n } \frac \partial {\partial y _ {k} } ( a _ {k} ( t , y ) p ) + $$
$$ + \frac{1}{2} \sum _ {k , j = 1 } ^ { n } \frac{\partial ^ {2} }{ \partial y _ {k} \partial y _ {j} } ( b _ {kj} ( t , y ) p ) = L ^ {*} ( t , y ) p . $$
The function $ p ( s , x , t , y ) $ is the Green's function of the equations (6)–(7), and the first known methods for constructing diffusion processes were based on existence theorems for this function for the partial differential equations (6)–(7). For a time-homogeneous process the operator $ L ( s , x ) = L ( x ) $ coincides on smooth functions with the infinitesimal operator of the Markov process (see Transition-operator semi-group).
The expectations of various functionals of diffusion processes are solutions of boundary value problems for the differential equation . Let $ {\mathsf E} _ {s , x } ( \cdot ) $ be the expectation with respect to the measure $ {\mathsf P} _ {s , x } $. Then the function $ {\mathsf E} _ {s ,x } \phi ( X ( T) ) = u _ {1} ( s , x ) $ satisfies (6) for $ s < T $ and $ u _ {1} ( T , x ) = \phi ( x ) $.
Similarly, the function
$$ u _ {2} ( s , x ) = \ {\mathsf E} _ {s , x } \int\limits _ { s } ^ { T } f ( t , X ( t) ) dt $$
satisfies, for $ s < T $,
$$ \frac{\partial u _ {2} }{\partial s } + L ( s , x ) u _ {2} = \ - f ( s , x ) , $$
and $ u _ {2} ( T , x ) = 0 $.
Let $ \tau $ be the time at which the trajectories of $ X ( t) $ first hit the boundary $ \partial D $ of a domain $ D \subset \mathbf R ^ {n} $, and let $ \tau \wedge T = \min ( \tau , T ) $. Then, under certain conditions, the function
$$ u _ {3} ( s , x ) = \ {\mathsf E} _ {s , x } \int\limits _ { s } ^ { \tau \wedge T } f ( t , X ( t) ) dt + {\mathsf E} _ {s , x } \phi ( \tau \wedge T , X ( \tau \wedge T )) $$
satisfies
$$ \frac{\partial u }{\partial s } + L ( s , x ) u = - f $$
and takes the value $ \phi $ on the set
$$ \Gamma = \ \{ s < T , x \in \partial D \} \cup \{ s = T , x \in D \} . $$
The solution of the first boundary value problem for a general second-order linear parabolic equation
$$ \tag{8 } \left . \begin{array}{c} \frac{\partial u }{\partial s } + L ( s , x ) u + c ( s , x ) u = - f ( s ,x ) , \\ u \mid _ \Gamma = \phi , \\ \end{array} \right \} $$
can, under fairly general assumptions, be described in the form
$$ \tag{9 } u ( s , x ) = \ {\mathsf E} _ {s , x } \int\limits _ { s } ^ { \tau \wedge T } \mathop{\rm exp} \left \{ \int\limits _ { s } ^ { v } c ( t , X ( t) ) dt \right \} f ( v , X ( v) ) dv + $$
$$ + {\mathsf E} _ {s , x } \left \{ \mathop{\rm exp} \left \{ \int\limits _ { s } ^ { \tau \wedge T } c ( t , X ( t) ) \ dt \right \} \phi ( \tau \wedge T , X ( \tau \wedge T ) ) \right \} . $$
When the operator $ L $ and the functions $ c $ and $ f $ do not depend on $ s $, a representation similar to (9) is possible also for the solution of a linear elliptic equation. More precisely, the function
$$ \tag{10 } u ( x) = \ {\mathsf E} _ {x} \int\limits _ { 0 } ^ \tau \mathop{\rm exp} \left \{ \int\limits _ { 0 } ^ { v } c ( X ( t) ) dt \right \} f ( X ( v) ) dv + $$
$$ + {\mathsf E} _ {x} \left \{ \mathop{\rm exp} \left \{ \int\limits _ { 0 } ^ \tau c ( X ( t) ) dt \right \} \phi ( X ( \tau ) ) \right \} $$
is, under certain assumptions, the solution of
$$ \tag{11 } L ( x ) u + c ( x) u = \ - f ( x) ,\ \ u \mid _ {\partial D } = \phi . $$
When $ L $ is degenerate $ ( \mathop{\rm det} b ( s , x ) = 0 ) $ or $ \partial D $ is not sufficiently "smooth" , the boundary values need not be taken by the functions (9), (10) at individual points or on whole sets. The notion of a regular boundary point for $ L $ has a probabilistic interpretation. At regular points the boundary values are attained by (9), (10). The solution of (8) and (11) allows one to study the properties of the corresponding diffusion processes and functionals of them.
There are methods for constructing Markov processes which do not rely on the construction of solutions of (6) and (7). For example, the method of stochastic differential equations (cf. Stochastic differential equation), of absolutely-continuous change of measure, etc. This situation, together with the formulas (9) and (10), gives a probabilistic route to the construction and study of the properties of boundary value problems for (8) and also to the study of properties of the solutions of the corresponding elliptic equation.
Since the solution of a stochastic differential equation is insensitive to degeneracy of $ b ( s , x ) $, probabilistic methods can be applied to construct solutions of degenerate elliptic and parabolic differential equations. The extension of the averaging principle of N.M. Krylov and N.N. Bogolyubov to stochastic differential equations allows one, with the help of (9), to obtain corresponding results for elliptic and parabolic differential equations. It turns out that certain difficult problems in the investigation of properties of solutions of equations of this type with small parameters in front of the highest derivatives can be solved by probabilistic arguments. Even the solution of the second boundary value problem for (6) has a probabilistic meaning. The formulation of boundary value problems for unbounded domains is closely connected with recurrence in the corresponding diffusion process.
In the case of a time-homogeneous process ( $ L $ is independent of $ s $), a positive solution of $ L ^ {*} q = 0 $ coincides, under certain assumptions and up to a multiplicative constant, with the stationary density of the distribution of a Markov chain. Probabilistic arguments turn out to be useful even for boundary value problems for non-linear parabolic equations.
R.Z. Khas'minskii
References to all sections are given below.
References
[M] | A.A. Markov, "?", Izv. Fiz.-Mat. Obshch. Kazan. Univ. , 15 : 4 (1906) pp. 135–156 |
[B] | L. Bachelier, "?", Ann. Sci. Ecole Norm. Sup. , 17 (1900) pp. 21–86 |
[Ko] | A.N. Kolmogorov, "Ueber die analytischen Methoden in der Wahrscheinlichkeitsrechnung" Math. Ann. , 104 (1931) pp. 415–458 |
[C] | K.L. Chung, "Markov chains with stationary transition probabilities" , Springer (1967) MR0217872 Zbl 0146.38401 |
[F] | W. Feller, "The general diffusion operator and positivity-preserving semi-groups in one dimension" Ann. of Math. , 60 (1954) pp. 417–436 MR0065809 |
[DY] | E.B. Dynkin, A.A. Yushkevich, "Strong Markov processs" Theor. Veroyatnost. i Primenen. , 1 : 1 (1956) pp. 149–155 (In Russian) (English abstract) |
[H] | G.A. Hunt, "Markov processes and potentials I" Illinois J. Math. , 1 (1957) pp. 44–93 |
[H2] | G.A. Hunt, "Markov processes and potentials II" Illinois J. Math. , 1 (1957) pp. 316–369 |
[H3] | G.A. Hunt, "Markov processes and potentials III" Illinois J. Math. , 2 (1958) pp. 151–213 |
[De] | C. Dellacherie, "Capacités et processus stochastiques" , Springer (1972) MR0448504 Zbl 0246.60032 |
[Dy] | E.B. Dynkin, "Theory of Markov processes" , Pergamon (1960) (Translated from Russian) MR2305744 MR1009436 MR0245096 MR1531923 MR0131900 MR0131898 Zbl 1116.60001 Zbl 0689.60003 Zbl 0222.60048 Zbl 0091.13605 |
[D2] | E.B. Dynkin, "Markov processes" , 1 , Springer (1965) (Translated from Russian) MR0193671 Zbl 0132.37901 |
[GS] | I.I. Gihman, A.V. Skorohod, "The theory of stochastic processes" , 2 , Springer (1979) (Translated from Russian) MR0651014 MR0651015 Zbl 0404.60061 |
[Fr] | M.I. Freidlin, "Markov processes and differential equations" Itogi Nauk. Teor. Veroyatnost. Mat. Statist. Teoret. Kibernet. 1966 (1967) pp. 7–58 (In Russian) MR0235308 Zbl 0863.60049 |
[Kh] | R.Z. Khas'minskii, "Principle of averaging for parabolic and elliptic partial differential equations and for Markov processes with small diffusion" Theor. Probab. Appl. , 8 (1963) pp. 1–21 Teor. Veroyatnost. i Primenen. , 8 : 1 (1963) pp. 3–25 |
[VF] | A.D. Venttsel', M.I. Freidlin, "Random perturbations of dynamical systems" , Springer (1984) (Translated from Russian) MR722136 |
[BG] | R.M. Blumenthal, R.K. Getoor, "Markov processes and potential theory" , Acad. Press (1968) MR0264757 Zbl 0169.49204 |
[G] | R.K. Getoor, "Markov processes: Ray processes and right processes" , Lect. notes in math. , 440 , Springer (1975) MR0405598 Zbl 0299.60051 |
[Kuz] | S.E. Kuznetsov, "Any Markov process in a Borel space has a transition function" Theor. Prob. Appl. , 25 (1980) pp. 384–388 Teor. Veroyatnost. i Primenen. , 25 : 2 (1980) pp. 389–393 MR0572574 Zbl 0456.60077 Zbl 0431.60071 |
Comments
References
[SV] | D.W. Stroock, S.R.S. Varadhan, "Multidimensional diffusion processes" , Springer (1979) MR0532498 Zbl 0426.60069 |
[C2] | K.L. Chung, "Lectures from Markov processes to Brownian motion" , Springer (1982) MR0648601 Zbl 0503.60073 |
[Do] | J.L. Doob, "Stochastic processes" , Wiley (1953) MR1570654 MR0058896 Zbl 0053.26802 |
[We] | A.D. Wentzell, "A course in the theory of stochastic processes" , McGraw-Hill (1981) (Translated from Russian) MR0781738 MR0614594 Zbl 0502.60001 |
[Kur] | T.G. Kurtz, "Markov processes" , Wiley (1986) MR0838085 Zbl 0592.60049 |
[F2] | W. Feller, "An introduction to probability theory and its applications" , 1–2 , Wiley (1966) MR0210154 Zbl 0138.10207 |
[Wa] | N. Wax (ed.), Selected papers on noise and stochastic processes , Dover, reprint (1954) Zbl 0059.11903 |
[Ka] | M. Kac, "Probability and related topics in physical sciences" , Interscience (1959) pp. Chapt. 4 MR0102849 Zbl 0087.33003 |
[Le] | P. Lévy, "Processus stochastiques et mouvement Brownien" , Gauthier-Villars (1965) MR0190953 Zbl 0137.11602 |
[Lo] | M. Loève, "Probability theory" , II , Springer (1978) MR0651017 MR0651018 Zbl 0385.60001 |
Markov process. Encyclopedia of Mathematics. URL: http://encyclopediaofmath.org/index.php?title=Markov_process&oldid=26570