Fubini-Tonelli, Stochastic Kernel
The theorem of Fubini-Tonelli is concerned with the definition of sound product measures on finite product spaces and their properties. To do so, we will make use of Carathéodory's extension Theorem. In the following, we consider the two-dimensional case.
Let \((\Omega_1,\mathcal{F}_1)\) and \((\Omega_2, \mathcal{F}_2)\) be two measurable spaces and define \(\Omega =\Omega_1 \times \Omega_2\) endowed with the product \(\sigma\)-algebra \(\mathcal{F}=\mathcal{F}_1\otimes \mathcal{F}_2\).
Proposition
Let \(A\) be an event in the product \(\sigma\)-algebra \(\mathcal{F}=\mathcal{F}_1\otimes \mathcal{F}_2\) and \(X\colon \Omega \to \mathbb{R}\) be a measurable function. Define
Then it holds that \(A_{\omega_1}\) and \(A_{\omega_2}\) are events in \(\mathcal{F}_2\) and \(\mathcal{F}_1\) respectively. Furthermore, \(X_{\omega_1}\) is \(\mathcal{F}_2\)-measurable, and \(X_{\omega_2}\) is \(\mathcal{F}_1\)-measurable.
Proof
Let \(\mathcal{C}\) be the collection of those \(A\in\mathcal{F}\) such that \(A_{\omega_2}\times A_{\omega_1}\) is in \(\mathcal{F}_1\times \mathcal{F}_2\). It clearly holds that \(\mathcal{F}_1\times \mathcal{F}_2\subseteq \mathcal{C}\). Direct inspection shows that \(\mathcal{C}\) is a \(\sigma\)-algebra, and therefore
showing the first assertion.
As for the second point, let \(B\) be a Borel set in \(\mathbb{R}\). It follows that \(\{X_{\omega_1}\in B\}=\{X\in B\}_{\omega_1}\), which is an element of \(\mathcal{F}_2\) by what has just been shown. Hence, \(X_{\omega_1}\) is \(\mathcal{F}_2\)-measurable. The same argumentation holds for \(X_{\omega_2}\).
Definition: Stochastic Kernel
A stochastic kernel on \(\Omega_1\times \mathcal{F}_2\) is a function \(K\colon \Omega_1\times \mathcal{F}_2\to [0,1]\) such that
- \(\omega_1\mapsto K(\omega_1,A_2)\) is \(\mathcal{F}_1\)-measurable for every event \(A_2\) in \(\mathcal{F}_2\).
- \(A_2\mapsto K(\omega_1,A_2)\) is a probability measure on \(\mathcal{F}_2\) for every \(\omega_1 \in \Omega_1\).
A stochastic kernel is, in some sense, a measurable family of probability measures on \(\mathcal{F}_1\), one for each state \(\omega_1\) in \(\Omega_1\). A special case of a stochastic kernel is the constant one \(K(\omega_1,\cdot)=P_2\) for all states \(\omega_1\) in \(\Omega_1\), where \(P_2\) is a probability measure on \(\mathcal{F}_2\).
Given a probability measure \(P_1\) on \(\mathcal{F}_1\), we want to define a probability measure \(P\) on the product \(\sigma\)-algebra \(\mathcal{F}\) such that
This is the subject of the following theorem.
Stochastic variant of Tonelli's Theorem
Let \(P_1\) be a measure on \(\mathcal{F}_1\) and \(K\) a stochastic kernel on \(\Omega_1\times \mathcal{F}_2\). Then there exists a unique probability measure \(P\) on \(\mathcal{F}\) such that for every positive random variable \(X\colon \Omega \to \mathbb{R}\), it holds
In particular,
for any event \(A\) in \(\mathcal{F}\).
Proof
Define \(\mathcal{R}=\mathcal{F}_1\times \mathcal{F}_2\) and the set function \(P\colon \mathcal{R}\to [0,1]\) given by
For any element \(A_1 \times A_2\) in $\mathcal{R}. Inspection shows that \(\mathcal{R}\) is a semi-ring that contains \(\Omega\). To apply Carathéodory’s extension Theorem, we must show that \(P\) is a \(\sigma\)-additive pre-measure.
It clearly holds that \(P[\emptyset]=0\) and
Let \((A_1^n\times A_2^n)\) be a sequence of pairwise disjoint elements of \(\mathcal{R}\) such that \(\cup A_1^n\times A_2^n=A_1\times A_2\) in \(\mathcal{R}\) for some \(A_1\in \mathcal{F}_1\) and \(A_2 \in \mathcal{F}_2\) and define the functions
Furthermore, due to the pairwise disjointness of \((A_1^n\times A_2^n)\), as well as monotone convergence, it follows that
for any state \(\omega_1\) in \(\Omega_1\).
Hence, once again, monotone convergence yields
showing the \(\sigma\)-additivity.
It follows that we can apply Carathéodory's extension Theorem, ensuring the existence of a unique measure \(P\) on \(\mathcal{F}\) satisfying
Let us now show that
holds.
Define the collection \(\mathcal{C}\) of those events \(A\) in \(\mathcal{F}\) such that this relation holds.
For \(A=A_1\times A_2\), it follows that \(A_{\omega_1}=A_2\) if \(\omega_1 \in A_1\) and \(\emptyset\) otherwise.
It follows that
showing that \(\mathcal{F}_1\times \mathcal{F}_2\subseteq \mathcal{C}\).
In particular, \(\Omega \in \mathcal{C}\).
Furthermore, for every pairwise disjoint sequence \((A^n)\) of elements in \(\mathcal{C}\), denoting \(A=\cup A^n\), it follows from monotone convergence that
showing that \(A \in \mathcal{C}\). Finally, for \(A \in \mathcal{C}\), it follows that
showing that \(A^c \in \mathcal{C}\). Hence, \(\mathcal{C}\) is a \(\lambda\)-system that contains the \(\pi\)-system \(\mathcal{F}_1\times \mathcal{F}_2\). Hence, by Dynkin's \(\pi\)-\(\lambda\) lemma, it follows that
showing that \(\mathcal{C}=\mathcal{F}\), that is, showing that the second equation of the Theorem holds for any event \(A\) in \(\mathcal{F}\).
As for expectation equality from the theorem, it follows from the fact that every positive random variable \(X\colon\Omega \to \mathbb{R}\) can be approximated by step functions, ending the proof.
Definition: Product Measure
With the notations of Theorem the previous theorem, we denote this unique measure by \(P=P_1\otimes K\).
In the case where \(K(\omega_1,\cdot)=P_2\) for all \(\omega_1 \in \Omega_1\) for some measure \(P_2\) on \(\mathcal{F}_2\), then \(P\) is called the product measure of \(P_1\) and \(P_2\) on the product space and is denoted by \(P=P_1\otimes P_2\).
In the case of a product measure, due to symmetry, it holds in particular
Corollary
Let \(X\) be a positive random variable on some probability space \((\Omega,\mathcal{F},P)\). Then it holds
where \(\lambda\) is the Lebesgue measure on \(\mathbb{R}\).
Proof
For almost all state \(\omega\) in \(\Omega\), we have \(X(\omega)\geq 0\), and therefore
where \(\lambda\) is the Lebesgue measure on \(\mathbb{R}\). Since \((\omega,x)\mapsto 1_{\{X(\omega)>x\}}\) is a \(\mathcal{F}\otimes \mathcal{B}(\mathbb{R}_+)\)-measurable function, by Fubini-Tonelli for the product measure \(P\otimes \lambda\), it holds
We now address the stochastic variant of Fubini's theorem since we considered a stochastic kernel instead of a simple probability measure. Let \(X\) and \(Y\) be two random variables on some probability space \((\Omega,\mathcal{F},P)\). We consider the probability measure \(P_{(X,Y)}\) on the product Borel \(\sigma\)-algebra of \(\mathbb{R}^2\) given by
for any Borel set \(B\) on \(\mathbb{R}^2\). We suppose that this joint distribution \(P_{(X,Y)}\) can be decomposed into \(P_1\otimes K\) for some probability measure \(P_1\) on \(\mathcal{B}(\mathbb{R})\) and a stochastic kernel \(K\) on \(\mathbb{R}\times \mathcal{B}(\mathbb{R})\). We will see later that this is always the case. Note that by Tonelli's Theorem, it holds
showing that \(P_X=P_1\), justifying therefore the notation \(P_{(X,Y)}=P_X\otimes K_{Y|X}\).
Theorem
Let \(X\) and \(Y\) be two random variables whose joint distribution is given by \(P_X\otimes K_{Y|X}\), where \(P_X\) is the distribution of \(X\) and \(K_{Y|X}\) is a stochastic kernel on \(\mathbb{R}\times \mathcal{B}(\mathbb{R})\).
For every positive random variable \(g\colon \mathbb{R}^2\to \mathbb{R}_+\) such that \(g(X,Y)\) is integrable, it holds
\(P\)-almost surely.
This relation means that for \(P\)-almost all \(\omega \in \Omega\), it holds
Proof
From Tonelli's Theorem's proof, the function \(x\mapsto h(x):=\int_{\mathbb{R}}g(x,y)K_{Y|X}(x,dy)\) for \(x\) in \(\mathbb{R}\), is measurable, and therefore
is a positive random variable. Let \(A\) be an event in \(\sigma(X)\). It follows that \(A=X^{-1}(B)\) for some Borel set \(B \in \mathbb{R}\). Therefore,
This concludes the proof.
Remark
As in the previous theorem, let \(X\) and \(Y\) be two random variables whose joint distribution is given by \(P_{(X,Y)}\). Suppose that \(P_{(X,Y)}\) is absolutely continuous with respect to the Lebesgue measure on \(\mathbb{R}^2\). It follows that there exists a Lebesgue-almost surely unique positive function \(f_{(X,Y)}\colon \mathbb{R}^2\to \mathbb{R}\) with expectation \(1\) such that
It follows that the density of \(X\) and \(Y\) are respectively given by
By defining
inspection shows that
defines a kernel. It holds
for every \(A\) and \(B\) in \(\mathcal{B}(\mathbb{R})\). From the uniqueness assumption of Fubini-Tonelli's theorem, it follows that
And following the theorem, it follows that