Grimmett & Stirzaker 7.11.31: Log-likelihood estimation of Markov transition matrix

I have trouble understanding the notation in exercise 7.11.31:

enter image description here

I don't understand what $lambda(textbf{P})$ is. Lets take the factor $f_{X_0}$. My current understanding is as follows. We know that $mathbb{P}(X_n=i)$ and $mathbb{P}(X_n(omega)=i)$ are shorthand for $mathbb{P}({omegamid X_n(omega)=i})$. Based on this I introduce another shorthand notation $mathbb{P}_omega(X_n(omega) = i) = mathbb{P}(X_n = i)$. Using this notation I interpret
$$
f_{X_0} = f_{X_0}(beta) = mathbb{P}_{alpha}(X_0(alpha)=X_0(beta)),
$$
which itself is a random variable having the probability mass function
$$
mathbb{P}_beta(mathbb{P}_{alpha}(X_0(alpha)=X_0(beta)) = p)
$$

Similarly, if I pick the factor $p_{X_0,X_1}$ the I interpret
$$
p_{X_0,X_1} = p_{X_0,X_1}(beta) = mathbb{P}_alpha(X_{1}(alpha) = X_1(beta) mid X_n(alpha) = X_n(beta)),
$$
which is a random variable having probability mass function
$$
mathbb{P}_beta(mathbb{P}_alpha(X_{1}(alpha) = X_1(beta) mid X_0(alpha) = X_0(beta)) = p).
$$
If this is correct then how do I interpret the random variables and their respective probability mass functions?

edited Dec 20 '18 at 1:07

grand_chat

20.2k11226

asked Dec 19 '18 at 20:18

Angelos

788

add a comment |

I have trouble understanding the notation in exercise 7.11.31:

enter image description here

edited Dec 20 '18 at 1:07

grand_chat

20.2k11226

asked Dec 19 '18 at 20:18

Angelos

788

add a comment |

I have trouble understanding the notation in exercise 7.11.31:

enter image description here

edited Dec 20 '18 at 1:07

grand_chat

20.2k11226

asked Dec 19 '18 at 20:18

Angelos

788

I have trouble understanding the notation in exercise 7.11.31:

enter image description here

probability probability-theory notation markov-chains self-learning

edited Dec 20 '18 at 1:07

grand_chat

20.2k11226

asked Dec 19 '18 at 20:18

Angelos

788

edited Dec 20 '18 at 1:07

grand_chat

20.2k11226

asked Dec 19 '18 at 20:18

Angelos

788

edited Dec 20 '18 at 1:07

grand_chat

20.2k11226

edited Dec 20 '18 at 1:07

grand_chat

20.2k11226

edited Dec 20 '18 at 1:07

grand_chat

20.2k11226

asked Dec 19 '18 at 20:18

Angelos

788

asked Dec 19 '18 at 20:18

Angelos

788

asked Dec 19 '18 at 20:18

Angelos

788

add a comment |

1 Answer
1

active

oldest

votes

You should interpret an expression such as
$$
log (f_{X_0}p_{X_0,X_1}p_{X_1,X_2})tag1
$$
as the function
$$
h(i, j, k):= log(f_ip_{i,j}p_{j,k})tag2
$$
evaluated at $i=X_0, j=X_1, k=X_2$. In other words, take the function $h$ defined in (2), which is a function of three integer inputs that returns a real-valued output, and plug in $X_0, X_1, X_2$ in place of those inputs. The result is $h(X_0, X_1, X_2)$, a random variable, which we write in the form (1).

Analogously the log-likelihood function $lambda({bf P})$ that involves $X_0, X_1,ldots,X_n$ is obtained by plugging in $X_0, X_1,ldots , X_n$ into a function that takes $n+1$ integer inputs.

I'm not sure where the authors are headed with this, but if the goal is to estimate the matrix $bf P$ of transition probabilities, the typical setup is to observe the Markov chain for a while, obtaining observed values $x_0, x_1, ldots, x_n$. These are plugged into the log-likelihood, then the log-likelihood is manipulated to obtain plausible estimates of the transition probabilities. For the duration of this exercise we imagine the observed values are frozen, and therefore treated as constants, as if we are manipulating form (2) (the deterministic version) instead of form (1) (the random version). In reality the end result (the estimated transition probabilities) are a function of the data, and can be considered random variables, so they are estimators in the statistical sense, and we can study properties of these estimators such as expectation and variance.

It is true that (1) is a random variable and each of the factors $f_{X_0}$, $p_{X_0,X_1}$, $p_{X_1, X_2}$ is as well. However, the probability mass functions of these random variables are not relevant to the manipulations that lead to the estimates of the transition probabilities.

answered Dec 20 '18 at 1:06

grand_chat

20.2k11226

$begingroup$
Is the following a correct unpacking of the the notation of equation (2) evaluated at $i = X_0$, $j = X_1$ and $k=X_2$: $$ log(overbrace{mathbb{P}_beta(mathbb{P}_alpha(X_0(alpha) = X_0(beta)))}^{f_{X_0}} overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{1}(alpha) =X_{1}(beta) mid X_0(alpha) = X_0(beta)))}^{p_{X_0,X_1}})overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{2}(alpha) =X_{2}(beta) mid X_1(alpha) = X_1(beta)))}^{p_{X_1,X_2}})$$ Yes, the authors want to derive an estimator of the transition probabilities.
$endgroup$
– Angelos
Dec 23 '18 at 14:17

$begingroup$
Why do we derive the estimator from the deterministic version?
$endgroup$
– Angelos
Dec 23 '18 at 14:24

$begingroup$
@Angelos We derive the estimator from the deterministic version in the sense that all these calculations are performed pointwise, the same way that we interpret addition of random variables pointwise, or any other manipulation of random variables. If you keep this in mind then it is fine to work with form (1) instead of form (2).
$endgroup$
– grand_chat
Dec 24 '18 at 5:32

$begingroup$
@Angelos Regarding your unpacking of the notation of (2), I do not understand what you mean by $P_alpha$ or $P_beta$. You should interpret $f_{X_0}$ as $f_i$ evaluated at $i=X_0$, similar to how you interpret $h(Y)$ when $h:{mathbb Z}to {mathbb R}$ is a function from the integers to the reals and $Y$ is a random variable; then $h(Y)$ is a random variable, the composition of $h$ and $Y$. In the case of $f_{X_0}$ we have $h(i) := f_i$.
$endgroup$
– grand_chat
Dec 24 '18 at 5:38

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3046840%2fgrimmett-stirzaker-7-11-31-log-likelihood-estimation-of-markov-transition-mat%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

Analogously the log-likelihood function $lambda({bf P})$ that involves $X_0, X_1,ldots,X_n$ is obtained by plugging in $X_0, X_1,ldots , X_n$ into a function that takes $n+1$ integer inputs.

answered Dec 20 '18 at 1:06

grand_chat

20.2k11226

$begingroup$
Is the following a correct unpacking of the the notation of equation (2) evaluated at $i = X_0$, $j = X_1$ and $k=X_2$: $$ log(overbrace{mathbb{P}_beta(mathbb{P}_alpha(X_0(alpha) = X_0(beta)))}^{f_{X_0}} overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{1}(alpha) =X_{1}(beta) mid X_0(alpha) = X_0(beta)))}^{p_{X_0,X_1}})overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{2}(alpha) =X_{2}(beta) mid X_1(alpha) = X_1(beta)))}^{p_{X_1,X_2}})$$ Yes, the authors want to derive an estimator of the transition probabilities.
$endgroup$
– Angelos
Dec 23 '18 at 14:17

$begingroup$
Why do we derive the estimator from the deterministic version?
$endgroup$
– Angelos
Dec 23 '18 at 14:24

$begingroup$
@Angelos We derive the estimator from the deterministic version in the sense that all these calculations are performed pointwise, the same way that we interpret addition of random variables pointwise, or any other manipulation of random variables. If you keep this in mind then it is fine to work with form (1) instead of form (2).
$endgroup$
– grand_chat
Dec 24 '18 at 5:32

$begingroup$
@Angelos Regarding your unpacking of the notation of (2), I do not understand what you mean by $P_alpha$ or $P_beta$. You should interpret $f_{X_0}$ as $f_i$ evaluated at $i=X_0$, similar to how you interpret $h(Y)$ when $h:{mathbb Z}to {mathbb R}$ is a function from the integers to the reals and $Y$ is a random variable; then $h(Y)$ is a random variable, the composition of $h$ and $Y$. In the case of $f_{X_0}$ we have $h(i) := f_i$.
$endgroup$
– grand_chat
Dec 24 '18 at 5:38

add a comment |

Analogously the log-likelihood function $lambda({bf P})$ that involves $X_0, X_1,ldots,X_n$ is obtained by plugging in $X_0, X_1,ldots , X_n$ into a function that takes $n+1$ integer inputs.

answered Dec 20 '18 at 1:06

grand_chat

20.2k11226

$begingroup$
Is the following a correct unpacking of the the notation of equation (2) evaluated at $i = X_0$, $j = X_1$ and $k=X_2$: $$ log(overbrace{mathbb{P}_beta(mathbb{P}_alpha(X_0(alpha) = X_0(beta)))}^{f_{X_0}} overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{1}(alpha) =X_{1}(beta) mid X_0(alpha) = X_0(beta)))}^{p_{X_0,X_1}})overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{2}(alpha) =X_{2}(beta) mid X_1(alpha) = X_1(beta)))}^{p_{X_1,X_2}})$$ Yes, the authors want to derive an estimator of the transition probabilities.
$endgroup$
– Angelos
Dec 23 '18 at 14:17

$begingroup$
Why do we derive the estimator from the deterministic version?
$endgroup$
– Angelos
Dec 23 '18 at 14:24

$begingroup$
@Angelos We derive the estimator from the deterministic version in the sense that all these calculations are performed pointwise, the same way that we interpret addition of random variables pointwise, or any other manipulation of random variables. If you keep this in mind then it is fine to work with form (1) instead of form (2).
$endgroup$
– grand_chat
Dec 24 '18 at 5:32

$begingroup$
@Angelos Regarding your unpacking of the notation of (2), I do not understand what you mean by $P_alpha$ or $P_beta$. You should interpret $f_{X_0}$ as $f_i$ evaluated at $i=X_0$, similar to how you interpret $h(Y)$ when $h:{mathbb Z}to {mathbb R}$ is a function from the integers to the reals and $Y$ is a random variable; then $h(Y)$ is a random variable, the composition of $h$ and $Y$. In the case of $f_{X_0}$ we have $h(i) := f_i$.
$endgroup$
– grand_chat
Dec 24 '18 at 5:38

add a comment |

Analogously the log-likelihood function $lambda({bf P})$ that involves $X_0, X_1,ldots,X_n$ is obtained by plugging in $X_0, X_1,ldots , X_n$ into a function that takes $n+1$ integer inputs.

answered Dec 20 '18 at 1:06

grand_chat

20.2k11226

Analogously the log-likelihood function $lambda({bf P})$ that involves $X_0, X_1,ldots,X_n$ is obtained by plugging in $X_0, X_1,ldots , X_n$ into a function that takes $n+1$ integer inputs.

answered Dec 20 '18 at 1:06

grand_chat

20.2k11226

answered Dec 20 '18 at 1:06

grand_chat

20.2k11226

answered Dec 20 '18 at 1:06

grand_chat

20.2k11226

answered Dec 20 '18 at 1:06

grand_chat

20.2k11226

$begingroup$
Is the following a correct unpacking of the the notation of equation (2) evaluated at $i = X_0$, $j = X_1$ and $k=X_2$: $$ log(overbrace{mathbb{P}_beta(mathbb{P}_alpha(X_0(alpha) = X_0(beta)))}^{f_{X_0}} overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{1}(alpha) =X_{1}(beta) mid X_0(alpha) = X_0(beta)))}^{p_{X_0,X_1}})overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{2}(alpha) =X_{2}(beta) mid X_1(alpha) = X_1(beta)))}^{p_{X_1,X_2}})$$ Yes, the authors want to derive an estimator of the transition probabilities.
$endgroup$
– Angelos
Dec 23 '18 at 14:17

$begingroup$
Why do we derive the estimator from the deterministic version?
$endgroup$
– Angelos
Dec 23 '18 at 14:24

$begingroup$
@Angelos We derive the estimator from the deterministic version in the sense that all these calculations are performed pointwise, the same way that we interpret addition of random variables pointwise, or any other manipulation of random variables. If you keep this in mind then it is fine to work with form (1) instead of form (2).
$endgroup$
– grand_chat
Dec 24 '18 at 5:32

$begingroup$
@Angelos Regarding your unpacking of the notation of (2), I do not understand what you mean by $P_alpha$ or $P_beta$. You should interpret $f_{X_0}$ as $f_i$ evaluated at $i=X_0$, similar to how you interpret $h(Y)$ when $h:{mathbb Z}to {mathbb R}$ is a function from the integers to the reals and $Y$ is a random variable; then $h(Y)$ is a random variable, the composition of $h$ and $Y$. In the case of $f_{X_0}$ we have $h(i) := f_i$.
$endgroup$
– grand_chat
Dec 24 '18 at 5:38

add a comment |

$begingroup$
Is the following a correct unpacking of the the notation of equation (2) evaluated at $i = X_0$, $j = X_1$ and $k=X_2$: $$ log(overbrace{mathbb{P}_beta(mathbb{P}_alpha(X_0(alpha) = X_0(beta)))}^{f_{X_0}} overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{1}(alpha) =X_{1}(beta) mid X_0(alpha) = X_0(beta)))}^{p_{X_0,X_1}})overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{2}(alpha) =X_{2}(beta) mid X_1(alpha) = X_1(beta)))}^{p_{X_1,X_2}})$$ Yes, the authors want to derive an estimator of the transition probabilities.
$endgroup$
– Angelos
Dec 23 '18 at 14:17

$begingroup$
Why do we derive the estimator from the deterministic version?
$endgroup$
– Angelos
Dec 23 '18 at 14:24

$begingroup$
@Angelos We derive the estimator from the deterministic version in the sense that all these calculations are performed pointwise, the same way that we interpret addition of random variables pointwise, or any other manipulation of random variables. If you keep this in mind then it is fine to work with form (1) instead of form (2).
$endgroup$
– grand_chat
Dec 24 '18 at 5:32

$begingroup$
@Angelos Regarding your unpacking of the notation of (2), I do not understand what you mean by $P_alpha$ or $P_beta$. You should interpret $f_{X_0}$ as $f_i$ evaluated at $i=X_0$, similar to how you interpret $h(Y)$ when $h:{mathbb Z}to {mathbb R}$ is a function from the integers to the reals and $Y$ is a random variable; then $h(Y)$ is a random variable, the composition of $h$ and $Y$. In the case of $f_{X_0}$ we have $h(i) := f_i$.
$endgroup$
– grand_chat
Dec 24 '18 at 5:38

Is the following a correct unpacking of the the notation of equation (2) evaluated at $i = X_0$, $j = X_1$ and $k=X_2$: $$ log(overbrace{mathbb{P}_beta(mathbb{P}_alpha(X_0(alpha) = X_0(beta)))}^{f_{X_0}} overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{1}(alpha) =X_{1}(beta) mid X_0(alpha) = X_0(beta)))}^{p_{X_0,X_1}})overbrace{mathbb{P}_beta (mathbb{P}_alpha(X_{2}(alpha) =X_{2}(beta) mid X_1(alpha) = X_1(beta)))}^{p_{X_1,X_2}})$$ Yes, the authors want to derive an estimator of the transition probabilities.

– Angelos
Dec 23 '18 at 14:17

Why do we derive the estimator from the deterministic version?

– Angelos
Dec 23 '18 at 14:24

@Angelos We derive the estimator from the deterministic version in the sense that all these calculations are performed pointwise, the same way that we interpret addition of random variables pointwise, or any other manipulation of random variables. If you keep this in mind then it is fine to work with form (1) instead of form (2).

– grand_chat
Dec 24 '18 at 5:32

@Angelos Regarding your unpacking of the notation of (2), I do not understand what you mean by $P_alpha$ or $P_beta$. You should interpret $f_{X_0}$ as $f_i$ evaluated at $i=X_0$, similar to how you interpret $h(Y)$ when $h:{mathbb Z}to {mathbb R}$ is a function from the integers to the reals and $Y$ is a random variable; then $h(Y)$ is a random variable, the composition of $h$ and $Y$. In the case of $f_{X_0}$ we have $h(i) := f_i$.

– grand_chat
Dec 24 '18 at 5:38

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Mathematics Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

HUht,LP JWsw iO,a 7s4eYLe,lTE3G,MYAAW9H 2bEKg7

搜尋此網誌

Xrfgtjtk