Combining Gaussian Distributions
$begingroup$
My question illustrated in a problem:
After giving and grading a statistics exam, the professor found the mean and standard deviation of points students got for each problem. One of the professors students had to miss the exam, and scheduled to take it later. With the calculated distributions only, what is the probability that the late student gets at least X% on the test?
The number of problems and percent cutoff are not predefined, and the points, mean, and standard deviation of each problem are independent to other problems.
You can assume each distribution is purely gaussian, if that helps.
For whatever reason, say that the professor doesn't have access to the individual students' data anymore, so he/she couldn't just count the number of students with a certain overall score, make a new distribution of overall scores, or any other workaround. The professor only has a set of independent gaussian distributions.
In short: summing together one random value per independent gaussian distribution, what is the probability of getting over a certain total value?
probability statistics
$endgroup$
add a comment |
$begingroup$
My question illustrated in a problem:
After giving and grading a statistics exam, the professor found the mean and standard deviation of points students got for each problem. One of the professors students had to miss the exam, and scheduled to take it later. With the calculated distributions only, what is the probability that the late student gets at least X% on the test?
The number of problems and percent cutoff are not predefined, and the points, mean, and standard deviation of each problem are independent to other problems.
You can assume each distribution is purely gaussian, if that helps.
For whatever reason, say that the professor doesn't have access to the individual students' data anymore, so he/she couldn't just count the number of students with a certain overall score, make a new distribution of overall scores, or any other workaround. The professor only has a set of independent gaussian distributions.
In short: summing together one random value per independent gaussian distribution, what is the probability of getting over a certain total value?
probability statistics
$endgroup$
$begingroup$
You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
$endgroup$
– Henry
Dec 23 '18 at 1:50
$begingroup$
Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
$endgroup$
– Antonio
Dec 23 '18 at 2:04
$begingroup$
That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
$endgroup$
– Antonio
Dec 23 '18 at 5:28
add a comment |
$begingroup$
My question illustrated in a problem:
After giving and grading a statistics exam, the professor found the mean and standard deviation of points students got for each problem. One of the professors students had to miss the exam, and scheduled to take it later. With the calculated distributions only, what is the probability that the late student gets at least X% on the test?
The number of problems and percent cutoff are not predefined, and the points, mean, and standard deviation of each problem are independent to other problems.
You can assume each distribution is purely gaussian, if that helps.
For whatever reason, say that the professor doesn't have access to the individual students' data anymore, so he/she couldn't just count the number of students with a certain overall score, make a new distribution of overall scores, or any other workaround. The professor only has a set of independent gaussian distributions.
In short: summing together one random value per independent gaussian distribution, what is the probability of getting over a certain total value?
probability statistics
$endgroup$
My question illustrated in a problem:
After giving and grading a statistics exam, the professor found the mean and standard deviation of points students got for each problem. One of the professors students had to miss the exam, and scheduled to take it later. With the calculated distributions only, what is the probability that the late student gets at least X% on the test?
The number of problems and percent cutoff are not predefined, and the points, mean, and standard deviation of each problem are independent to other problems.
You can assume each distribution is purely gaussian, if that helps.
For whatever reason, say that the professor doesn't have access to the individual students' data anymore, so he/she couldn't just count the number of students with a certain overall score, make a new distribution of overall scores, or any other workaround. The professor only has a set of independent gaussian distributions.
In short: summing together one random value per independent gaussian distribution, what is the probability of getting over a certain total value?
probability statistics
probability statistics
asked Dec 23 '18 at 1:42
AntonioAntonio
1
1
$begingroup$
You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
$endgroup$
– Henry
Dec 23 '18 at 1:50
$begingroup$
Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
$endgroup$
– Antonio
Dec 23 '18 at 2:04
$begingroup$
That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
$endgroup$
– Antonio
Dec 23 '18 at 5:28
add a comment |
$begingroup$
You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
$endgroup$
– Henry
Dec 23 '18 at 1:50
$begingroup$
Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
$endgroup$
– Antonio
Dec 23 '18 at 2:04
$begingroup$
That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
$endgroup$
– Antonio
Dec 23 '18 at 5:28
$begingroup$
You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
$endgroup$
– Henry
Dec 23 '18 at 1:50
$begingroup$
You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
$endgroup$
– Henry
Dec 23 '18 at 1:50
$begingroup$
Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
$endgroup$
– Antonio
Dec 23 '18 at 2:04
$begingroup$
Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
$endgroup$
– Antonio
Dec 23 '18 at 2:04
$begingroup$
That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
$endgroup$
– Antonio
Dec 23 '18 at 5:28
$begingroup$
That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
$endgroup$
– Antonio
Dec 23 '18 at 5:28
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
Alright, I think I figured it out.
I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.
You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.
This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.
In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$
$endgroup$
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3049992%2fcombining-gaussian-distributions%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Alright, I think I figured it out.
I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.
You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.
This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.
In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$
$endgroup$
add a comment |
$begingroup$
Alright, I think I figured it out.
I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.
You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.
This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.
In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$
$endgroup$
add a comment |
$begingroup$
Alright, I think I figured it out.
I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.
You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.
This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.
In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$
$endgroup$
Alright, I think I figured it out.
I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.
You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.
This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.
In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$
answered Dec 23 '18 at 5:26
AntonioAntonio
1
1
add a comment |
add a comment |
Thanks for contributing an answer to Mathematics Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3049992%2fcombining-gaussian-distributions%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
$endgroup$
– Henry
Dec 23 '18 at 1:50
$begingroup$
Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
$endgroup$
– Antonio
Dec 23 '18 at 2:04
$begingroup$
That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
$endgroup$
– Antonio
Dec 23 '18 at 5:28