Combining Gaussian Distributions












0












$begingroup$


My question illustrated in a problem:



After giving and grading a statistics exam, the professor found the mean and standard deviation of points students got for each problem. One of the professors students had to miss the exam, and scheduled to take it later. With the calculated distributions only, what is the probability that the late student gets at least X% on the test?



The number of problems and percent cutoff are not predefined, and the points, mean, and standard deviation of each problem are independent to other problems.



You can assume each distribution is purely gaussian, if that helps.



For whatever reason, say that the professor doesn't have access to the individual students' data anymore, so he/she couldn't just count the number of students with a certain overall score, make a new distribution of overall scores, or any other workaround. The professor only has a set of independent gaussian distributions.





In short: summing together one random value per independent gaussian distribution, what is the probability of getting over a certain total value?










share|cite|improve this question









$endgroup$












  • $begingroup$
    You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
    $endgroup$
    – Henry
    Dec 23 '18 at 1:50










  • $begingroup$
    Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;⁠) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
    $endgroup$
    – Antonio
    Dec 23 '18 at 2:04










  • $begingroup$
    That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
    $endgroup$
    – Antonio
    Dec 23 '18 at 5:28
















0












$begingroup$


My question illustrated in a problem:



After giving and grading a statistics exam, the professor found the mean and standard deviation of points students got for each problem. One of the professors students had to miss the exam, and scheduled to take it later. With the calculated distributions only, what is the probability that the late student gets at least X% on the test?



The number of problems and percent cutoff are not predefined, and the points, mean, and standard deviation of each problem are independent to other problems.



You can assume each distribution is purely gaussian, if that helps.



For whatever reason, say that the professor doesn't have access to the individual students' data anymore, so he/she couldn't just count the number of students with a certain overall score, make a new distribution of overall scores, or any other workaround. The professor only has a set of independent gaussian distributions.





In short: summing together one random value per independent gaussian distribution, what is the probability of getting over a certain total value?










share|cite|improve this question









$endgroup$












  • $begingroup$
    You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
    $endgroup$
    – Henry
    Dec 23 '18 at 1:50










  • $begingroup$
    Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;⁠) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
    $endgroup$
    – Antonio
    Dec 23 '18 at 2:04










  • $begingroup$
    That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
    $endgroup$
    – Antonio
    Dec 23 '18 at 5:28














0












0








0





$begingroup$


My question illustrated in a problem:



After giving and grading a statistics exam, the professor found the mean and standard deviation of points students got for each problem. One of the professors students had to miss the exam, and scheduled to take it later. With the calculated distributions only, what is the probability that the late student gets at least X% on the test?



The number of problems and percent cutoff are not predefined, and the points, mean, and standard deviation of each problem are independent to other problems.



You can assume each distribution is purely gaussian, if that helps.



For whatever reason, say that the professor doesn't have access to the individual students' data anymore, so he/she couldn't just count the number of students with a certain overall score, make a new distribution of overall scores, or any other workaround. The professor only has a set of independent gaussian distributions.





In short: summing together one random value per independent gaussian distribution, what is the probability of getting over a certain total value?










share|cite|improve this question









$endgroup$




My question illustrated in a problem:



After giving and grading a statistics exam, the professor found the mean and standard deviation of points students got for each problem. One of the professors students had to miss the exam, and scheduled to take it later. With the calculated distributions only, what is the probability that the late student gets at least X% on the test?



The number of problems and percent cutoff are not predefined, and the points, mean, and standard deviation of each problem are independent to other problems.



You can assume each distribution is purely gaussian, if that helps.



For whatever reason, say that the professor doesn't have access to the individual students' data anymore, so he/she couldn't just count the number of students with a certain overall score, make a new distribution of overall scores, or any other workaround. The professor only has a set of independent gaussian distributions.





In short: summing together one random value per independent gaussian distribution, what is the probability of getting over a certain total value?







probability statistics






share|cite|improve this question













share|cite|improve this question











share|cite|improve this question




share|cite|improve this question










asked Dec 23 '18 at 1:42









AntonioAntonio

1




1












  • $begingroup$
    You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
    $endgroup$
    – Henry
    Dec 23 '18 at 1:50










  • $begingroup$
    Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;⁠) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
    $endgroup$
    – Antonio
    Dec 23 '18 at 2:04










  • $begingroup$
    That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
    $endgroup$
    – Antonio
    Dec 23 '18 at 5:28


















  • $begingroup$
    You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
    $endgroup$
    – Henry
    Dec 23 '18 at 1:50










  • $begingroup$
    Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;⁠) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
    $endgroup$
    – Antonio
    Dec 23 '18 at 2:04










  • $begingroup$
    That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
    $endgroup$
    – Antonio
    Dec 23 '18 at 5:28
















$begingroup$
You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
$endgroup$
– Henry
Dec 23 '18 at 1:50




$begingroup$
You seem to be suggests individual students' scores on each question are independent of their scores on other questions; that would be a curious exam which suggests that individuals' overall skills and efforts do not affect results. But if so, you should be aware that (a) the sum of independent Gaussian random variables have a Gaussian distribution, and (b) the sum of random variables has an expected value equal to the sum of the individual expected values, and (c) the sum of independent random variables has a variance equal to the sum of the individual variances
$endgroup$
– Henry
Dec 23 '18 at 1:50












$begingroup$
Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;⁠) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
$endgroup$
– Antonio
Dec 23 '18 at 2:04




$begingroup$
Haha yeah. This was just an example scenario to put my question into context; don't take the independent scores too seriously ;⁠) (a) Is that so? Since each problem has a different number of points, the expected value isn't the same. So summing together the distributions could potentially be quite lumpy. (b) I am aware of that, but I'm not totally sure how that would help, unless all expected values are the same. (c) I did not know this. If the points for each problem WERE the same, could you then just combine them somehow? Multiply their standard deviations together or something?
$endgroup$
– Antonio
Dec 23 '18 at 2:04












$begingroup$
That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
$endgroup$
– Antonio
Dec 23 '18 at 5:28




$begingroup$
That's my bad; I mixed my vocab way up. I think I get what you're hinting at, and I think I got a solution. I did my best to validate it by generating random gaussians and adding them together, and it seems to be good. Please do let me know if I messed anything up.
$endgroup$
– Antonio
Dec 23 '18 at 5:28










1 Answer
1






active

oldest

votes


















0












$begingroup$

Alright, I think I figured it out.



I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.



You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.



This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.



In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$






share|cite|improve this answer









$endgroup$













    Your Answer





    StackExchange.ifUsing("editor", function () {
    return StackExchange.using("mathjaxEditing", function () {
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    });
    });
    }, "mathjax-editing");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "69"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3049992%2fcombining-gaussian-distributions%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0












    $begingroup$

    Alright, I think I figured it out.



    I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.



    You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.



    This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.



    In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$






    share|cite|improve this answer









    $endgroup$


















      0












      $begingroup$

      Alright, I think I figured it out.



      I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.



      You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.



      This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.



      In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$






      share|cite|improve this answer









      $endgroup$
















        0












        0








        0





        $begingroup$

        Alright, I think I figured it out.



        I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.



        You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.



        This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.



        In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$






        share|cite|improve this answer









        $endgroup$



        Alright, I think I figured it out.



        I'm not marking this as accepted, though (or at least not for a while), just in case I did something wrong and someone can correct me, as my statistical skills are borderline nonexistent.



        You can combine all gaussian distributions into a single gaussian where $mu = sum_{i=1}^nmu_i$ and $sigma = sqrt{sum_{i=0}^nsigma_i^2}$, where $n$ denotes the number of gaussian distributions to sum over, and $mu_i$ and $sigma_i$ denote the $i^{th}$ gaussians mean and standard deviation, respectively.



        This is all I really care about, but just for completeness, to find the probability of the student getting above $X$%, you need to do a z table lookup. To do that, you must first get your z score. This is done via $Z = frac{X - mu}{sigma}$. After finding your $P$ with the corresponding $Z$ in the z table, because we want everything ABOVE $X$%, not below, we get our final probability as $1 - P$.



        In one equation, the solution for $n$ gaussian distributions with a total score of at least $X$% is:$$1-Phi(frac{X - sum_{i=1}^nmu_i}{sqrt{sum_{i=0}^nsigma_i^2}})$$







        share|cite|improve this answer












        share|cite|improve this answer



        share|cite|improve this answer










        answered Dec 23 '18 at 5:26









        AntonioAntonio

        1




        1






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Mathematics Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3049992%2fcombining-gaussian-distributions%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Måne

            Storängen

            VLT Carioca