{"id":1766,"date":"2020-06-05T18:44:26","date_gmt":"2020-06-05T18:44:26","guid":{"rendered":"https:\/\/opentextbc.ca\/businesstechnicalmath\/chapter\/9-2-standard-deviation\/"},"modified":"2021-08-31T21:30:03","modified_gmt":"2021-08-31T21:30:03","slug":"9-2-standard-deviation","status":"publish","type":"chapter","link":"https:\/\/opentextbc.ca\/businesstechnicalmath\/chapter\/9-2-standard-deviation\/","title":{"raw":"8.2  Measures of Dispersion","rendered":"8.2  Measures of Dispersion"},"content":{"raw":"<img class=\"aligncenter wp-image-1748 size-large\" title=\"https:\/\/pixabay.com\/illustrations\/dispersion-flower-effect-flora-6184855\/\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-1024x614.jpg\" alt=\"\" width=\"1024\" height=\"614\">\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">Learning Objectives<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nBy the end of this section the student should be able to:\n<ul>\n \t<li>Determine the range for a data set.<\/li>\n \t<li>Determine the standard deviation for a data set.<\/li>\n \t<li>Determine the standard deviation from a histogram<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h1>Measures of Dispersion or Spread<\/h1>\nWe have seen that<strong> measures of central tendency,<\/strong> including the\u00a0[pb_glossary id=\"1845\"]mean[\/pb_glossary]\u00a0and [pb_glossary id=\"1844\"]median[\/pb_glossary], \u00a0are used to identify a central position within a data set. They indicate where the data clusters.\n\nConsider student A's scores on five tests:\u00a0 32%\u00a0 \u00a095%\u00a0 \u00a089%\u00a0 \u00a074%\u00a0 \u00a055%\u00a0\u00a0The <strong>mean,<\/strong> or average, is\u00a0 (32 + 95 + 89 + 74 + 55)\/5 = 69%\u00a0 and the <strong>median<\/strong> is 74%.\n\nConsider\u00a0 student B's scores on the same five tests:\u00a0 68%\u00a0 \u00a069%\u00a0 \u00a072%\u00a0 \u00a074%\u00a0 \u00a062%\u00a0 \u00a0The <strong>mean,<\/strong> or average, is\u00a0 (68 + 69 + 72 + 74 + 62)\/5 = 69%\u00a0 and the <strong>median<\/strong> is 69%.\n\nBoth student's have the same test average of 69% but there is a substantial difference in the spread or dispersion of their scores. Student A's test scores range\u00a0from a low score of 32% to a high score of\u00a0 95% so the spread in marks is 63 percentage points. Student B's test scores range from a low score of 62% to a high score of\u00a0 74% so the spread in marks is 12 percentage points.\n\nWhen we analyze data it is important to consider how <strong>dispersed<\/strong> or spread out the data values are. In this section we will consider two measures of dispersion.\n<h1>Range<\/h1>\nRange is one <strong>measure of dispersion. <\/strong>A measure of dispersion is used to describe the spread of data.\n<div class=\"textbox textbox--key-takeaways\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">Range<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nThe <strong>range<\/strong> indicates the total spread in data values. It is the difference between the highest and lowest data values.\n\nRange = highest data value - lowest data value\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 1<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nThe table shows the daily high temperature (\u00b0C) over a three week period.\n\nDetermine the highest temperature, lowest temperature, and the range\u00a0 in daily high temperatures over the three weeks.\n\n<img class=\"alignnone wp-image-1749 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-1024x212.png\" alt=\"\" width=\"1024\" height=\"212\">\n\n<strong>Solution<\/strong>\n\nThe highest temperature was 29\u00ba, the lowest temperature was 16\u00b0, and the range in temperatures was\u00a0 29 - 16 =\u00a0 13\u00ba.\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 1<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nThe table shows the test score for a group of fourteen students.\n\nDetermine the highest test score, the lowest test score, and the range\u00a0 in test scores for the group of 14 students.\n\n<img class=\"alignnone wp-image-1750 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-1024x114.png\" alt=\"\" width=\"1024\" height=\"114\">\n\n<details><summary>Show answer<\/summary>The highest score was 89%, the lowest score was 56%, and the range in scores was 33%.\n\n<\/details><\/div>\n<\/div>\nAn advantage of using the range as a measure of dispersion is that it involves a simple calculation. A disadvantage is that the range only provides a measure between the highest and lowest values so it disregards all other data values. If the highest or lowest data value is an [pb_glossary id=\"1846\"]outlier[\/pb_glossary] then the range will not provide a true measure of the spread in the typical values.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 2<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA student wrote five tests and earned the following five scores:\n<p style=\"text-align: center;\">92%\u00a0 \u00a095%\u00a0 \u00a089%\u00a0 \u00a094%\u00a0 \u00a035%<\/p>\n\n<ol>\n \t<li>Determine the mean, median and the range for these five scores.<\/li>\n \t<li>Which of the five scores is an outlier?<\/li>\n \t<li>Remove the outlier and recalculate the mean, median and the range for the four remaining scores.<\/li>\n \t<li>Comparing the results for the five scores versus four scores, which of the three measures was least impacted by the outlier?<\/li>\n \t<li>Comparing the results for the five scores versus four scores, which of the three measures was most impacted by the outlier?<\/li>\n<\/ol>\n<strong>Solution<\/strong>\n\n1. The student's mean score: (92 + 95 + 89 + 94 + 35) \/ 5 = 81%\n\nThe median score is 92%.\n\nThe range in marks is 95% - 35% = 60%.\n\n2. The score of 35% is an outlier.\n\n3. The student's mean score: (92 + 95 + 89 + 94) \/ 4 = 92.5%\n\nThe median score is 93%.\n\nThe range in marks is 95% - 89% = 6%.\n\n4. The median was least impacted by the removal of the outlier.\n\n5. The range was most impacted by the outlier.\n\n<\/div>\n&nbsp;\n\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 2<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nThe following seven values are salaries at a local computer company.\n<p style=\"text-align: center;\">$62,000\u00a0 \u00a0 \u00a0$95, 000\u00a0 \u00a0$120,000\u00a0 \u00a0$101, 000\u00a0 \u00a0 \u00a0$99,000\u00a0 \u00a0$98,000\u00a0 \u00a0 \u00a0$110,000<\/p>\n\n<ol>\n \t<li>Determine the mean, median and the range for these seven salaries.<\/li>\n \t<li>Which of the seven salaries is an outlier?<\/li>\n \t<li>Remove the outlier and recalculate the mean, median and the range for the six remaining salaries.<\/li>\n \t<li>Comparing the results for the seven versus six salaries, which of the three measures was least impacted by the outlier?<\/li>\n \t<li>Comparing the results for the seven versus six salaries, which of the three measures was most impacted by the outlier?<\/li>\n<\/ol>\n<details open=\"open\"><summary>Show answer<\/summary>\n<ol>\n \t<li>mean salary $97,857; median salary $99,000;\u00a0 range in salaries $58,000<\/li>\n \t<li>\u00a0$62,000<\/li>\n \t<li>mean salary $103, 833; median salary $100,000;\u00a0 range in salaries $25,000<\/li>\n \t<li>the median<\/li>\n \t<li>the range<\/li>\n<\/ol>\n<\/details><\/div>\n<\/div>\nRefer back to Example 2 and the measures that were calculated for five test scores.The student's mean score is 81% and the range in marks is 60%.\u00a0 The range of 60% does not capture the fact that if the outlier is removed then there is a spread of only 6% for the four remaining data values. The range depends on only the <strong>highest<\/strong> and <strong>lowest<\/strong> data values. The existence of an outlier can result in a misleading representation of the spread in data values.\n\nAn alternative measure of dispersion is called the <strong>standard deviation<\/strong>. It depends on <strong>all<\/strong> data values rather than on only the highest and lowest data values.\n<h1>Standard Deviation<\/h1>\nStandard deviation\u00a0 measures the\u00a0 <strong>dispersion of the data values around the mean<\/strong>. Unlike the range, its value depends on every data value in the data set. The standard deviation is found by determining how much each data value differs from the mean.\n\nWhat does the standard deviation actually tell us? Consider two sets of test scores:\n\nSet A:\u00a0 \u00a0 76%\u00a0 \u00a074%\u00a0 \u00a086%\u00a0 \u00a084%\u00a0 \u00a085%\n\nSet<span style=\"font-size: 14pt;\">\u00a0<\/span><span style=\"font-size: 14pt;\">B:\u00a0 \u00a0 53%\u00a0 \u00a095%\u00a0 \u00a062%\u00a0 \u00a099%\u00a0 \u00a096%<\/span>\n\nRefer to <a href=\"#figure1\">Figure 1<\/a>. For both sets the mean is 81%. If we plot the scores (indicated by the <strong>*<\/strong>) on a scale of 0% to 100% we see that the scores in Set A are much less spread out around\u00a0 the mean. The scores from\u00a0 Set B are much more dispersed.<a id=\"#figure1\" href=\"\"><\/a>\n\n[caption id=\"attachment_1760\" align=\"aligncenter\" width=\"1024\"]<img class=\"wp-image-1751 size-large\" title=\"Created by Kim Moshenko\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-1024x417.png\" alt=\"\" width=\"1024\" height=\"417\"> Fig. 1[\/caption]\n\nIf we compare the standard deviations for the two data sets we should find that although the mean is the same, the standard deviation for Set B will be greater since the data is more widely spread out from the mean.\n<h1>Population versus Sample Standard Deviation<\/h1>\nWhen working with standard deviation it is important to distinguish whether you are working with the entire population or a sample of the population.\u00a0Statisticians generally survey a [pb_glossary id=\"1847\"]sample[\/pb_glossary] of the population because it is often impossible to survey the entire population.\u00a0 As an example, assume your university wants to determine food preferences for its entire student body. The population would be all students enrolled in the university. Rather than attempt to survey every student, the university will survey only a [pb_glossary id=\"1847\"]sample[\/pb_glossary].\n\nThe symbolic representation of standard deviation is different for a population versus a sample. If\u00a0 you are working with an entire <strong>population,<\/strong> the symbol for standard deviation is the Greek letter sigma, <strong>\u03c3.<\/strong> The symbol for the standard deviation of a <strong>sample<\/strong> is\u00a0 <strong><em>s<\/em><\/strong>. Similarly, the calculation of standard deviation is different for a population versus a sample.\u00a0 Unless otherwise indicated in this chapter, we will assume that we are working with\u00a0 a sample, rather than an entire population.\n<div class=\"textbox textbox--key-takeaways\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">Standard Deviation Formula<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\n<strong>Standard Deviation<\/strong>\u00a0<strong>Formula for a Population<\/strong>\n\n<img class=\"aligncenter wp-image-1752 \" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-pop-std-dev-formula.png\" alt=\"\" width=\"510\" height=\"156\">\n\nThe Greek letter sigma\u00a0 \u2211 is the summation symbol. It indicates that all of the values (x \u2013 \u03bc)<sup>2<\/sup> must be added.\n\n&nbsp;\n\n<strong>Standard Deviation<\/strong>\u00a0<strong>Formula for a Sample<\/strong>\n\n<img class=\"aligncenter wp-image-1753 \" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula.png\" alt=\"\" width=\"507\" height=\"189\">\n\n<\/div>\n<\/div>\n<h1>Calculating the Standard Deviation<\/h1>\nWhen calculating the standard deviation with the aid of a scientific calculator it is helpful to record the steps using a table.\n<div class=\"textbox textbox--key-takeaways\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">Calculating Standard Deviation<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nWe will use the formula for finding the standard deviation of a sample:\n\n<img class=\"aligncenter\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula-1.png\" width=\"427\" height=\"136\">\n\nTo determine the standard deviation for a sample we will use the following steps :\n<ol>\n \t<li>Find the mean<\/li>\n \t<li>Create a table with three columns:\u00a0 \u00a0 data value,\u00a0 \u00a0 data value - mean,\u00a0 \u00a0 \u00a0(data value - mean)<sup>2<\/sup><\/li>\n \t<li>Fill in the <strong>data value<\/strong> column with all values from the sample<\/li>\n \t<li>Subtract the mean from each data value:\u00a0 (data value - mean)<\/li>\n \t<li>Square the results from step 4:\u00a0 \u00a0(data value - mean)<sup>2<\/sup><\/li>\n \t<li>Sum the results in column 3 (from step 5)<\/li>\n \t<li>Divide the sum (from step 6) by (n - 1)<\/li>\n \t<li>Find the square root of the result in step 7<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 3<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\na) Determine the standard deviation for the sample set A test scores:\u00a0 <span style=\"font-size: 14pt;\">76%\u00a0 \u00a074%\u00a0 \u00a086%\u00a0 \u00a084%\u00a0 \u00a085%<\/span>\n\nb) Determine the standard deviation for the sample set B test scores: <span style=\"font-size: 14pt;\">\u00a0 \u00a0 53%\u00a0 \u00a095%\u00a0 \u00a062%\u00a0 \u00a099%\u00a0 \u00a096%<\/span>\n\nc) Compare the means and standard deviations for Set A and Set B. Which set is more spread out (dispersed)?\n\n<strong>Solution<\/strong>\n\n<span style=\"text-align: initial; font-size: 0.9em; word-spacing: normal;\">a)<\/span>\n\n<\/div>\n<div class=\"textbox__content\">\n\n<img class=\"aligncenter wp-image-1755 \" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3-1024x701.png\" alt=\"\" width=\"603\" height=\"413\">\n\nThe standard deviation for Set A is 5.57 (rounded to 2 decimal places)\n\nb)\n\n<img class=\"aligncenter wp-image-1756 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4-1024x699.png\" alt=\"\" width=\"1024\" height=\"699\">\n\nThe standard deviation for Set B is 21.74 (rounded to 2 decimal places).\n\nc)\n\nIn comparing the two data sets A and B:\n<ul>\n \t<li>The means are the same value of\u00a0 81.<\/li>\n \t<li>The standard deviation for Set A is\u00a0 s = 5.57\u00a0 and for Set B it is\u00a0 s = 21.74.\u00a0 \u00a0The much larger standard deviation for Set B indicates that there is a much greater spread in the data values around the mean of 81%.<\/li>\n<\/ul>\n<\/div>\n<\/div>\nNote that the sum of the middle column (data value - mean) is 0. This will always be the case. That is why we must square the values before we add them, as is done in column 3.\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 3<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nThe average high temperatures (in \u00baC) for one week in April for two different cities in Canada were as follows:\n\nCity A:\u00a0 \u00a0 15\u00a0 \u00a0 \u00a0 19\u00a0 \u00a0 \u00a022\u00a0 \u00a0 26\u00a0 \u00a0 21\u00a0 \u00a0 19\u00a0 \u00a0 18\n\nCity B:\u00a0 \u00a0 \u00a06\u00a0 \u00a0 \u00a0 \u00a09\u00a0 \u00a0 \u00a0 15\u00a0 \u00a0 18\u00a0 \u00a0 \u00a020\u00a0 \u00a0 19\u00a0 \u00a0 21\n\na) Calculate the mean high temperature (if necessary round to 2 decimal places) for each city. Which of the two cities appears to have a wider spread in temperatures around their means?\n\nb) Calculate the standard deviation for each temperature set (if necessary round to 2 decimal places) to see if your observation is correct.\n\n<details><summary>Show answer<\/summary>The mean for City A is 20 \u02daC and the mean for City B was 15.43\u02daC.\n\nThe standard deviation for City A is 3.46 and for City B is 5.80. City B\u2019s temperatures are more widely spread out from the mean temperature.\n\n<\/details><\/div>\n<\/div>\nWe have seen that two measures of spread or dispersion are the [pb_glossary id=\"1848\"]range[\/pb_glossary]\u00a0 and the [pb_glossary id=\"1850\"]standard deviation[\/pb_glossary]. Although the range is a much simpler calculation, it only takes into consideration the highest and lowest data values. The existence of an outlier can result in a range that is not truly indicative of the spread in data values. The standard deviation is a more complex calculation but takes into consideration all data values. It is important to note that technology is often used to calculate the standard deviation which eliminates the need for tedious calculations.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 4<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nIn Example 1 the range for the following set of temperature was determined to be\u00a0 13 \u00baC.\n\na) Determine the mean and the standard deviation.\n\nb) Explain why the values for the range and standard deviation are different.\n\n<img class=\"alignnone wp-image-1749 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-1024x212.png\" alt=\"\" width=\"1024\" height=\"212\">\n\n<strong>Solution<\/strong>\n\na) The mean (average) is 22.29 \u02daC and the standard deviation is 3.86.\nb) The reason for the difference in values is that the range only tells us the difference between the highest and lowest temperature whereas the standard deviation tells us how widespread the temperatures are in relation to the mean temperature of 22.29 \u02daC\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 4<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nRefer back to TRY IT 1 and the test scores for a group of 14 students. The range was determined to be 33.\n\nDetermine the mean (rounded to the nearest whole number) and the standard deviation (rounded to the nearest 2 decimal places) for this set of test scores.\n\n<img class=\"alignnone wp-image-1750 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-1024x114.png\" alt=\"\" width=\"1024\" height=\"114\">\n\n&nbsp;\n\n<details><summary>Show answer<\/summary>The mean is 72 and the standard deviation is 8.19.\n\n<\/details><\/div>\n<\/div>\n<h6><strong>Histograms and the Dispersion of Data Values<\/strong><\/h6>\nWe have seen that for two data sets with the same mean, when the standard deviation is larger the data values are more spread out. A [pb_glossary id=\"1851\"]histogram[\/pb_glossary] can be used to illustrate the spread of data values.\n\nConsider a dance competition where teams comprised of seven dancers compete for the prize money in several different dance categories. There were four teams entered in the elite category. The dancers must be between the ages of\u00a0 18-24 years old. The age breakdown for the members of the four teams in the elite category is:\n\nTeam Unity:\u00a0 all seven dancers are age 21\n\nTeam Harmony:\u00a0 2 dancers are 20, 3 dancers are 21 and 2 dancers\u00a0 are 22\n\nTeam Mix:\u00a0 1 dancer is 19, 2 dancers are 20, 1 dancer is 21, 2 dancers are 22 and 1 dancer is 23\n\nTeam Extend: 3 dancers are 18, 1 dancer is 21, and 3 dancers are 24\n\nThe <strong>mean<\/strong> age for all four teams is 21 but the standard deviations for each of the four teams are different. Team Unity has a standard deviation of 0, Team Harmony has a standard deviation of 0.82, Team Mix has a standard deviation of 1.412 and Team Extend has a standard deviation of\u00a0 3.\n\nThe histograms for each of the teams appears in Figures 2a through Figures 2d below.\n\n&nbsp;\n\n[caption id=\"attachment_1760\" align=\"aligncenter\" width=\"505\"]<img class=\"wp-image-1757 \" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Unity-Histogram.png\" alt=\"\" width=\"505\" height=\"301\"> <strong>Fig. 2a. Team Unity with\u00a0 s = 0<\/strong>[\/caption]\n\nIn Figure 2a\u00a0 Team Unity has a standard deviation of 0 since all ages are the same. None of the ages spread out from the mean of 21.\n\n&nbsp;\n\n[caption id=\"attachment_1760\" align=\"aligncenter\" width=\"486\"]<img class=\"wp-image-1758 \" style=\"font-size: 14pt;\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Harmony-Histogram.png\" alt=\"\" width=\"486\" height=\"293\"> <strong>Fig. 2b. Team Harmony with\u00a0 s = 0.82<\/strong>[\/caption]\n\nIn Figure 2b Team Harmony has a standard deviation of 0.82 years. The histogram illustrates that the ages are closely clustered around the mean of 21.\n\n&nbsp;\n\n[caption id=\"attachment_1760\" align=\"aligncenter\" width=\"490\"]<img class=\"wp-image-1759 \" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Mix-Histogram.png\" alt=\"\" width=\"490\" height=\"294\"> <strong>Fig. 2c Team Mix\u00a0 with\u00a0 s = 1.41<\/strong>[\/caption]\n\nIn Figure 2c Team Mix has a standard deviation of 1.41.\u00a0 The histogram illustrates that the data (age) spread is greater than for Team Unity and Team Harmony.\n\n&nbsp;\n\n[caption id=\"attachment_1760\" align=\"aligncenter\" width=\"422\"]<img class=\"wp-image-1760 \" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Extend-Histogram.png\" alt=\"\" width=\"422\" height=\"251\"> <strong>Fig. 2d. Team Extend\u00a0 \u00a0with\u00a0 s = 3<\/strong>[\/caption]\n\nIn Figure 2d Team Extend\u00a0 has a standard deviation of 3. The histogram clearly illustrates that the ages for this team are the most spread out from the mean of 21.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 5<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nTen participants in a group fitness class were asked to rank the class on a scale from 1 to 5. Determine the mean and standard deviation for the evaluation scores as depicted in the histogram below.\n\n<a href=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/794\/2020\/06\/8.2-Example5-4.png\"><img class=\"aligncenter wp-image-1761\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example5-4.png\" alt=\"\" width=\"405\" height=\"238\"><\/a>\n\n<strong style=\"text-align: initial; word-spacing: normal; font-size: 0.9em;\">Solution<\/strong>\n\n<\/div>\n<div class=\"textbox__content\">\n\n<a href=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/794\/2020\/06\/8.2-ex-5-solution.png\"><img class=\"aligncenter wp-image-1762\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-1024x1019.png\" alt=\"\" width=\"546\" height=\"543\"><\/a>\n\nThe mean score is 3 and the standard deviation is 1.49\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 5<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nThe hourly wages for ten employees at a small coffee shop are illustrated in the histogram. Determine the mean and standard deviation for the employee hourly wages.\n\n<img class=\"wp-image-1763 aligncenter\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-5-Histogram-1.png\" alt=\"\" width=\"639\" height=\"380\">\n\n<details><summary>Show answer<\/summary>The mean is $17.80 and the standard deviation is\u00a0 $1.32\n\n<\/details><\/div>\n<\/div>\nWe have seen that standard deviation provides us with a measure of the spread of data values in relation to the mean.\u00a0 We have learned how to calculate the standard deviation for\u00a0 a data set but we\u00a0 have not explored the significance or meaning of these calculated values. In the next section we will explore the significance of the calculated values as we consider the relationship between the standard deviation and the distribution of the data values.\n<h1>Key Concepts<\/h1>\n<ul>\n \t<li>Two measures of dispersion or spread in data values are:\n<ul>\n \t<li>Range = highest data value - lowest data value<\/li>\n \t<li>Standard deviation. \u00a0Unlike the range, its value depends on every data value in the data set. It is found by determining how much each data value differs from the mean.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<h1><strong>Glossary<\/strong><\/h1>\n<div class=\"textbox shaded\">\n\n<strong>histogram<\/strong>\n\nrepresents the frequency distribution (number of occurrences) of each data value. The data values are grouped into intervals or \"bins\".\n\n<strong>outlier<\/strong>\n\nA data observation that is deemed to be unusual based on the pattern of the other data values.\n\n<strong>range<\/strong>\n\nindicates the total spread in data values. It is the difference between the highest and lowest data values.\n\n<strong>sigma<\/strong>\n\nis the uppercase Greek letter written\u00a0 \u2211. It is used to indicate the sum of a series of values.\n\n<strong>standard deviation<\/strong>\n\nmeasures the\u00a0 dispersion of the data values around the mean.\n\n<\/div>\n<h1>8.2 Exercise Set<\/h1>\n<ol>\n \t<li>The daily high temperature (in degrees C) for Calgary AB was recorded over a period of two weeks:\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"0\">\n<tbody>\n<tr>\n<td style=\"width: 14.2857%;\">28<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<td style=\"width: 14.2857%;\">26<\/td>\n<td style=\"width: 14.2857%;\">27<\/td>\n<td style=\"width: 14.2857%;\">27<\/td>\n<td style=\"width: 14.2857%;\">29<\/td>\n<td style=\"width: 14.2857%;\">30<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 14.2857%;\">30<\/td>\n<td style=\"width: 14.2857%;\">12<\/td>\n<td style=\"width: 14.2857%;\">20<\/td>\n<td style=\"width: 14.2857%;\">22<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<td style=\"width: 14.2857%;\">24<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ol type=\"a\">\n \t<li>Determine the mean, median and the range for these fourteen temperatures.<\/li>\n \t<li>Which of the temperatures appears to be an outlier?<\/li>\n \t<li>Remove the outlier and recalculate the mean, median and the range for the thirteen remaining values.<\/li>\n \t<li>Comparing the results for the fourteen temperature values versus thirteen temperature values, which of the three measures was most impacted by the outlier?<\/li>\n<\/ol>\n<\/li>\n \t<li>The population of Cache Creek, B.C. for the years 2011 to 2019 is provided in the table below (<em>Demographic Analysis Section, BC Stats)<\/em><img class=\"aligncenter wp-image-1764 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table-1024x118.png\" alt=\"\" width=\"1024\" height=\"118\">\n<ol type=\"a\">\n \t<li>Determine the mean and median population.<\/li>\n \t<li>Determine the range.<\/li>\n \t<li>Determine the standard deviation.<\/li>\n<\/ol>\n<\/li>\n \t<li>Set A:\u00a0 6\u00a0 7\u00a0 7\u00a0 10\u00a0 10\u00a0 11Set B:\u00a0 10\u00a0 11\u00a0 12\u00a0 13\u00a0 14\u00a0 15Set C:\u00a0\u00a0 10\u00a0 12\u00a0 16\u00a0 11\u00a0 12\u00a0 13\n<ol type=\"a\">\n \t<li>For each of the three data samples below, calculate the range, mean, and standard deviation.<\/li>\n \t<li>Based on these calculations, determine any similarities that exist between any \u00a0of the data samples.<\/li>\n \t<li>Which of the three measures is different for every set?<\/li>\n<\/ol>\n<ol type=\"a\">\n \t<li>Given the two data samples <strong>Set A<\/strong> and <strong>Set B<\/strong> below, which appears to have the greatest spread in values?\n<ul>\n \t<li><strong>Set A<\/strong>:\u00a0 69\u00a0\u00a0 86\u00a0\u00a0 74\u00a0\u00a0 60\u00a0\u00a0 67\u00a0\u00a0 65<\/li>\n \t<li><strong>Set B<\/strong>:\u00a0 50 \u00a0\u00a0\u00a051\u00a0\u00a0 86\u00a0\u00a0 50\u00a0\u00a0 52\u00a0\u00a0 51<\/li>\n<\/ul>\n<\/li>\n \t<li>For each sample calculate the mean, range and standard deviation. If necessary round final answers to 2 decimal places.<\/li>\n \t<li>Based on these calculations which sample has the larger standard deviation?<\/li>\n \t<li>If the 86 in each data set is changed to a 56 which set would you predict would have the greatest <strong>change<\/strong> in standard deviation? Recalculate the standard deviations with a value of 56 instead of 86. Is your prediction correct?\n<ul>\n \t<li><strong>Set A<\/strong>:\u00a0 69\u00a0 \u00a0<span style=\"color: #0000ff;\">56<\/span>\u00a0\u00a0 74\u00a0\u00a0 60\u00a0\u00a0 67\u00a0\u00a0 65<\/li>\n \t<li><strong>Set B<\/strong>:\u00a0 50 \u00a0\u00a0\u00a051\u00a0 \u00a0<span style=\"color: #0000ff;\">56<\/span>\u00a0\u00a0 50\u00a0\u00a0 52\u00a0\u00a0 51<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<\/li>\n \t<li>The standard deviation for a sample is calculated to be 0. What can you conclude about the data values?<\/li>\n \t<li>The maximum hourly wage (in dollars\/hour) for pipefitters and carpenters in the ten Canadian provinces is listed below.(Source: <a href=\"https:\/\/www.jobbank.gc.ca\/wagereport\/occupation\/21460\">Wages for Steamfitters, pipefitters and sprinkler system installers from the Canadian Job Bank<\/a>)\n<ul>\n \t<li><strong>Pipefitter<\/strong>:\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 48\u00a0\u00a0 47\u00a0\u00a0 45\u00a0\u00a0 45\u00a0\u00a0 40\u00a0\u00a0 48\u00a0\u00a0 42\u00a0\u00a0 50\u00a0\u00a0 46\u00a0\u00a0 43<\/li>\n \t<li><strong>Carpenter<\/strong>:\u00a0\u00a0\u00a0\u00a0\u00a0 36\u00a0\u00a0 33\u00a0\u00a0 25\u00a0\u00a0\u00a0 28\u00a0\u00a0 39\u00a0\u00a0 36\u00a0\u00a0 33\u00a0\u00a0 35\u00a0\u00a0 39\u00a0\u00a0 35<\/li>\n<\/ul>\n<ol type=\"a\">\n \t<li>Without calculating the average, which <strong>occupation<\/strong> appears to have the higher average maximum hourly wage?<\/li>\n \t<li>Calculate the average (mean) maximum hourly wage (to the nearest cent) for each occupation.<\/li>\n \t<li>Determine the range in maximum hourly wages for each occupation. Which occupation has a greater range in hourly wages?<\/li>\n \t<li>Calculate the standard deviation (to the nearest cent) for each occupation. Which occupation has a maximum hourly wage that is more spread out?<\/li>\n<\/ol>\n<\/li>\n \t<li>Revisit question #1. The daily high temperature (in degrees C) for Calgary AB was recorded over a period of two weeks:\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"0\">\n<tbody>\n<tr>\n<td style=\"width: 14.2857%;\">28<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<td style=\"width: 14.2857%;\">26<\/td>\n<td style=\"width: 14.2857%;\">27<\/td>\n<td style=\"width: 14.2857%;\">27<\/td>\n<td style=\"width: 14.2857%;\">29<\/td>\n<td style=\"width: 14.2857%;\">30<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 14.2857%;\">30<\/td>\n<td style=\"width: 14.2857%;\">12<\/td>\n<td style=\"width: 14.2857%;\">20<\/td>\n<td style=\"width: 14.2857%;\">22<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<td style=\"width: 14.2857%;\">24<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ol type=\"a\">\n \t<li>Calculate the standard deviation for the fourteen temperature values.<\/li>\n \t<li>Remove the outlier and recalculate the standard deviation for the thirteen temperature values. How has the standard deviation changed with the removal of the outlier?<\/li>\n<\/ol>\n<\/li>\n \t<li>Two histograms are provided below:<img class=\"aligncenter wp-image-1765 \" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms-1024x441.png\" alt=\"\" width=\"497\" height=\"214\">\n<ol type=\"a\">\n \t<li><span style=\"font-size: 14pt;\">For each histogram determine the range, mean, and standard deviation.<\/span><\/li>\n \t<li>Considering only the means and ranges, what can you conclude about the spread or dispersion of the data values for each set?<\/li>\n \t<li>Considering only the means and standard deviations, what can you conclude about the spread or dispersion of the data values for each set?<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<h1>Answers<\/h1>\n<ol>\n \t<li style=\"list-style-type: none;\">\n<ol type=\"a\">\n \t<li>mean 25; median 25.5, range 18<\/li>\n \t<li>outlier is 12<\/li>\n \t<li>mean 26; median 26, range 10<\/li>\n \t<li>The range was most impacted by the removal of the outlier.<\/li>\n<\/ol>\n<ol type=\"a\">\n \t<li>mean is 1022; median is\u00a0 1021<\/li>\n \t<li>58<\/li>\n \t<li>standard deviation = 21.40<\/li>\n<\/ol>\n<ol type=\"a\">\n \t<li>Set A:\u00a0 range is 5;\u00a0 mean is 8.5;\u00a0 standard deviation is 2.07Set B:\u00a0 range is 5;\u00a0\u00a0 mean is 12.5;\u00a0 standard deviation is 1.87Set C:\u00a0 range is 6;\u00a0 mean is 12.33;\u00a0 standard deviation is 2.07<\/li>\n \t<li>Set A and Set B\u00a0 have identical ranges but different means and standard deviations.Set A and Set C\u00a0 have the same standard deviation but different ranges.<\/li>\n \t<li>Each set has a different mean.<\/li>\n<\/ol>\n<ol type=\"a\">\n \t<li>Answers may vary. Set A scores appear to be more spread out; aside from one score of 86, the scores in Set B are less spread out.<\/li>\n \t<li>Set A: range is 26; mean is 70.17; standard deviation is 9.02Set B:\u00a0 range is 36;\u00a0 mean is 56.67;\u00a0 standard deviation is 14.39<\/li>\n \t<li>Set B has a larger standard deviation.<\/li>\n \t<li>Predictions will vary.\u00a0 For Set A the new standard deviation is 6.43 (compared to 9.02);\u00a0 Set B has a new standard deviation of 2.25 (compared to 14.39) so set B had the greatest change in standard deviation.<\/li>\n<\/ol>\n<\/li>\n \t<li>All data values are equal.\n<ol type=\"a\">\n \t<li><span style=\"font-size: 14pt;\">pipefitter<\/span><\/li>\n \t<li>pipefitter \u2018s top hourly wage on average is $45.40\/hour; carpenter\u2019s top hourly wage on average is $33.90\/hour<\/li>\n \t<li>pipefitter range of $10\/hour for the 10 provincescarpenter range of $14\/hour for the 10 provincesThe carpenter has a greater range in wages<\/li>\n \t<li>pipefitter has a standard deviation of $3.06 and the carpenter has a standard deviation of $4.46. The carpenter has a higher standard deviation and therefore the carpenter\u2019s hourly wage is more spread out.<\/li>\n<\/ol>\n<ol type=\"a\">\n \t<li>Standard deviation is 4.71<\/li>\n \t<li>Standard deviation is 2.97; With the removal of the outlier this value has become smaller<\/li>\n<\/ol>\n<ol type=\"a\">\n \t<li>Histogram A: range is 40, mean is 75, standard deviation is 11.88Histogram B:\u00a0 range is 40, mean is 75, standard deviation is 15.34<\/li>\n \t<li>Considering only the means and ranges, these are both identical so it would appear that the data values are equally dispersed.<\/li>\n \t<li>Histogram B has a greater dispersion of data as indicated by the larger standard deviation.<\/li>\n<\/ol>\n<\/li>\n<\/ol>","rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1748 size-large\" title=\"https:\/\/pixabay.com\/illustrations\/dispersion-flower-effect-flora-6184855\/\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-1024x614.jpg\" alt=\"\" width=\"1024\" height=\"614\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-1024x614.jpg 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-300x180.jpg 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-768x461.jpg 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-1536x922.jpg 1536w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-65x39.jpg 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-225x135.jpg 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1-350x210.jpg 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2020\/06\/8.2-intro-image-flower-1.jpg 1920w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Learning Objectives<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>By the end of this section the student should be able to:<\/p>\n<ul>\n<li>Determine the range for a data set.<\/li>\n<li>Determine the standard deviation for a data set.<\/li>\n<li>Determine the standard deviation from a histogram<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h1>Measures of Dispersion or Spread<\/h1>\n<p>We have seen that<strong> measures of central tendency,<\/strong> including the\u00a0mean\u00a0and median, \u00a0are used to identify a central position within a data set. They indicate where the data clusters.<\/p>\n<p>Consider student A&#8217;s scores on five tests:\u00a0 32%\u00a0 \u00a095%\u00a0 \u00a089%\u00a0 \u00a074%\u00a0 \u00a055%\u00a0\u00a0The <strong>mean,<\/strong> or average, is\u00a0 (32 + 95 + 89 + 74 + 55)\/5 = 69%\u00a0 and the <strong>median<\/strong> is 74%.<\/p>\n<p>Consider\u00a0 student B&#8217;s scores on the same five tests:\u00a0 68%\u00a0 \u00a069%\u00a0 \u00a072%\u00a0 \u00a074%\u00a0 \u00a062%\u00a0 \u00a0The <strong>mean,<\/strong> or average, is\u00a0 (68 + 69 + 72 + 74 + 62)\/5 = 69%\u00a0 and the <strong>median<\/strong> is 69%.<\/p>\n<p>Both student&#8217;s have the same test average of 69% but there is a substantial difference in the spread or dispersion of their scores. Student A&#8217;s test scores range\u00a0from a low score of 32% to a high score of\u00a0 95% so the spread in marks is 63 percentage points. Student B&#8217;s test scores range from a low score of 62% to a high score of\u00a0 74% so the spread in marks is 12 percentage points.<\/p>\n<p>When we analyze data it is important to consider how <strong>dispersed<\/strong> or spread out the data values are. In this section we will consider two measures of dispersion.<\/p>\n<h1>Range<\/h1>\n<p>Range is one <strong>measure of dispersion. <\/strong>A measure of dispersion is used to describe the spread of data.<\/p>\n<div class=\"textbox textbox--key-takeaways\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Range<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>The <strong>range<\/strong> indicates the total spread in data values. It is the difference between the highest and lowest data values.<\/p>\n<p>Range = highest data value &#8211; lowest data value<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 1<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>The table shows the daily high temperature (\u00b0C) over a three week period.<\/p>\n<p>Determine the highest temperature, lowest temperature, and the range\u00a0 in daily high temperatures over the three weeks.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1749 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-1024x212.png\" alt=\"\" width=\"1024\" height=\"212\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-1024x212.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-300x62.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-768x159.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-1536x319.png 1536w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-65x13.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-225x47.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-350x73.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range.png 1793w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p><strong>Solution<\/strong><\/p>\n<p>The highest temperature was 29\u00ba, the lowest temperature was 16\u00b0, and the range in temperatures was\u00a0 29 &#8211; 16 =\u00a0 13\u00ba.<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 1<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>The table shows the test score for a group of fourteen students.<\/p>\n<p>Determine the highest test score, the lowest test score, and the range\u00a0 in test scores for the group of 14 students.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1750 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-1024x114.png\" alt=\"\" width=\"1024\" height=\"114\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-1024x114.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-300x33.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-768x85.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-1536x171.png 1536w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-65x7.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-225x25.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-350x39.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range.png 1771w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>The highest score was 89%, the lowest score was 56%, and the range in scores was 33%.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<p>An advantage of using the range as a measure of dispersion is that it involves a simple calculation. A disadvantage is that the range only provides a measure between the highest and lowest values so it disregards all other data values. If the highest or lowest data value is an outlier then the range will not provide a true measure of the spread in the typical values.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 2<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A student wrote five tests and earned the following five scores:<\/p>\n<p style=\"text-align: center;\">92%\u00a0 \u00a095%\u00a0 \u00a089%\u00a0 \u00a094%\u00a0 \u00a035%<\/p>\n<ol>\n<li>Determine the mean, median and the range for these five scores.<\/li>\n<li>Which of the five scores is an outlier?<\/li>\n<li>Remove the outlier and recalculate the mean, median and the range for the four remaining scores.<\/li>\n<li>Comparing the results for the five scores versus four scores, which of the three measures was least impacted by the outlier?<\/li>\n<li>Comparing the results for the five scores versus four scores, which of the three measures was most impacted by the outlier?<\/li>\n<\/ol>\n<p><strong>Solution<\/strong><\/p>\n<p>1. The student&#8217;s mean score: (92 + 95 + 89 + 94 + 35) \/ 5 = 81%<\/p>\n<p>The median score is 92%.<\/p>\n<p>The range in marks is 95% &#8211; 35% = 60%.<\/p>\n<p>2. The score of 35% is an outlier.<\/p>\n<p>3. The student&#8217;s mean score: (92 + 95 + 89 + 94) \/ 4 = 92.5%<\/p>\n<p>The median score is 93%.<\/p>\n<p>The range in marks is 95% &#8211; 89% = 6%.<\/p>\n<p>4. The median was least impacted by the removal of the outlier.<\/p>\n<p>5. The range was most impacted by the outlier.<\/p>\n<\/div>\n<p>&nbsp;<\/p>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 2<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>The following seven values are salaries at a local computer company.<\/p>\n<p style=\"text-align: center;\">$62,000\u00a0 \u00a0 \u00a0$95, 000\u00a0 \u00a0$120,000\u00a0 \u00a0$101, 000\u00a0 \u00a0 \u00a0$99,000\u00a0 \u00a0$98,000\u00a0 \u00a0 \u00a0$110,000<\/p>\n<ol>\n<li>Determine the mean, median and the range for these seven salaries.<\/li>\n<li>Which of the seven salaries is an outlier?<\/li>\n<li>Remove the outlier and recalculate the mean, median and the range for the six remaining salaries.<\/li>\n<li>Comparing the results for the seven versus six salaries, which of the three measures was least impacted by the outlier?<\/li>\n<li>Comparing the results for the seven versus six salaries, which of the three measures was most impacted by the outlier?<\/li>\n<\/ol>\n<details open=\"open\">\n<summary>Show answer<\/summary>\n<ol>\n<li>mean salary $97,857; median salary $99,000;\u00a0 range in salaries $58,000<\/li>\n<li>\u00a0$62,000<\/li>\n<li>mean salary $103, 833; median salary $100,000;\u00a0 range in salaries $25,000<\/li>\n<li>the median<\/li>\n<li>the range<\/li>\n<\/ol>\n<\/details>\n<\/div>\n<\/div>\n<p>Refer back to Example 2 and the measures that were calculated for five test scores.The student&#8217;s mean score is 81% and the range in marks is 60%.\u00a0 The range of 60% does not capture the fact that if the outlier is removed then there is a spread of only 6% for the four remaining data values. The range depends on only the <strong>highest<\/strong> and <strong>lowest<\/strong> data values. The existence of an outlier can result in a misleading representation of the spread in data values.<\/p>\n<p>An alternative measure of dispersion is called the <strong>standard deviation<\/strong>. It depends on <strong>all<\/strong> data values rather than on only the highest and lowest data values.<\/p>\n<h1>Standard Deviation<\/h1>\n<p>Standard deviation\u00a0 measures the\u00a0 <strong>dispersion of the data values around the mean<\/strong>. Unlike the range, its value depends on every data value in the data set. The standard deviation is found by determining how much each data value differs from the mean.<\/p>\n<p>What does the standard deviation actually tell us? Consider two sets of test scores:<\/p>\n<p>Set A:\u00a0 \u00a0 76%\u00a0 \u00a074%\u00a0 \u00a086%\u00a0 \u00a084%\u00a0 \u00a085%<\/p>\n<p>Set<span style=\"font-size: 14pt;\">\u00a0<\/span><span style=\"font-size: 14pt;\">B:\u00a0 \u00a0 53%\u00a0 \u00a095%\u00a0 \u00a062%\u00a0 \u00a099%\u00a0 \u00a096%<\/span><\/p>\n<p>Refer to <a href=\"#figure1\">Figure 1<\/a>. For both sets the mean is 81%. If we plot the scores (indicated by the <strong>*<\/strong>) on a scale of 0% to 100% we see that the scores in Set A are much less spread out around\u00a0 the mean. The scores from\u00a0 Set B are much more dispersed.<a id=\"#figure1\" href=\"\"><\/a><\/p>\n<figure id=\"attachment_1760\" aria-describedby=\"caption-attachment-1760\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1751 size-large\" title=\"Created by Kim Moshenko\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-1024x417.png\" alt=\"\" width=\"1024\" height=\"417\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-1024x417.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-300x122.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-768x313.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-1536x626.png 1536w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-65x26.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-225x92.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image-350x143.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Sets-AB-image.png 1990w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-1760\" class=\"wp-caption-text\">Fig. 1<\/figcaption><\/figure>\n<p>If we compare the standard deviations for the two data sets we should find that although the mean is the same, the standard deviation for Set B will be greater since the data is more widely spread out from the mean.<\/p>\n<h1>Population versus Sample Standard Deviation<\/h1>\n<p>When working with standard deviation it is important to distinguish whether you are working with the entire population or a sample of the population.\u00a0Statisticians generally survey a sample of the population because it is often impossible to survey the entire population.\u00a0 As an example, assume your university wants to determine food preferences for its entire student body. The population would be all students enrolled in the university. Rather than attempt to survey every student, the university will survey only a sample.<\/p>\n<p>The symbolic representation of standard deviation is different for a population versus a sample. If\u00a0 you are working with an entire <strong>population,<\/strong> the symbol for standard deviation is the Greek letter sigma, <strong>\u03c3.<\/strong> The symbol for the standard deviation of a <strong>sample<\/strong> is\u00a0 <strong><em>s<\/em><\/strong>. Similarly, the calculation of standard deviation is different for a population versus a sample.\u00a0 Unless otherwise indicated in this chapter, we will assume that we are working with\u00a0 a sample, rather than an entire population.<\/p>\n<div class=\"textbox textbox--key-takeaways\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Standard Deviation Formula<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p><strong>Standard Deviation<\/strong>\u00a0<strong>Formula for a Population<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1752\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-pop-std-dev-formula.png\" alt=\"\" width=\"510\" height=\"156\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-pop-std-dev-formula.png 891w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-pop-std-dev-formula-300x92.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-pop-std-dev-formula-768x234.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-pop-std-dev-formula-65x20.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-pop-std-dev-formula-225x69.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-pop-std-dev-formula-350x107.png 350w\" sizes=\"auto, (max-width: 510px) 100vw, 510px\" \/><\/p>\n<p>The Greek letter sigma\u00a0 \u2211 is the summation symbol. It indicates that all of the values (x \u2013 \u03bc)<sup>2<\/sup> must be added.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Standard Deviation<\/strong>\u00a0<strong>Formula for a Sample<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1753\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula.png\" alt=\"\" width=\"507\" height=\"189\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula.png 879w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula-300x112.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula-768x286.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula-65x24.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula-225x84.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula-350x130.png 350w\" sizes=\"auto, (max-width: 507px) 100vw, 507px\" \/><\/p>\n<\/div>\n<\/div>\n<h1>Calculating the Standard Deviation<\/h1>\n<p>When calculating the standard deviation with the aid of a scientific calculator it is helpful to record the steps using a table.<\/p>\n<div class=\"textbox textbox--key-takeaways\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Calculating Standard Deviation<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>We will use the formula for finding the standard deviation of a sample:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-sample-std-dev-formula-1.png\" width=\"427\" height=\"136\" alt=\"image\" \/><\/p>\n<p>To determine the standard deviation for a sample we will use the following steps :<\/p>\n<ol>\n<li>Find the mean<\/li>\n<li>Create a table with three columns:\u00a0 \u00a0 data value,\u00a0 \u00a0 data value &#8211; mean,\u00a0 \u00a0 \u00a0(data value &#8211; mean)<sup>2<\/sup><\/li>\n<li>Fill in the <strong>data value<\/strong> column with all values from the sample<\/li>\n<li>Subtract the mean from each data value:\u00a0 (data value &#8211; mean)<\/li>\n<li>Square the results from step 4:\u00a0 \u00a0(data value &#8211; mean)<sup>2<\/sup><\/li>\n<li>Sum the results in column 3 (from step 5)<\/li>\n<li>Divide the sum (from step 6) by (n &#8211; 1)<\/li>\n<li>Find the square root of the result in step 7<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 3<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>a) Determine the standard deviation for the sample set A test scores:\u00a0 <span style=\"font-size: 14pt;\">76%\u00a0 \u00a074%\u00a0 \u00a086%\u00a0 \u00a084%\u00a0 \u00a085%<\/span><\/p>\n<p>b) Determine the standard deviation for the sample set B test scores: <span style=\"font-size: 14pt;\">\u00a0 \u00a0 53%\u00a0 \u00a095%\u00a0 \u00a062%\u00a0 \u00a099%\u00a0 \u00a096%<\/span><\/p>\n<p>c) Compare the means and standard deviations for Set A and Set B. Which set is more spread out (dispersed)?<\/p>\n<p><strong>Solution<\/strong><\/p>\n<p><span style=\"text-align: initial; font-size: 0.9em; word-spacing: normal;\">a)<\/span><\/p>\n<\/div>\n<div class=\"textbox__content\">\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1755\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3-1024x701.png\" alt=\"\" width=\"603\" height=\"413\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3-1024x701.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3-300x205.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3-768x526.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3-65x45.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3-225x154.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3-350x240.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-3.png 1317w\" sizes=\"auto, (max-width: 603px) 100vw, 603px\" \/><\/p>\n<p>The standard deviation for Set A is 5.57 (rounded to 2 decimal places)<\/p>\n<p>b)<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1756 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4-1024x699.png\" alt=\"\" width=\"1024\" height=\"699\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4-1024x699.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4-300x205.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4-768x524.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4-65x44.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4-225x154.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4-350x239.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example-4.png 1362w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>The standard deviation for Set B is 21.74 (rounded to 2 decimal places).<\/p>\n<p>c)<\/p>\n<p>In comparing the two data sets A and B:<\/p>\n<ul>\n<li>The means are the same value of\u00a0 81.<\/li>\n<li>The standard deviation for Set A is\u00a0 s = 5.57\u00a0 and for Set B it is\u00a0 s = 21.74.\u00a0 \u00a0The much larger standard deviation for Set B indicates that there is a much greater spread in the data values around the mean of 81%.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<p>Note that the sum of the middle column (data value &#8211; mean) is 0. This will always be the case. That is why we must square the values before we add them, as is done in column 3.<\/p>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 3<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>The average high temperatures (in \u00baC) for one week in April for two different cities in Canada were as follows:<\/p>\n<p>City A:\u00a0 \u00a0 15\u00a0 \u00a0 \u00a0 19\u00a0 \u00a0 \u00a022\u00a0 \u00a0 26\u00a0 \u00a0 21\u00a0 \u00a0 19\u00a0 \u00a0 18<\/p>\n<p>City B:\u00a0 \u00a0 \u00a06\u00a0 \u00a0 \u00a0 \u00a09\u00a0 \u00a0 \u00a0 15\u00a0 \u00a0 18\u00a0 \u00a0 \u00a020\u00a0 \u00a0 19\u00a0 \u00a0 21<\/p>\n<p>a) Calculate the mean high temperature (if necessary round to 2 decimal places) for each city. Which of the two cities appears to have a wider spread in temperatures around their means?<\/p>\n<p>b) Calculate the standard deviation for each temperature set (if necessary round to 2 decimal places) to see if your observation is correct.<\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>The mean for City A is 20 \u02daC and the mean for City B was 15.43\u02daC.<\/p>\n<p>The standard deviation for City A is 3.46 and for City B is 5.80. City B\u2019s temperatures are more widely spread out from the mean temperature.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<p>We have seen that two measures of spread or dispersion are the range\u00a0 and the standard deviation. Although the range is a much simpler calculation, it only takes into consideration the highest and lowest data values. The existence of an outlier can result in a range that is not truly indicative of the spread in data values. The standard deviation is a more complex calculation but takes into consideration all data values. It is important to note that technology is often used to calculate the standard deviation which eliminates the need for tedious calculations.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 4<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>In Example 1 the range for the following set of temperature was determined to be\u00a0 13 \u00baC.<\/p>\n<p>a) Determine the mean and the standard deviation.<\/p>\n<p>b) Explain why the values for the range and standard deviation are different.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1749 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-1024x212.png\" alt=\"\" width=\"1024\" height=\"212\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-1024x212.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-300x62.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-768x159.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-1536x319.png 1536w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-65x13.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-225x47.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range-350x73.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex1-Range.png 1793w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p><strong>Solution<\/strong><\/p>\n<p>a) The mean (average) is 22.29 \u02daC and the standard deviation is 3.86.<br \/>\nb) The reason for the difference in values is that the range only tells us the difference between the highest and lowest temperature whereas the standard deviation tells us how widespread the temperatures are in relation to the mean temperature of 22.29 \u02daC<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 4<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>Refer back to TRY IT 1 and the test scores for a group of 14 students. The range was determined to be 33.<\/p>\n<p>Determine the mean (rounded to the nearest whole number) and the standard deviation (rounded to the nearest 2 decimal places) for this set of test scores.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1750 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-1024x114.png\" alt=\"\" width=\"1024\" height=\"114\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-1024x114.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-300x33.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-768x85.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-1536x171.png 1536w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-65x7.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-225x25.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range-350x39.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-1-Range.png 1771w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>&nbsp;<\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>The mean is 72 and the standard deviation is 8.19.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<h6><strong>Histograms and the Dispersion of Data Values<\/strong><\/h6>\n<p>We have seen that for two data sets with the same mean, when the standard deviation is larger the data values are more spread out. A histogram can be used to illustrate the spread of data values.<\/p>\n<p>Consider a dance competition where teams comprised of seven dancers compete for the prize money in several different dance categories. There were four teams entered in the elite category. The dancers must be between the ages of\u00a0 18-24 years old. The age breakdown for the members of the four teams in the elite category is:<\/p>\n<p>Team Unity:\u00a0 all seven dancers are age 21<\/p>\n<p>Team Harmony:\u00a0 2 dancers are 20, 3 dancers are 21 and 2 dancers\u00a0 are 22<\/p>\n<p>Team Mix:\u00a0 1 dancer is 19, 2 dancers are 20, 1 dancer is 21, 2 dancers are 22 and 1 dancer is 23<\/p>\n<p>Team Extend: 3 dancers are 18, 1 dancer is 21, and 3 dancers are 24<\/p>\n<p>The <strong>mean<\/strong> age for all four teams is 21 but the standard deviations for each of the four teams are different. Team Unity has a standard deviation of 0, Team Harmony has a standard deviation of 0.82, Team Mix has a standard deviation of 1.412 and Team Extend has a standard deviation of\u00a0 3.<\/p>\n<p>The histograms for each of the teams appears in Figures 2a through Figures 2d below.<\/p>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_1760\" aria-describedby=\"caption-attachment-1760\" style=\"width: 505px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1757\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Unity-Histogram.png\" alt=\"\" width=\"505\" height=\"301\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Unity-Histogram.png 944w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Unity-Histogram-300x179.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Unity-Histogram-768x457.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Unity-Histogram-65x39.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Unity-Histogram-225x134.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Unity-Histogram-350x208.png 350w\" sizes=\"auto, (max-width: 505px) 100vw, 505px\" \/><figcaption id=\"caption-attachment-1760\" class=\"wp-caption-text\"><strong>Fig. 2a. Team Unity with\u00a0 s = 0<\/strong><\/figcaption><\/figure>\n<p>In Figure 2a\u00a0 Team Unity has a standard deviation of 0 since all ages are the same. None of the ages spread out from the mean of 21.<\/p>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_1760\" aria-describedby=\"caption-attachment-1760\" style=\"width: 486px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1758\" style=\"font-size: 14pt;\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Harmony-Histogram.png\" alt=\"\" width=\"486\" height=\"293\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Harmony-Histogram.png 939w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Harmony-Histogram-300x181.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Harmony-Histogram-768x462.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Harmony-Histogram-65x39.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Harmony-Histogram-225x135.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Harmony-Histogram-350x211.png 350w\" sizes=\"auto, (max-width: 486px) 100vw, 486px\" \/><figcaption id=\"caption-attachment-1760\" class=\"wp-caption-text\"><strong>Fig. 2b. Team Harmony with\u00a0 s = 0.82<\/strong><\/figcaption><\/figure>\n<p>In Figure 2b Team Harmony has a standard deviation of 0.82 years. The histogram illustrates that the ages are closely clustered around the mean of 21.<\/p>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_1760\" aria-describedby=\"caption-attachment-1760\" style=\"width: 490px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1759\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Mix-Histogram.png\" alt=\"\" width=\"490\" height=\"294\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Mix-Histogram.png 942w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Mix-Histogram-300x180.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Mix-Histogram-768x461.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Mix-Histogram-65x39.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Mix-Histogram-225x135.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Mix-Histogram-350x210.png 350w\" sizes=\"auto, (max-width: 490px) 100vw, 490px\" \/><figcaption id=\"caption-attachment-1760\" class=\"wp-caption-text\"><strong>Fig. 2c Team Mix\u00a0 with\u00a0 s = 1.41<\/strong><\/figcaption><\/figure>\n<p>In Figure 2c Team Mix has a standard deviation of 1.41.\u00a0 The histogram illustrates that the data (age) spread is greater than for Team Unity and Team Harmony.<\/p>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_1760\" aria-describedby=\"caption-attachment-1760\" style=\"width: 422px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1760\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Extend-Histogram.png\" alt=\"\" width=\"422\" height=\"251\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Extend-Histogram.png 942w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Extend-Histogram-300x178.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Extend-Histogram-768x456.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Extend-Histogram-65x39.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Extend-Histogram-225x134.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.3-Team-Extend-Histogram-350x208.png 350w\" sizes=\"auto, (max-width: 422px) 100vw, 422px\" \/><figcaption id=\"caption-attachment-1760\" class=\"wp-caption-text\"><strong>Fig. 2d. Team Extend\u00a0 \u00a0with\u00a0 s = 3<\/strong><\/figcaption><\/figure>\n<p>In Figure 2d Team Extend\u00a0 has a standard deviation of 3. The histogram clearly illustrates that the ages for this team are the most spread out from the mean of 21.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 5<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>Ten participants in a group fitness class were asked to rank the class on a scale from 1 to 5. Determine the mean and standard deviation for the evaluation scores as depicted in the histogram below.<\/p>\n<p><a href=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/794\/2020\/06\/8.2-Example5-4.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1761\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example5-4.png\" alt=\"\" width=\"405\" height=\"238\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example5-4.png 934w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example5-4-300x176.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example5-4-768x451.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example5-4-65x38.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example5-4-225x132.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Example5-4-350x206.png 350w\" sizes=\"auto, (max-width: 405px) 100vw, 405px\" \/><\/a><\/p>\n<p><strong style=\"text-align: initial; word-spacing: normal; font-size: 0.9em;\">Solution<\/strong><\/p>\n<\/div>\n<div class=\"textbox__content\">\n<p><a href=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/794\/2020\/06\/8.2-ex-5-solution.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1762\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-1024x1019.png\" alt=\"\" width=\"546\" height=\"543\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-1024x1019.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-300x298.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-150x150.png 150w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-768x764.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-65x65.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-225x224.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution-350x348.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-ex-5-solution.png 1315w\" sizes=\"auto, (max-width: 546px) 100vw, 546px\" \/><\/a><\/p>\n<p>The mean score is 3 and the standard deviation is 1.49<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 5<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>The hourly wages for ten employees at a small coffee shop are illustrated in the histogram. Determine the mean and standard deviation for the employee hourly wages.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1763 aligncenter\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-5-Histogram-1.png\" alt=\"\" width=\"639\" height=\"380\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-5-Histogram-1.png 950w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-5-Histogram-1-300x178.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-5-Histogram-1-768x457.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-5-Histogram-1-65x39.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-5-Histogram-1-225x134.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Try-It-5-Histogram-1-350x208.png 350w\" sizes=\"auto, (max-width: 639px) 100vw, 639px\" \/><\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>The mean is $17.80 and the standard deviation is\u00a0 $1.32<\/p>\n<\/details>\n<\/div>\n<\/div>\n<p>We have seen that standard deviation provides us with a measure of the spread of data values in relation to the mean.\u00a0 We have learned how to calculate the standard deviation for\u00a0 a data set but we\u00a0 have not explored the significance or meaning of these calculated values. In the next section we will explore the significance of the calculated values as we consider the relationship between the standard deviation and the distribution of the data values.<\/p>\n<h1>Key Concepts<\/h1>\n<ul>\n<li>Two measures of dispersion or spread in data values are:\n<ul>\n<li>Range = highest data value &#8211; lowest data value<\/li>\n<li>Standard deviation. \u00a0Unlike the range, its value depends on every data value in the data set. It is found by determining how much each data value differs from the mean.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<h1><strong>Glossary<\/strong><\/h1>\n<div class=\"textbox shaded\">\n<p><strong>histogram<\/strong><\/p>\n<p>represents the frequency distribution (number of occurrences) of each data value. The data values are grouped into intervals or &#8220;bins&#8221;.<\/p>\n<p><strong>outlier<\/strong><\/p>\n<p>A data observation that is deemed to be unusual based on the pattern of the other data values.<\/p>\n<p><strong>range<\/strong><\/p>\n<p>indicates the total spread in data values. It is the difference between the highest and lowest data values.<\/p>\n<p><strong>sigma<\/strong><\/p>\n<p>is the uppercase Greek letter written\u00a0 \u2211. It is used to indicate the sum of a series of values.<\/p>\n<p><strong>standard deviation<\/strong><\/p>\n<p>measures the\u00a0 dispersion of the data values around the mean.<\/p>\n<\/div>\n<h1>8.2 Exercise Set<\/h1>\n<ol>\n<li>The daily high temperature (in degrees C) for Calgary AB was recorded over a period of two weeks:<br \/>\n<table style=\"border-collapse: collapse; width: 100%;\">\n<tbody>\n<tr>\n<td style=\"width: 14.2857%;\">28<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<td style=\"width: 14.2857%;\">26<\/td>\n<td style=\"width: 14.2857%;\">27<\/td>\n<td style=\"width: 14.2857%;\">27<\/td>\n<td style=\"width: 14.2857%;\">29<\/td>\n<td style=\"width: 14.2857%;\">30<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 14.2857%;\">30<\/td>\n<td style=\"width: 14.2857%;\">12<\/td>\n<td style=\"width: 14.2857%;\">20<\/td>\n<td style=\"width: 14.2857%;\">22<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<td style=\"width: 14.2857%;\">24<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ol type=\"a\">\n<li>Determine the mean, median and the range for these fourteen temperatures.<\/li>\n<li>Which of the temperatures appears to be an outlier?<\/li>\n<li>Remove the outlier and recalculate the mean, median and the range for the thirteen remaining values.<\/li>\n<li>Comparing the results for the fourteen temperature values versus thirteen temperature values, which of the three measures was most impacted by the outlier?<\/li>\n<\/ol>\n<\/li>\n<li>The population of Cache Creek, B.C. for the years 2011 to 2019 is provided in the table below (<em>Demographic Analysis Section, BC Stats)<\/em><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1764 size-large\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table-1024x118.png\" alt=\"\" width=\"1024\" height=\"118\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table-1024x118.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table-300x34.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table-768x88.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table-65x7.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table-225x26.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table-350x40.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set1-Table.png 1159w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/>\n<ol type=\"a\">\n<li>Determine the mean and median population.<\/li>\n<li>Determine the range.<\/li>\n<li>Determine the standard deviation.<\/li>\n<\/ol>\n<\/li>\n<li>Set A:\u00a0 6\u00a0 7\u00a0 7\u00a0 10\u00a0 10\u00a0 11Set B:\u00a0 10\u00a0 11\u00a0 12\u00a0 13\u00a0 14\u00a0 15Set C:\u00a0\u00a0 10\u00a0 12\u00a0 16\u00a0 11\u00a0 12\u00a0 13\n<ol type=\"a\">\n<li>For each of the three data samples below, calculate the range, mean, and standard deviation.<\/li>\n<li>Based on these calculations, determine any similarities that exist between any \u00a0of the data samples.<\/li>\n<li>Which of the three measures is different for every set?<\/li>\n<\/ol>\n<ol type=\"a\">\n<li>Given the two data samples <strong>Set A<\/strong> and <strong>Set B<\/strong> below, which appears to have the greatest spread in values?\n<ul>\n<li><strong>Set A<\/strong>:\u00a0 69\u00a0\u00a0 86\u00a0\u00a0 74\u00a0\u00a0 60\u00a0\u00a0 67\u00a0\u00a0 65<\/li>\n<li><strong>Set B<\/strong>:\u00a0 50 \u00a0\u00a0\u00a051\u00a0\u00a0 86\u00a0\u00a0 50\u00a0\u00a0 52\u00a0\u00a0 51<\/li>\n<\/ul>\n<\/li>\n<li>For each sample calculate the mean, range and standard deviation. If necessary round final answers to 2 decimal places.<\/li>\n<li>Based on these calculations which sample has the larger standard deviation?<\/li>\n<li>If the 86 in each data set is changed to a 56 which set would you predict would have the greatest <strong>change<\/strong> in standard deviation? Recalculate the standard deviations with a value of 56 instead of 86. Is your prediction correct?\n<ul>\n<li><strong>Set A<\/strong>:\u00a0 69\u00a0 \u00a0<span style=\"color: #0000ff;\">56<\/span>\u00a0\u00a0 74\u00a0\u00a0 60\u00a0\u00a0 67\u00a0\u00a0 65<\/li>\n<li><strong>Set B<\/strong>:\u00a0 50 \u00a0\u00a0\u00a051\u00a0 \u00a0<span style=\"color: #0000ff;\">56<\/span>\u00a0\u00a0 50\u00a0\u00a0 52\u00a0\u00a0 51<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<\/li>\n<li>The standard deviation for a sample is calculated to be 0. What can you conclude about the data values?<\/li>\n<li>The maximum hourly wage (in dollars\/hour) for pipefitters and carpenters in the ten Canadian provinces is listed below.(Source: <a href=\"https:\/\/www.jobbank.gc.ca\/wagereport\/occupation\/21460\">Wages for Steamfitters, pipefitters and sprinkler system installers from the Canadian Job Bank<\/a>)\n<ul>\n<li><strong>Pipefitter<\/strong>:\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 48\u00a0\u00a0 47\u00a0\u00a0 45\u00a0\u00a0 45\u00a0\u00a0 40\u00a0\u00a0 48\u00a0\u00a0 42\u00a0\u00a0 50\u00a0\u00a0 46\u00a0\u00a0 43<\/li>\n<li><strong>Carpenter<\/strong>:\u00a0\u00a0\u00a0\u00a0\u00a0 36\u00a0\u00a0 33\u00a0\u00a0 25\u00a0\u00a0\u00a0 28\u00a0\u00a0 39\u00a0\u00a0 36\u00a0\u00a0 33\u00a0\u00a0 35\u00a0\u00a0 39\u00a0\u00a0 35<\/li>\n<\/ul>\n<ol type=\"a\">\n<li>Without calculating the average, which <strong>occupation<\/strong> appears to have the higher average maximum hourly wage?<\/li>\n<li>Calculate the average (mean) maximum hourly wage (to the nearest cent) for each occupation.<\/li>\n<li>Determine the range in maximum hourly wages for each occupation. Which occupation has a greater range in hourly wages?<\/li>\n<li>Calculate the standard deviation (to the nearest cent) for each occupation. Which occupation has a maximum hourly wage that is more spread out?<\/li>\n<\/ol>\n<\/li>\n<li>Revisit question #1. The daily high temperature (in degrees C) for Calgary AB was recorded over a period of two weeks:<br \/>\n<table style=\"border-collapse: collapse; width: 100%;\">\n<tbody>\n<tr>\n<td style=\"width: 14.2857%;\">28<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<td style=\"width: 14.2857%;\">26<\/td>\n<td style=\"width: 14.2857%;\">27<\/td>\n<td style=\"width: 14.2857%;\">27<\/td>\n<td style=\"width: 14.2857%;\">29<\/td>\n<td style=\"width: 14.2857%;\">30<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 14.2857%;\">30<\/td>\n<td style=\"width: 14.2857%;\">12<\/td>\n<td style=\"width: 14.2857%;\">20<\/td>\n<td style=\"width: 14.2857%;\">22<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<td style=\"width: 14.2857%;\">24<\/td>\n<td style=\"width: 14.2857%;\">25<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ol type=\"a\">\n<li>Calculate the standard deviation for the fourteen temperature values.<\/li>\n<li>Remove the outlier and recalculate the standard deviation for the thirteen temperature values. How has the standard deviation changed with the removal of the outlier?<\/li>\n<\/ol>\n<\/li>\n<li>Two histograms are provided below:<img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1765\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms-1024x441.png\" alt=\"\" width=\"497\" height=\"214\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms-1024x441.png 1024w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms-300x129.png 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms-768x331.png 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms-65x28.png 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms-225x97.png 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms-350x151.png 350w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2021\/08\/8.2-Ex-Set6-Histograms.png 1158w\" sizes=\"auto, (max-width: 497px) 100vw, 497px\" \/>\n<ol type=\"a\">\n<li><span style=\"font-size: 14pt;\">For each histogram determine the range, mean, and standard deviation.<\/span><\/li>\n<li>Considering only the means and ranges, what can you conclude about the spread or dispersion of the data values for each set?<\/li>\n<li>Considering only the means and standard deviations, what can you conclude about the spread or dispersion of the data values for each set?<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<h1>Answers<\/h1>\n<ol>\n<li style=\"list-style-type: none;\">\n<ol type=\"a\">\n<li>mean 25; median 25.5, range 18<\/li>\n<li>outlier is 12<\/li>\n<li>mean 26; median 26, range 10<\/li>\n<li>The range was most impacted by the removal of the outlier.<\/li>\n<\/ol>\n<ol type=\"a\">\n<li>mean is 1022; median is\u00a0 1021<\/li>\n<li>58<\/li>\n<li>standard deviation = 21.40<\/li>\n<\/ol>\n<ol type=\"a\">\n<li>Set A:\u00a0 range is 5;\u00a0 mean is 8.5;\u00a0 standard deviation is 2.07Set B:\u00a0 range is 5;\u00a0\u00a0 mean is 12.5;\u00a0 standard deviation is 1.87Set C:\u00a0 range is 6;\u00a0 mean is 12.33;\u00a0 standard deviation is 2.07<\/li>\n<li>Set A and Set B\u00a0 have identical ranges but different means and standard deviations.Set A and Set C\u00a0 have the same standard deviation but different ranges.<\/li>\n<li>Each set has a different mean.<\/li>\n<\/ol>\n<ol type=\"a\">\n<li>Answers may vary. Set A scores appear to be more spread out; aside from one score of 86, the scores in Set B are less spread out.<\/li>\n<li>Set A: range is 26; mean is 70.17; standard deviation is 9.02Set B:\u00a0 range is 36;\u00a0 mean is 56.67;\u00a0 standard deviation is 14.39<\/li>\n<li>Set B has a larger standard deviation.<\/li>\n<li>Predictions will vary.\u00a0 For Set A the new standard deviation is 6.43 (compared to 9.02);\u00a0 Set B has a new standard deviation of 2.25 (compared to 14.39) so set B had the greatest change in standard deviation.<\/li>\n<\/ol>\n<\/li>\n<li>All data values are equal.\n<ol type=\"a\">\n<li><span style=\"font-size: 14pt;\">pipefitter<\/span><\/li>\n<li>pipefitter \u2018s top hourly wage on average is $45.40\/hour; carpenter\u2019s top hourly wage on average is $33.90\/hour<\/li>\n<li>pipefitter range of $10\/hour for the 10 provincescarpenter range of $14\/hour for the 10 provincesThe carpenter has a greater range in wages<\/li>\n<li>pipefitter has a standard deviation of $3.06 and the carpenter has a standard deviation of $4.46. The carpenter has a higher standard deviation and therefore the carpenter\u2019s hourly wage is more spread out.<\/li>\n<\/ol>\n<ol type=\"a\">\n<li>Standard deviation is 4.71<\/li>\n<li>Standard deviation is 2.97; With the removal of the outlier this value has become smaller<\/li>\n<\/ol>\n<ol type=\"a\">\n<li>Histogram A: range is 40, mean is 75, standard deviation is 11.88Histogram B:\u00a0 range is 40, mean is 75, standard deviation is 15.34<\/li>\n<li>Considering only the means and ranges, these are both identical so it would appear that the data values are equally dispersed.<\/li>\n<li>Histogram B has a greater dispersion of data as indicated by the larger standard deviation.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<div class=\"glossary\"><span class=\"screen-reader-text\" id=\"definition\">definition<\/span><template id=\"term_1766_1845\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_1766_1845\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_1766_1844\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_1766_1844\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_1766_1846\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_1766_1846\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_1766_1847\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_1766_1847\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_1766_1848\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_1766_1848\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_1766_1850\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_1766_1850\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_1766_1851\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_1766_1851\"><div tabindex=\"-1\"><\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><\/div>","protected":false},"author":125,"menu_order":2,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-1766","chapter","type-chapter","status-publish","hentry"],"part":1811,"_links":{"self":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters\/1766","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/users\/125"}],"version-history":[{"count":2,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters\/1766\/revisions"}],"predecessor-version":[{"id":1861,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters\/1766\/revisions\/1861"}],"part":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/parts\/1811"}],"metadata":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters\/1766\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/media?parent=1766"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapter-type?post=1766"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/contributor?post=1766"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/license?post=1766"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}