F Distribution and OneWay ANOVA
66 Facts About the F Distribution
Here are some facts about the F distribution.
 The curve is not symmetrical but skewed to the right.
 There is a different curve for each set of degrees of freedom.
 The F statistic is greater than or equal to zero.
 As the degrees of freedom for the numerator and for the denominator get larger, the curve approximates the normal as can be seen in the two figures below. Figure (b) with more degrees of freedom is more closely approaching the normal distribution, but remember that the F cannot ever be less than zero so the distribution does not have a tail that goes to infinity on the left as the normal distribution does.
 Other uses for the F distribution include comparing two variances and twoway Analysis of Variance. TwoWay Analysis is beyond the scope of this chapter.
References
Data from a fourth grade classroom in 1994 in a private K – 12 school in San Jose, CA.
Hand, D.J., F. Daly, A.D. Lunn, K.J. McConway, and E. Ostrowski. A Handbook of Small Datasets: Data for Fruitfly Fecundity. London: Chapman & Hall, 1994.
Hand, D.J., F. Daly, A.D. Lunn, K.J. McConway, and E. Ostrowski. A Handbook of Small Datasets. London: Chapman & Hall, 1994, pg. 50.
Hand, D.J., F. Daly, A.D. Lunn, K.J. McConway, and E. Ostrowski. A Handbook of Small Datasets. London: Chapman & Hall, 1994, pg. 118.
“MLB Standings – 2012.” Available online at http://espn.go.com/mlb/standings/_/year/2012.
Mackowiak, P. A., Wasserman, S. S., and Levine, M. M. (1992), “A Critical Appraisal of 98.6 Degrees F, the Upper Limit of the Normal Body Temperature, and Other Legacies of Carl Reinhold August Wunderlich,” Journal of the American Medical Association, 268, 15781580.
Chapter Review
The graph of the F distribution is always positive and skewed right, though the shape can be mounded or exponential depending on the combination of numerator and denominator degrees of freedom. The F statistic is the ratio of a measure of the variation in the group means to a similar measure of the variation within the groups. If the null hypothesis is correct, then the numerator should be small compared to the denominator. A small F statistic will result, and the area under the F curve to the right will be large, representing a large pvalue. When the null hypothesis of equal group means is incorrect, then the numerator should be large compared to the denominator, giving a large F statistic and a small area (small pvalue) to the right of the statistic under the F curve.
When the data have unequal group sizes (unbalanced data), then techniques from (Figure) need to be used for hand calculations. In the case of balanced data (the groups are the same size) however, simplified calculations based on group means and variances may be used. In practice, of course, software is usually employed in the analysis. As in any analysis, graphs of various sorts should be used in conjunction with numerical techniques. Always look at your data!
An F statistic can have what values?
What happens to the curves as the degrees of freedom for the numerator and the denominator get larger?
The curves approximate the normal distribution.
Use the following information to answer the next seven exercise. Four basketball teams took a random sample of players regarding how high each player can jump (in inches). The results are shown in (Figure).
Team 1  Team 2  Team 3  Team 4  Team 5 

36  32  48  38  41 
42  35  50  44  39 
51  38  39  46  40 
What is the df(num)?
What is the df(denom)?
ten
What are the Sum of Squares and Mean Squares Factors?
What are the Sum of Squares and Mean Squares Errors?
SS = 237.33; MS = 23.73
What is the F statistic?
What is the pvalue?
0.1614
At the 5% significance level, is there a difference in the mean jump heights among the teams?
Use the following information to answer the next seven exercises. A video game developer is testing a new game on three different groups. Each group represents a different target market for the game. The developer collects scores from a random sample from each group. The results are shown in (Figure)
Group A  Group B  Group C 

101  151  101 
108  149  109 
98  160  198 
107  112  186 
111  126  160 
What is the df(num)?
two
What is the df(denom)?
What are the SS_{between} and MS_{between}?
SS = 5,700.4;
MS = 2,850.2
What are the SS_{within} and MS_{within}?
What is the F Statistic?
3.6101
What is the pvalue?
At the 10% significance level, are the scores among the different groups different?
Yes, there is enough evidence to show that the scores among the groups are statistically significant at the 10% level.
Use the following information to answer the next three exercises. Suppose a group is interested in determining whether teenagers obtain their drivers licenses at approximately the same average age across the country. Suppose that the following data are randomly collected from five teenagers in each region of the country. The numbers represent the age at which teenagers obtained their drivers licenses.
Northeast  South  West  Central  East  

16.3  16.9  16.4  16.2  17.1  
16.1  16.5  16.5  16.6  17.2  
16.4  16.4  16.6  16.5  16.6  
16.5  16.2  16.1  16.4  16.8  
________  ________  ________  ________  ________  
________  ________  ________  ________  ________ 
Enter the data into your calculator or computer.
pvalue = ______
State the decisions and conclusions (in complete sentences) for the following preconceived levels of α.
α = 0.05
a. Decision: ____________________________
b. Conclusion: ____________________________
α = 0.01
a. Decision: ____________________________
b. Conclusion: ____________________________
Homework
Three students, Linda, Tuan, and Javier, are given five laboratory rats each for a nutritional experiment. Each rat’s weight is recorded in grams. Linda feeds her rats Formula A, Tuan feeds his rats Formula B, and Javier feeds his rats Formula C. At the end of a specified time period, each rat is weighed again, and the net gain in grams is recorded. Using a significance level of 10%, test the hypothesis that the three formulas produce the same mean weight gain.
Linda’s rats  Tuan’s rats  Javier’s rats 

43.5  47.0  51.2 
39.4  40.5  40.9 
41.3  38.9  37.9 
46.0  46.3  45.0 
38.2  44.2  48.6 
 H_{0}: µ_{L} = µ_{T} = µ_{J}
 H_{a}: at least any two of the means are different
 df(num) = 2; df(denom) = 12
 F distribution
 0.67
 0.5305
 Check student’s solution.
 Decision:Cannot reject null hypothesis; Conclusion: There is insufficient evidence to conclude that the means are different.
A grassroots group opposed to a proposed increase in the gas tax claimed that the increase would hurt workingclass people the most, since they commute the farthest to work. Suppose that the group randomly surveyed 24 individuals and asked them their daily oneway commuting mileage. The results are in (Figure). Using a 5% significance level, test the hypothesis that the three mean commuting mileages are the same.
Workingclass  Professional (middle incomes)  Professional (wealthy) 

17.8  16.5  8.5 
26.7  17.4  6.3 
49.4  22.0  4.6 
9.4  7.4  12.6 
65.4  9.4  11.0 
47.1  2.1  28.6 
19.5  6.4  15.4 
51.2  13.9  9.3 
Use the following information to answer the next two exercises.(Figure) lists the number of pages in four different types of magazines.
Home decorating  News  Health  Computer 

172  87  82  104 
286  94  153  136 
163  123  87  98 
205  106  103  207 
197  101  96  146 
Using a significance level of 5%, test the hypothesis that the four magazine types have the same mean length.
Eliminate one magazine type that you now feel has a mean length different from the others. Redo the hypothesis test, testing that the remaining three means are statistically the same. Use a new solution sheet. Based on this test, are the mean lengths for the remaining three magazines statistically the same?
 H_{a}: µ_{c} = µ_{n} = µ_{h}
 At least any two of the magazines have different mean lengths.
 df(num) = 2, df(denom) = 12
 F distribtuion
 F = 15.28
 pvalue = 0.001
 Check student’s solution.

 Alpha: 0.05
 Decision: Cannot accept the null hypothesis.
 Reason for decision: pvalue < alpha
 Conclusion: There is sufficient evidence to conclude that the mean lengths of the magazines are different.
A researcher wants to know if the mean times (in minutes) that people watch their favorite news station are the same. Suppose that (Figure) shows the results of a study.
CNN  FOX  Local 

45  15  72 
12  43  37 
18  68  56 
38  50  60 
23  31  51 
35  22 
Assume that all distributions are normal, the four population standard deviations are approximately the same, and the data were collected independently and randomly. Use a level of significance of 0.05.
Are the means for the final exams the same for all statistics class delivery types? (Figure) shows the scores on final exams from several randomly selected classes that used the different delivery types.
Online  Hybrid  FacetoFace 

72  83  80 
84  73  78 
77  84  84 
80  81  81 
81  86  
79  
82 
Assume that all distributions are normal, the four population standard deviations are approximately the same, and the data were collected independently and randomly. Use a level of significance of 0.05.
 H_{0}: μ_{o} = μ_{h} = μ_{f}
 At least two of the means are different.
 df(n) = 2, df(d) = 13
 F_{2,13}
 0.64
 0.5437
 Check student’s solution.

 Alpha: 0.05
 Decision: Cannot reject the null hypothesis.
 Reason for decision: pvalue > alpha
 Conclusion: The mean scores of different class delivery are not different.
Are the mean number of times a month a person eats out the same for whites, blacks, Hispanics and Asians? Suppose that (Figure) shows the results of a study.
White  Black  Hispanic  Asian 

6  4  7  8 
8  1  3  3 
2  5  5  5 
4  2  4  1 
6  6  7 
Assume that all distributions are normal, the four population standard deviations are approximately the same, and the data were collected independently and randomly. Use a level of significance of 0.05.
Are the mean numbers of daily visitors to a ski resort the same for the three types of snow conditions? Suppose that (Figure) shows the results of a study.
Powder  Machine Made  Hard Packed 

1,210  2,107  2,846 
1,080  1,149  1,638 
1,537  862  2,019 
941  1,870  1,178 
1,528  2,233  
1,382 
Assume that all distributions are normal, the four population standard deviations are approximately the same, and the data were collected independently and randomly. Use a level of significance of 0.05.
 H_{0}: μ_{p} = μ_{m} = μ_{h}
 At least any two of the means are different.
 df(n) = 2, df(d) = 12
 F_{2,12}
 3.13
 0.0807
 Check student’s solution.

 Alpha: 0.05
 Decision: Cannot reject the null hypothesis.
 Reason for decision: pvalue > alpha
 Conclusion: There is not sufficient evidence to conclude that the mean numbers of daily visitors are different.
Sanjay made identical paper airplanes out of three different weights of paper, light, medium and heavy. He made four airplanes from each of the weights, and launched them himself across the room. Here are the distances (in meters) that his planes flew.
Paper type/Trial  Trial 1  Trial 2  Trial 3  Trial 4 

Heavy  5.1 meters  3.1 meters  4.7 meters  5.3 meters 
Medium  4 meters  3.5 meters  4.5 meters  6.1 meters 
Light  3.1 meters  3.3 meters  2.1 meters  1.9 meters 
 Take a look at the data in the graph. Look at the spread of data for each group (light, medium, heavy). Does it seem reasonable to assume a normal distribution with the same variance for each group? Yes or No.
 Why is this a balanced design?
 Calculate the sample mean and sample standard deviation for each group.
 Does the weight of the paper have an effect on how far the plane will travel? Use a 1% level of significance. Complete the test using the method shown in the bean plant example in (Figure).
 variance of the group means __________
 MS_{between}= ___________
 mean of the three sample variances ___________
 MS_{within} = _____________
 F statistic = ____________
 df(num) = __________, df(denom) = ___________
 number of groups _______
 number of observations _______
 pvalue = __________ (P(F > _______) = __________)
 Graph the pvalue.
 decision: _______________________
 conclusion: _______________________________________________________________
DDT is a pesticide that has been banned from use in the United States and most other areas of the world. It is quite effective, but persisted in the environment and over time became seen as harmful to higherlevel organisms. Famously, egg shells of eagles and other raptors were believed to be thinner and prone to breakage in the nest because of ingestion of DDT in the food chain of the birds.
An experiment was conducted on the number of eggs (fecundity) laid by female fruit flies. There are three groups of flies. One group was bred to be resistant to DDT (the RS group). Another was bred to be especially susceptible to DDT (SS). Finally there was a control line of nonselected or typical fruitflies (NS). Here are the data:
RS  SS  NS  RS  SS  NS 

12.8  38.4  35.4  22.4  23.1  22.6 
21.6  32.9  27.4  27.5  29.4  40.4 
14.8  48.5  19.3  20.3  16  34.4 
23.1  20.9  41.8  38.7  20.1  30.4 
34.6  11.6  20.3  26.4  23.3  14.9 
19.7  22.3  37.6  23.7  22.9  51.8 
22.6  30.2  36.9  26.1  22.5  33.8 
29.6  33.4  37.3  29.5  15.1  37.9 
16.4  26.7  28.2  38.6  31  29.5 
20.3  39  23.4  44.4  16.9  42.4 
29.3  12.8  33.7  23.2  16.1  36.6 
14.9  14.6  29.2  23.6  10.8  47.4 
27.3  12.2  41.7 
The values are the average number of eggs laid daily for each of 75 flies (25 in each group) over the first 14 days of their lives. Using a 1% level of significance, are the mean rates of egg selection for the three strains of fruitfly different? If so, in what way? Specifically, the researchers were interested in whether or not the selectively bred strains were different from the nonselected line, and whether the two selected lines were different from each other.
Here is a chart of the three groups:
The data appear normally distributed from the chart and of similar spread. There do not appear to be any serious outliers, so we may proceed with our ANOVA calculations, to see if we have good evidence of a difference between the three groups.
;
some
Define μ_{1}, μ_{2}, μ_{3}, as the population mean number of eggs laid by the three groups of fruit flies.
F statistic = 8.6657;
pvalue = 0.0004
Decision: Since the pvalue is less than the level of significance of 0.01, we reject the null hypothesis.
Conclusion: We have good evidence that the average number of eggs laid during the first 14 days of life for these three strains of fruitflies are different.
Interestingly, if you perform a two sample ttest to compare the RS and NS groups they are significantly different (p = 0.0013). Similarly, SS and NS are significantly different (p = 0.0006). However, the two selected groups, RS and SS are not significantly different (p = 0.5176). Thus we appear to have good evidence that selection either for resistance or for susceptibility involves a reduced rate of egg production (for these specific strains) as compared to flies that were not selected for resistance or susceptibility to DDT. Here, genetic selection has apparently involved a loss of fecundity.
The data shown is the recorded body temperatures of 130 subjects as estimated from available histograms.
Traditionally we are taught that the normal human body temperature is 98.6 F. This is not quite correct for everyone. Are the mean temperatures among the four groups different?
Calculate 95% confidence intervals for the mean body temperature in each group and comment about the confidence intervals.
FL  FH  ML  MH  FL  FH  ML  MH 

96.4  96.8  96.3  96.9  98.4  98.6  98.1  98.6 
96.7  97.7  96.7  97  98.7  98.6  98.1  98.6 
97.2  97.8  97.1  97.1  98.7  98.6  98.2  98.7 
97.2  97.9  97.2  97.1  98.7  98.7  98.2  98.8 
97.4  98  97.3  97.4  98.7  98.7  98.2  98.8 
97.6  98  97.4  97.5  98.8  98.8  98.2  98.8 
97.7  98  97.4  97.6  98.8  98.8  98.3  98.9 
97.8  98  97.4  97.7  98.8  98.8  98.4  99 
97.8  98.1  97.5  97.8  98.8  98.9  98.4  99 
97.9  98.3  97.6  97.9  99.2  99  98.5  99 
97.9  98.3  97.6  98  99.3  99  98.5  99.2 
98  98.3  97.8  98  99.1  98.6  99.5  
98.2  98.4  97.8  98  99.1  98.6  
98.2  98.4  97.8  98.3  99.2  98.7  
98.2  98.4  97.9  98.4  99.4  99.1  
98.2  98.4  98  98.4  99.9  99.3  
98.2  98.5  98  98.6  100  99.4  
98.2  98.6  98  98.6  100.8 