{"id":1708,"date":"2019-08-07T14:06:20","date_gmt":"2019-08-07T14:06:20","guid":{"rendered":"https:\/\/opentextbc.ca\/businesstechnicalmath\/chapter\/considerations\/"},"modified":"2021-08-31T21:22:07","modified_gmt":"2021-08-31T21:22:07","slug":"considerations","status":"publish","type":"chapter","link":"https:\/\/opentextbc.ca\/businesstechnicalmath\/chapter\/considerations\/","title":{"raw":"7.3 Collecting Data","rendered":"7.3 Collecting Data"},"content":{"raw":"<h6><img class=\"aligncenter wp-image-1707 size-full\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2019\/08\/7.4-intro-dice.jpg\" alt=\"\" width=\"1000\" height=\"500\"><\/h6>\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">Learning Objectives<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nBy the end of this section it is expected that you will be able to:\n<ul>\n \t<li>State whether data is quantitative or qualitative<\/li>\n \t<li>Describe the random sampling methods: simple random sampling, systematic sampling, cluster sampling and convenience sampling<\/li>\n \t<li>Discuss potential problems that might arise when sampling from a population<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h1>Populations and Samples<\/h1>\nIn statistics, we generally want to study a population. You can think of a population as a collection of persons, things, or objects under study. It is often not feasible or possible to study the entire population. Instead we can select a sample. The idea of sampling is to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population.\nBecause it takes a lot of time and money to examine an entire population, sampling is a very practical technique. If you wished to compute the overall grade point average at your school, it would make sense to select a sample of students who attend the school. The data collected from the sample would be the students' grade point averages. In elections, opinion poll samples of 1,000\u20132,000 people are taken. The opinion poll is supposed to represent the views of the people in the entire country.\n<h1>Types of Data<\/h1>\nMost data can be categorized as <strong>qualitative<\/strong> or <strong>quantitative.<\/strong>\n\n<strong>Qualitative data<\/strong> are the result of categorizing or describing attributes of a population using our senses such as sight or touch. Hair color, blood type, ethnic group, the car model that a person drives, and the street a person lives on are examples of qualitative data. Qualitative data are generally described by words or letters. For instance, hair color might be black, dark brown, light brown, blonde, gray, or red. Blood type might be AB+, O-, or B+.\n\n<strong>Quantitative data<\/strong> are always numbers. Quantitative data are the result of counting or measuring attributes of a population. Amount of money, pulse rate, weight, number of people living in your town, and number of students who take statistics are examples of quantitative data. Researchers often prefer to use quantitative data over qualitative data because it lends itself more easily to mathematical analysis. For example, it does not make sense to find an average hair color or median blood type.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 1<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nConsider a high school math class and a sample of five student's backpacks. Determine whether the data is quantitative or qualitative.\n\n1.\u00a0 One data set is the number of books students carry in their backpacks.Two students carry three books, one student carries four books, one student carries two books, and one student carries one book.\n\n2.\u00a0 For the sample of five backpacks you weigh the backpacks and contents. The weights (in kilograms) of their backpacks are 3.2, 5, 4.8, 5.1, 2.3.\n\n3. For the sample of five students you record the colour of the backpacks. The books are red, blue or black.\n\n<strong>Solution:<\/strong>\n<ol>\n \t<li>This is\u00a0 quantitative data.<\/li>\n \t<li>This is quantitative data.<\/li>\n \t<li>This is qualitative data.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 1<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nDetermine the correct data type (quantitative or qualitative).\n<ol type=\"a\">\n \t<li>\u00a0the number of pairs of shoes you own<\/li>\n \t<li>\u00a0the colour of vehicle you drive<\/li>\n \t<li>the distance it is from your home to the nearest grocery store<\/li>\n \t<li>the number of classes you take per school year.<\/li>\n \t<li>the model of calculator you use<\/li>\n \t<li>weights of sumo wrestlers<\/li>\n \t<li>total number of correct answers on a quiz<\/li>\n \t<li>IQ scores<\/li>\n<\/ol>\n<details><summary>Show answer<\/summary>Items a, c, d, f, g and h are\u00a0 quantitative; items b and e are qualitative.\n\n<\/details><\/div>\n<\/div>\nIt is often possible to assign both qualitative and quantitative measures to one set of data.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 2<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nYou go to the supermarket and purchase three cans of soup (350 ml tomato, 400 ml lentil, and 250 ml chicken noodle), four different kinds of vegetables (broccoli, cauliflower, spinach, and carrots), and two containers pf ice cream (pistachio ice cream and vanilla ice cream).\nName the data sets that are qualitative.\n\n<strong>Solution<\/strong>\n\nThe types of soups, vegetables and desserts are qualitative data because they are categorical. They are not measured or counted.\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 2<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nYou go to the supermarket and purchase three cans of soup (350 ml tomato, 400 ml lentil, and 250 ml chicken noodle),\u00a0 four different kinds of vegetables (broccoli, cauliflower, spinach, and carrots), and two containers pf ice cream (pistachio ice cream and vanilla ice cream).\nName the data sets that are quantitative.\n\n<details><summary>Show answer<\/summary>The three cans of soup, four kinds of vegetables and two ice creams are quantitative data because you count them. The weights of the soups are quantitative because you measure weights as precisely as possible.\n\n<\/details><\/div>\n<\/div>\n<h1>Sampling<\/h1>\n<span style=\"color: #000000;\">Gathering information about an entire population often costs too much or is virtually impossible. Instead, we use a sample of the population. A sample should have the same characteristics as the population it is representing. There are several different methods of random sampling. This section will describe four of the most common methods.\u00a0 In each form of random sampling, each member of a population initially has an equal chance of being selected for the sample.<\/span>\n<h1><span style=\"color: #000000;\">Simple Random Sampling<\/span><\/h1>\n<span style=\"color: #000000;\">The easiest method to describe is called a simple random sample. Any group of 'n' individuals is equally likely to be chosen as any other group of 'n' individuals if the simple random sampling technique is used. In other words, each sample of the same size has an equal chance of being selected. <\/span>\n\n<span style=\"color: #000000;\">For example, suppose Lisa wants to form a four-person study group (herself and three other people) from her pre-calculus class, which has 31 members not including Lisa. To choose a simple random sample of size three from the othe<\/span><span style=\"color: #000000;\">r members of her class, Lisa could put all 31 names in a hat, shake the hat, close her eyes, and pick out three names. An alternative is for Lisa to alphabetically list the last names of the members of her class and number each with a two-digit number\u00a0 01, 02, 03, 04, 05, 06,...31.<span style=\"color: #001000;\">\u00a0\u00a0<\/span><\/span><span style=\"font-size: 14pt;\">Lisa can use a table of random numbers (found in many statistics books) a calculator, or a computer to generate random numbers.<\/span>\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 3<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nHow can Lisa determine three group mates from a numbered list of 31 students?\n\n<strong>Solution<\/strong>\n\nLisa can generate random numbers from a calculator.\n\nThe calculator generates the first seven random numbers as follows:\u00a0 0.943\u00a0 0.230\u00a0 0.046\u00a0 0.514\u00a0 0.405\u00a0 0.733\u00a0 \u00a00.983\u00a0 \u00a0Lisa reads two-digit groups until she has chosen three class members. Each random number may only contribute one class member.\n\nThe first random number\u00a0 0.943 is read as the numbers 94 and 43. Neither of these\u00a0 corresponds to the students' assigned numbers (01 to 31).\n\nThe random number 0.230 is read as 23\u00a0 and 30. Although both of these numbers corresponds to a student, only the first number, 23, will be used. The first student will be number 23.\n\nThe random number 0.046 is read as 04 and 46\u00a0 which corresponds to student 04. The second student will be student number 4.\n\nThe third student will correspond to the number 14 which is read from the random number 0.514 (since there is no student numbered 51).\n\nThe three names that correspond to the two-digit numbers 23, 04 and 14 will form Lisa's group. If she needed to, Lisa could have generated more random numbers.\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 3<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA fitness studio plans to purchase new equipment and wants to conduct a survey of\u00a0 its membership. There are over 700 members and the studio wishes to survey only a portion of this membership. Upon purchasing a membership, every\u00a0 member has been assigned a 3 digit membership number.\u00a0 Decribe how the studio can use the membership numbers to select a <strong>simple random sample<\/strong> of 80 members.\n\n<details><summary>Show answer<\/summary>A random number generator is used to generate a list of three digit numbers. Each random number that is generated will be compared with the membership numbers. If the number has been assignd to a member then that member will be one of the survey group. If the random number has not been assigned then the next random number is considered until 80 members have been selected.\n\n<\/details><\/div>\n<\/div>\n<h1><strong>Systematic Sampling<\/strong><\/h1>\nSystematic sampling is where the first\u00a0 sample member from a larger population is selected according to a random starting point. Additional sample members are then selected based on a fixed interval. The interval is calculated by dividing the population size by the desired sample size. If the population consists of 500 members and the desired sample size is 50, then the interval would be 500\/50 = 10.\u00a0 Every tenth member of the population would be part of the sample.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 4<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA high school counsellor is conducting a survey of\u00a0 the graduating class which consists of 1243 students. Describe how the counsellor can select a systematic sample of 50 students.\n\n<strong>Solution<\/strong>\n\nThe counsellor can interview 50 students. The interval is calculated as 1243 students\/50 = 24.86 which rounds up to 25. This determines the interval increment as 25 so every 25th student will be in the sample.\n\nTo obtain the sample, the counsellor accesses the alphabetical list of graduates and generates a random number. Suppose the number is 03. The counsellor will interview the 3rd student on the list followed by every 25th student on the list: This will yield a sample of student 3, 28, 53, 78, and so on until 50 names have been chosen.\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 4<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA fitness studio plans to purchase new equipment and wants to conduct a survey of\u00a0 its membership. There are over 700 members and the studio wishes to survey only a portion of this membership. Upon purchasing a membership, every\u00a0 member has been assigned a 4 digit membership number.\u00a0 Decribe how the studio can use the membership numbers to select a <strong>systematic sample<\/strong> of 80 members.\n\n<details><summary>Show answer<\/summary>Since 80 members are needed for the survey, the total number of members will be divided by 80. Assume there are 724 members, then 724\/80 = 9.05 which rounds to an increment of 9. This determines the increment for the intervals. A list of 3-digit random numbers is generated to determine the first member in the survey group and every 9th member will be included in the survey group.\u00a0 If the first member has a number 546, then every 9th member counting from 546 will be chosen. When the end of the membership list is reached the increments will continue counting from the beginning of the list unil 80 members are selected.\n\n<\/details><\/div>\n<\/div>\n<h1>Cluster Sampling<\/h1>\nTo choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. Every member from each of the selected clusters will be in the cluster sample. This type of sampling works best in populations that can be grouped into distinct groups. In a 50 floor apartment building, each floor could represent a cluster. In a hockey league, each team could be a cluster.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 5<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA textbook publisher plans to conduct a survey of the faculty at a college campus. There are 23 departments at the college. Describe how the publisher can use the departments to select four cluster samples.\n\n<strong>Solution<\/strong>\n\nLet each department represents one cluster. The publisher numbers the departments from one to twenty-three and randomly\u00a0 selects 4 numbers which determine the four departments. Only these four departments will form the cluster sample and all faculty within the four departments (clusters) will be surveyed.\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 5<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA textbook publisher plans to conduct a survey of the students at a college campus. There are 45 program areas ranging from 18 to 40 students in each program.\u00a0 Decribe how the publisher can use the program areas to select a <strong>cluster sample<\/strong> of at least 100 students.\n\n<details><summary>Show answer<\/summary>The publisher numbers the program areas from one to forty-five and generates random numbers. The first random number is used to determine the first program area (cluster). Additional random numbers are assigned to clusters until\u00a0 there are at least 100 students for the survey. Only the students in the selected programs (clusters) will be surveyed.\n\n<\/details><\/div>\n<\/div>\nCluster sampling can reduce the need for resources and may be more efficient. Disadvantages are that it can introduce biases or it may not represent the total population. In example 5, perhaps the textbook publisher is seeking feedback on its textbooks. If one or more of the chosen clusters does not use textbooks then the results may not be reliable.\n<h1>Convenience Sampling<\/h1>\nA type of sampling that is non-random is called convenience sampling. Convenience sampling involves using results that are readily available or convenient.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 6<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA computer software developer seeks to determine which of its new video games are the most popular among females. Describe how the developer can select a convenience sample.\n\n<strong>Solution<\/strong>\n\nThe developer can conduct a marketing study by going to a local electronic gaming store and ask all female shoppers as they enter the store if they will participate in a 3 minute survey on video games.\n\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 6<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA fitness studio plans to purchase new equipment and wants to conduct a survey of\u00a0 its membership. There are over 700 members and the studio wishes to survey 100 of its members.\u00a0 Decribe how the studio can select a <strong>convenience sample<\/strong> of 80 members.\n\n<details><summary>Show answer<\/summary>The studio owner prepares a survey and distributes it to all members who visit the studio over a 3-day period.\n\n<\/details><\/div>\n<\/div>\nThis form of sampling may be appealing due to its convenience but the results can be misleading. This type of surveying may be good in some cases but it can also be highly biased (favor certain outcomes) in others.\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 7<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nA study is done to determine the average tuition that undergraduate students pay per semester. Each student in the following samples is asked how much tuition he or she paid for the Fall semester. What is the type of sampling in each case?\u00a0(simple random, systematic, cluster, or convenience)\n<ol type=\"a\">\n \t<li>A random number generator is used to select a student from the alphabetically numbered email listing of all undergraduate students in the Fall semester. Starting with that student, every 50th student is chosen until 75 students are included in the sample.<\/li>\n \t<li>A random number generator is used to select 75 student ID numbers.<\/li>\n \t<li>The freshman, sophomore, junior, and senior years are numbered one, two, three, and four, respectively. A random number generator is used to pick two of those years. All students in those two years are in the sample.<\/li>\n \t<li>An administrative assistant is asked to stand in front of the library one day and to ask the first 100 undergraduate students he encounters what they paid for tuition in the Fall semester.<\/li>\n<\/ol>\n<strong>Solution<\/strong>\n<ol type=\"a\">\n \t<li>systematic<\/li>\n \t<li>simple random<\/li>\n \t<li>cluster<\/li>\n \t<li>convenience<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 7<\/p>\n\n<\/header>\n<div class=\"textbox__content\">\n\nDetermine the type of sampling used (simple random, systematic, cluster, or convenience).\n<ol type=\"a\">\n \t<li>A pollster interviews all human resource personnel in five different high tech companies.<\/li>\n \t<li>A medical researcher interviews every third cancer patient from a list of cancer patients at a local hospital.<\/li>\n \t<li>A high school counselor uses a computer to generate 50 random numbers and then picks students whose names correspond to the numbers.<\/li>\n \t<li>A student interviews classmates in his algebra class to determine how many pairs of jeans a student owns, on the average.<\/li>\n<\/ol>\n<details><summary>Show answer<\/summary>\n<ol type=\"a\">\n \t<li>cluster<\/li>\n \t<li>systematic<\/li>\n \t<li>simple random<\/li>\n \t<li>convenience<\/li>\n<\/ol>\n<\/details><\/div>\n<\/div>\n<h1>Potential Survey Issues<\/h1>\nUsers of statistical studies should be aware of the sampling method before accepting the results of the studies. Common problems to be aware of include:\n<ol>\n \t<li>Nonrepresentative samples: A sample must be representative of the population under study. A sample that is not representative of the population is biased. Biased samples that are not representative of the population give results that are inaccurate and not valid. An example of a biased sample would be a survey on violence in sports where only the female students in a coed high school are surveyed.<\/li>\n \t<li>Self-selected samples: Surveys where responses are voluntary, such as call-in surveys, are often unreliable.<\/li>\n \t<li>Sample size issues: Samples that are too small may be unreliable. Larger samples are better, if possible. In some situations, having small samples is unavoidable and can still be used to draw conclusions. Examples would include crash testing of cars or medical testing for rare conditions.<\/li>\n \t<li>Undue influence: collecting data or asking questions in a way that influences the response. An example would be conducting a taste test of two sodas where one is refrigerated and the other is served at room temperature.<\/li>\n \t<li>Non-response or refusal of a subject to participate: The collected responses may no longer be representative of the population. Often, people with strong positive or negative opinions may answer surveys, which can affect the results. As an example, reviewers on Internet travel sites may not be representative of the entire population.<\/li>\n \t<li>Misleading use of data: Be aware of improperly displayed graphs, incomplete data, or lack of context.<\/li>\n<\/ol>\n<h1 style=\"text-align: left;\" data-type=\"title\">Key Concepts<\/h1>\nWhen conducting a survey we can choose from several sampling methods:\n<ul>\n \t<li>Simple random sampling is where a member of the population <span style=\"color: #000000;\">is equally as likely to be chosen as any other member from the population.\u00a0<\/span><\/li>\n \t<li>Systematic sampling is where the first\u00a0 sample member from a larger population is selected according to a random starting point. Additional sample members are then selected based on a fixed interval.<\/li>\n \t<li><span style=\"text-align: initial; font-size: 14pt;\">Cluster sampling is where the population is divided\u00a0 into clusters (groups) and then a specific number of clusters is randomly selected. Every member from each of the selected clusters will be in the cluster sample.\u00a0<\/span><\/li>\n \t<li>\u00a0Convenience sampling is where the selection is made from a part of the population that is easy to access.<\/li>\n<\/ul>\n<h1>Glossary<\/h1>\n<div class=\"textbox shaded\">\n\n<strong>qualitative data<\/strong>\n\nare the result of categorizing or describing attributes of a population using our senses such as sight or touch.\n\n<strong>quantitative data <\/strong>\n\nare the result of counting or measuring a specific attribute of a population.\n\n<\/div>\n<h1>7.3 Exercise Set<\/h1>\n<ol>\n \t<li>\u00a0Shoppers at a farmer's market were surveyed to determine how environmentally and market friendly they were. The survey recorded the <strong>A)<\/strong> type of bag (cloth, plastic, none, wicker, other)\u00a0 <strong>B)<\/strong> the number of bags (0, 1, 2, 3, more than 3)<strong>\u00a0 C)<\/strong> the number of market visits per year\u00a0 <strong>\u00a0D)\u00a0<\/strong> Average amount of money per visit spent at the market\u00a0<strong> E)<\/strong> preferred vendor(s) .\u00a0 Which of A, B, C, D, E are qualitative and which are quantitative?<\/li>\n \t<li>A census yields a wide variety of data. State whether each of the following questions would provide qualitative or quantitative data.\n<ol type=\"a\">\n \t<li>What province do you live in?<\/li>\n \t<li>How many years have you lived at your current address?<\/li>\n \t<li>What type of dwelling do you live in (house, apartment, condo, mobile home, other)?<\/li>\n \t<li>How many people live in your home?<\/li>\n \t<li>How many years languages do you speak?<\/li>\n \t<li>What languages do you speak?<\/li>\n \t<li>What is your occupation?<\/li>\n \t<li>What is your annual salary?<\/li>\n<\/ol>\n<\/li>\n \t<li>Consider a typical classroom in college or university. Name two types of qualitative data and two types of quantitative data that could be collected. e.g. qualitative - score from an entrance exam;\u00a0 quantitative - country of birth<\/li>\n \t<li>A study is done to determine the food outlet preferences for all students living on campus in the fall semester.\u00a0 Each student in the sample will be asked the same set of 10 questions.\u00a0 Four different sampling techniques are described below. What is the type of sampling in each case? (simple random, systematic, cluster, or convenience).\n<ol type=\"a\">\n \t<li>There are 8 different student residences on campus. Two residences are randomly selected and every student living in those two residences is surveyed.<\/li>\n \t<li>The surnames of all students living on campus are arranged alphabetically and numbered from 1 to n (where n is the number of students living on campus).\u00a0 A random number generator is used to determine a number between 1 and 50. This number is matched to a student with the same number.\u00a0 Starting with that student, every 50th student is chosen until the required number of\u00a0 students is chosen for the sample.<\/li>\n \t<li>One of the food outlets is chosen by drawing one outlet name. Over a four hour period one day, four helpers stop all students entering that food outlet and if they live on campus they are administered the survey.<\/li>\n \t<li>A computer is used to generate random numbers that have the same format as the students' ID numbers.\u00a0 Random numbers are generated until 100 random numbers are matched by student number to a student living on campus. These 100 students are contacted and arrangements are made for the interviewer to meet with the student.<\/li>\n<\/ol>\n<\/li>\n \t<li>State one advantage and one disadvantage for each of systematic sampling,\u00a0 cluster sampling and convenience sampling.<\/li>\n \t<li>A marketing company wants to determine which is more popular - its lemonade or a competitor's lemonade. The company sets up a booth at a local arena the evening of a Professional Boxing Match. Anyone who visits the booth is asked to choose their favourite lemonade\u00a0 from two unmarked glasses of lemonade. The marketing company's lemonade is made onsite and served with ice and a fresh slice of lemon; the competitor's lemonade is poured straight from a bottle. All taste testers receive a chance to win a television. Name at least three problems with the methodology used for this marketing company's taste test.<\/li>\n<\/ol>\n<h1>Answers<\/h1>\n<ol>\n \t<li>\u00a0Qualitative is A &amp; E; Quantitative is B, C D<\/li>\n \t<li>Qualitative is a, c, f, g;\u00a0 Quantitative is b, d, e, h<\/li>\n \t<li>Answers will vary<\/li>\n \t<li>\n<ol type=\"a\">\n \t<li>Cluster<\/li>\n \t<li>Systematic<\/li>\n \t<li>Convenience<\/li>\n \t<li>Simple Random<\/li>\n<\/ol>\n<\/li>\n \t<li>Answers may vary.\u00a0 Systematic Sampling avoids bias but it involves a commitment in time.\u00a0 Cluster sampling involves less time to determine the sample but it can be biased.\u00a0 Convenience sampling can involve less effort but it may be non representative\u00a0 of the population<\/li>\n \t<li>\u00a0Non representative sample -\u00a0 attendees at a boxing match may not be interested in lemonade ;\u00a0 \u00a0Possibly not a big enough sample; Undue Influence - the two lemonades are served up very differently;\u00a0 \u00a0Not random but instead involves self-selection by the participants (the taste testers must choose to go to the booth); Testers might participate only for the chance to win a TV and may not provide reliable feedback.<\/li>\n<\/ol>\n<h1>Attribution<\/h1>\nThis chapter has been adapted from \u201c<span class=\"os-text\" data-type=\"\">Data, Sampling, and Variation in Data and Sampling<\/span>\u201d in Introductory Statistics (OpenStax) by Barbara Illowsky and Susan Dean which is under a CC BY 4.0 Licence. Adapted by Kim Moshenko. See the Copyright page for more information.","rendered":"<h6><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1707 size-full\" src=\"https:\/\/opentextbc.ca\/oerdiscipline\/wp-content\/uploads\/sites\/361\/2019\/08\/7.4-intro-dice.jpg\" alt=\"\" width=\"1000\" height=\"500\" srcset=\"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2019\/08\/7.4-intro-dice.jpg 1000w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2019\/08\/7.4-intro-dice-300x150.jpg 300w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2019\/08\/7.4-intro-dice-768x384.jpg 768w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2019\/08\/7.4-intro-dice-65x33.jpg 65w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2019\/08\/7.4-intro-dice-225x113.jpg 225w, https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-content\/uploads\/sites\/361\/2019\/08\/7.4-intro-dice-350x175.jpg 350w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><\/h6>\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Learning Objectives<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>By the end of this section it is expected that you will be able to:<\/p>\n<ul>\n<li>State whether data is quantitative or qualitative<\/li>\n<li>Describe the random sampling methods: simple random sampling, systematic sampling, cluster sampling and convenience sampling<\/li>\n<li>Discuss potential problems that might arise when sampling from a population<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h1>Populations and Samples<\/h1>\n<p>In statistics, we generally want to study a population. You can think of a population as a collection of persons, things, or objects under study. It is often not feasible or possible to study the entire population. Instead we can select a sample. The idea of sampling is to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population.<br \/>\nBecause it takes a lot of time and money to examine an entire population, sampling is a very practical technique. If you wished to compute the overall grade point average at your school, it would make sense to select a sample of students who attend the school. The data collected from the sample would be the students&#8217; grade point averages. In elections, opinion poll samples of 1,000\u20132,000 people are taken. The opinion poll is supposed to represent the views of the people in the entire country.<\/p>\n<h1>Types of Data<\/h1>\n<p>Most data can be categorized as <strong>qualitative<\/strong> or <strong>quantitative.<\/strong><\/p>\n<p><strong>Qualitative data<\/strong> are the result of categorizing or describing attributes of a population using our senses such as sight or touch. Hair color, blood type, ethnic group, the car model that a person drives, and the street a person lives on are examples of qualitative data. Qualitative data are generally described by words or letters. For instance, hair color might be black, dark brown, light brown, blonde, gray, or red. Blood type might be AB+, O-, or B+.<\/p>\n<p><strong>Quantitative data<\/strong> are always numbers. Quantitative data are the result of counting or measuring attributes of a population. Amount of money, pulse rate, weight, number of people living in your town, and number of students who take statistics are examples of quantitative data. Researchers often prefer to use quantitative data over qualitative data because it lends itself more easily to mathematical analysis. For example, it does not make sense to find an average hair color or median blood type.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 1<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>Consider a high school math class and a sample of five student&#8217;s backpacks. Determine whether the data is quantitative or qualitative.<\/p>\n<p>1.\u00a0 One data set is the number of books students carry in their backpacks.Two students carry three books, one student carries four books, one student carries two books, and one student carries one book.<\/p>\n<p>2.\u00a0 For the sample of five backpacks you weigh the backpacks and contents. The weights (in kilograms) of their backpacks are 3.2, 5, 4.8, 5.1, 2.3.<\/p>\n<p>3. For the sample of five students you record the colour of the backpacks. The books are red, blue or black.<\/p>\n<p><strong>Solution:<\/strong><\/p>\n<ol>\n<li>This is\u00a0 quantitative data.<\/li>\n<li>This is quantitative data.<\/li>\n<li>This is qualitative data.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 1<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>Determine the correct data type (quantitative or qualitative).<\/p>\n<ol type=\"a\">\n<li>\u00a0the number of pairs of shoes you own<\/li>\n<li>\u00a0the colour of vehicle you drive<\/li>\n<li>the distance it is from your home to the nearest grocery store<\/li>\n<li>the number of classes you take per school year.<\/li>\n<li>the model of calculator you use<\/li>\n<li>weights of sumo wrestlers<\/li>\n<li>total number of correct answers on a quiz<\/li>\n<li>IQ scores<\/li>\n<\/ol>\n<details>\n<summary>Show answer<\/summary>\n<p>Items a, c, d, f, g and h are\u00a0 quantitative; items b and e are qualitative.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<p>It is often possible to assign both qualitative and quantitative measures to one set of data.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 2<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>You go to the supermarket and purchase three cans of soup (350 ml tomato, 400 ml lentil, and 250 ml chicken noodle), four different kinds of vegetables (broccoli, cauliflower, spinach, and carrots), and two containers pf ice cream (pistachio ice cream and vanilla ice cream).<br \/>\nName the data sets that are qualitative.<\/p>\n<p><strong>Solution<\/strong><\/p>\n<p>The types of soups, vegetables and desserts are qualitative data because they are categorical. They are not measured or counted.<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 2<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>You go to the supermarket and purchase three cans of soup (350 ml tomato, 400 ml lentil, and 250 ml chicken noodle),\u00a0 four different kinds of vegetables (broccoli, cauliflower, spinach, and carrots), and two containers pf ice cream (pistachio ice cream and vanilla ice cream).<br \/>\nName the data sets that are quantitative.<\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>The three cans of soup, four kinds of vegetables and two ice creams are quantitative data because you count them. The weights of the soups are quantitative because you measure weights as precisely as possible.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<h1>Sampling<\/h1>\n<p><span style=\"color: #000000;\">Gathering information about an entire population often costs too much or is virtually impossible. Instead, we use a sample of the population. A sample should have the same characteristics as the population it is representing. There are several different methods of random sampling. This section will describe four of the most common methods.\u00a0 In each form of random sampling, each member of a population initially has an equal chance of being selected for the sample.<\/span><\/p>\n<h1><span style=\"color: #000000;\">Simple Random Sampling<\/span><\/h1>\n<p><span style=\"color: #000000;\">The easiest method to describe is called a simple random sample. Any group of &#8216;n&#8217; individuals is equally likely to be chosen as any other group of &#8216;n&#8217; individuals if the simple random sampling technique is used. In other words, each sample of the same size has an equal chance of being selected. <\/span><\/p>\n<p><span style=\"color: #000000;\">For example, suppose Lisa wants to form a four-person study group (herself and three other people) from her pre-calculus class, which has 31 members not including Lisa. To choose a simple random sample of size three from the othe<\/span><span style=\"color: #000000;\">r members of her class, Lisa could put all 31 names in a hat, shake the hat, close her eyes, and pick out three names. An alternative is for Lisa to alphabetically list the last names of the members of her class and number each with a two-digit number\u00a0 01, 02, 03, 04, 05, 06,&#8230;31.<span style=\"color: #001000;\">\u00a0\u00a0<\/span><\/span><span style=\"font-size: 14pt;\">Lisa can use a table of random numbers (found in many statistics books) a calculator, or a computer to generate random numbers.<\/span><\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 3<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>How can Lisa determine three group mates from a numbered list of 31 students?<\/p>\n<p><strong>Solution<\/strong><\/p>\n<p>Lisa can generate random numbers from a calculator.<\/p>\n<p>The calculator generates the first seven random numbers as follows:\u00a0 0.943\u00a0 0.230\u00a0 0.046\u00a0 0.514\u00a0 0.405\u00a0 0.733\u00a0 \u00a00.983\u00a0 \u00a0Lisa reads two-digit groups until she has chosen three class members. Each random number may only contribute one class member.<\/p>\n<p>The first random number\u00a0 0.943 is read as the numbers 94 and 43. Neither of these\u00a0 corresponds to the students&#8217; assigned numbers (01 to 31).<\/p>\n<p>The random number 0.230 is read as 23\u00a0 and 30. Although both of these numbers corresponds to a student, only the first number, 23, will be used. The first student will be number 23.<\/p>\n<p>The random number 0.046 is read as 04 and 46\u00a0 which corresponds to student 04. The second student will be student number 4.<\/p>\n<p>The third student will correspond to the number 14 which is read from the random number 0.514 (since there is no student numbered 51).<\/p>\n<p>The three names that correspond to the two-digit numbers 23, 04 and 14 will form Lisa&#8217;s group. If she needed to, Lisa could have generated more random numbers.<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 3<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A fitness studio plans to purchase new equipment and wants to conduct a survey of\u00a0 its membership. There are over 700 members and the studio wishes to survey only a portion of this membership. Upon purchasing a membership, every\u00a0 member has been assigned a 3 digit membership number.\u00a0 Decribe how the studio can use the membership numbers to select a <strong>simple random sample<\/strong> of 80 members.<\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>A random number generator is used to generate a list of three digit numbers. Each random number that is generated will be compared with the membership numbers. If the number has been assignd to a member then that member will be one of the survey group. If the random number has not been assigned then the next random number is considered until 80 members have been selected.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<h1><strong>Systematic Sampling<\/strong><\/h1>\n<p>Systematic sampling is where the first\u00a0 sample member from a larger population is selected according to a random starting point. Additional sample members are then selected based on a fixed interval. The interval is calculated by dividing the population size by the desired sample size. If the population consists of 500 members and the desired sample size is 50, then the interval would be 500\/50 = 10.\u00a0 Every tenth member of the population would be part of the sample.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 4<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A high school counsellor is conducting a survey of\u00a0 the graduating class which consists of 1243 students. Describe how the counsellor can select a systematic sample of 50 students.<\/p>\n<p><strong>Solution<\/strong><\/p>\n<p>The counsellor can interview 50 students. The interval is calculated as 1243 students\/50 = 24.86 which rounds up to 25. This determines the interval increment as 25 so every 25th student will be in the sample.<\/p>\n<p>To obtain the sample, the counsellor accesses the alphabetical list of graduates and generates a random number. Suppose the number is 03. The counsellor will interview the 3rd student on the list followed by every 25th student on the list: This will yield a sample of student 3, 28, 53, 78, and so on until 50 names have been chosen.<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 4<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A fitness studio plans to purchase new equipment and wants to conduct a survey of\u00a0 its membership. There are over 700 members and the studio wishes to survey only a portion of this membership. Upon purchasing a membership, every\u00a0 member has been assigned a 4 digit membership number.\u00a0 Decribe how the studio can use the membership numbers to select a <strong>systematic sample<\/strong> of 80 members.<\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>Since 80 members are needed for the survey, the total number of members will be divided by 80. Assume there are 724 members, then 724\/80 = 9.05 which rounds to an increment of 9. This determines the increment for the intervals. A list of 3-digit random numbers is generated to determine the first member in the survey group and every 9th member will be included in the survey group.\u00a0 If the first member has a number 546, then every 9th member counting from 546 will be chosen. When the end of the membership list is reached the increments will continue counting from the beginning of the list unil 80 members are selected.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<h1>Cluster Sampling<\/h1>\n<p>To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. Every member from each of the selected clusters will be in the cluster sample. This type of sampling works best in populations that can be grouped into distinct groups. In a 50 floor apartment building, each floor could represent a cluster. In a hockey league, each team could be a cluster.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 5<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A textbook publisher plans to conduct a survey of the faculty at a college campus. There are 23 departments at the college. Describe how the publisher can use the departments to select four cluster samples.<\/p>\n<p><strong>Solution<\/strong><\/p>\n<p>Let each department represents one cluster. The publisher numbers the departments from one to twenty-three and randomly\u00a0 selects 4 numbers which determine the four departments. Only these four departments will form the cluster sample and all faculty within the four departments (clusters) will be surveyed.<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 5<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A textbook publisher plans to conduct a survey of the students at a college campus. There are 45 program areas ranging from 18 to 40 students in each program.\u00a0 Decribe how the publisher can use the program areas to select a <strong>cluster sample<\/strong> of at least 100 students.<\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>The publisher numbers the program areas from one to forty-five and generates random numbers. The first random number is used to determine the first program area (cluster). Additional random numbers are assigned to clusters until\u00a0 there are at least 100 students for the survey. Only the students in the selected programs (clusters) will be surveyed.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<p>Cluster sampling can reduce the need for resources and may be more efficient. Disadvantages are that it can introduce biases or it may not represent the total population. In example 5, perhaps the textbook publisher is seeking feedback on its textbooks. If one or more of the chosen clusters does not use textbooks then the results may not be reliable.<\/p>\n<h1>Convenience Sampling<\/h1>\n<p>A type of sampling that is non-random is called convenience sampling. Convenience sampling involves using results that are readily available or convenient.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 6<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A computer software developer seeks to determine which of its new video games are the most popular among females. Describe how the developer can select a convenience sample.<\/p>\n<p><strong>Solution<\/strong><\/p>\n<p>The developer can conduct a marketing study by going to a local electronic gaming store and ask all female shoppers as they enter the store if they will participate in a 3 minute survey on video games.<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 6<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A fitness studio plans to purchase new equipment and wants to conduct a survey of\u00a0 its membership. There are over 700 members and the studio wishes to survey 100 of its members.\u00a0 Decribe how the studio can select a <strong>convenience sample<\/strong> of 80 members.<\/p>\n<details>\n<summary>Show answer<\/summary>\n<p>The studio owner prepares a survey and distributes it to all members who visit the studio over a 3-day period.<\/p>\n<\/details>\n<\/div>\n<\/div>\n<p>This form of sampling may be appealing due to its convenience but the results can be misleading. This type of surveying may be good in some cases but it can also be highly biased (favor certain outcomes) in others.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">EXAMPLE 7<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>A study is done to determine the average tuition that undergraduate students pay per semester. Each student in the following samples is asked how much tuition he or she paid for the Fall semester. What is the type of sampling in each case?\u00a0(simple random, systematic, cluster, or convenience)<\/p>\n<ol type=\"a\">\n<li>A random number generator is used to select a student from the alphabetically numbered email listing of all undergraduate students in the Fall semester. Starting with that student, every 50th student is chosen until 75 students are included in the sample.<\/li>\n<li>A random number generator is used to select 75 student ID numbers.<\/li>\n<li>The freshman, sophomore, junior, and senior years are numbered one, two, three, and four, respectively. A random number generator is used to pick two of those years. All students in those two years are in the sample.<\/li>\n<li>An administrative assistant is asked to stand in front of the library one day and to ask the first 100 undergraduate students he encounters what they paid for tuition in the Fall semester.<\/li>\n<\/ol>\n<p><strong>Solution<\/strong><\/p>\n<ol type=\"a\">\n<li>systematic<\/li>\n<li>simple random<\/li>\n<li>cluster<\/li>\n<li>convenience<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">TRY IT 7<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>Determine the type of sampling used (simple random, systematic, cluster, or convenience).<\/p>\n<ol type=\"a\">\n<li>A pollster interviews all human resource personnel in five different high tech companies.<\/li>\n<li>A medical researcher interviews every third cancer patient from a list of cancer patients at a local hospital.<\/li>\n<li>A high school counselor uses a computer to generate 50 random numbers and then picks students whose names correspond to the numbers.<\/li>\n<li>A student interviews classmates in his algebra class to determine how many pairs of jeans a student owns, on the average.<\/li>\n<\/ol>\n<details>\n<summary>Show answer<\/summary>\n<ol type=\"a\">\n<li>cluster<\/li>\n<li>systematic<\/li>\n<li>simple random<\/li>\n<li>convenience<\/li>\n<\/ol>\n<\/details>\n<\/div>\n<\/div>\n<h1>Potential Survey Issues<\/h1>\n<p>Users of statistical studies should be aware of the sampling method before accepting the results of the studies. Common problems to be aware of include:<\/p>\n<ol>\n<li>Nonrepresentative samples: A sample must be representative of the population under study. A sample that is not representative of the population is biased. Biased samples that are not representative of the population give results that are inaccurate and not valid. An example of a biased sample would be a survey on violence in sports where only the female students in a coed high school are surveyed.<\/li>\n<li>Self-selected samples: Surveys where responses are voluntary, such as call-in surveys, are often unreliable.<\/li>\n<li>Sample size issues: Samples that are too small may be unreliable. Larger samples are better, if possible. In some situations, having small samples is unavoidable and can still be used to draw conclusions. Examples would include crash testing of cars or medical testing for rare conditions.<\/li>\n<li>Undue influence: collecting data or asking questions in a way that influences the response. An example would be conducting a taste test of two sodas where one is refrigerated and the other is served at room temperature.<\/li>\n<li>Non-response or refusal of a subject to participate: The collected responses may no longer be representative of the population. Often, people with strong positive or negative opinions may answer surveys, which can affect the results. As an example, reviewers on Internet travel sites may not be representative of the entire population.<\/li>\n<li>Misleading use of data: Be aware of improperly displayed graphs, incomplete data, or lack of context.<\/li>\n<\/ol>\n<h1 style=\"text-align: left;\" data-type=\"title\">Key Concepts<\/h1>\n<p>When conducting a survey we can choose from several sampling methods:<\/p>\n<ul>\n<li>Simple random sampling is where a member of the population <span style=\"color: #000000;\">is equally as likely to be chosen as any other member from the population.\u00a0<\/span><\/li>\n<li>Systematic sampling is where the first\u00a0 sample member from a larger population is selected according to a random starting point. Additional sample members are then selected based on a fixed interval.<\/li>\n<li><span style=\"text-align: initial; font-size: 14pt;\">Cluster sampling is where the population is divided\u00a0 into clusters (groups) and then a specific number of clusters is randomly selected. Every member from each of the selected clusters will be in the cluster sample.\u00a0<\/span><\/li>\n<li>\u00a0Convenience sampling is where the selection is made from a part of the population that is easy to access.<\/li>\n<\/ul>\n<h1>Glossary<\/h1>\n<div class=\"textbox shaded\">\n<p><strong>qualitative data<\/strong><\/p>\n<p>are the result of categorizing or describing attributes of a population using our senses such as sight or touch.<\/p>\n<p><strong>quantitative data <\/strong><\/p>\n<p>are the result of counting or measuring a specific attribute of a population.<\/p>\n<\/div>\n<h1>7.3 Exercise Set<\/h1>\n<ol>\n<li>\u00a0Shoppers at a farmer&#8217;s market were surveyed to determine how environmentally and market friendly they were. The survey recorded the <strong>A)<\/strong> type of bag (cloth, plastic, none, wicker, other)\u00a0 <strong>B)<\/strong> the number of bags (0, 1, 2, 3, more than 3)<strong>\u00a0 C)<\/strong> the number of market visits per year\u00a0 <strong>\u00a0D)\u00a0<\/strong> Average amount of money per visit spent at the market\u00a0<strong> E)<\/strong> preferred vendor(s) .\u00a0 Which of A, B, C, D, E are qualitative and which are quantitative?<\/li>\n<li>A census yields a wide variety of data. State whether each of the following questions would provide qualitative or quantitative data.\n<ol type=\"a\">\n<li>What province do you live in?<\/li>\n<li>How many years have you lived at your current address?<\/li>\n<li>What type of dwelling do you live in (house, apartment, condo, mobile home, other)?<\/li>\n<li>How many people live in your home?<\/li>\n<li>How many years languages do you speak?<\/li>\n<li>What languages do you speak?<\/li>\n<li>What is your occupation?<\/li>\n<li>What is your annual salary?<\/li>\n<\/ol>\n<\/li>\n<li>Consider a typical classroom in college or university. Name two types of qualitative data and two types of quantitative data that could be collected. e.g. qualitative &#8211; score from an entrance exam;\u00a0 quantitative &#8211; country of birth<\/li>\n<li>A study is done to determine the food outlet preferences for all students living on campus in the fall semester.\u00a0 Each student in the sample will be asked the same set of 10 questions.\u00a0 Four different sampling techniques are described below. What is the type of sampling in each case? (simple random, systematic, cluster, or convenience).\n<ol type=\"a\">\n<li>There are 8 different student residences on campus. Two residences are randomly selected and every student living in those two residences is surveyed.<\/li>\n<li>The surnames of all students living on campus are arranged alphabetically and numbered from 1 to n (where n is the number of students living on campus).\u00a0 A random number generator is used to determine a number between 1 and 50. This number is matched to a student with the same number.\u00a0 Starting with that student, every 50th student is chosen until the required number of\u00a0 students is chosen for the sample.<\/li>\n<li>One of the food outlets is chosen by drawing one outlet name. Over a four hour period one day, four helpers stop all students entering that food outlet and if they live on campus they are administered the survey.<\/li>\n<li>A computer is used to generate random numbers that have the same format as the students&#8217; ID numbers.\u00a0 Random numbers are generated until 100 random numbers are matched by student number to a student living on campus. These 100 students are contacted and arrangements are made for the interviewer to meet with the student.<\/li>\n<\/ol>\n<\/li>\n<li>State one advantage and one disadvantage for each of systematic sampling,\u00a0 cluster sampling and convenience sampling.<\/li>\n<li>A marketing company wants to determine which is more popular &#8211; its lemonade or a competitor&#8217;s lemonade. The company sets up a booth at a local arena the evening of a Professional Boxing Match. Anyone who visits the booth is asked to choose their favourite lemonade\u00a0 from two unmarked glasses of lemonade. The marketing company&#8217;s lemonade is made onsite and served with ice and a fresh slice of lemon; the competitor&#8217;s lemonade is poured straight from a bottle. All taste testers receive a chance to win a television. Name at least three problems with the methodology used for this marketing company&#8217;s taste test.<\/li>\n<\/ol>\n<h1>Answers<\/h1>\n<ol>\n<li>\u00a0Qualitative is A &amp; E; Quantitative is B, C D<\/li>\n<li>Qualitative is a, c, f, g;\u00a0 Quantitative is b, d, e, h<\/li>\n<li>Answers will vary<\/li>\n<li>\n<ol type=\"a\">\n<li>Cluster<\/li>\n<li>Systematic<\/li>\n<li>Convenience<\/li>\n<li>Simple Random<\/li>\n<\/ol>\n<\/li>\n<li>Answers may vary.\u00a0 Systematic Sampling avoids bias but it involves a commitment in time.\u00a0 Cluster sampling involves less time to determine the sample but it can be biased.\u00a0 Convenience sampling can involve less effort but it may be non representative\u00a0 of the population<\/li>\n<li>\u00a0Non representative sample &#8211;\u00a0 attendees at a boxing match may not be interested in lemonade ;\u00a0 \u00a0Possibly not a big enough sample; Undue Influence &#8211; the two lemonades are served up very differently;\u00a0 \u00a0Not random but instead involves self-selection by the participants (the taste testers must choose to go to the booth); Testers might participate only for the chance to win a TV and may not provide reliable feedback.<\/li>\n<\/ol>\n<h1>Attribution<\/h1>\n<p>This chapter has been adapted from \u201c<span class=\"os-text\" data-type=\"\">Data, Sampling, and Variation in Data and Sampling<\/span>\u201d in Introductory Statistics (OpenStax) by Barbara Illowsky and Susan Dean which is under a CC BY 4.0 Licence. Adapted by Kim Moshenko. See the Copyright page for more information.<\/p>\n","protected":false},"author":125,"menu_order":3,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-1708","chapter","type-chapter","status-publish","hentry"],"part":1671,"_links":{"self":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters\/1708","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/users\/125"}],"version-history":[{"count":1,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters\/1708\/revisions"}],"predecessor-version":[{"id":1709,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters\/1708\/revisions\/1709"}],"part":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/parts\/1671"}],"metadata":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapters\/1708\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/media?parent=1708"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/pressbooks\/v2\/chapter-type?post=1708"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/contributor?post=1708"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/opentextbc.ca\/businesstechnicalmath\/wp-json\/wp\/v2\/license?post=1708"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}