MATH220 Test 1 Solutions Fall 2014 1. (12 pts) True or False: Put a T or F in front of each number. T. 1. Both stratified sample and cluster sample divide the population into groups. F. 2. In a cluster sample, the population is divided into groups, and a random sample from each group is drawn. F. 3. A sample of convenience is never acceptable. F. 4. A confounder makes it easier to draw conclusions from a study. F. 5. In an observational study, subjects are assigned to treatment groups at random. F. 6. Observational studies are generally more reliable than experiments. F.7. If every individual in a sample has of an equal chance of being selected, then this must be a simple random sample. T. 8. In a case-control study, the outcome has occurred before the subjects are sampled. T. 9. A nominal variable is a qualitative variable with no natural ordering. T. 10. A person’s height is an example of a continuous variable. T. 11. A double blind experiment means neither the investigator nor the subjects know who has been assigned to which treatment. T. 12. In a randomize experiment, the treatment groups are not too different from each other except that they receive different treatments. 2. (14 pts) Find Q1, Q2, Q3, IQR and use quartiles and IQR to check potential outliers for this data set. 70, 10, 20, 30, 40, 50, 60, 70, 80, 90, 110, 205. The ordered values are: 10, 20, 30, | 40, 50, 60, 1 || 70, 70, 80, | 90, 110, 205. Q1 = 35 Q2 = 65 Q3 = 85 IQR = Q3 − Q1 = 50 Lower outlier boundary = Q1 − 1.5 ∗ IQR = 35 − 1.5 ∗ 50 = −40 Upper outlier boundary = Q5 + 1.5 ∗ IQR = 85 + 1.5 ∗ 50 = 160. Outliers are 205 since 205 > 160. No values are smaller than the lower outlier boundary. 3. (16 pts) The table shows the frequency of primary food choices of 200 alligators captured in Florida lakes. Primary food choice frequency Fish Invertebrate Other Reptile Bird 86 58 30 16 10 relative frequency 0.43 0.28 0.15 0.08 0.05 (a) Put a column of relative frequency (proportion or percent) in the table. (b) Make a bar chart for the variable Primary food choice. Put the relative frequency on the vertical axis. Omitted. (c) What is the mode for primary food choice? The mode is Fish which has the higher frequency among all the categories. (d) In addition to a bar chart, what chart can be used as well to display this variable? Only name this chart. You do not need to make the chart. A pie chart can also be used to display a categorical variable. 2 4. (20 pts) The table below classifying auto accidents by survival status (S=survival, D=died) and seat belt status of the individuals involved in the accident. Belt Yes No Outcome S 412,368 162,527 D 510 1601 Total 412,878 164,128 Total 574,895 2111 577,006 (a) Estimate P (D), the probability that the individual died in the auto accident, i.e., find the the proportion of the individuals who died in the auto accident among all who had auto accidents. 2111 = 0.0037. P (D) = 577,006 (b) Estimate the probability that the individual wore a seat belt given that he/she survived the accident. P (Y es|S) = 412,368 = 0.7173. 574,895 (c) Estimate the probability that the individual died, given that he/she (1) wore, (2) did not wear a seat belt. Note you need to give two probabilities here. 510 P (D|Y es) = 412878 = 0.0012 1601 P (D|N o) = 164218 = 0.0097. 5. (8 pts) Following are the prices (in dollars) of 4 randomly chosen undergraduate textbooks. Find the variance of this data set. 100, 135, 85, 180. x¯ = 125, s2 = (100−125)2 +(135−125)2 +(85−125)2 +(180−125)2 4−1 3 = 1783.33. 6. (13 pts) In Harrisonburg, 60% of the household subscribe to NY Times, 45% subscribe to Washington Post, and 75% subscribe to at least one of the two newspapers. Let N indicate subscribing to NY Times, W indicate subscribing to Washington Post. (a) Find P (N or W). P(N or W)= 0.75. (b) What is the probability that a randomly selected household subscribe to neither of them? P (Neither) = 1-0.75=0.25. (c) Find P( N and W). Note P(N or W)= P(N)+P(W) - P(N and W), so P(N and W)= P(N)+P(W)- P(N or W)= 0.60+0.45-0.75=0.30. (d) What is the probability that a randomly selected household subscribes to exactly one newspaper? P(Exactly one) = P(N or W)- P(N and W)=0.75-30=0.45. 7. (16 pts) It is known that 10% of a population carry a certain disease. A doctor is about to check 4 randomly selected people from this population. (a) What is the probability all the 4 people carry the disease? P (all) = 0.14 = 0.0001. (b) What is the probability that none of the 4 people carry the disease? P (none) = 0.94 = 0.6561. (c) What is the probability that at least one person carry the disease? P(at least one) = 1- P(none) =1-0.6561=0.3439. 4
© Copyright 2024 ExpyDoc