Practice
1.1 Definitions of Statistics, Probability, and Key Terms
Soccer | Basketball | Lacrosse | Total | |
---|---|---|---|---|
Women | 8 | 8 | 4 | 20 |
Men | 4 | 12 | 4 | 20 |
Total | 12 | 20 | 8 | 40 |
Given these data, calculate the marginal distributions of college sports for the people surveyed.
Soccer | Basketball | Lacrosse | Total | |
---|---|---|---|---|
Women | 8 | 8 | 4 | 20 |
Men | 4 | 12 | 4 | 20 |
Total | 12 | 20 | 8 | 40 |
Given these data, calculate the conditional distributions for the subpopulation of women who play college sports.
Use the following information to answer the next five exercises:
Studies are often done by pharmaceutical companies to determine the effectiveness of a treatment program. Suppose that a new viral antibody drug is currently under study. It is given to patients once the virus's symptoms have revealed themselves. Of interest is the average (mean) length of time in months patients live once they start the treatment. Two researchers each follow a different set of 40 patients with the viral disease from the start of treatment until their deaths. The following data (in months) are collected:
Researcher A3: 4, 11, 15, 16, 17, 22, 44, 37, 16, 14, 24, 25, 15, 26, 27, 33, 29, 35, 44, 13, 21, 22, 10, 12, 8, 40, 32, 26, 27, 31, 34, 29, 17, 8, 24, 18, 47, 33, 34.
Researcher B3: 14, 11, 5, 16, 17, 28, 41, 31, 18, 14, 14, 26, 25, 21, 22, 31, 2, 35, 44, 23, 21, 21, 16, 12, 18, 41, 22, 16, 25, 33, 34, 29, 13, 18, 24, 23, 42, 33, 29.
Determine what the key terms refer to in the example for Researcher A.
1.2 Data, Sampling, and Variation in Data and Sampling
a. qualitative (categorical), b. quantitative discrete, c. quantitative continuous
Use the following information to answer the next four exercises:
A study was done to determine the age, number of times per week, and the duration (amount of time) of residents using a local park in San Antonio, Texas. The first house in the neighborhood around the park was selected randomly, and then the resident of every eighth house in the neighborhood around the park was interviewed.
a. simple random, b. systematic, c. stratified, d. cluster
a. qualitative (categorical), b. quantitative discrete, c. quantitative continuous
a. qualitative (categorical), b. quantitative discrete, c. quantitative continuous
Year | Total Number of Deaths |
---|---|
2000 | 231 |
2001 | 21,357 |
2002 | 11,685 |
2003 | 33,819 |
2004 | 228,802 |
2005 | 88,003 |
2006 | 6,605 |
2007 | 712 |
2008 | 88,011 |
2009 | 1,790 |
2010 | 320,120 |
2011 | 21,953 |
2012 | 768 |
Total | 823,856 |
Use Table 1.30 to answer the following questions:
- What is the proportion of deaths between 2007–2012?
- What percent of deaths occurred before 2001?
- What is the percent of deaths that occurred in 2003 or after 2010?
- What is the fraction of deaths that happened before 2012?
- What kind of data is the number of deaths?
- Earthquakes are quantified according to the amount of energy they produce (examples are 2.1, 5.0, 6.7). What type of data is that?
- What contributed to the large number of deaths in 2010? In 2004? Explain.
- If you were asked to present these data in an oral presentation, what type of graph would you choose to present and why? Explain what features you would point out on the graph during your presentation.
For the following four exercises, determine the type of sampling used (simple random, stratified, systematic, cluster, or convenience):
Use the following information to answer the next seven exercises:
Studies are often done by pharmaceutical companies to determine the effectiveness of a treatment program. Suppose that a new viral antibody drug is currently under study. It is given to patients once the virus's symptoms have revealed themselves. Of interest is the average (mean) length of time in months patients live once starting the treatment. Two researchers each follow a different set of 40 patients with the viral disease from the start of treatment until their deaths. The following data (in months) are collected:
Researcher A: 3, 4, 11, 15, 16, 17, 22, 44, 37, 16, 14, 24, 25, 15, 26, 27, 33, 29, 35, 44, 13, 21, 22, 10, 12, 8, 40, 32, 26, 27, 31, 34, 29, 17, 8, 24, 18, 47, 33, 34.
Researcher B: 3, 14, 11, 5, 16, 17, 28, 41, 31, 18, 14, 14, 26, 25, 21, 22, 31, 2, 35, 44, 23, 21, 21, 16, 12, 18, 41, 22, 16, 25, 33, 34, 29, 13, 18, 24, 23, 42, 33, 29.
Survival Length (in months) | Frequency | Relative Frequency | Cumulative Relative Frequency |
---|---|---|---|
.5–6.5 | |||
6.5–12.5 | |||
12.5–18.5 | |||
18.5–24.5 | |||
24.5–30.5 | |||
30.5–36.5 | |||
36.5–42.5 | |||
42.5–48.5 |
Survival Length (in months) | Frequency | Relative Frequency | Cumulative Relative Frequency |
---|---|---|---|
.5–6.5 | |||
6.5–12.5 | |||
12.5–18.5 | |||
18.5–24.5 | |||
24.5–30.5 | |||
30.5–36.5 | |||
36.5-45.5 |
Use the following data to answer the next five exercises:
Two researchers are gathering data on hours of video games played by school-aged children and young adults. They each randomly sample different groups of 150 students from the same school. They collect the following data:
Hours Played per Week | Frequency | Relative Frequency | Cumulative Relative Frequency |
---|---|---|---|
0–2 | 26 | .17 | .17 |
2–4 | 30 | .20 | .37 |
4–6 | 49 | .33 | .70 |
6–8 | 25 | .17 | .87 |
8–10 | 12 | .08 | .95 |
10–12 | 8 | .05 | 1 |
Hours Played per Week | Frequency | Relative Frequency | Cumulative Relative Frequency |
---|---|---|---|
0–2 | 48 | .32 | .32 |
2–4 | 51 | .34 | .66 |
4–6 | 24 | .16 | .82 |
6–8 | 12 | .08 | .90 |
8–10 | 11 | .07 | .97 |
10–12 | 4 | .03 | 1 |
Use the following data to answer the next five exercises:
A pair of studies was performed to measure the effectiveness of a new software program designed to help stroke patients regain their problem-solving skills. Patients were asked to use the software program twice a day, once in the morning, and once in the evening. The studies observed 200 stroke patients recovering over a period of several weeks. The first study collected the data in Table 1.35. The second study collected the data in Table 1.36.
Group | Showed Improvement | No Improvement | Deterioration |
---|---|---|---|
Used program | 142 | 43 | 15 |
Did not use program | 72 | 110 | 18 |
Group | Showed Improvement | No Improvement | Deterioration |
---|---|---|---|
Used program | 105 | 74 | 19 |
Did not use program | 89 | 99 | 12 |
1.3 Frequency, Frequency Tables, and Levels of Measurement
42. What type of measure scale is being used? Nominal, ordinal, interval or ratio.
- High school soccer players classified by their athletic ability: superior, average, above average
- Baking temperatures for various main dishes: 350, 400, 325, 250, 300
- The colors of crayons in a 24-crayon box
- Social security numbers
- Incomes measured in dollars
- A satisfaction survey of a social website by number: 1 = very satisfied, 2 = somewhat satisfied, 3 = not satisfied
- Preferred TV shows: comedy, drama, science fiction, sports, news
- Time of day on an analog watch
- The distance in miles to the closest grocery store
- The dates 1066, 1492, 1644, 1947, and 1944
- The heights of 21–65-year-old women
- Common letter grades: A, B, C, D, and F
1.4 Experimental Design and Ethics
44. Discuss potential violations of the rule requiring informed consent.
- Inmates in a correctional facility are offered good behavior credit in return for participation in a study.
- A research study is designed to investigate a new children’s allergy medication.
- Participants in a study are told that the new medication being tested is highly promising, but they are not told that only a small portion of participants will receive the new medication. Others will receive placebo treatments and traditional treatments.