Stat 301 - HW 8
Due midnight, Friday, March 17
Remember to
join a HW group if you are submitting together, to put your names in this file
and to include, and to integrate all relevant computer output.
1) Stat
301 students (Coutin & Heffler, 2021) wanted to know whether listening to
up-tempo music causes college students to tend to type faster. To collect their
data (number of words typed correctly in one minute), the students planned to
use the 60-second Easy-Text typing test (TypingTest.com).
They recruited 34 Cal Poly students from groups they were associated with on
campus (e.g., athletic teams, musical groups).
For the up-tempo music they selected Overture to Candide performed by
the London Symphony Orchestra.
Let μnomusic
represent the population mean typing speed without the music and μmusic the population mean typing seed with
the music.
(a) State the student researchers’ null and alternative
hypotheses in symbols and in words.
(b) Briefly describe what a “completely randomized design”
(independent samples) would look like. (Be clear how randomization is used.)
(c) Briefly describe/contrast what a matched pairs design
would look like. (Be clear how randomization is used.)
(d)
According to typingpal.com, an average typing speed is 40 words per minute, and
a good targe speed is 65 to 70 words per minute. Suppose we will be impressed if listening to
music increases typing speed by 5 words per minute on average. Also suppose the variability (standard
deviation) of typing speeds is around 14 words per minute. (Note, this corresponds to an effect size of
5/14 = 0.36.) No question here!
(e) Suppose we plan to use a completely randomized design,
randomly assigning 17 students to each treatment. Find the power we would detect a difference
in population mean typing speed in a two-sided test. We had a direction in mind
above, but two-sided is the default in both R and JMP, so let’s start there.
R power.t.test(n=17,delta=5,sd=14,type="two.sample") |
JMP Choose DOE > Design
Diagnostics > Sample Size and Power Choose Two Sample Means Enter 14 as the standard deviation Enter 5 as the difference to detect Enter 34 as the overall sample size (Leave power blank) Press Continue (JMP then fills
in the power) |
Include your output and give a one-sentence interpretation of
the calculated power in the context of this study.
(f) Suppose we plan to use a matched pairs design, asking
each student to take the test twice, once with music and once without,
randomizing which method they use first.
Should the variability in the speed differences be larger or
smaller than the variability in the speeds? Explain your reasoning
(should be clear how you are contrasting the two).
(g) Suppose the variability in the speed
differences is 10 words per minute.
Calculate the power that we would detect a nonzero mean difference in
typing speed in a two-sided test.
R power.t.test(n=17,delta=5,sd=10,type="paired") or power.t.test(n=34,delta=5,sd=10,type="paired") |
JMP Choose DOE > Design
Diagnostics > Sample Size and Power Choose One Sample Mean Enter 10 as the standard deviation Enter 5 as the difference to detect Enter 17 or 34 as the sample size Press Continue |
Include your output. How do the power calculations compare?
The data in TypingMusic.txt shows the speeds (in words per minute) for each
student using music and not using music.
(h) Create a graph of the differences. Include your output
and summarize what you learn. (JMP users, you may want to calculate the
differences in Excel?) Also, calculate
the standard deviations of each group (with and without music) as well as the
standard deviation of the differences (include output). Is the standard
deviation of the differences a lot smaller? (If so, then that says the pairing
was helpful.)
(i) Use R or JMP to carry out the one-sample t-test
on the differences (aka a matched-pairs t-test). Note both R and JMP allow you to use the
“unstacked” format of these data in BaseRunning.txt.
R library(readr) typing <- read_delim("http://www.rossmanchance.com/iscam3/data/TypingMusic.txt",
delim = "\t") t.test(typing$WithMusic, typing$NoMusic,
paired=TRUE) |
JMP To run a matched pairs analysis using
the original two columns of data: Analyze >
Specialized Modelling > Matched Pairs Enter both columns in the Y, Paired
Responses box and press OK. |
Include a screen capture of the results and report the test
statistic and two-sided p-value.
(j) Does the t-test appear to be valid for these
data? You should comment on the validity conditions of the paired t-test.
(Probability plot?)
(k) Regardless of your answer to (j), use R or JMP to
calculate and interpret a one-sample t-confidence interval in
context (including measurement units).
(l) Carry out a sign test on the paired data:
1. How
many of the difference are positive? How many are negative? How many are zero?
2. Consider
the non-zero differences (i.e., throw-away the ties), what proportion of the
non-zero differences are positive?
3. Use
the binomial distribution to determine whether a statistically significant
majority of the differences are positive (define the parameter of interest,
state the hypotheses, and determine the exact binomial p-value – be sure to
include a screen capture of the null distribution showing the p-value).
(m) Does the sign test provide stronger or weaker evidence
that one typing method tends to be faster than
the other? (Be very clear how you are deciding.)
(n) Would a one-sample z-test be appropriate in
(l)? Explain how you are deciding.
(o) Determine, include output, and interpret in context a
95% confidence interval for . This time, consider your answer to (n) in
deciding which interval procedure to use.
Reminder: Course
Evaluations will be due Friday night as well!