Math 37 - Lecture 21
What if we don't know s? (7.1)
RECAP
Confidence Intervals: estimate
z* (standard dev of estimate)
z* from Normal Probability Table
e.g. Means
z*
; Proportions:
z*
Tests of Significance: Assume Ho is true, and calculate a test statistic to see if observation is very rare (compare distance between estimate and hypothesized value dividing by std dev of estimate).
Means: z0=(
- m0)/(
); Proportions: zo = (
- p0)/![]()
Compare p-value (from Table A) to a
Technical assumptions for the validity of the procedure:
Means: Observations are Normal or n large
Proportions: n large (np>10 ), X is Binomial (pop>>sample)
Standard deviation(
)=
Standard Error of a Statistic:
When we dont know the standard deviation exactly, we can estimate it from the data.
1. Std Dev(
) =
, substitute
, SE(
)=![]()
2. Std Dev(
) =
, substitute s, SE(
)=
"standard error of
" (Explains SE MEAN in Minitab output).
The bad news: With means, longer follows a Standard Normal Distribution!
What should be true about the distribution it does follow?
In fact, (
- m)/(
) has a Students t distribution with n-1 degrees of freedom, tn-1 IF
-
-
Properties of Students t distribution:
Calculating p-values from Student's t distribution:
Table D gives you some information about p-value.
Example Is there significant evidence that the gas mileage increase is greater than zero?
Calculating critical values from Students t distribution:
For 95% confidence interval, find t* (Table D)
How does t* compare to z*(95%)?
Why does this make sense?
Example 95% confidence interval for mileage difference:
SUMMARY Inference when s is unknown:
Hypothesis Test:
Confidence Interval:
Use Table D for critical values and p-values.
Technical assumptions of Student t procedures for means:
1. Normality of measurements
Can be hard to check
- past experience (small data sets)
- plots (Eliminate outliers?) (Normal Probability Plot)
or 2. Large n
Use 30/40 for cutoff between large and small sample inference
Usually safe with t procedures when n > 15 (robust)
If n is small check for skewness and outliers.