Collection of notes for various classes I've taken.
A continuous, symmetric distribution used for inference about a population mean when the population standard deviation is unknown and sample size is small. It has heavier tails than the normal distribution, which accounts for extra uncertainty from estimating variance.
For ν > 0 and x ∈ ℝ:
\[f(x) = \frac{\Gamma\!\left(\tfrac{\nu+1}{2}\right)}{\sqrt{\nu\pi}\,\Gamma\!\left(\tfrac{\nu}{2}\right)} \left(1 + \frac{x^2}{\nu}\right)^{-\frac{\nu+1}{2}}\]If Z ~ N(0,1) and U ~ χ²(ν) are independent, then
\[T = \frac{Z}{\sqrt{U/\nu}} \sim t(\nu).\]This is the origin of the t-statistic used in practice.
Commonly used for confidence intervals and hypothesis tests about a population mean μ when σ is unknown.
t-statistic for a sample (X̄ sample mean, s sample standard deviation, n sample size):
\[t = \frac{\overline{X} - \mu}{s / \sqrt{n}} \sim t(n-1)\]Critical values and p-values are obtained from the t-distribution with ν = n − 1 degrees of freedom.
1) (Construct a 95% CI for a mean — show work)
Given: n = 10, x̄ = 5.0, s = 2.0, confidence level = 95%.
Find: 95% CI for μ.
Solution:
Answer: 95% CI ≈ (3.569, 6.431).
2) (Two-sided hypothesis test — show test statistic, p-value, decision)
Problem: Test H0: μ = 4.0 versus Ha: μ ≠ 4.0 using the same sample (n = 10, x̄ = 5.0, s = 2.0). Use α = 0.05.
Solution:
Conclusion: Insufficient evidence to conclude μ ≠ 4.0 (at the 5% level).