Module 8: Interval Estimation

θ is fixed while θ_{hat_n} is a random variable which provides the single best value to estimate θ
θ_hatis unbiased when bias = E(θ_{hat_n) -}θ = 0
θ_hatis consistent whenθ_{hat_n ->}θ
Mean squared error MSE = E(θ_{hat_n -}θ)² = bias(θ_{hat_n}) + V(θ_{hat_n})
If bias -> and se -> as n -> infinity then θ_{hat_n}is consistent
Probability is stronger than samples, probability standard error eventually converges to 0 as n approaches infinity but samples converge to a normal distribution which is not necessarily the same as the population distribution.
We the estimator variability (se) to provide an interval of parameter values that are "supported" by the sample.

A 1 - α confidence interval for a parameter θ is an interval C_n = (a; b) where a = a(X₁, ...X_n) and b = b(X₁, ...X_n) are functions of the data such that: P(θ ∈ C_n) >= 1 - α ; Where θ is the actual population mean. C_n is random and θ is fixed.

The confidence interval (a; b) capture the true mean with ~~probability~~confidence 1 - α. We commonly use 95% confidence intervals which corresponds to α = .05. This does NOT mean there is 1- α chance the parameter falls in the interval. The correct interpretation: If we repeatedly take samples of size n from a fixed and stable population and build a 95% confidence intervals, 95% of these intervals would contain the true unknown parameter.