The standard error (SE) is a statistical measure that quantifies the amount of variation or dispersion of a sample statistic (such as the mean) relative to the true population parameter. It reflects how much a sample statistic is expected to vary due to random sampling error.
Formula for Standard Error
The standard error of the mean (SEM) is commonly calculated as:
SE\(= \frac{\sigma}{\sqrt{n}}\)
Where:
- \(\sigma\) is the standard deviation of the population.
- n is the sample size.
If the population standard deviation \((\sigma)\) is unknown, the sample standard deviation (s) is often used:
SE\(= \frac{s}{\sqrt{n}}\)
Why Is Standard Error Useful?
-
Estimates Precision of the Sample Statistic
The SE provides insight into how precise the sample mean (or another statistic) is as an estimate of the population mean. A smaller SE indicates more reliable estimates.
-
Builds Confidence Intervals
SE is used to calculate confidence intervals, which show the range within which the true population parameter is likely to fall. For example:
Confidence Interval=\(\bar x\) ± z⋅SE
Where \(\bar{x}\) is the sample mean and z is the critical value from the standard normal distribution.
-
Conducts Hypothesis Testing
In statistical tests, the SE is used to determine the significance of sample results by comparing them to a null hypothesis.
-
Accounts for Sample Size
Since SE decreases as the sample size increases (n), it highlights the effect of sample size on the reliability of an estimate. Larger samples yield smaller SEs and more accurate estimates.
-
Evaluates Sampling Variability
It helps quantify how much variability is expected in repeated sampling from the population, providing a measure of the stability of the sample statistic.
Example
Suppose a study measures the average height of 100 individuals and finds a sample mean of 170 cm with a sample standard deviation of 10 cm. The SE of the mean would be:
SE \(= \frac{10}{\sqrt{100}} \)=1cm.
This implies that the sample mean is expected to vary by about 1 cm around the true population mean due to random sampling variability.