What does the central limit theorem state when the sample size is increased?

What does the central limit theorem state when the sample size is increased?

Mohammad Masum, PhD

Follow

Feb 26

·

5 min read

·

Member-only

Effect of Sample Size in Central Limit Theorem

What Central Limit Theorem Tells You???

Fundamentals of Statistics

What does the central limit theorem state when the sample size is increased?

Figure \(\PageIndex{3}\)

Figure \(\PageIndex{4}\) is a uniform distribution which, a bit amazingly, quickly approached the normal distribution even with only a sample of 10.

What does the central limit theorem state when the sample size is increased?

Figure \(\PageIndex{4}\)

Figure \(\PageIndex{5}\) is a skewed distribution. This last one could be an exponential, geometric, or binomial with a small probability of success creating the skew in the distribution. For skewed distributions our intuition would say that this will take larger sample sizes to move to a normal distribution and indeed that is what we observe from the simulation. Nevertheless, at a sample size of 50, not considered a very large sample, the distribution of sample means has very decidedly gained the shape of the normal distribution.

What does the central limit theorem state when the sample size is increased?
Figure \(\PageIndex{5}\)

The Central Limit Theorem provides more than the proof that the sampling distribution of means is normally distributed. It also provides us with the mean and standard deviation of this distribution. Further, as discussed above, the expected value of the mean, \(\mu_{\overline{x}}\), is equal to the mean of the population of the original data which is what we are interested in estimating from the sample we took. We have already inserted this conclusion of the Central Limit Theorem into the formula we use for standardizing from the sampling distribution to the standard normal distribution. And finally, the Central Limit Theorem has also provided the standard deviation of the sampling distribution, \(\sigma_{\overline{x}}=\frac{\sigma}{\sqrt{n}}\), and this is critical to have to calculate probabilities of values of the new random variable, \(\overline x\).

Figure \(\PageIndex{6}\) shows a sampling distribution. The mean has been marked on the horizontal axis of the \(\overline X\)'s and the standard deviation has been written to the right above the distribution. Notice that the standard deviation of the sampling distribution is the original standard deviation of the population, divided by the sample size. We have already seen that as the sample size increases the sampling distribution becomes closer and closer to the normal distribution. As this happens, the standard deviation of the sampling distribution changes in another way; the standard deviation decreases as \(n\) increases. At very very large \(n\), the standard deviation of the sampling distribution becomes very small and at infinity it collapses on top of the population mean. This is what it means that the expected value of \(\mu_{\overline{x}}\) is the population mean, \(\mu\).

What does the central limit theorem state when the sample size is increased?

Figure \(\PageIndex{6}\)

At non-extreme values of \(n\), this relationship between the standard deviation of the sampling distribution and the sample size plays a very important part in our ability to estimate the parameters we are interested in.

Figure \(\PageIndex{7}\) shows three sampling distributions. The only change that was made is the sample size that was used to get the sample means for each distribution. As the sample size increases, \(n\) goes from 10 to 30 to 50, the standard deviations of the respective sampling distributions decrease because the sample size is in the denominator of the standard deviations of the sampling distributions.

What does the central limit theorem state when the sample size is increased?

Figure \(\PageIndex{7}\)

The implications for this are very important. Figure \(\PageIndex{8}\) shows the effect of the sample size on the confidence we will have in our estimates. These are two sampling distributions from the same population. One sampling distribution was created with samples of size 10 and the other with samples of size 50. All other things constant, the sampling distribution with sample size 50 has a smaller standard deviation that causes the graph to be higher and narrower. The important effect of this is that for the same probability of one standard deviation from the mean, this distribution covers much less of a range of possible values than the other distribution. One standard deviation is marked on the \(\overline X\) axis for each distribution. This is shown by the two arrows that are plus or minus one standard deviation for each distribution. If the probability that the true mean is one standard deviation away from the mean, then for the sampling distribution with the smaller sample size, the possible range of values is much greater. A simple question is, would you rather have a sample mean from the narrow, tight distribution, or the flat, wide distribution as the estimate of the population mean? Your answer tells us why people intuitively will always choose data from a large sample rather than a small sample. The sample mean they are getting is coming from a more compact distribution. This concept will be the foundation for what will be called level of confidence in the next unit.

What does the central limit theorem state when the sample size is increased?

Figure \(\PageIndex{8}\)


This page titled 7.2: Using the Central Limit Theorem is shared under a CC BY 4.0 license and was authored, remixed, and/or curated by OpenStax via source content that was edited to the style and standards of the LibreTexts platform; a detailed edit history is available upon request.

What does central limit theorem say about sample size?

The central limit theorem says that the sampling distribution of the mean will always follow a normal distribution when the sample size is sufficiently large. This sampling distribution of the mean isn't normally distributed because its sample size isn't sufficiently large.

What happens when the sample size increases?

As the sample sizes increase, the variability of each sampling distribution decreases so that they become increasingly more leptokurtic. The range of the sampling distribution is smaller than the range of the original population.

What does the central limit theorem tells us?

The central limit theorem states that if you have a population with mean μ and standard deviation σ and take sufficiently large random samples from the population with replacement , then the distribution of the sample means will be approximately normally distributed.