Sampling Distribution
Suppose we have a finite population and we draw all possible simple random samples of size without replacement or with replacement. For each sample we calculate a statistic (sample mean
or proportion
, etc.). All possible values of the statistic make a probability distribution which is called the sampling distribution. The number of all possible samples is usually very large and obviously the number of statistics (any function of the sample) will be equal to the number of samples if one and only one statistic is calculated from each sample. In fact, in practical situations, the sampling distribution has a very large number of values. The shape of the sampling distribution depends upon the size of the sample, the nature of the population and the statistic which is calculated from all possible simple random samples. Some of the most well-known sampling distributions are:
(1) Binomial distribution
(2) Normal distribution
(3) t-distribution
(4) Chi-square distribution
(5) F-distribution
These distributions are called derived distributions because they are derived from all possible samples.
Standard Error
The standard deviation of a statistic is called the standard error of that statistic. If the statistic is , the standard deviation of all possible values of
is called the standard error of
, which may be written as S.E.
or
. Similarly, if the sample statistic is proportion
, the standard deviation of all possible values of
is called the standard error of
and is denoted by
or S.E.
.
Sampling Distribution of
The probability distribution of all possible values of calculated from all possible simple random samples is called the sampling distribution of
. In brief, we shall call it the distribution of
. The mean of this distribution is called the expected value of
and is written as
or
. The standard deviation (standard error) of this distribution is denoted by S.E.
or
and the variance of
is denoted by
or
. The distribution of
has some important properties:
- One important property of the distribution of
is that it is a normal distribution when the size of the sample is large. When the sample size
is more than
, we call it a large sample size. The shape of the population distribution does not matter. The population may be normal or non-normal, the distribution of
is normal for
, but this is true when the number of samples is very large. As the distribution of random variable
is normal,
can be transformed into a standard normal variable
where
. The distribution of
has a t-distribution when the population is normal and
. Diagram (a) shows the normal distribution and diagram (b) shows the t-distribution.


-
The mean of the distribution of
is equal to the mean of the population. Thus
(population mean). This relation is true for small as well as large sample sizes in sampling without replacement and with replacement.
-
The standard error (standard deviation) of
is related to the standard deviation of the population
through the relations:
S.E.
This is true when population is infinite, which meansis very large or the sampling is done with replacement from a finite or infinite population.
S.E.
This is true when sampling is without replacement from a finite population. The above two equations betweenand
are true both for small as well as large sample sizes.