Note: When solving the assignment do it step by step and use proper English to explain each step so that you can be properly rewarded. A better explanation carries better marks.

SAMPLE DATA 1

31 38 36 31 33 35 32 35 30 32 38 33 37 31 39 37 39 30 34 38 31 30 30 39 35 35 37 39 36 30 35 38 39 38 35

33 38 36 36 33 33 35 31 30 30 37 32 30 30 33

Given the above sample data 1 do the following:

(a) Create an appropriate frequency distribution containing 5 classes.

(a) Create a relative frequency distribution of the above frequency distribution.

(c) Create a cumulative frequency distribution of the above frequency distribution.

(c) Make an argument whether it is a normal distribution or not or what kind of distribution it is and why.

(d) Find the mean of the frequency distribution.

(e) Find the variance and standard deviation of the sample data.

(f) Check if 95% of the sample data lies within 2 standard deviations. Give arguments if it holds or if it does not hold. Use Chebyshev's theorem and show that it works/holds for K=2 on your dataset.

SAMPLE DATA 2

Generate 50 data points (normally distributed) using the website https://www.socscistatistics.com/utilities/normaldistribution/default.aspx with population mean 79 and a standard deviation of 4. Write the data on the answer sheet.

(g) Use the range rule of thumb to find the limits separating values that are significantly low or significantly high. Show if the range rule of thumb for normal distribution holds for 2 standard deviations.

(h) Calculate and show which of the above sample data 1 or sample data 2 varies more than the other.

(i) Pick up the first two data points from the second dataset and show which of values is a more extreme value in the dataset and why.

(j) Pick up the last data point in the second dataset and tell whether the value is significantly low / significantly high / usual.

(k) Draw a boxplot for the first data set that also shows the range for outliers.

Suppose that your population consists only of these 4 numbers:

Population:

10 31 14 48

(l) Generate all possible samples of size 2 from your population with replacement (note that 5,6 and 6,5 are two different samples and 5,5 is also a sample)

Then prove the following by showing all calculations:

(m) Sample mean is an unbiased estimator of the population mean.

(n) Sample variance is an unbiased estimator of the population variance.

(o) Sample standard deviation is a biased estimator of the population standard deviation

22 29 23 25 23 23 25 22 20 22 20 20 27 26 25 29 23 23 20 20 26 27 22 21 26 27 22 22 28 28 25 20 22 21 28

25 20 29 25 22 27 21 25 21 25 20 21 20 29 27

30 26 26 27

