Welcome to Naing Wai Yan Tun's Maths | အတွေးအတွက် မျှတစေရမယ် ကျနလှပစေရမယ် | HOME | GRADE 12 | GRADE 11 | GRADE 10 | ABOUT US |
Welcome to Naing Wai Yan Tun's Maths | အတွေးအတွက် မျှတစေရမယ် ကျနလှပစေရမယ် | HOME | GRADE 12 | GRADE 11 | GRADE 10 | ABOUT US |

Naing Wai Yan Tun's Maths

အတွေးအတွက် မျှတစေရမယ် ကျနလှပစေရမယ်

Measure of Variation

Chapter 6 Statistics - Section 6.1 Measure of Variation

6.1 Measure of Variation

Measures of centre of a data set

These measures include mean, median, and mode.

For a set of data, the mean is defined by the sum of the values, divided by the number of values.

$ \bar{x} = \frac{x_1 + x_2 + ... + x_n}{n} = \frac{1}{n}\sum_{i=1}^{n} x_i $

The median is the middle point of the data arranged in ascending order.

The mode is the value that occurs most often in a set of data.

The simplest measure of spread is the range R which is the difference between the largest value and the smallest value.

$ R = \text{largest value} - \text{smallest value} $

The data variation is on the difference of each data value from the mean. This difference is called a deviation, in symbol $x_i - \bar{x}$.

The variance $\sigma^2$ is the mean of squares of the deviations.

$ \sigma^2 = \frac{1}{n}\sum_{i=1}^{n} (x_i - \bar{x})^2 $

The standard deviation $\sigma$ is the square root of the variance.

$ \sigma = \sqrt{\text{variance}} $

We can compute the similar formula for the variance as follow:

$ \sigma^2 = \frac{\sum x^2}{n} - (\bar{x})^2 $

Quartiles $Q_1, Q_2$ and $Q_3$ divide the data arranged in ascending order into four equal groups. Here, $Q_2$ is the median of the whole data set. $Q_1$ and $Q_3$ are called the lower quartile and the upper quartile. The interquartile range is a measure of variability.

$ \text{interquartile range} = \text{upper quartile} - \text{lower quartile} = Q_3 - Q_1 $

From the frequency table, we can find the mean:

$ \bar{x} = \frac{\sum_{i=1}^{n} f_i x_i}{\sum_{i=1}^{n} f_i} $

We have the similar formula for the variance of group data as follow:

$ \sigma^2 = \frac{\sum f_i x_i^2}{\sum f_i} - (\bar{x})^2 $

Examples

Example 1

The marks scored in a test by seven students are 3, 4, 5, 2, 8, 8, 5. Find the mean and standard deviation.

Step 1: Calculate the Mean ($\bar{x}$)

$ \bar{x} = \frac{3+4+5+2+8+8+5}{7} = \frac{35}{7} = 5 $

Step 2: Calculate the Variance ($\sigma^2$) using Computational Formula

$\sum x^2 = 3^2+4^2+5^2+2^2+8^2+8^2+5^2 = 9+16+25+4+64+64+25 = 207$

$\sigma^2 = \frac{\sum x^2}{n} - (\bar{x})^2 = \frac{207}{7} - (5)^2 = 29.5714 - 25 = 4.5714$

Step 3: Calculate the Standard Deviation ($\sigma$)

$ \sigma = \sqrt{4.5714} = 2.1381 $

Example 2

Ten students are weighted (w kg). The summary data for the weights are $\sum w = 440$, $\sum w^2 = 19490$. Find the mean and standard deviation of the students' weight.

Step 1: Calculate the Mean ($\bar{w}$)

$ \bar{w} = \frac{\sum w}{n} = \frac{440}{10} = 44 $

Step 2: Calculate the Variance ($\sigma^2$)

$ \sigma^2 = \frac{\sum w^2}{n} - (\bar{w})^2 = \frac{19490}{10} - (44)^2 = 1949 - 1936 = 13 $

Step 3: Calculate the Standard Deviation ($\sigma$)

$ \sigma = \sqrt{13} = 3.6056 $

Example 3

Find the median and interquartile range for each of the set of data below.

(a) 7, 9, 4, 6, 3, 2, 8, 1, 10, 15, 11

Step 1: Arrange data in ascending order.

1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 15

Step 2: Find the Median ($Q_2$).

Median ($Q_2$) = 7 (the 6th value)

Step 3: Find the Lower Quartile ($Q_1$) and Upper Quartile ($Q_3$).

Lower half: [1, 2, 3, 4, 6]. $Q_1 = 3$.

Upper half: [8, 9, 10, 11, 15]. $Q_3 = 10$.

Step 4: Calculate the Interquartile Range (IQR).

IQR = $Q_3 - Q_1 = 10 - 3 = 7$.

(b) 6, 5, 1, 14, 3, 7, 8, 2, 13

Step 1: Arrange data in ascending order.

1, 2, 3, 5, 6, 7, 8, 13, 14

Step 2: Find the Median ($Q_2$).

Median ($Q_2$) = 6 (the 5th value)

Step 3: Find $Q_1$ and $Q_3$.

Lower half: [1, 2, 3, 5]. $Q_1 = \frac{2+3}{2} = 2.5$.

Upper half: [7, 8, 13, 14]. $Q_3 = \frac{8+13}{2} = 10.5$.

Step 4: Calculate the IQR.

IQR = $Q_3 - Q_1 = 10.5 - 2.5 = 8$.

(c) 2, 15, 13, 6, 5, 12, 50, 22, 18, 52

Step 1: Arrange data in ascending order.

2, 5, 6, 12, 13, 15, 18, 22, 50, 52

Step 2: Find the Median ($Q_2$).

Median ($Q_2$) = $\frac{13+15}{2} = 14$.

Step 3: Find $Q_1$ and $Q_3$.

Lower half: [2, 5, 6, 12, 13]. $Q_1 = 6$.

Upper half: [15, 18, 22, 50, 52]. $Q_3 = 22$.

Step 4: Calculate the IQR.

IQR = $Q_3 - Q_1 = 22 - 6 = 16$.

(d) 5, 3, 5, 7, 8, 2, 4, 1, 2, 2, 3, 6

Step 1: Arrange data in ascending order.

1, 2, 2, 2, 3, 3, 4, 5, 5, 6, 7, 8

Step 2: Find the Median ($Q_2$).

Median ($Q_2$) = $\frac{3+4}{2} = 3.5$.

Step 3: Find $Q_1$ and $Q_3$.

Lower half: [1, 2, 2, 2, 3, 3]. $Q_1 = \frac{2+2}{2} = 2$.

Upper half: [4, 5, 5, 6, 7, 8]. $Q_3 = \frac{5+6}{2} = 5.5$.

Step 4: Calculate the IQR.

IQR = $Q_3 - Q_1 = 5.5 - 2 = 3.5$.

Example 4

Find the mean and standard deviation for the following frequency table:

x56789
Frequency(f)99101210

Step 1: Construct the calculation table

$x$$f$$f \cdot x$$x^2$$f \cdot x^2$
594525225
695436324
7107049490
8129664768
9109081810
$\sum f = 50$$\sum fx = 355$$\sum fx^2 = 2617$

Step 2: Calculate Mean, Variance, and Standard Deviation

Mean: $\bar{x} = \frac{\sum fx}{\sum f} = \frac{355}{50} = 7.1000$

Variance: $\sigma^2 = \frac{\sum fx^2}{\sum f} - (\bar{x})^2 = \frac{2617}{50} - (7.1)^2 = 52.34 - 50.41 = 1.9300$

Standard Deviation: $\sigma = \sqrt{1.93} = 1.3892$

Example 5

A man recorded the length, in minutes, of each telephone call he made for a month. Find the mean and standard deviation for the data.

Length (minute)(0, 10](10, 20](20, 30](30, 40](40, 50]
Frequency (f)41551210

Step 1: Construct the calculation table with midpoints

LengthMidpoint (x)f$f \cdot x$$x^2$$f \cdot x^2$
(0, 10]542025100
(10, 20]15152252253375
(20, 30]2551256253125
(30, 40]3512420122514700
(40, 50]4510450202520250
$\sum f = 46$$\sum fx = 1240$$\sum fx^2 = 41150$

Step 2: Calculate Mean, Variance, and Standard Deviation

Mean: $\bar{x} = \frac{\sum fx}{\sum f} = \frac{1240}{46} = 26.9565$

Variance: $\sigma^2 = \frac{\sum fx^2}{\sum f} - (\bar{x})^2 = \frac{41150}{46} - (\frac{1240}{46})^2 = 176.6068$

Standard Deviation: $\sigma = \sqrt{176.6068} = 13.2893$

Exercises 6.1

1. Find the median and the interquartile range for the following sets of data:

(a) 16, 18, 22, 19, 3, 21, 17, 20

Sorted Data: 3, 16, 17, 18, 19, 20, 21, 22

Median ($Q_2$) = $\frac{18+19}{2} = 18.5$.

Lower half: [3, 16, 17, 18]. $Q_1 = \frac{16+17}{2} = 16.5000$.

Upper half: [19, 20, 21, 22]. $Q_3 = \frac{20+21}{2} = 20.5$.

IQR = $20.5 - 16.5 = 4$.

(b) 33, 38, 43, 30, 29, 40, 51

Sorted Data: 29, 30, 33, 38, 40, 43, 51

Median ($Q_2$) = 38 (the 4th value).

Lower half: [29, 30, 33]. $Q_1 = 30$.

Upper half: [40, 43, 51]. $Q_3 = 43$.

IQR = $43 - 30 = 13$.

(c) 24, 32, 54, 31, 16, 18, 19, 14, 17, 20

Sorted Data: 14, 16, 17, 18, 19, 20, 24, 31, 32, 54

Median ($Q_2$) = $\frac{19+20}{2} = 19.5$.

Lower half: [14, 16, 17, 18, 19]. $Q_1 = 17$.

Upper half: [20, 24, 31, 32, 54]. $Q_3 = 31$.

IQR = $31 - 17 = 14$.

(d) 14, 16, 27, 18, 13, 19, 36, 15, 20

Sorted Data: 13, 14, 15, 16, 18, 19, 20, 27, 36

Median ($Q_2$) = 18 (the 5th value).

Lower half: [13, 14, 15, 16]. $Q_1 = \frac{14+15}{2} = 14.5$.

Upper half: [19, 20, 27, 36]. $Q_3 = \frac{20+27}{2} = 23.5$.

IQR = $23.5 - 14.5 = 9$.

2. Fifteen students do a mathematics test. Their marks are as follows: 7, 4, 9, 7, 6, 10, 12, 11, 3, 8, 5, 9, 8, 7, 3. Find the median and the interquartile range.

Step 1: Arrange data in ascending order.

3, 3, 4, 5, 6, 7, 7, 7, 8, 8, 9, 9, 10, 11, 12

Step 2: Find the Median ($Q_2$).

Median ($Q_2$) = 7 (the 8th value)

Step 3: Find $Q_1$ and $Q_3$.

Lower half: [3, 3, 4, 5, 6, 7, 7]. $Q_1 = 5$.

Upper half: [8, 8, 9, 9, 10, 11, 12]. $Q_3 = 9$.

Step 4: Calculate the IQR.

IQR = $Q_3 - Q_1 = 9 - 5 = 4$.

3. Find the mean, variance and the standard deviation of the following data sets.

(a) 1, 2, 3, 4, 5, 6, 7

Mean: $\bar{x} = \frac{1+2+3+4+5+6+7}{7} = \frac{28}{7} = 4.0000$

$\sum x^2 = 1^2+2^2+3^2+4^2+5^2+6^2+7^2 = 1+4+9+16+25+36+49=140$

Variance: $\sigma^2 = \frac{140}{7} - (4)^2 = 20 - 16 = 4$

Standard Deviation: $\sigma = \sqrt{4} = 2$

(b) 5, 6, 7, 3, 2, 9, 11

Mean: $\bar{x} = \frac{5+6+7+3+2+9+11}{7} = \frac{43}{7} \approx 6.1429$

$\sum x^2 = 5^2+6^2+7^2+3^2+2^2+9^2+11^2 = 25+36+49+9+4+81+121=325$

Variance: $\sigma^2 = \frac{325}{7} - (\frac{43}{7})^2 \approx 46.4286 - 37.7347 = 8.6939$

Standard Deviation: $\sigma = \sqrt{8.6939} = 2.9485$

(c) 4, 3, 2, 7, 0, 9

Mean: $\bar{x} = \frac{4+3+2+7+0+9}{6} = \frac{25}{6} \approx 4.1667$

$\sum x^2 = 4^2+3^2+2^2+7^2+0^2+9^2 = 16+9+4+49+0+81=159$

Variance: $\sigma^2 = \frac{159}{6} - (\frac{25}{6})^2 \approx 26.5 - 17.3611 = 9.1389$

Standard Deviation: $\sigma = \sqrt{9.1389} = 3.0231$

(d) 5, 5, 5, 4, 2, 7, 7, 7, 7

Mean: $\bar{x} = \frac{5(3)+4+2+7(4)}{9} = \frac{15+4+2+28}{9} = \frac{49}{9} \approx 5.4444$

$\sum x^2 = 5^2(3)+4^2+2^2+7^2(4) = 25(3)+16+4+49(4) = 75+16+4+196 = 291$

Variance: $\sigma^2 = \frac{291}{9} - (\frac{49}{9})^2 \approx 32.3333 - 29.6420 = 2.6913$

Standard Deviation: $\sigma = \sqrt{2.6913} = 1.6405$

(e) 1, 3, 5, 5, 7, 9, 10, 15, 17

Mean: $\bar{x} = \frac{1+3+5+5+7+9+10+15+17}{9} = \frac{72}{9} = 8$

$\sum x^2 = 1^2+3^2+5^2+5^2+7^2+9^2+10^2+15^2+17^2 = 1+9+25+25+49+81+100+225+289=804$

Variance: $\sigma^2 = \frac{804}{9} - (8)^2 = 89.3333 - 64 = 25.3333$

Standard Deviation: $\sigma = \sqrt{25.3333} = 5.0332$

4. The numbers of errors, x, on each of 200 pages of typescript was monitored. The results when summarized showed that $\sum x = 1000, \sum x^2 = 5500$. Calculate the mean and the standard deviation of the number of errors per page.

Step 1: Calculate the Mean ($\bar{x}$)

$\bar{x} = \frac{\sum x}{n} = \frac{1000}{200} = 5$

Step 2: Calculate the Variance ($\sigma^2$)

$\sigma^2 = \frac{\sum x^2}{n} - (\bar{x})^2 = \frac{5500}{200} - (5)^2 = 27.5 - 25 = 2.5$

Step 3: Calculate the Standard Deviation ($\sigma$)

$\sigma = \sqrt{2.5} = 1.5811$

5. In a student group, a record was kept for the number of absent days each student had over one particular term. Calculate the mean and standard deviation for these data shown in the table below.

Number of absent days (x)01234
Number of students (f)15102082

Step 1: Construct the calculation table

$x$$f$$f \cdot x$$x^2$$f \cdot x^2$
015000
11010110
22040480
3824972
4281632
$\sum f = 55$$\sum fx = 82$$\sum fx^2 = 194$

Step 2: Calculate Mean, Variance, and Standard Deviation

Mean: $\bar{x} = \frac{\sum fx}{\sum f} = \frac{82}{55} = 1.4909$

Variance: $\sigma^2 = \frac{\sum fx^2}{\sum f} - (\bar{x})^2 = \frac{194}{55} - (1.4909)^2 = 3.5273 - 2.2228 = 1.3045$

Standard Deviation: $\sigma = \sqrt{1.3045} = 1.1421$

6. A moth trap was set every night for five weeks. The number of moths caught in the trap was recorded. Calculate the mean and standard deviation for these data shown in the table below.

Number of moths (x)7891011
Frequency (f)38959

Step 1: Construct the calculation table

$x$$f$$f \cdot x$$x^2$$f \cdot x^2$
732149147
886464512
998181729
10550100500
119991211089
$ \sum f = 34$$\sum fx = 315$$\sum fx^2=2977$

Step 2: Calculate Mean, Variance, and Standard Deviation

Mean: $\bar{x} = \frac{315}{34} = 9.2647$

Variance: $\sigma^2 = \frac{2977}{34} - (9.2647)^2 = 87.5588 - 85.8259 = 1.7329$

Standard Deviation: $\sigma = \sqrt{1.7329} = 1.3164$

7. The lifetimes of 80 batteries, to the nearest hour (h), is shown in the following table. Calculate the mean and standard deviation for these data.

Lifetime (h)(6, 10](10, 15](15, 20](20, 25](25, 30]
Number of batteries128153015

Step 1: Construct the calculation table

Midpoint(x)f$f \cdot x$$x^2$$f \cdot x^2$
8129664768
12.58100156.251250
17.515262.5306.254593.75
22.530675506.2515187.5
27.515412.5756.2511343.75
$\sum f=80$$\sum fx=1546$$\sum fx^2=33143$

Step 2: Calculate Mean, Variance, and Standard Deviation

Mean: $\bar{x} = \frac{1546}{80} = 19.3250$

Variance: $\sigma^2 = \frac{33143}{80} - (19.3250)^2 = 414.2875 - 373.4556 = 40.8319$

Standard Deviation: $\sigma = \sqrt{40.8319} = 6.39$

8. The table summarizes the distances travelled by 150 students to college each day. Calculate the mean and standard deviation for these data.

Distance (s)(0, 2](2, 4](4, 6](6, 8](8, 10](10, 12]
Number of students82255351812

Step 1: Construct the calculation table

Distance (s)Midpoint(x)f$f \cdot x$$x^2$$f \cdot x^2$
(0, 2]18818
(2, 4]322669198
(4, 6]555275251375
(6, 8]735245491715
(8, 10]918162811458
(10, 12]11121321211452
$\sum f=150$$\sum fx=888$$\sum fx^2=6206$

Step 2: Calculate Mean, Variance, and Standard Deviation

Mean: $\bar{x} = \frac{\sum fx}{\sum f} = \frac{888}{150} = 5.92$

Variance: $\sigma^2 = \frac{\sum fx^2}{\sum f} - (\bar{x})^2 = \frac{6206}{150} - (5.92)^2 = 41.3733 - 35.0464 = 6.3269$

Standard Deviation: $\sigma = \sqrt{6.3269} = 2.5153$