0% found this document useful (0 votes)
622 views

Module 2 Grouped Data

This document discusses two methods for calculating the mean and calculating the median for grouped data: 1) The midpoint method calculates the mean by taking the midpoint of each class interval, multiplying it by its frequency, summing the products, and dividing by the total frequency. 2) The unit deviation method calculates the mean by choosing an assumed mean, constructing a unit deviation column, multiplying frequencies by deviations, summing, and using a formula to calculate the mean. It also explains that the median is the middlemost value that divides the data into two equal parts and discusses calculating the median through interpolation by finding the median class and approximating the position within the class.

Uploaded by

Angel Pasahol
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
622 views

Module 2 Grouped Data

This document discusses two methods for calculating the mean and calculating the median for grouped data: 1) The midpoint method calculates the mean by taking the midpoint of each class interval, multiplying it by its frequency, summing the products, and dividing by the total frequency. 2) The unit deviation method calculates the mean by choosing an assumed mean, constructing a unit deviation column, multiplying frequencies by deviations, summing, and using a formula to calculate the mean. It also explains that the median is the middlemost value that divides the data into two equal parts and discusses calculating the median through interpolation by finding the median class and approximating the position within the class.

Uploaded by

Angel Pasahol
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

MEAN FOR GROUPED DATA

To compute the value of the mean of a data presented in a frequency distribution we


shall consider two methods.

1. Midpoint Method
2. Unit Deviation Method

In using the Midpoint Method, the midpoint of each class interval is taken as the representative
of each class. These midpoint are multiplied by their corresponding frequencies. The products
are added and the sum is divided by the total number of frequencies. The value obtained is
considered the mean of the grouped data. The formula is


̅

Where f – represent the frequency of each data


x – the midpoint of each class
n – the total number of frequencies or sample size.

To be able to apply, we shall follow the steps below.

1. Get the midpoint of each class


2. Multiply each midpoint by its corresponding frequency
3. Get the sum of the products
4. divide the sum obtained by the total number of frequency. The result shall be rounded
off to two decimal places.

Example: Consider the frequency distribution of the examination scores of the sixty students in a
statistics class. Compute the value of the mean.

Solution:

Step 1. Get the midpoint of each class.


Step 2. Multiply each midpoint by its corresponding frequency.
Step 3. Get the sum of the products in step 2.

f
Classes x (midpoint) fx
(frequency)
11-22 3 16.5 49.5
23-34 5 28.5 142.5
35-46 11 40.5 445.5
47-58 19 52.5 997.5
59-70 14 64.5 903
71-82 6 76.5 459
83-94 2 88.5 177
n = 60 ∑ == 3174

Step 4. Divide the result in step 3 by the sample size. The result is the mean of the distribution.
Hence


̅

Final Module 2 – Mathematics in the Modern World 1


Seatwork/: Consider the frequency distribution of the ages of 75 mayors. Compute the mean of
the age of the mayor.

f
Classes x (midpoint) fx
(frequency)
25-30 3 27.5 82.5
31-36 6 33.5 201
37-42 11 39.5 434.5
43-48 27 40.5 1093.5
49-54 16 48.5 776
55-60 7 57.5 402.5
61-66 4 63.5 254
67-72 1 69.5 69.5
75 3313.5

The alternative method of computing the value of the mean for grouped data is the Unit
Deviation Method. Instead of using midpoints, this method uses unit deviation. This method is
usually implemented by considering an arbitrary point as the initial step in approximating the
value of the mean. This point is the midpoint of any class interval. For conventional purposes,
however, the midpoint of the class interval with the highest frequency will be the arbitrary value
and shall be called the assumed mean. The interval containing the assumed mean shall be
referred as to the mean class.

The next step is done by constructing the unit deviation column. This step involves
assigning a deviation of 0 to the assumed class mean and the other class marks with successive
integers. Form example, if the distribution has nine classes and the fifth class interval is the
assumed class mean, then the entries in the unit deviation column shall be -4, -3, -2, -1, 0, 1, 2, 3,
4. However, if assumed class mean is the 4th class interval, then the entries in the unit deviation
column will be -3, -2, -1, 0, 1, 2, 3, 4 and 5 respectively. The unit deviation are usually represented
by d.

The third step is implemented by multiplying the frequencies by their corresponding unit
deviations. The products are added and the sum is divided by the sample size. The result is then
multiplied by the size of the class interval.

Finally, the value of the mean is determined by adding the product to the assumed
mean.

The formula will be as follows.



̅ ( )

Where ̅ - represents the assumed mean


F – the frequency of each class
d – the unit deviation
c – the size of the class interval
n – the sample size

To be able to apply, we shall follow the step below.

1. Choose an assumed mean by getting the midpoint of any interval


2. Construct the unit deviation column.
3. Multiply the frequencies by their corresponding unit deviations. Add the products.
4. divide the sum in step 3 by the sample size
5. Multiply the result in step 4 by the size of the class interval.
6. Add the value obtained in step 5 to the assumed mean. The obtained result which is the
mean should be rounded off to 2 decimal places.

Final Module 2 – Mathematics in the Modern World 2


Example: Compute the value of the mean using the unit deviation methods

Step 1 Choose an assumed mean.


An assumed mean may be the midpoint of the class interval 47-58
Step 2. Construct the unit deviation column.
Step 3. Multiply the frequencies by their corresponding unit deviations. Add the products.

Classes F d Fd
11-22 3 -3 -9
23-34 5 -2 -10
35-46 11 -1 -11 -30
47-58 19 0 0
59-70 14 1 14
71-82 6 2 12
83-94 2 3 6 32
60 2

Step 4, 5, 6. We will now apply in formula.


̅ ( )

̅ ( )

Seatwork: Using the unit deviation method, compute the mean age of the 75 mayor.

Classes F D Fd
25-30 3
31-36 5
37-42 11
43-48 27
49-54 16
55-60 7
61-66 4
67-72 1

MEDIAN

In the process of computing the mean, we observed that ll the values are taken into
consideration. Thus, if a distribution contains extreme values, then the value of the mean usually
pulled either to the right or to the left depending on the position of these extreme values.

We shall now consider a measure of central tendency that does not take into
consideration all the values in the distribution. This measure, called the median is a positional
measure defined as the middlemost value in the distribution. Hence, this value divides a given
set of data into two equal parts.

MEDIAN FOR GROUPED DATA

Just like the mean, the computation of the value of the median is done through
interpolation. The procedure requires the construction of the less than cumulative frequency
column (<cumf)

Final Module 2 – Mathematics in the Modern World 3


The first step in finding the value of the median is to divide the total number of the
frequency by 2. This is consistent with the definition of the median. The value shall be used to
determine the cumulative frequency before the median class denoted by cumf b. Cumfb refers
to the highest value under the <cumf that is less than . The median class refers to the interval
that contains the median, that is, where the value is located. Hence, among the entries
under the <cumf column which are greater the , the smallest shall be the frequency of the
median class. If a distribution contains an interval where the cumulative frequency is exactly ,
then the upper boundary of that class will be the median and no interpolation is needed.

After identifying the median class, we shall approximate the position of the median
within the median class. This approximation shall be done by subtracting the value of cumf,
from . Then the difference is divided by the frequency of the median class times the size of the
class interval. The result is then added to the lower boundary of the median class to get the
median of the distribution.

The computing formula for grouped data is given below.

̃ ( )

where xlb - refers to the lower boundary of the median class.


cumfb - the cumulative frequency before the median class
fm - the frequency of the median class

To be able to apply, we shall follow the step below.

1. Get of the total number of the value.


2. Determine the value of cumfb
3. Determine the median class.
4. Determine the lower boundary and the frequency of the median class and the size of
the class interval.
5. Substitute the value obtained in step 1 – 4. Round off the final result to two decimal
places.

Example 3. Compute the value of the median of the examination scores of the students in
Statistics.

Solution: We shall first construct the less than cumulative frequency column. Using the steps
indicated, we have.
Classes F <cumf
11-22 3 3
23-34 5 8
35-46 11 19 cumfb
47-58 19 fm 38 median
59-70 14 52
71-82 6 58
83-94 2 60

Steps:

1. n/2 = 60/2 = 30
5. ̃ ( )
2. cumfb = 19

̃ ( )
3. median class 47 – 58

̃
4. xlb = 46.5 ; fm = 19; c = 12

Final Module 2 – Mathematics in the Modern World 4


Seatwork:

A researcher is conduction an investigation regarding the income of the alumni of a


certain university 5 years after the graduation. The monthly incomes of the 200 respondents
were taken and are presented below.

Determine the median of the monthly income of the 200 respondents.

Classes F < cum f


3,500 – 4,999 6
5,000 – 6,499 23
6,500 – 7,999 36
8,000 – 9,499 40
9,500 – 10,999 59
11,000 – 12,499 20
12,500 – 13,999 8
14,000 – 15,499 6
15,500 – 16,999 2
200

MODE FOR GROUPED DATA

In the computation of the value of the mode for grouped data, it is necessary to identify
the class interval that contains the mode. This interval, called modal class, containing the
highest frequency in the distribution.

The next step after getting the modal class is to determine the mode within the class. This
value may be approximately by getting the differences of the frequency of the modal class to
the frequency before and to the frequency after the modal class. If we let d 1be the difference
of the frequency of the modal class and the frequency of the interval preceding the modal
class and d2 be the difference of the frequency of the modal class, then the mode within the
class shall be approximately using the expression.

( )

If this expression is added to the lower boundary of the modal class, then we can come
up with the computing formula for the value of the mode for grouped data. The formula is

̂ ( )

To be able to apply, we shall consider the following steps.

1. Determine the modal class


2. Get the vale of d1
3. Get the value of d2
4. Get the lower boundary of the modal class.
5. Apply the formula by substituting the value obtained in the preceding steps.

Final Module 2 – Mathematics in the Modern World 5


Example 2. consider the frequency of the examination scores of sixty (60) students. Compute
the mode of that distribution

Solution: The frequency distribution of the data is reproduced below. To compute the mode, we
have

Classes f
11-22 3
23-34 5
35-46 11
47-58 19 modal class
59-70 14
71-82 6
83-94 2

To get the value of d1 and d2 we have

d1 = 19 – 11 = 8
d2 = 19 – 14 = 5

Substitute these values, we have

̂ ( )

̂ ( )

Final Module 2 – Mathematics in the Modern World 6

You might also like