Organizing and Displaying of Data
Zafar Iqbal (BSN,MPH,MSN*)
Lecturer(INS-KMU)
Presentation of Data
Statistical data are generally presented by:
Tables
- Frequency table
- Cross tabulation
Graphs
- Qualitative data
- Quantitative data
Classification, Summarization &
Organization of Qualitative Data
________________________________________________________________
Response Frequency Relative
Frequency_
Tally Number of
Marks Patients Proportion
Never II 2 2/20=0.10
Few Times III 3 3/20=0.15
Often IIIIIIII 9 9/20=0.45
Always IIIII 6 6/20=0.30
_________________________________________________________________
Total 20 1.00
Note: Relative frequency can also be presented in times of percentage by
multiplying 100.
3
Classification, Summarization & Organization of
Quantitative Data
Number of Cumulative
Living children Tally Frequency Frequency
0 II 2 2
1 III 3 5
2 IIII 5 10
3 IIII 5 15
4 IIIII 6 21
5 IIII 4 25
6 II 2 27
7 II 2 29
8 I 1 30
_______________________________________________
Total 30
4
Frequency Distribution Table
• A frequency distribution is a table that includes a set of
intervals and displays the number of measurement in each
interval.
• A Tabular summary of a set of data showing the frequency (or
number) of items in each of several non overlapping (with each
data value belonging to one and only one group) groups
Definitions
Class
• One of the categories in which qualitative data can be classified.
• A range of value established to divide quantitative data in to
classes.
Class Frequency
• Number of observations in a data set falling into a particular class.
Tally Marks
• Used to count the number of data items associated with each group.
Conti….
Class Relative Frequency
• Class frequency divided by the total number of observations in the data set.
Relative Frequency = Frequency
Total observations
Cumulative Frequency
• Number of observation in a data set falling below or above particular class inclusive of that particular
class
Relative Cumulative Frequency
• Cumulative frequency divided by the total number of observations in the data set.
Relative cumulative Frequency = Cumulative Frequency
Total observation
Example 1
• A group of Post RN BScN students at the Aga Khan
University School of Nursing conducted a baseline
sample survey at a community for the Emergency
obstetric care project. As the baseline information, the
students also asked about the number of living children
per women (15-49 years). The following data has been
collected based on a random sample of n=30 woman.
2,2,5,3,0,1,3,2,3,4,1,3,4,5,7,
3,2,4,1,0,5,8,6,5,4, 2,4,4,7,6
• Arrange the data in an ascending order
• Find the range of the data;
Range = Maximum – Minimum
8
Frequency Distribution Table
Class Frequency Relative Cumulative Cumulative Rel.
(Number of Frequency Frequency Frequency
Living
Children)
0 2 2/30=0.067 2 2/30 = 0.067
1 3 3/30=0.10 5 5/30= 0.167
2 5 5/30=0.167 10 10/30 = 0.33
3 5 5/30=0.167 15 15/30=0.50
4 6 6/30=0.20 21 21/30=0.70
5 4 4/30=0.133 25 25/30=0.833
6 2 2/30=0.067 27 27/30=0.90
7 2 2/30=0.067 29 29/30 =0.967
8 1 1/30=0.03 30 30/30=1.0
9
Example 2
Suppose we need to construct a similar frequency table
for age of Diabetic patients coming to a clinic. The
following data has been collected based on random
sample of n = 30.
The measurements are:
42,38,51,53,40,68,62,36,32,45,51,67,53,59,47,63,52,64,61,43,
56,58,66,54,56,52,40,55,71,69
Arrange the data in ascending order
32,36,38,40,40,42,43,45,47,51,51,52,52,53,53,54,55,56,56,58,
59,61,62,63,64,66,67,68,69,71
Find the range of the data Range = Max –Min
Range = 71-32 = 39
10
Decide the number of classes
The number of classes or intervals depends on the
number of observations but in general should
range from 5 to 15.
Calculate width of classes
Approximate Class Width = Largest data value – Smallest data
value
Number of Classes
Width = range / no. of classes
Width = 39/8 = 4.8 = 5
11
Frequency Distribution Table
Class Frequency Relative Cumulative Cumulative Rel.
(Number of Frequency Frequency Frequency
Living
Children)
32-37 2 2/30=0.067 2 2/30 = 0.067
37-42 3 3/30=0.100 5 5/30= 0.167
42-47 3 3/30=0.100 8 8/30 = 0.267
47-52 3 3/30=0.100 11 11/30=0.367
52-57 8 8/30=0.267 19 19/30=0.633
57-62 3 3/30=0.100 22 22/30=0.733
62-67 4 4/30=0.134 26 26/30=0.867
67-72 4 4/30=0.134 30 30/30 =1.00
Total 30
12
Cross tabulation (or crosstabs for short)
• It is a statistical process that summarizes categorical data to
create a contingency table
• They provide a basic picture of the interrelation between two
variables and can help find interactions between them.
Sample # Gender Handedness
1 Female Right-handed
2 Male Left-handed
3 Female Right-handed
4 Male Right-handed
5 Male Left-handed
6 Male Right-handed
7 Female Right-handed
8 Female Left-handed
9 Male Right-handed
10 Female Right-handed
14
Cross-Tabulation
• Contingency table
Left handed Right handed total
Males 2 3 5
Females 1 4 5
total 3 7 10
16
Graphical representation of data
• It gives the reader a nice overview of the essential features of
the data. Graphs are designed to give an intuitive feeling of the
data at a glace.
• Therefore graphs:
• Should be self explanatory
• Must have title
• Must have labeled axis
• Should mention unit of observation
• Should be simple & clean
17
Graphical Presentation
• Graphs often make it easier to see certain characteristics and
trends in a set of data.
• Graphs for quantitative data.
• Histogram
• Frequency Polygon
• Stem and Leaf
• Graphs for qualitative data.
• Bar Chart
• Pie Chart
18
Histogram
• Most common graph
• Graphical representation of frequency table
• Consists of a number of bars placed side by side.
• The width of each bar indicates the interval size.
• The height of each bar indicates the frequency of the interval.
• There are no gaps between adjacent bars.
• Continuous nature of quantitative data.
19
Shapes of Histograms
20
Frequency polygon
• Uses the same axis as histogram
• Constructed by making a point (at the same height as the
histogram’s bar) at the mid point of the class interval
• These points care connected by a straight lines.
• Superior to histograms in providing means of comparing
two frequency distribution.
• They should only be used to graph continuous
quantitative data and never discrete or qualitative data.
21
Advantages of polygons:
• The frequency polygon is simpler than its histogram counterpart.
• It sketches an outline of the data more clearly.
• The polygon becomes increasingly smooth and curve like as we
increase the number of classes and the number of observations.
Cumulative
frequency polygon
•Horizontal axis is the same as that is used for
histogram, the vertical scale indicates
cumulative frequency or cumulative relative
frequency
•To construct place a point at the upper class
boundary of each class interval. Each point
represents cumulative relative frequency. The
graph is completed by joining the points.
23
Tree and leaf or stem and leaf
• Used to show directly quantitative data.
• We organize data determining the number of divisions
(5-15).
• We plot a vertical line and put the first digit of category
to the left of the line (tree) and the second digit to the
right of the vertical line (leafs).
Bar chart
• Useful for displaying nominal or ordinal data
• Bar graphs have spaces between adjacent bars to represent distinct
categories
• Categories are represented along the horizontal axis.
• The height of each bar is equal to the frequency of item for that category.
26
MULTIPLE BAR CHART (VERTICAL)
60
50
ASCITES
40
30
Ascites
20
Yes
10 No
Male Female
GENDER
27
BAR CHART (HORIZONTAL)
Figure 2.2 Bar graph showing the number of students of each category 28
SLIDING BAR CHART
29
Pie chart
• Common devise for displaying data arranged in categories
• A circle is divided into wedges that corresponds to the
percentage frequencies of the distribution
• Useful for variables with small number of categories
30
Conti…
There are some other graphs that are used for presentation of
Quantitative data:
• Dot Plot
• Scattered Dot Plot
• Box Plot
DOT PLOT
A
A
15
Frequency A
10
A A
A A A A
5
A A
A A
A A A A A A
A A A A AA A A A A A A A A A AA A A
0
20 40 60 80
Age
32
SCATTER PLOT
80
70
60
50
40
Age
30
40 50 60 70 80 90
Weight
33
Thank you