DATA COLLECTION
HOW TO ORGANIZE THE DATA
1) POPULATION – SAMPLE
2) SIMPLE SERIES – CLASSIFIED SERIES – GROUPED DATA
DEFINITION NOTATION
Size Population -> N
Sample -> n
Mean Population -> 𝜇 (mü)
Sample -> 𝑥̅ (x bar)
Variance Population -> 𝜎 2 (sigma squared)
Sample -> 𝑠 2
Standard deviation Population -> 𝜎 (sigma)
Sample -> s
Proportion (ratio) Population -> 𝜋
Sample -> 𝑝̅ (p bar)
Simple Series:
x
10
13
12
35
30
25
20
Classified Series:
X FREQUENCY
0 10
3 15
6 5
8 10
n=40
Grouped Data:
Groups FREQUENCY
0–5 10
5 – 10 15
10 -15 5
15- 20 10
20 - 25 35
N=75
Descriptive Statistics:
1) Measures of Central Tendency (Merkezi Eğilim Ölçüleri)
2) Measures of Dispersion (Değişkenlik Ölçüleri)
3) Ratios (Oranlar)
1) Measures of Central Tendency:
a) Mode (Mod): Mode of a set of observations is the value that occurs most frequently
Simple Series: n=7
X
10
13
13
10
13
10
Find the mode! (Sample-Simple)
Mode = 10 and 13
Classified Series: n=40
X FREQUENCY
0 12
3 12
6 12
8 12
Find the mode! (Sample – Classified)
Mode= No mode
Grouped Data: n=75
Groups FREQUENCY
0–5 9
5 – 10 10
10 -15 5
15- 20 6
20 - 25 15
Calculate the mode! (Sample – Grouped Data)
∆1
Mode = 𝐿+( ).𝑐
∆1 +∆2
L: Lowest limit value of the mode group
∆1 : The difference between the frequencies of the mode group and the previous group
∆2 : The difference between the frequencies of the mode group and the later group
C : The group width
∆1 15−6
Mode = 𝐿+( ) . 𝑐 = 20 + ( .5 = 21.87
∆1 +∆2 15−6)+(15−0)
b) Median (Medyan): Median of a set of observations is the middle one if the number of
observations is odd and the average of the middle pair if their number is even when these
observations are arranged in increasing order.
Simple Series: n=6
X
10
13
13
10
13
10
X
10
10
10
13
13
13
Find the median! (Sample – Simple)
10+13
Median = 2
= 11.5
X
10
12
15
10
13
9
X
6
9
10
10
12
13
15
N=7
Find the median! (Population – simple)
Median = 10
Classified Series: n=18
X FREQUENCY
0 4
6 3
3 5
8 6
X FREQUENCY
0 4
3 5
6 3
8 6
X
0
0
0
0
3
3
3
3
3
6
6
6
8
8
8
8
8
8
Find the median! (Sample – Classified)
3+6
Median = 2
= 4.5
Classified Series: N=100
X FREQUENCY
5 35
3 10
7 25
10 30
Find the median! (Population – Classified)
X FREQUENCY
3 10
5 35
7 25
10 30
7+7
Median= 2
=7
Classified Series: N=61
X FREQUENCY
15 13
25 12
30 24
50 12
Find the median! (Population – Classified)
Median = 31. Observation = 30
Grouped Data: n=50
Groups FREQUENCY
0–5 9
5 – 10 10
10 -15 10
15- 20 6
20 - 25 15
. 9 observations
. 10 observations
. 25 th
. 26 th
Calculate the median! (Sample – Grouped)
𝑛 𝑐 50 5
Median = 𝐿 + [2 − (∑ 𝑓)𝑚 ]. 𝑓 = 10 + [ 2 − 19] . 10 = 13
𝑚𝑒𝑑𝑖𝑎𝑛
L: Lowest limit value of the median group
C : The group width
𝑓𝑚𝑒𝑑𝑖𝑎𝑛 : the frequency of the median group
(∑ 𝑓)𝑚 : The sum of the frequencies up to median group
Grouped Data: N=80
Groups FREQUENCY
0 – 10 10
10 – 20 5
20 – 30 15
30 – 40 30
40 – 50 20
Find the median! (Population – Grouped)
𝑁 𝑐 80 10
Median = 𝐿 + [ 2 − (∑ 𝑓)𝑚 ]. 𝑓 = 30 + [ 2 − 30] . 30 =
𝑚𝑒𝑑𝑖𝑎𝑛
33.33
40th and 41. Observations
c) Mean (Aritmetik Ortalama): Mean of a set of numerical observations is the sum of the set
divided by the number of observations, that is, their average.
X
10
12
15
10
13
9
N=6
Calculate the mean! (Population – Simple)
∑𝑁
𝑖=1 𝑥𝑖 10 + 12 + 15 + 10 + 13 + 9
𝜇= = = 11.5
𝑁 6
n=6
Calculate the mean! (Sample – Simple)
∑𝑛𝑖=1 𝑥𝑖 10 + 12 + 15 + 10 + 13 + 9
𝑥̅ = = = 11.5
𝑛 6
X FREQUENCY
15 13
25 12
30 24
50 12
N=61
Calculate the mean! (Population – Classified)
∑𝑘𝑖=1 𝑥𝑖 . 𝑓𝑖
𝜇=
∑𝑘𝑖=1 𝑓𝑖
15 + 15 + 15 + ⋯ + 25 + 25 + 25 + ⋯ + 30 + 30 + 30 + ⋯ + 50 + 50 + 50 + ⋯
=
61
15.13 + 25.12 + 30.24 + 50.12
= = 29.75
61
n=61
Calculate the mean! (Sample – Classified)
∑𝑘𝑖=1 𝑥𝑖 . 𝑓𝑖 15.13 + 25.12 + 30.24 + 50.12
𝑥̅ = = = 29.75
∑𝑘𝑖=1 𝑓𝑖 61
MIDPOINTS (M) Groups FREQUENCY
12.5 10– 15 10
17.5 15 – 20 5
22.5 20 – 25 15
27.5 25 – 30 30
32.5 30 – 35 20
Variable= years spent in work
N=80
Calculate the mean! (Population – Grouped data)
∑𝑘𝑖=1 𝑚𝑖 . 𝑓𝑖 5.10 + 15.5 + 25.15 + 35.30 + 45.20
𝜇= = = 30.62
∑𝑘𝑖=1 𝑓𝑖 80
n=80
Calculate the mean! (Sample – Grouped Data)
∑𝑘𝑖=1 𝑚𝑖 . 𝑓𝑖 5.10 + 15.5 + 25.15 + 35.30 + 45.20
𝑥̅ = = = 30.62
∑𝑘𝑖=1 𝑓𝑖 80
Variable -> Years spent in working
Sample 1: n=5
9, 10, 11, 8, 12 -> 8 9 10 11 12
∑𝑛𝑖=1 𝑥𝑖 9 + 10 + 11 + 8 + 12
𝑥̅ = = = 10
𝑛 5
Median = 10
Sample 2: n=5
7, 8, 10, 12, 13
∑𝑛𝑖=1 𝑥𝑖 7 + 8 + 10 + 12 + 13
𝑥̅ = = = 10
𝑛 5
Median = 10
10 10
2) Measures of Dispersion:
a) Range: Range of a set of observations is the difference between the largest and
smallest observations.
Sample 1 -> Range= X max- X min= 12-8=4 years
Sample 2 -> Range= X max- X min= 13-7=6 years
X FREQUENCY
15 13
25 12
30 24
50 12
N=61 (n=61)
Calculate the range! (Population (sample) – Classified)
Range = X max – X min = 50 – 15 = 35
MIDPOINTS Groups FREQUENCY
5 0 – 10 10
10 – 20 5
20 – 30 15
30 – 40 30
45 40 – 50 20
n=80
Calculate the range! (Sample – Grouped Data)
Range = X max – X min = 45 – 5 = 40
Sample 2: n=5
7, 8, 10, 12, 13
∑𝑛𝑖=1 𝑥𝑖 7 + 8 + 10 + 12 + 13
𝑥̅ = = = 10
𝑛 5
(7 − 10)2 = 9
(8 − 10)2 = 4
(10 − 10)2 = 0
(12 − 10)2 = 4
(13 − 10)2 = 9
SUM = 26/5=5.2 -> variance
√𝟓. 𝟐 = 𝟐. 𝟐𝟖 -> standard deviation
Sample 1: n=5
9, 10, 11, 8, 12
(9 − 10)2 = 1
(10 − 10)2 = 0
(11 − 10)2 = 1
(8 − 10)2 = 4
(12 − 10)2 = 4
SUM = 10/5=2 -> variance
√𝟐 = 𝟏. 𝟒𝟏 -> standard deviation
Population 1 -> N= 10 Sum = 15 / 10 = 1.5 on average per unit
Population 2 -> N= 20 Sum = 20 / 20 = 1 on average per unit
b) Mean Absolute Deviation:
X
10
12
15
10
13
9
𝜇 = 11.5
N=6
Calculate the mean absolute deviation! (Population – Simple)
∑𝑁𝑖=1|𝑥𝑖 − 𝜇|
𝑀𝐴𝐷 =
𝑁
|10 − 11.5| + |12 − 11.5| + |15 − 11.5| + |10 − 11.5| + |13 − 11.5| + |9 − 11.5|
=
6
= 1.83
X
10
12
15
10
13
9
𝑥̅ = 11.5
n=6
Calculate the mean absolute deviation! (Sample – Simple)
∑𝑛𝑖=1|𝑥𝑖 − 𝑥̅ |
𝑀𝐴𝐷 =
𝑛
|10 − 11.5| + |12 − 11.5| + |15 − 11.5| + |10 − 11.5| + |13 − 11.5| + |9 − 11.5|
=
6
= 1.83
X FREQUENCY
15 13
25 12
30 24
50 12
N=61
Calculate the mean absolute deviation! (Population – Classified)
𝜇 = 29.75
∑𝑘𝑖=1|𝑥𝑖 − 𝜇|. 𝑓𝑖
𝑀𝐴𝐷 =
∑𝑘𝑖=1 𝑓𝑖
|15 − 29.75|. 13 + |25 − 29.75|. 12 + |30 − 29.75|. 24 + |50 − 29.75|. 12
=
61
= 8.15
X FREQUENCY
15 13
25 12
30 24
50 12
n=61
Calculate the mean absolute deviation! (Sample – Classified)
𝑥̅ = 29.75
∑𝑘𝑖=1|𝑥𝑖 − 𝑥̅ |. 𝑓𝑖
𝑀𝐴𝐷 =
∑𝑘𝑖=1 𝑓𝑖
|15 − 29.75|. 13 + |25 − 29.75|. 12 + |30 − 29.75|. 24 + |50 − 29.75|. 12
=
61
= 8.15
|15 − 29.75| = (29.75 − 15)
MIDPOINTS Groups FREQUENCY
5 0 – 10 10
15 10 – 20 5
25 20 – 30 15
35 30 – 40 30
45 40 – 50 20
N=80
Calculate the mean absolute deviation! (Population – Grouped Data)
𝜇 = 30.62
∑𝑘𝑖=1|𝑚𝑖 − 𝜇|. 𝑓𝑖
𝑀𝐴𝐷 =
∑𝑘𝑖=1 𝑓𝑖
|5 − 30.62|. 10 + |15 − 30.62|. 5 + |25 − 30.62|. 15 + |35 − 30.62|. 30 +
|45 − 30.62|. 20
=
80
= 10.47
MIDPOINTS Groups FREQUENCY
5 0 – 10 10
15 10 – 20 5
25 20 – 30 15
35 30 – 40 30
45 40 – 50 20
n=80
Calculate the mean absolute deviation! (Sample – Grouped Data)
𝑥̅ = 30.62
∑𝑘𝑖=1|𝑚𝑖 − 𝑥̅ |. 𝑓𝑖
𝑀𝐴𝐷 =
∑𝑘𝑖=1 𝑓𝑖
|5 − 30.62|. 10 + |15 − 30.62|. 5 + |25 − 30.62|. 15 + |35 − 30.62|. 30 +
|45 − 30.62|. 20
=
80
= 10.47
c) Variance and Standard Deviation:
X
10
12
15
10
13
9
𝜇 = 11.5
N=6
Calculate the variance and standard deviation! (Population – Simple)
(10−11.5)2 +(12−11.5)2 +(15−11.5)2 +(10−11.5)2 +(13−11.5)2
∑𝑁
𝑖=1(𝑥𝑖 −𝜇)
2
√ +(9−11.5)2
𝜎=√ = =
𝑁 6
2.06 -> standard deviation
𝜎 2 = 2.062 = 4.25 -> variance
X
10
12
15
10
13
9
𝑥̅ = 11.5
n=6
Calculate the variance and standard deviation! (Sample – Simple)
(10−11.5)2 +(12−11.5)2 +(15−11.5)2 +(10−11.5)2 +(13−11.5)2
∑𝑛
𝑖=1(𝑥𝑖 −𝑥̅ )
2
√ +(9−11.5)2
𝑠=√ = =
𝑛−1 6−1
2.25 -> standard deviation
𝑠 2 = 2.252 = 5.1 -> variance
X FREQUENCY
15 13
25 12
30 24
50 12
N=61
Calculate the variance and standard deviation! (Population – Classified)
𝜇 = 29.75
∑𝑘 2
𝑖=1(𝑥𝑖 −𝜇) .𝑓𝑖
𝜎=√ ∑𝑘
=
𝑖=1 𝑓𝑖
(15−29.75)2 .13+(25−29.75)2 .12+(30−29.75)2 .24+(50−29.75)2 .12
√ = 36.56 -> standard
6
deviation
𝜎 2 = 36.562 = 1336.88 -> variance
X FREQUENCY
15 13
25 12
30 24
50 12
n=61
Calculate the variance and standard deviation! (Sample – Classified)
𝑥̅ = 29.75
∑𝑘𝑖=1(𝑥𝑖 − 𝑥̅ )2 . 𝑓𝑖
𝑠=√
∑𝑘𝑖=1 𝑓𝑖 − 1
(15 − 29.75)2 . 13 + (25 − 29.75)2 . 12 + (30 − 29.75)2 . 24 + (50 − 29.75)2 . 12
√
=
6−1
= 40.05
𝑠 2 = 40.052 = 1604 -> variance
MIDPOINTS Groups FREQUENCY
5 0 – 10 10
15 10 – 20 5
25 20 – 30 15
35 30 – 40 30
45 40 – 50 20
N=80
Calculate the variance and standard deviation! (Population – Grouped Data)
𝜇 = 30.62
∑𝑘𝑖=1(𝑚𝑖 − 𝜇)2 . 𝑓𝑖
𝜎=√
∑𝑘𝑖=1 𝑓𝑖
(5 − 30.62)2 . 10 + (15 − 30.62)2 . 5 + (25 − 30.62)2 . 15 + (35 − 30.62)2 . 30
√ +(45 − 30.62)2 . 20
=
80
= 12.73
𝜎 2 = 12.732 = 162.10 -> variance
MIDPOINTS Groups FREQUENCY
5 0 – 10 10
15 10 – 20 5
25 20 – 30 15
35 30 – 40 30
45 40 – 50 20
n=80
Calculate the variance and standard deviation! (Sample – Grouped Data)
𝑥̅ = 30.62
∑𝑘𝑖=1(𝑚𝑖 − 𝑥̅ )2 . 𝑓𝑖
𝑠=√
∑𝑘𝑖=1 𝑓𝑖 − 1
(5 − 30.62)2 . 10 + (15 − 30.62)2 . 5 + (25 − 30.62)2 . 15 + (35 − 30.62)2 . 30
√ +(45 − 30.62)2 . 20
=
80 − 1
= 12.81
𝑠 2 = 12.812 = 164.16 -> variance