0% found this document useful (0 votes)

374 views16 pages

Chi-Square Basics for Learners

Uploaded by

temedebere

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

374 views16 pages

Chi-Square Basics for Learners

Uploaded by

temedebere

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

CHAPTER - SIX

THE CHI-SQUARE DISTRIBUTION

OBJECTIVES:-
The aim of this unit is to provide the learner with the basic applications of the Chi-Square
distribution in the analyses of frequencies.

At the end of the unit, the reader is expected to:

 Perform the Chi-Square test of association (independence)

 Conduct the Chi-Square test of goodness of fit to verify whether a fitted distribution to a
set of observations is appropriate or not.
1.1 INTRODUCTION

It is evidently true that most research problems call for the process of determining the existence
of association or interdependence between two or more variables. To this end statistical methods
help in measuring the relationship that exists between variables. One of such methods is the Chi-
Square test of independence or association. This method is applied in cases where we have two
variables. It is used to detect the existence or non-existence of association between two variables.
Nevertheless, it should be noted that the Chi-Square test of association does not measure the
degree of association or relationship. On the other hand, the Chi-Square distribution is also used
in evaluating the goodness of fit of a distribution to a given data. This kind of test is referred to
as the Chi-Square test of Goodness of Fit, and it is of great importance in statistics.

1.2 THE CHI-SQUARE DISTRIBUTION

The chi-square ( ) distribution is obtained from the values of the ratio of the sample variance
and population variance multiplied by the degrees of freedom. This occurs when the population
is normally distributed with population variance sigma^2.

Properties of the Chi-Square

 Chi-square is non-negative. Is the ratio of two non-negative values, therefore must be

non-negative itself.
 Chi-square is non-symmetric.
 There are many different chi-square distributions, one for each degree of freedom.
 The degrees of freedom when working with a single population variance is n-1.

The Chi-Square distribution will be used in investigating whether the expected frequencies are
significantly different from the observed frequencies obtained from the sample.
There are three cases in which we apply this kind of test.
i) Test of Goodness of fit:
fit: Testing whether a given model is acceptable or not.
ii) Chi-Square test of independence:
independence: Testing whether two attributes are associated or
not.
iii) Chi-Square test of Homogeneity:
Homogeneity: Testing whether many populations are homogenous
with respect to certain classification.
CHI SQUARE ( χ 2
) TEST:
In most statistical tests, our decisions are based on the assumption that the population is normally
distributed. But when this assumption about the population cannot be made, it is necessary to use
the CHI SQUARE ( χ 2
) test. This test is good for nominal or ordinal scale of measurement
where nominal scale of measurement deals with the data which can only be classified into
categories such as male and female, or freshman, juniors and seniors and so on. There is no
particular order for these groupings and are mutually exclusive so that an item in one category is
not included in another category. The ordinal scale of measurement assigns different ranks to
these categories. One category may be superior in standing an the other may be good or fair and
so on. χ 2
test is used for analyzing qualitative variables such as opinions of persons, religious
affiliations, smoking habits, etc. It deals with judgments about proportions of two or more than
two populations.

Properties of Chi square distribution

1- It involves squared observations and hence it is always positive or greater than or equal to
zero.
2- The distribution is not symmetrical. It is skewed to the right so that its skew ness is
positive. However, as the number of degrees of freedom increases, Chi-square
approaches a symmetrical distribution.
3- Similar to t-distribution, there is a family of chi-square distributions. There is a particular
distribution for each degree of freedom.
The estimation of degree of freedom of χ 2
-distribution is determined by the number of
categories in which various attributes of the sample are place, so that if there are K numbers of
categories, then the number of degrees of freedom (df) would be (k – 1). For categories of two or
more independent samples (where given contingency table), the df would be (k – 1) (r – 1),
where r-is the number of rows and k-number of columns. For example, if a sample of 100
students were categorized as freshman, sophomores, juniors and seniors, then there are four
categories and k is 4.

So that the degree of freedom or df is k-1 = 3

The following illustration shows the family of χ 2
curves with varying degrees of freedom and
it can be seen that as the number of degrees of freedom increases, χ 2
distribution approaches
the normal curve.
The χ 2
test is used to test whether there is a significant difference between the observed
number of responses in each category and the expected number of responses for such category
under the assumptions of null hypothesis. In order words, the objective is to find how well the
distribution of observed frequencies (fo) fit the distribution of expected frequencies (fe). Hence
this test is also called goodness-of-fit test.

Example: -
Find the critical value of χ 2
from the table of χ -distribution if level of significance  is
2

0.05 and degree of freedom is 2.

Answer χ 2
= 5.991

1.3 CHI-SQUARE TEST OF INDEPENDENCE (ASSOCIATION)

The chi-square test of independence is a bivariate statistical technique that is used to detect the
existence of association between two attributes or variables. It is of great interest to know the
existence of association between variables. Some examples of such variables are the following.
 Type of school to which families send their children and income of
families.
 Religion and family size.
size.
 Time studied and exam result.
result.
 Demand and Price of a commodity.
commodity.
 Age and number of teeth.
teeth.
It is one of the most commonly applied statistical techniques in researches conducted in
different disciplines.
Suppose that we have two attributes (characteristics) say A and B. We want to test the hypothesis
H0: There is no association between attributes A and attribute B.
Versus the alternative
HA: There is association between attribute A and attribute B.
In the test for independence, the claim is that the row and column variables are independent of
each other. This is the null hypothesis. The test statistic used is the same as the chi-square
goodness-of-fit test. The principle behind the test for independence is the same as the principle
behind the goodness-of-fit test. The test for independence is always a right tail test.
test.
If attribute A has r categories (levels) and attribute B has c categories (levels), then the table in
which the two attributes (variables) are cross classified contains r rows and c columns. The table
has rc cells. This table is usually referred to as rxc (r by c) contingency table.
Now suppose a sample of size n is taken and cross classified. Let O ij denote the observed
frequency of the ith category (level) of A and the jth category (level) of B. recall that, our interest
is to test the null hypothesis that there is no association between the two attributes A and B. The
test statistics to be used is:
r c 2
2 ( Oij −e ij )
χ =∑ ∑
i=1 j=1 e ij
e ij=n P {A i∩B j } Where P {A i ∩B j } is the probability of the cell ( i,j)
If the null hypothesis is true, then
O O O O
e ij=n P {A i∩B j }=n P( A i )P( B j )=n i. j. = i . . j
Where n n n
Oi. is the total frequency of the ith row and O.j is the total frequency of the jth column.
The above test statistics has a chi-square distribution with (r-1)(c-1) degrees of freedom. The
rejection criterion is:
2 2
χ > χ α [(r−1)(c−1 )]

The multiplication rule said that if two events were independent, then the probability of both
occurring was the product of the probabilities of each occurring. This is key to working the test
for independence. If you end up rejecting the null hypothesis, then the assumption must have
been wrong and the row and column variable are dependent. Remember, all hypothesis testing is
done under the assumption the null hypothesis is true.

Example:
Example: To test the hypothesis that color of eye and color of hair are associated, data on color
of eye and color of hair for 6,800 individuals were compiled.

Hair
Fair Brown Black Red
Blue 1768 808 190 47
Green 946 1387 746 43
Eye
Brown 115 444 288 18
Test whether there is association between the two attributes at 1% level of significance.
Solution:
The hypothesis we want to test is:
H0: There is no association between color of eye and color of hair.
HA: There is association between color of eye and color of hair.

Hair

Fair Brown Black Red Oi.

Blue 1768 808 190 47 2813
Eye Green 946 1387 746 43 3122
Brown 115 444 288 18 865
O.j 2829 2639 1224 108 O..= 6800
The values of eij for the different combinations of i and j are calculated using the formula

Oi. O. j
e ij=
n and presented in the following table.

eij 1 2 3 4
1 1170.29 1091.69 506.34 44.68
2 1298.84 1211.61 561.96 49.58
3 359.87 335.70 155.70 13.74
r c 2 3
2 ( Oij−e ij )
χ =∑ ∑ =∑¿
i=1 j=1 eij ¿

( 1768−1170.29 )2 ( 808−1091.69 )2 (18−13.74 )2

= + +...+ =1074.43
1170.29 1091.69 13.74
2 2
At 1% level, the rejection region is χ > χ 0. 01 [(3−1 )(4−1)]=16 .81 Since the calculated value,

which is 1074.43, is greater than the tabulated value we reject the null hypothesis and conclude
that there is association between the two attributes, eyes color and hair color.

Exercise:
Exercise:
Suppose that 500 university students were randomly selected and classified by year and
smoking habit.
Smoking habit
Year Non-Smokers Casual-Smokers Heavy-Smokers
Freshman 90 42 22
Sophomore 65 37 36
Junior 45 28 30
Senior 25 43 37

Test whether the two attributes, year and smoking habit, are related (associated) or not.

TEST OF ASSOCIATION OF ATTRIBUTES (TEST OF INDEPENDENCE)

In real life situations, sometimes our interest may be to determine whether two classifications or
variables are dependent or independent. For instance, we may be interested to know whether
qualification of employees and their salary are dependent or not, or to know whether advertising
expense and sales of a company are dependent or not. In such cases, we apply tests of
independence.
independence.

Suppose we have two classification: classification 1 consisting of r categories and classification

2 consisting of s categories. Take a random sample and classify each item in to one of the rxs
categories, called cells.

Example 20.9
Suppose we are interested to check whether qualification and salary of employees are dependent
or not. Then we may classify qualification in to three categories (r = 3) as: 12 complete, Diploma
holder, and First Degree or higher.

We may also classify salary in to three categories (s = 3) as: Less than 200 Birr, 200 up to 499
Birr and 500 Birr or more. Then we randomly select employees and classify them into one of the
rxs = 3 x 3 = 9 categories (cells). Suppose a random sample of 80 employees is taken and the
following result is obtained.

Qualification
Salary 12 Diploma 1st degree Row total
complete holder or higher
< 200 Birr 10 2 0 12
200 – 499 Birr 16 20 2 38
500 Birr or more 6 2 22 30
Column Total 32 24 24 80

Notation: Ors = observed frequency of rth row and sth column.

Ers = expected frequency of rth row and sth column.
Ers is computed as:

( r th row total ) X ( sth column total )

Ers = Overall total

Consider the above example:

( 1st row total ) X ( 1 st column total)

O11 = 10 and E11 = Overall total
12 X 32
= 4.8
= 80

( 1st row total ) X ( 2nd column total )

O12 = 2 and E12 = Overall total
12 X 24
= 3.6
= 80

( 1st row total ) X ( 3rd column total )

O13 = 0 and E13 = Overall total
12 X 24
= 3.6
= 80
⋮ Continue

( 3rd row total ) X ( 3rd column total )

O33 = 22 and E33 = Overall total
30 X 24
=9
= 80

In tests of independence, the null and alternative hypotheses are of the form:
HO : The two classifications are independent
H1 : The two classifications are dependent.
The null hypothesis can also be written as “ There is no association between the two
classifications”.

The test statistics used to test the hypothesis of independence is called a Chi-Square test.
A chi-square distribution denoted by  2
is a continuous probability distribution. Unlike the
normal distribution the chi-square distribution is asymmetric (not symmetric). It is positively
skewed (right) distribution. It cannot assume a negative value. The 2 values for a given level of
significance () and the number of degrees of freedom (d.f.) can be read from the  2 distribution
table.

Notation:  2 denotes the value of  2 for which the area to its right is  with a given degree of
freedom.
This is displayed below

2
For example, to find χ 0 .05 (28), look up at the value of  = 0.05and under this value of , look
for the number of degrees of freedom (d.f.), which is equal to 28 in the chi-square distribution
2
table. From the chi-square table, this value is 41.337. Similarly χ 0 .01 (15) = 30.578

To test for independence, the critical value is:

2 (r – 1) (s – 1)
i.e. d.f. = (r – 1) (s – 1) where r is the number of rows and s is the number of columns. To accept
or reject the null hypothesis, compare 2cal with this critical value 2 cal (r – 1) (s – 1) (the
tabulated value)
The test criterion is to reject HO if:
2cal > 2 r
r - 1 s - 1

Example 20.10
Look at the previous example about qualification and salary.
Test if there is a relationship between qualification and salary at the 5 percent level of
significance.

Solution: -
HO : Qualification and salary are independent
H1 : Qualification and salary are dependent
 = 0.05
Computing for the expected frequencies, we have the following table.

Salary Qualification
12 complete Diploma 1st degree or Column
holder higher total
< 200 Birr 10 (4.8) 2 (3.6) 0 (3.6) 12
200 – 499 Birr 16 (15.2) 20 (11.4) 2 (11.4) 38
500 Birr or more 6 (12) 2 (9) 22 (9) 30
Row total 32 24 24 80

The values in bracket are the expected frequencies

The test statistic is:
2
( Ors − Ers )
∑ Ers
2cal = Ors – Observed freq.
Ers – Expected freq.
( 10 − 4 .8 )2 ( 2 − 3. 6 )2 ( 0 − 3. 6 )2 ( 22 − 9 )2
+ + +−−−−−+
= 4 .8 3. 6 3.6 9
= 51.44
 = 0.05 and d.f. = (r – 1) (s – 1) = (3 – 1) (3 – 1) = 2 x 2 = 4

The critical value is:

2
r - 1 s - 1 = χ 0 .05
2 r (4) = 9.488 (from the Chi-square table)
As 2cal > 2 (r – 1) (s – 1) tabulated i.e.
51.44 > 9.488, thus HO is rejected and accept H1
i.e. Salary of employees and their qualification are dependent i.e.; they are associated.

Self-Reviews 9.3
An electronic company wants to check if advertisement has a significant effect on the number of
TV sets that are sold within six months of production. A random sample of 600 TV sets reveals
the following results.

Number of TV sets sold Number of TV sets not

Within 6 months sold within 6 months
Before 150 150
advertisement
After 165 135
advertisement
Is the effect of advertisement significant? Uses  = 0.05

1.4 CHI-SQUARE TEST OF GOODNESS OF FIT

The idea behind the chi-square goodness-of-fit test is to see if the sample comes from the
population with the claimed distribution. Another way of looking at that is to ask if the frequency
distribution fits a specific pattern. Here we want to test whether an observed data (frequency
distribution) is sufficiently close to a theoretical (fitted) distribution.
Suppose that the data is classified into k classes (one-way classification). Let us designate the
expected and observed frequencies of the ith classes by ei and Oi, respectively. Expected
frequencies are calculated frequencies, which are calculated based the proposed distribution. On
the other hand, observed frequencies are those frequencies that are obtained by observation.
They are the sample frequencies.
The test statistics for goodness of fit test is:
k 2
2 ( Oi−ei )
χ =∑
i=1 ei

The test statistic has a chi-square distribution when the following assumptions are met

 The data are obtained from a random sample

 The expected frequency of each category must be at least 5. This goes back to the
requirement that the data be normally distributed.

If the above assumptions are satisfied, the test statistics will have a Chi-Square distribution with
(k-1) degrees of freedom if no parameter is estimated in the process.

The idea is that if the observed frequency is really close to the claimed (expected) frequency,
then the square of the deviations will be small. If the sum of these weighted squared deviations is
small, the observed frequencies are close to the expected frequencies and there would be no
reason to reject the claim that it came from that distribution. Only when the sum is large is that
we have a reason to question the distribution. In other words we reject the null hypothesis when
the value of the calculated test statistics is very large. Therefore, the chi-square goodness-of-fit
test is always a right tail test.
test.

2 2
The rejection region is χ > χ α (k −1) .

Example: In an experiment of pea breading the following frequencies of seeds were ontained.
360 round and yellow, 102 wrinkled and yellow, 109 round and green, and 33 wringkled and
green. Theory predicts that the frequencies should be in proportions 9:3:3:1. Apply Chi-Square
goodness of fit test to examine the correspondence of theory and practice.

Solution

The hypothesis we want to test is

H0: The proportion of the frequencies in the four classes is 9:3:3:1

HA: Not H0

2
k
(O i−ei )2
χ =∑
The test statistics is i=1 ei .

In the given problem there are four classes. These are:

i=1 ….round and yellow

i=2…..wrinkled and yellow

i=3…..round and green

i=4…..wrinkled and green

The expected frequencies are ei=nPi; where n is the sample size and Pi is the probability of the ith
class. But n=316+102+109+33=560

Class Observed (Oi) Pi Expected( ei=nPi) (O i−ei )2

ei
1 316 9/16 315 0.0032
2 102 3/16 105 0.0857
3 109 3/16 105 0.1524
4 33 1/16 35 0.1143
Total 560 1 560 0.3356

Thus the calculated test statistics becomes 2=0.3356.

Since the calculated value is less than the tabulated value at 5% ( 2=0.3356<0.052(3)=7.81) we
accept the null hypothesis are conclude that the observed and expected frequencies are close to
one another.

Example 2:
2: Fit a normal distribution to the following frequency and test whether the fit is good.

Class Frequency
1-4 13
5-8 18
9-12 6
13-16 10
17-20 16

Solution:

The hypothesis we want to test is that the observations come from a normal population. The test

2 (O i−ei )2
k
χ =∑
statistic is i=1 ei .

ei’s are the expected frequencies which will be computed by using the fitted normal curve.

k k
∑ f i xi ∑ f ( xi −X )2
X = i=1 and S2= i =1
The unbiased estimator of  and  are2 n n−1 ; where xi is the
class mark of the ith class.

Class Marks (xi) Frequency

2.5 13
6.5 18
10.5 6
14.5 10
18.5 16
Total 63
5 k
∑ f i xi ∑ f ( x i −X )2
i =1 653 .5 i=1
X= = =10 .37 and S2 = =37 .15 ⇔ S=6 .1
n 63 n−1

In computing the expected frequencies we use the class boundaries since the variable is
continuous random variable. Note that if X is assumed to have a normal distribution then the

X−μ X −X
Z= ≈
value σ S will have the standard normal distribution.

Clearly, ei=nPi and

X− X 4 .5−10 .37
P{ < }=P {Z<−0 . 96}=0 .5−P {0<Z <0 . 96}=0 . 1685
P1=P{X<4.5}= S 6.1

4 . 5−10 . 37 X−X 8 .5−10 .37

P2 =P {4 . 5< X<8 .5}=P { < < }
6 .1 S 6 .1
=P {−0 . 96<Z <−0 . 31}=P {0 .31<Z<0 .96}=P{0<Z <0 . 96}−P{0<Z <0 .31}=0 . 2098

8. 5−10. 37 X− X 12 .5−10. 37
P3 =P{8 .5< X <12 .5}=P { < < }
6 .1 S 6.1
=P {−0 . 31<Z <0 . 35}=P {0<Z <0 . 31}+P {0<Z <0 . 35}=0 .1217+0 .1368=0 .2585

12 .5−10. 37 X− X 16 . 5−10 .37

P4 =P {12 . 5<X <16 . 5}=P{ < < }
6.1 S 6.1
=P {0 . 35<Z <1 . 01}=P {0 .35<Z <1. 01}=P {0<Z <1 .01}−P {0<Z<0. 35}=0 . 3438−0 . 1368=2070
4
P5 =P{X >16 . 5}=1−∑ Pi =1−0. 1685−0. 2098−0 .2585−0 .207=0 . 1562
i=1

The above result may be summarized in the following table so as to facilitate the remaining
calculation.
2
( Oi−ei )
Class Boundaries Oi Pi ei=nPi ei
<4.5 13 0.1685 10.6 0.536
4.5-8.5 18 0.2098 13.2 1.731
8.5-12.5 6 0.2585 16.3 6.496
12.5-16.5 10 0.207 13.0 0.709
>16.5 16 0.1562 9.8 3.855
Total 63 1 63 13.33
2
The calculated value is 13.33. The tabulated value is χ 0 .01 (5−1−2)=10. 6 . Since the

calculated value is greater than the tabulated value we reject the null hypothesis that
the observations come from the normal distribution. Accordingly, we conclude that the
fit is not good. In other words, the fitted curve does not describe the given frequency
distribution.

Research Methods in Economics Part II STAT
No ratings yet
Research Methods in Economics Part II STAT
350 pages
Sampling Distribution Basics
No ratings yet
Sampling Distribution Basics
48 pages
Stationary and Non Stationary
100% (2)
Stationary and Non Stationary
5 pages
Econometrics For Finance
100% (1)
Econometrics For Finance
54 pages
Yate's Correction
No ratings yet
Yate's Correction
15 pages
Applications of The Geometric Mean
100% (1)
Applications of The Geometric Mean
5 pages
Correlation-Partial Unit-3
No ratings yet
Correlation-Partial Unit-3
33 pages
Understanding Sampling Distributions
No ratings yet
Understanding Sampling Distributions
86 pages
Measurement and Scaling Technique
No ratings yet
Measurement and Scaling Technique
32 pages
New Course Outline Managerial Statistics-1
100% (1)
New Course Outline Managerial Statistics-1
4 pages
New ND Manual PDF
No ratings yet
New ND Manual PDF
36 pages
CH 3 and 4
100% (4)
CH 3 and 4
44 pages
CH 4 Seasonal Estimation 2015
No ratings yet
CH 4 Seasonal Estimation 2015
17 pages
Statistics and Data
No ratings yet
Statistics and Data
67 pages
Inferential Statistics Course
No ratings yet
Inferential Statistics Course
46 pages
Chi-Square Test Guide for Students
100% (1)
Chi-Square Test Guide for Students
8 pages
Supplement To Chapter 2 - Probability and Statistics
No ratings yet
Supplement To Chapter 2 - Probability and Statistics
34 pages
Statistics Exam Question 1
0% (2)
Statistics Exam Question 1
2 pages
Chapter Three Multiple
No ratings yet
Chapter Three Multiple
15 pages
TIME SERIES ANALYSIS KYU 2019-Chapter1
No ratings yet
TIME SERIES ANALYSIS KYU 2019-Chapter1
41 pages
Chapter 1
No ratings yet
Chapter 1
9 pages
Two-Way ANOVA: Concepts and Examples
No ratings yet
Two-Way ANOVA: Concepts and Examples
20 pages
Statistics Lesson 1
No ratings yet
Statistics Lesson 1
111 pages
Estimation
No ratings yet
Estimation
53 pages
Introduction To Statistics Material 2023
No ratings yet
Introduction To Statistics Material 2023
85 pages
ANOVA
No ratings yet
ANOVA
12 pages
Basic Statics
No ratings yet
Basic Statics
218 pages
Understanding Multiple Regression Analysis
100% (1)
Understanding Multiple Regression Analysis
58 pages
Topic 5 Anova Statistic
No ratings yet
Topic 5 Anova Statistic
18 pages
Correlation Analysis Notes-2
No ratings yet
Correlation Analysis Notes-2
5 pages
Basics of Probability Theory Explained
100% (1)
Basics of Probability Theory Explained
95 pages
Path Analysis
100% (1)
Path Analysis
1 page
Census and Sample Survey
No ratings yet
Census and Sample Survey
7 pages
Unit One Sampling and Sampling Distribution
No ratings yet
Unit One Sampling and Sampling Distribution
41 pages
Class Discusion Dsamplindistn
100% (1)
Class Discusion Dsamplindistn
2 pages
MCQs - Variables and Data Types
No ratings yet
MCQs - Variables and Data Types
2 pages
Chapter 4 Multiple Regression Model
No ratings yet
Chapter 4 Multiple Regression Model
31 pages
Midterm Review
No ratings yet
Midterm Review
10 pages
Understanding Adjusted R-Squared
100% (1)
Understanding Adjusted R-Squared
2 pages
DAF1212 Business Statistics II
No ratings yet
DAF1212 Business Statistics II
67 pages
Assumptions of Classical Linear Regression
No ratings yet
Assumptions of Classical Linear Regression
75 pages
21-355 Notes
No ratings yet
21-355 Notes
42 pages
Measures of Central Tendency and Dispersion
No ratings yet
Measures of Central Tendency and Dispersion
64 pages
Measurement and Scaling in Research
100% (1)
Measurement and Scaling in Research
8 pages
Statistics for Finance: Chapter 1 Overview
100% (2)
Statistics for Finance: Chapter 1 Overview
28 pages
Introduction to Econometrics Concepts
No ratings yet
Introduction to Econometrics Concepts
5 pages
Presentation of Data (Statistics)
No ratings yet
Presentation of Data (Statistics)
9 pages
Buss. Stat CH-2
100% (2)
Buss. Stat CH-2
13 pages
Econometrics - Classical Regression Assumptions
No ratings yet
Econometrics - Classical Regression Assumptions
15 pages
Discriminant Analysis Guide
No ratings yet
Discriminant Analysis Guide
33 pages
Statistics Set 2
No ratings yet
Statistics Set 2
5 pages
Sampling (Theory)
No ratings yet
Sampling (Theory)
8 pages
Course Outline For Probability and Statistics
No ratings yet
Course Outline For Probability and Statistics
2 pages
Chisquare
No ratings yet
Chisquare
10 pages
Chi-Square Distribution Guide
No ratings yet
Chi-Square Distribution Guide
28 pages
X Test PDF
No ratings yet
X Test PDF
38 pages
Unit 9 8614
No ratings yet
Unit 9 8614
25 pages
Abisola
No ratings yet
Abisola
12 pages
1 Stat511 U4-1
No ratings yet
1 Stat511 U4-1
45 pages
Module 6 Chi-Square T Z Test
100% (1)
Module 6 Chi-Square T Z Test
72 pages
LECTURED Statistics Refresher
100% (1)
LECTURED Statistics Refresher
123 pages
Robiel H. Statistics For Management
No ratings yet
Robiel H. Statistics For Management
18 pages
SAMPLING & SAMPLING DISTRIBUTION EDITED ch07
No ratings yet
SAMPLING & SAMPLING DISTRIBUTION EDITED ch07
50 pages
Robiel H. Statistics For Management
No ratings yet
Robiel H. Statistics For Management
18 pages
SAMPLING & SAMPLING DISTRIBUTION EDITED ch07
100% (1)
SAMPLING & SAMPLING DISTRIBUTION EDITED ch07
50 pages
Sampling Techniques Explained
No ratings yet
Sampling Techniques Explained
6 pages
Chapter 10 Powerpoint
No ratings yet
Chapter 10 Powerpoint
47 pages
Sample Questions: EXAM 2
No ratings yet
Sample Questions: EXAM 2
6 pages
Chapter Two: Statistical Estimation: Definition of Terms: Interval Estimate
100% (1)
Chapter Two: Statistical Estimation: Definition of Terms: Interval Estimate
15 pages
Chapter Five: Analysis of Variance: Terminologies
No ratings yet
Chapter Five: Analysis of Variance: Terminologies
10 pages
Management Statistics Essentials
No ratings yet
Management Statistics Essentials
3 pages
Measurement Scales: Measurement Scale Has A Specific Use
No ratings yet
Measurement Scales: Measurement Scale Has A Specific Use
16 pages
Hypothesis Testing Basics
100% (1)
Hypothesis Testing Basics
16 pages
What Is Sampling Distribution
0% (1)
What Is Sampling Distribution
1 page
Multiple Choice Questions: Answer: D
100% (1)
Multiple Choice Questions: Answer: D
41 pages
Sales Agency vs. Branch Accounting Explained
100% (1)
Sales Agency vs. Branch Accounting Explained
17 pages
2.conditional Probability and Bayes Theorem
No ratings yet
2.conditional Probability and Bayes Theorem
68 pages
Quantitative Methods - Hypothesis Testing
No ratings yet
Quantitative Methods - Hypothesis Testing
9 pages
CH 04
No ratings yet
CH 04
18 pages
Understanding Hypothesis Testing Basics
100% (1)
Understanding Hypothesis Testing Basics
30 pages
Advance Ii
75% (4)
Advance Ii
110 pages
ADV I Chapter 2 2009
No ratings yet
ADV I Chapter 2 2009
5 pages
Advance Acct CH 3N
No ratings yet
Advance Acct CH 3N
18 pages
1.5.1) Marshaling of Assets Doctrine and Liquidation of A Partnership
No ratings yet
1.5.1) Marshaling of Assets Doctrine and Liquidation of A Partnership
5 pages
Advance Acct II CH 4N
No ratings yet
Advance Acct II CH 4N
11 pages
Advanced Accounting Mid Exam Instructions
No ratings yet
Advanced Accounting Mid Exam Instructions
7 pages
Chapter One T
No ratings yet
Chapter One T
39 pages
Objectives of Branch Accounting
33% (3)
Objectives of Branch Accounting
2 pages
Unit Two: Branch Accounting
100% (3)
Unit Two: Branch Accounting
17 pages

Chi-Square Basics for Learners

Uploaded by

Chi-Square Basics for Learners

Uploaded by

CHAPTER - SIX

THE CHI-SQUARE DISTRIBUTION

At the end of the unit, the reader is expected to:

1.2 THE CHI-SQUARE DISTRIBUTION

Properties of the Chi-Square

 Chi-square is non-negative. Is the ratio of two non-negative values, therefore must be

Properties of Chi square distribution

So that the degree of freedom or df is k-1 = 3

0.05 and degree of freedom is 2.

1.3 CHI-SQUARE TEST OF INDEPENDENCE (ASSOCIATION)

Fair Brown Black Red Oi.

( 1768−1170.29 )2 ( 808−1091.69 )2 (18−13.74 )2

TEST OF ASSOCIATION OF ATTRIBUTES (TEST OF INDEPENDENCE)

Suppose we have two classification: classification 1 consisting of r categories and classification

Notation: Ors = observed frequency of rth row and sth column.

( r th row total ) X ( sth column total )

Consider the above example:

( 1st row total ) X ( 1 st column total)

( 1st row total ) X ( 2nd column total )

( 1st row total ) X ( 3rd column total )

( 3rd row total ) X ( 3rd column total )

To test for independence, the critical value is:

The values in bracket are the expected frequencies

The critical value is:

Number of TV sets sold Number of TV sets not

1.4 CHI-SQUARE TEST OF GOODNESS OF FIT

 The data are obtained from a random sample

The hypothesis we want to test is

H0: The proportion of the frequencies in the four classes is 9:3:3:1

In the given problem there are four classes. These are:

i=1 ….round and yellow

i=2…..wrinkled and yellow

i=3…..round and green

i=4…..wrinkled and green

Class Observed (Oi) Pi Expected( ei=nPi) (O i−ei )2

Thus the calculated test statistics becomes 2=0.3356.

Class Marks (xi) Frequency

Clearly, ei=nPi and

4 . 5−10 . 37 X−X 8 .5−10 .37

12 .5−10. 37 X− X 16 . 5−10 .37

You might also like