Data Analysis: Florenda F. Cabatit RN MA Facilitator

This document provides an overview of data analysis and statistics. It discusses data analysis as the process of rendering information meaningful and testing hypotheses using research data. It also describes quantitative analysis and statistical analysis, noting that statistics is used for numerical analysis and describing phenomena. The document outlines descriptive and inferential statistics, levels of measurement, considerations in statistical methods choice, and key statistical concepts like central tendency, variability, hypothesis testing, and types of errors. It provides examples of statistical tools that correspond to different levels and types of data analysis.

Uploaded by

malyn1218

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Download as ppt, pdf, or txt

0% found this document useful (0 votes)

54 views44 pages

Data Analysis: Florenda F. Cabatit RN MA Facilitator

Uploaded by

malyn1218

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Download as ppt, pdf, or txt

You are on page 1/ 44

Data Analysis

Florenda F. Cabatit RN MA
Facilitator
DATA ANALYSIS

Data analysis is the process by which

information is rendered meaningful
and intelligible (Polit and Hungler,
1995).
It is the systematic organization and
synthesis of research data and the
testing of research hypotheses using
those data (2004).
Statistical Analysis
Quantitative analysis deals with
numerical analysis of information.
It is the manipulation of numeric data
through statistical procedures for the
purpose of describing phenomena or
assessing the magnitude and reliability
of relationships among them.
Statistics is the scientific method used in
quantitative analysis.
Statistics
Statistics helps to:
 Organize data
 Summarize data
 Evaluate data
 Present data in an easily
understood form.
Statistics
Two branches of Statistics:
 Descriptive statistics -
statistics used to describe and
summarize data
 Inferential Statistics –
statistics that permit inferences
on whether relationships
observed in a sample are likely
to occur in the larger population.
Considerations in the
choice of appropriate
statistical methods
 The purpose of the research
 The level of measurement of the
variables
 The number of groups/variables
involved
 The type of groups being studied
Levels of Measurement
 Nominal - the lowest level
- involves assigning numbers to classify
characteristics into categories
- numeric codes assigned in nominal
measurement do not convey quantitative
information.
- the numbers are merely symbols that
represent different values.
- categories must be mutually exclusive
and collectively exhaustive.
Ordinal Measurement
 This involves sorting objects on the basis
of their relative standing or ranking on an
attribute.
 The numbers are not arbitrary-they signify
incremental values but does not however,
tell anything about how much greater one
level is than another.
Interval Measurement

 A measurement in which
an attribute of a variable
is rank ordered on a scale
that has equal distances
between points on that
scale.
Ratio Scale
 A quantitative measurement in which intervals
are equal and there is a true zero point.
 The highest level of measurement
 All arithmetic operations are permissible with
this measurement (add, subtract, multiply, and
divide numbers on this scale).
Descriptive Statistics
Three characteristics to fully
describe a set of data:
• shape of the distribution
values
• central tendency
• Variability
Review of Descriptive
Stats.
 Descriptive Statistics are used to present
quantitative descriptions in a manageable
form.
 This method works by reducing lots of data
into a simpler summary.
 Example:
 370 Centigrade as average adult body
temperature
 SU’s quality-point system
Univariate Analysis
 This is the examination across cases of one
variable at a time.
 Frequency distributions are used to group
data.
 One may set up margins that allow us to
group cases into categories.
 Examples include
 Age categories
 Price categories
 Temperature categories.
Distributions
Two ways to describe a univariate
distribution
 A table
 A graph (histogram, bar chart)
Distributions (con’t)

 Distributions may also be displayed

using percentages.
 For example, one could use
percentages to describe the following:
 Percentage of people under the
poverty level
 Over a certain age
 Over a certain score on a
standardized test
Distributions (cont.)

A Frequency Distribution Table

Category Percent
Under 35 9%
36-45 21
46-55 45
56-65 19
66+ 6
Distributions (cont.)
A Histogram

45
40
35
30
25
20
Percent
15
10
5
0
36-45

46-55
Under

56-65

66+
35
Central Tendency
 An estimate of the “center” of a
distribution
 Three different types of
estimates:
 Mean
 Median
 Mode
Mean
 The most commonly used method of
describing central tendency.
 One basically totals all the results
and then divides by the number of
units or “n” of the sample.
 Example: The NCM 104 Quiz mean
was determined by the sum of all the
scores divided by the number of
students taking the exam.
Median
 The median is the score found at the
exact middle of the set.
 One must list all scores in numerical
order and then locate the score in
the center of the sample.
 Example: If there are 500 scores in
the list, score #250 would be the
median.
 This is useful in weeding out outliers.
Mode
 The mode is the most repeated score
in the set of results.
 Lets take the set of scores:
15,20,21,20,36,15, 25,15
 Again we first line up the scores
 15,15,15,20,20,21,25,36
 15 is the most repeated score and is
therefore labeled the mode.
Central Tendency
 If the distribution is normal (i.e., bell-
shaped), the mean, median and mode
are all equal.
 In our analyses, we’ll use the mean.
Dispersion
 Two estimates types:
 Range
 Standard deviation
 Standard deviation is more
accurate/detailed because an outlier can
greatly extend the range.
Range
 The range is used to identify the
highest and lowest scores.
 Lets take the set of
scores:15,20,21,20,36,15, 25,15.
 The range would be 15-36. This
identifies the fact that 21 points
separates the highest to the lowest
score.
Standard Deviation
 The standard deviation is a
value that shows the relation
that individual scores have to
the mean of the sample.
 If scores are said to be
standardized to a normal curve,
there are several statistical
manipulations that can be
performed to analyze the data
set.
Standard Dev. (con’t)
 Assumptions may be made about
the percentage of scores as they
deviate from the mean.
 If scores are normally distributed,
one can assume that
approximately 69% of the scores in
the sample fall within one standard
deviation of the mean.
Approximately 95% of the scores
would then fall within two standard
deviations of the mean.
Standard Dev. (con’t)
 The standard deviation calculates
the square root of the sum of the
squared deviations from the mean of
all the scores, divided by the number
of scores.
 This process accounts for both
positive and negative deviations
from the mean.
RESEARCH QUESTION: DESCRIBE

LEVEL TYPE OF DESCRIPTION STATISTICAL TOOL

Frequency distribution
Distribution Contingency Table
NOMINAL
Central Tendency
Mode

Distribution Frequency Distribution

ORDINAL Contingency Table
Scatterpoint

Central Tendency
Mode, Median

Frequency Distribution
Distribution Contingency Table
Scatterpoint
RATIO/INTERVAL
Central Tendency
Mode, Median, Mean

Variability
Range, Variance,
Standard Deviation
Inferential
statistics
 Based on the law of probability
 It provides a means for drawing
conclusions about a population,
given data from a sample
 It estimates population parameters
from sample statistics
Inferential
Statistics
Statistical Inference consists of two
techniques:
2.Estimation of parameters
3.Hypothesis testing
Hypothesis Testing
Statistical hypothesis testing provides
objective criteria for deciding whether
hypotheses are supported by empirical
evidence.
 It is a process of disproof or rejection.
 Researchers seek to reject the null
hypothesis through various statistical
tests.
 Hypothesis testing uses samples to draw
conclusions about relationships within the
population.
Type I and Type II
Errors
Type I Error - researchers make a type I
error when a true null hypothesis is
rejected.

Type II Error – researchers make a type II

error when a false null hypothesis is
accepted
Level of Significance
This refers to the risk of making a type
I error in a statistical analysis.
The value selected beforehand
signifies the risk or the probability of
rejecting of rejecting a true null
hypothesis.
The two most frequently used
significance levels (referred to as alpha or
α) are:
.05
.01
Level of Significance
 With .05 significance level, we are
accepting the risk that out of 100 samples
drawn from a population, a true null
hypothesis would be rejected only 5 times.

 With a .01 level of significance, the risk of

a type I error is lower: in only 1 sample out
of 100 would we erroneously reject the
null hypothesis.
Critical Region
This refers to the area in the sampling
distribution representing values that
are “improbable” if the null hypothesis
is true.
It is defined by the level of significance
Statistical Tests
Two-tailed test- this means that both ends
or tails of the sampling distribution are
used to determine improbable values.

In one-tailed tests, the critical region of

improbable values is entirely in one tail
of the distribution-the tail corresponding
to the direction of the hypothesis
An example of Critical Regions of a two
-tailed test
Types of Statistical
Tests
Parametric Tests – a class of
inferential statistical tests that
involve:
a. Assumptions about the
distribution of the variables
b. The estimation of a parameter
c. The use of interval or ratio
measures.
Statistical Tests

Non-parametric Tests –statistical

tests that do not estimate parameters
- also called distribution-free statistics.
Steps in Hypothesis
1.testing
State the alternative hypothesis
2. State the null hypothesis
3. Establish the level of significance
4. Select a one-tailed or two-tailed test
5. Compute a test statistic
6. Calculate the degrees of freedom
7. Obtain a tabled value for the statistical
test
8. Compare the test statistic with the
tabled value.
The Decision Matrix
In reality Null true Null false
Alternative false Alternative true
In reality... In reality...
What • There is no real program effect • There is a real program effect
• There is no difference, gain
we conclude • Our theory is wrong
•
•
There is a difference, gain
Our theory is correct

Accept null 1-α β

Reject alternative THE CONFIDENCE LEVEL TYPE II ERROR
We say...
The odds of saying there is no The odds of saying there is no
• There is no real program effect or gain when in fact there effect or gain when in fact there
is none is one
effect
• There is no difference, # of times out of 100 when # of times out of 100 when
gain there is no effect, we’ll say there is an effect, we’ll say
• Our theory is wrong there is none there is none
Reject null α 1-β
Accept alternative TYPE I ERROR POWER
We say... The odds of saying there is an The odds of saying there is an
effect or gain when in fact there effect or gain when in fact there
• There is a real program is none is one
effect
• There is a difference, gain # of times out of 100 when # of times out of 100 when
• Our theory is correct there is no effect, we’ll say there is an effect, we’ll say
there is one there is one
Decision Matrix

If you try to increase power, you

increase the chance of winding
up in the bottom row and of
Type I error.

If you try to decrease Type I

errors, you increase the chance
of winding up in the top row and
of Type II error.

Statistics For Dummies
From Everand
Statistics For Dummies
Deborah J. Rumsey
4/5 (27)
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Assessing The Skull and Face-Word
100% (1)
Assessing The Skull and Face-Word
3 pages
CRP Phase 4-Analyzing and Interpreting Quantitative Data
No ratings yet
CRP Phase 4-Analyzing and Interpreting Quantitative Data
24 pages
Sampling Distribution
No ratings yet
Sampling Distribution
19 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
Main Title: Planning Data Analysis Using Statistical Data
100% (1)
Main Title: Planning Data Analysis Using Statistical Data
40 pages
Unit-3 DS Students
No ratings yet
Unit-3 DS Students
35 pages
Powerpoint Presentation On: "Frequency
100% (2)
Powerpoint Presentation On: "Frequency
36 pages
Stat-Reviewer Notes
No ratings yet
Stat-Reviewer Notes
25 pages
ADS EXP 1
No ratings yet
ADS EXP 1
13 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
19 pages
Statistics For Datacience
100% (1)
Statistics For Datacience
7 pages
Marketing Ii: Facultad de Economía y Negocios Universidad de Chile
No ratings yet
Marketing Ii: Facultad de Economía y Negocios Universidad de Chile
18 pages
BRM Unit V
No ratings yet
BRM Unit V
99 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
4 pages
FXGFHGJHKJLK
No ratings yet
FXGFHGJHKJLK
19 pages
Introduction To Statistical Analysis
No ratings yet
Introduction To Statistical Analysis
24 pages
Norms
No ratings yet
Norms
36 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
31 pages
E Book - Unit 4
No ratings yet
E Book - Unit 4
12 pages
Statistical Tools Needed IN Analyzing Test Results: Prof. Yonardo Agustin Gabuyo
No ratings yet
Statistical Tools Needed IN Analyzing Test Results: Prof. Yonardo Agustin Gabuyo
110 pages
Data Analysis
100% (1)
Data Analysis
34 pages
Chapter 2 - : Measures of Variability
No ratings yet
Chapter 2 - : Measures of Variability
6 pages
STAT100 - Full Course Notes
No ratings yet
STAT100 - Full Course Notes
27 pages
lecture-1
No ratings yet
lecture-1
72 pages
Reviewer Part 1
No ratings yet
Reviewer Part 1
9 pages
Statistics,2
No ratings yet
Statistics,2
33 pages
Biostatistics Notes-numbered
No ratings yet
Biostatistics Notes-numbered
21 pages
Biostatistics 140127003954 Phpapp02
No ratings yet
Biostatistics 140127003954 Phpapp02
47 pages
Lecture 3-Basic Statistics
No ratings yet
Lecture 3-Basic Statistics
49 pages
RMPA Chapter 6
No ratings yet
RMPA Chapter 6
23 pages
RSU - Statistics - Lecture 3 - Final - myRSU
No ratings yet
RSU - Statistics - Lecture 3 - Final - myRSU
34 pages
Brand Loyalty: SERVICE. We Have
No ratings yet
Brand Loyalty: SERVICE. We Have
38 pages
Chapter 3 Io Psych PT - 031305
No ratings yet
Chapter 3 Io Psych PT - 031305
6 pages
Module 3 Descriptive Statistics Final
100% (1)
Module 3 Descriptive Statistics Final
15 pages
Statistics SS2020
No ratings yet
Statistics SS2020
12 pages
Health Statistics: Principles of Secondary Data Analysis
No ratings yet
Health Statistics: Principles of Secondary Data Analysis
61 pages
Module 10 Introduction To Data and Statistics
No ratings yet
Module 10 Introduction To Data and Statistics
63 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
Unit 3 Summarising Data - Averages and Dispersion
No ratings yet
Unit 3 Summarising Data - Averages and Dispersion
22 pages
Basics For Understanding
No ratings yet
Basics For Understanding
8 pages
6.descriptve PPHD
No ratings yet
6.descriptve PPHD
70 pages
PG Descriptive and Inferential Statistic 2024
No ratings yet
PG Descriptive and Inferential Statistic 2024
51 pages
Lesson 4-Analysis-Interpretation-Descriptive Statistics
No ratings yet
Lesson 4-Analysis-Interpretation-Descriptive Statistics
25 pages
What Are Measures of Central Tendency
No ratings yet
What Are Measures of Central Tendency
5 pages
Summary of Frequency Distribution, Cross Tabulation and Hypothesis Testing
No ratings yet
Summary of Frequency Distribution, Cross Tabulation and Hypothesis Testing
3 pages
Statistics Intro: Univariate Analysis Central Tendency Dispersion
No ratings yet
Statistics Intro: Univariate Analysis Central Tendency Dispersion
21 pages
Identifying Types of Variables
No ratings yet
Identifying Types of Variables
5 pages
Summary of The Introduction To Stats
No ratings yet
Summary of The Introduction To Stats
7 pages
Bea 159 Ads 1
No ratings yet
Bea 159 Ads 1
6 pages
DSBDL Asg 3 Write Up
No ratings yet
DSBDL Asg 3 Write Up
6 pages
MTH1310 - Statistics
No ratings yet
MTH1310 - Statistics
34 pages
1 Descriptive Statistics - Unlocked
No ratings yet
1 Descriptive Statistics - Unlocked
18 pages
A Brief (Very Brief) Overview of Biostatistics: Jody Kreiman, PHD Bureau of Glottal Affairs
No ratings yet
A Brief (Very Brief) Overview of Biostatistics: Jody Kreiman, PHD Bureau of Glottal Affairs
56 pages
Descriptive Stats
No ratings yet
Descriptive Stats
50 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
41 pages
Psychology's+Scientific+Method 4 (1)
No ratings yet
Psychology's+Scientific+Method 4 (1)
42 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Accident Triangle
No ratings yet
Accident Triangle
1 page
Generic Risk Assessment Register
No ratings yet
Generic Risk Assessment Register
28 pages
Cse New 1
No ratings yet
Cse New 1
1 page
Safety Alet Grinding
No ratings yet
Safety Alet Grinding
1 page
Power Analysis
100% (1)
Power Analysis
23 pages
Experimental Research Designs
No ratings yet
Experimental Research Designs
4 pages
Power and Politics
100% (2)
Power and Politics
18 pages
Diesel Storage Tank Operational Guidelines
No ratings yet
Diesel Storage Tank Operational Guidelines
8 pages
Reliability of Measurement Tools
No ratings yet
Reliability of Measurement Tools
6 pages
Problem Formulation in Applied Social Research
No ratings yet
Problem Formulation in Applied Social Research
6 pages
Probability Sampling
No ratings yet
Probability Sampling
24 pages
Methods of Data Collection
No ratings yet
Methods of Data Collection
44 pages
Non Probability Sampling
No ratings yet
Non Probability Sampling
11 pages
Data Collection Methods
50% (2)
Data Collection Methods
30 pages
Data Analysis: Florenda F. Cabatit RN MA Facilitator
No ratings yet
Data Analysis: Florenda F. Cabatit RN MA Facilitator
44 pages
Intro To Validity
No ratings yet
Intro To Validity
27 pages
Deductive Inductive Reasoning
No ratings yet
Deductive Inductive Reasoning
10 pages
Causal Hypothesis
100% (1)
Causal Hypothesis
8 pages
Ethics
No ratings yet
Ethics
10 pages
Data Colection Types
No ratings yet
Data Colection Types
31 pages
Inferential Statistics
100% (2)
Inferential Statistics
16 pages
External Validity
No ratings yet
External Validity
15 pages
Physical Assessment
No ratings yet
Physical Assessment
12 pages
The Communication Process
No ratings yet
The Communication Process
13 pages
Therapeutic Communication Techniques
100% (4)
Therapeutic Communication Techniques
10 pages
Nutrition Powerpoint New
88% (8)
Nutrition Powerpoint New
74 pages
Skin
100% (2)
Skin
34 pages
Phenomenology
100% (1)
Phenomenology
24 pages