0% found this document useful (0 votes)

135 views28 pages

Sample Size Determination 03202012

This document discusses sample size determination for clinical trials. It defines key statistical concepts like type I and type II errors, power, and effect size. It explains that sample size depends on these factors as well as the study design and outcome variables. Basic formulas are provided to calculate sample size for comparing two proportions or means between independent groups. Two examples apply these formulas to hypothetical clinical trials evaluating a vitamin supplement to prevent cancer and a special diet to lower cholesterol. The document stresses consulting a statistician and considering feasibility when determining sample size.

Uploaded by

Lotfy Lotfy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

135 views28 pages

Sample Size Determination 03202012

Uploaded by

Lotfy Lotfy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Sample Size Determination

Janice Weinberg, ScD

Professor of Biostatistics
Boston University School of Public Health

Outline
Why does this matter? Scientific and
ethical implications
Statistical definitions and notation
Questions that need to be answered prior to
determining sample size
Study design issues affecting sample size
Some basic sample size formulas

Scientific And Ethical

Implications
From a scientific perspective:
Cant be sure weve made right decision
regarding the effect of the intervention
However, we want enough subjects enrolled to
adequately address study question to feel
comfortable that weve reached correct
conclusion

From an ethical perspective:

Too few subjects:
Cannot adequately address study question. The
time, discomfort and risk to subjects have served
no purpose.
May conclude no effect of an intervention that is
beneficial. Current and future subjects may not
benefit from new intervention based on current
(inconclusive) study.

Too many subjects:

Too many subjects unnecessarily exposed to
risk. Should enroll only enough patients to
answer study question, to minimize the
discomfort and risk subjects may be exposed
to.

Definitions and Notation

Null hypothesis (H0): No difference between groups

H0: p1 = p2

H 0: 1 = 2

Alternative hypothesis (HA): There is a difference

between groups

HA: p1 p2

HA : 1 2

P-Value: Chance of obtaining observed result or one

more extreme when groups are equal (under H0)
Test of significance of H0
Based on distribution of a test statistic assuming H0 is true
It is NOT the probability that H0 is true

Definitions and Notation

: Measure of true population difference must

be estimated. Difference of medical importance

= |p1 - p2|

= | 1 - 2|

n: Sample size per arm

N: Total sample size (N=2n for 2 groups with
equal allocation)

Type I error: Rejecting H0 when H0 is true

: The type I error rate. Maximum p-value
considered statistically significant
Type II error: Failing to reject H0 when H0 is false
: The type II error rate
Power (1 - ): Probability of detecting group effect
given the size of the effect () and the sample size
of the trial (N)

Truth

Decision
Based on Do Not
the Data Reject HO
Reject HO

Treatments
are equal
(HO true)

Treatments
differ
(HA true)

O.K.

Type II error

Type I error

O.K.

The quantities , , and N are all interrelated.

Holding all other values constant, what happens to
the power of the study if
increases?
decreases?
N increases?
variability increases?

Power
Power
Power
Power

Note: Typical error rates are = .05 and = .1 or .2

(80 or 90% power). Why is often smaller than ?

SAMPLE SIZE:
How many subjects are needed to assure a given
probability of detecting a statistically significant
effect of a given magnitude if one truly exists?
POWER:
If a limited pool of subjects is available, what is the
likelihood of finding a statistically significant effect of
a given magnitude if one truly exists?

Before We Can Determine Sample Size We

Need To Answer The Following:

1. What is the main purpose of the study?

2. What is the primary outcome measure?
Is it a continuous or dichotomous outcome?
3. How will the data be analyzed to detect a group
difference?
4. How small a difference is clinically important to
detect?

5. How much variability is in our population?

6. What is the desired and ?
7. What is the sample size allocation ratio?
8. What is the anticipated drop out rate?

Example 1: Does the ingestion of large doses of

vitamin A in tablet form prevent breast cancer?
Suppose we know from Connecticut tumorregistry data that incidence rate of breast
cancer over a 1-year period for women aged
45 49 is 150 cases per 100,000
Women randomized to Vitamin A vs. placebo

Example 1 continued
Group 1: Control group given placebo pills by mail.
Expected to have same disease rate as registry (150
cases per 100,000)
Group 2: Intervention group given vitamin A tablets by mail.
Expected to have 20% reduction in risk (120 cases per
100,000)
Want to compare incidence of breast cancer over 1-year
Planned statistical analysis: Chi-square test to compare
two proportions from independent samples

H0: p1 = p2

vs.

HA: p1 p2

Example 2: Does a special diet help to reduce

cholesterol levels?
Suppose an investigator wishes to determine
sample size to detect a 10 mg/dl difference in
cholesterol level in a diet intervention group
compared to a control (no diet) group
Subjects with baseline total cholesterol of at
least 300 mg/dl randomized

Example 2 continued
Group 1: A six week diet intervention
Group 2: No changes in diet
Investigator wants to compare total cholesterol at
the end of the six week study
Planned statistical analysis: two sample t-test (for
independent samples)
H0: 1 = 2

vs.

HA: 1 2

Some Basic Sample Size Formulas

To Compare Two Proportions From Independent
Samples: H0: p1=p2
1. level
2. level (1 power)
3. Expected population proportions (p 1, p2)

Some Basic Sample Size Formulas

To Compare Two Means From Independent Samples:
H0 : 1 = 2
1.
2.
3.
4.

level
level (1 power)
Expected population difference (= |1 - 2|)
Expected population standard deviation (1 , 2)

The Standard Normal

Distribution

N(0,1) refers to standard normal (mean 0 and variance 1)

prob[N(0,1) > z1-/2 ] = /2

prob[N(0,1) > z1- ] =

Dichotomous Outcome (2 Independent Samples)

Test H0: p1 = p2 vs. HA: p1 p2
Assuming two-sided alternative and equal allocation

z1-/2

n per / group

2 pq z1

p1q1 p2 q2

p1, p2 = projected true probabilities of success in the

two groups
q1 = 1 p1, q2 = 1 p2
= p1 p2
p = (p1 + p2)/2, q = 1 p
z1-/2 is the N(0,1) cutoff corresponding to
z1- is the N(0,1) cutoff corresponding to
***Always Round Up To Nearest Integer!

Dichotomous Outcome
(2 Independent Samples)

Power

n z1 / 2 2 pq
p1q1 p2 q2

where is the probability from a standard normal

distribution

Continuous Outcome
(2 Independent Samples)
Test H0: 1 = 2 vs. HA: 1 2
Two-sided alternative and equal allocation
Assume outcome normally distributed with:

n per / group

2
1

2
2

1 / 2
2

Continuous Outcome
(2 Independent Samples)

12 22

Power

z1 / 2

where is the probability from a standard normal

distribution

Example 1: Does ingestion of large doses of vitamin A

prevent breast cancer?
Test H0: p1 = p2 vs. HA p1 p2

Assume 2-sided test with =0.05 and 80% power

p1 = 150 per 100,000 = .0015

p2 = 120 per 100,000 = .0012 (20% rate reduction)
= p1 p2 = .0003
z1-/2 = 1.96 z1- = .84

n per group = 234,882

Too many to recruit in one year!

Example 2: Does a special diet help to reduce

cholesterol levels?
Test H0: 1=2 vs. HA : 12
Assume 2-sided test with =0.05 and 90% power

= 1 - 2 = 10 mg/dl
1= 2 = (50 mg/dl)
z1-/2 = 1.96 z1- = 1.28
n per group = 525
Suppose 10% loss to follow-up expected,
adjust n = 525 / 0.9 = 584 per group

These two basic formulas address common settings

but are often inappropriate
Other types of outcomes/study designs require
different approaches including:
-Survival or time to event outcomes
-Cross-over trials
-Equivalency trials
-Repeated measures designs
-Clustered randomization

Sample Size Summary

Sample size very sensitive to values of
Large N required for high power to detect small differences
Consider current knowledge and feasibility
Examine a range of values, i.e.:
-for several , power find required sample size
-for several n, find power
Often increase sample size to account for loss to follow-up

Note: Only the basics of sample size are covered here. Its
always a good idea to consult a statistician

Confidence Intervals PDF
No ratings yet
Confidence Intervals PDF
5 pages
CH 9.3 Hypothesis Testing For Proportions
No ratings yet
CH 9.3 Hypothesis Testing For Proportions
13 pages
Random Sampling & Central Limit Theorem
100% (1)
Random Sampling & Central Limit Theorem
16 pages
Normalized PCA Techniques Explained
100% (1)
Normalized PCA Techniques Explained
24 pages
W. Interval Estimates For Population Proportion PDF
No ratings yet
W. Interval Estimates For Population Proportion PDF
24 pages
Study Test For Fundamentals of Statistics
100% (1)
Study Test For Fundamentals of Statistics
14 pages
Spatial Sampling Regional Science Chapter
100% (1)
Spatial Sampling Regional Science Chapter
18 pages
Confidence Intervals
No ratings yet
Confidence Intervals
16 pages
Estimation of Parameters: Example
No ratings yet
Estimation of Parameters: Example
2 pages
Estimating Abundance: Line Transects
No ratings yet
Estimating Abundance: Line Transects
35 pages
Optimal Sample Size Calculation Guide
No ratings yet
Optimal Sample Size Calculation Guide
34 pages
Ecology Prac-1
100% (1)
Ecology Prac-1
9 pages
Lab06 Confidence Intervals
No ratings yet
Lab06 Confidence Intervals
4 pages
Sample Size Calculation in R
No ratings yet
Sample Size Calculation in R
85 pages
12-Multiple Comparison Procedure
No ratings yet
12-Multiple Comparison Procedure
12 pages
Estimates 8.2 Users Guide
No ratings yet
Estimates 8.2 Users Guide
39 pages
Sample Size: By: Mrs. Precilla C. Stephen
No ratings yet
Sample Size: By: Mrs. Precilla C. Stephen
10 pages
Age-Structured Matrix Population Models
No ratings yet
Age-Structured Matrix Population Models
14 pages
Intuitive Biostatistics: Choosing A Statistical Test
No ratings yet
Intuitive Biostatistics: Choosing A Statistical Test
5 pages
3 Population Growth and Age Structure
No ratings yet
3 Population Growth and Age Structure
60 pages
Sample Size
No ratings yet
Sample Size
6 pages
Chi-Square Test A Nonparametric Hypothesis Test
No ratings yet
Chi-Square Test A Nonparametric Hypothesis Test
52 pages
Life Tables and Survival Rates PDF
No ratings yet
Life Tables and Survival Rates PDF
5 pages
Assumptions of Metapopulations
100% (1)
Assumptions of Metapopulations
18 pages
Dsur I Chapter 18 Categorical Data
No ratings yet
Dsur I Chapter 18 Categorical Data
47 pages
Count Data Models in SAS
No ratings yet
Count Data Models in SAS
12 pages
Inferential Statistics and Sampling Methods
No ratings yet
Inferential Statistics and Sampling Methods
27 pages
Chi Square Test
No ratings yet
Chi Square Test
5 pages
Chapter 12 ANOVA
No ratings yet
Chapter 12 ANOVA
25 pages
Parameters: Unless Otherwise Noted, These Formulas Assume
No ratings yet
Parameters: Unless Otherwise Noted, These Formulas Assume
6 pages
Overview of Multivariate Analysis Techniques
100% (1)
Overview of Multivariate Analysis Techniques
12 pages
Lecture 10 Randomized Complete Block Design Last Lecture
100% (1)
Lecture 10 Randomized Complete Block Design Last Lecture
4 pages
Regression Analysis in Healthcare
No ratings yet
Regression Analysis in Healthcare
3 pages
Spss Tutorials: Independent Samples T Test
100% (1)
Spss Tutorials: Independent Samples T Test
13 pages
Statistics For Health Research: Non-Parametric Methods
100% (2)
Statistics For Health Research: Non-Parametric Methods
56 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
4 pages
Stat Term Paper
No ratings yet
Stat Term Paper
17 pages
Hypothesis Testing of A Single Sample Solved Exercises
No ratings yet
Hypothesis Testing of A Single Sample Solved Exercises
9 pages
The Three MS: Analysis Data
No ratings yet
The Three MS: Analysis Data
5 pages
Statistics for Educators & Analysts
100% (1)
Statistics for Educators & Analysts
5 pages
Slides - Montgomery
No ratings yet
Slides - Montgomery
240 pages
Percentiles Worksheet for Students
No ratings yet
Percentiles Worksheet for Students
1 page
Logit Model For Binary Data
No ratings yet
Logit Model For Binary Data
50 pages
Bman 09
No ratings yet
Bman 09
77 pages
c6 Abiotic Components
No ratings yet
c6 Abiotic Components
37 pages
IE 211.001 Engineering Probability and Statistics Course Syllabus: Spring 2013 MW 3:00-4:15PM AG/IT 253 and 211
No ratings yet
IE 211.001 Engineering Probability and Statistics Course Syllabus: Spring 2013 MW 3:00-4:15PM AG/IT 253 and 211
4 pages
EC2303 Statistics Formula Sheet
No ratings yet
EC2303 Statistics Formula Sheet
8 pages
Intro to Inferential Statistics Course
No ratings yet
Intro to Inferential Statistics Course
652 pages
Statistics Packet
No ratings yet
Statistics Packet
17 pages
Nonparametric Tests: Statistics For Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 16-1
No ratings yet
Nonparametric Tests: Statistics For Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 16-1
27 pages
Confidence Intervals and Sample Size
No ratings yet
Confidence Intervals and Sample Size
45 pages
RCBD Anova Notes (III)
No ratings yet
RCBD Anova Notes (III)
13 pages
Determining Sample Size PDF
No ratings yet
Determining Sample Size PDF
7 pages
Hypothesis Testing Methods Guide
No ratings yet
Hypothesis Testing Methods Guide
6 pages
Life Tables and Population Dynamics Analysis
No ratings yet
Life Tables and Population Dynamics Analysis
18 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
106 pages
Sample Size Determination: Janice Weinberg, SCD Professor of Biostatistics Boston University School of Public Health
No ratings yet
Sample Size Determination: Janice Weinberg, SCD Professor of Biostatistics Boston University School of Public Health
28 pages
DR Pinzon - Sample Size Klinik
No ratings yet
DR Pinzon - Sample Size Klinik
45 pages
Sample Size Estimation
No ratings yet
Sample Size Estimation
12 pages
Sample Size Determination: Maj. Tun Tun Win
No ratings yet
Sample Size Determination: Maj. Tun Tun Win
38 pages
Inversion Theorem for Dunkl Transform
No ratings yet
Inversion Theorem for Dunkl Transform
9 pages
A Comparison of Univariate Probit and Logit Models Using Simulation
No ratings yet
A Comparison of Univariate Probit and Logit Models Using Simulation
21 pages
L-Moments in Income Distribution Analysis
No ratings yet
L-Moments in Income Distribution Analysis
15 pages
Fetal Health Surveillance - Antepartum and Intrapartum Consensus Guideline
No ratings yet
Fetal Health Surveillance - Antepartum and Intrapartum Consensus Guideline
60 pages
Hr2016126a PDF
No ratings yet
Hr2016126a PDF
8 pages
AIOU MBA/MPA Quantitative Techniques 2009
No ratings yet
AIOU MBA/MPA Quantitative Techniques 2009
8 pages
Survey Sampling 2
No ratings yet
Survey Sampling 2
73 pages
BRM Note 1 - Errors in Research Designs
75% (4)
BRM Note 1 - Errors in Research Designs
3 pages
2 2 13 +3368+mahmood+et+al
No ratings yet
2 2 13 +3368+mahmood+et+al
12 pages
3.badm - Mba Notes
No ratings yet
3.badm - Mba Notes
13 pages
Hypothesis Testing Random Motors Project Compress Random Motor Assignment For Hypothetical and Analytically Study of Profitability and Best Selling Product Identification
No ratings yet
Hypothesis Testing Random Motors Project Compress Random Motor Assignment For Hypothetical and Analytically Study of Profitability and Best Selling Product Identification
18 pages
Bbs14e PPT ch08
No ratings yet
Bbs14e PPT ch08
71 pages
Chapter 3
No ratings yet
Chapter 3
23 pages
Construct Confidence Intervals in Excel
No ratings yet
Construct Confidence Intervals in Excel
6 pages
Impact of Gadgets on Grade 10 Study Habits
No ratings yet
Impact of Gadgets on Grade 10 Study Habits
16 pages
4.1a Confidence Intervals For The Mean (Day 1 - Large Samples) Ashley Spencer Worksheet
No ratings yet
4.1a Confidence Intervals For The Mean (Day 1 - Large Samples) Ashley Spencer Worksheet
4 pages
Public Sector ICT Challenges
No ratings yet
Public Sector ICT Challenges
43 pages
Sampling Distributions of Sample Means and Proportions PDF
No ratings yet
Sampling Distributions of Sample Means and Proportions PDF
14 pages
Types and Sources of Errors in Analysis
No ratings yet
Types and Sources of Errors in Analysis
23 pages
ADM-SHS-StatProb-Q3-M21-Illustrating The T-Distribution
No ratings yet
ADM-SHS-StatProb-Q3-M21-Illustrating The T-Distribution
27 pages
Unit X
No ratings yet
Unit X
6 pages
Marketing Research - Individual Submission - Assignment 1 - CapStone Project - Huynh Phuong Anh
No ratings yet
Marketing Research - Individual Submission - Assignment 1 - CapStone Project - Huynh Phuong Anh
31 pages
Astm F3263-17
No ratings yet
Astm F3263-17
16 pages
ECON1010 Unit 6
No ratings yet
ECON1010 Unit 6
13 pages
Statistics For Business & Economics 14e Edition David R. Anderson - Ebook PDF Download
100% (6)
Statistics For Business & Economics 14e Edition David R. Anderson - Ebook PDF Download
30 pages
Sampling Methods and Procedures
No ratings yet
Sampling Methods and Procedures
20 pages
Estimations
No ratings yet
Estimations
6 pages
Research Methodology - Finalppt
No ratings yet
Research Methodology - Finalppt
491 pages
Chapter - 7 Audit Sampling
No ratings yet
Chapter - 7 Audit Sampling
20 pages
Unit III - 02
No ratings yet
Unit III - 02
36 pages
Chapter Three 3.0research Methodology
No ratings yet
Chapter Three 3.0research Methodology
2 pages
Reliable Indicators in Game Sports
No ratings yet
Reliable Indicators in Game Sports
19 pages
Statistics and Probability Formulas
No ratings yet
Statistics and Probability Formulas
10 pages
Relationship Among Social Medi
No ratings yet
Relationship Among Social Medi
145 pages
Non-Inferiority Tests For Two Survival Curves Using Cox's Proportional Hazards Model
No ratings yet
Non-Inferiority Tests For Two Survival Curves Using Cox's Proportional Hazards Model
10 pages

Sample Size Determination 03202012

Uploaded by

Sample Size Determination 03202012

Uploaded by

Sample Size Determination

Janice Weinberg, ScD

Scientific And Ethical

From an ethical perspective:

Too many subjects:

Definitions and Notation

Alternative hypothesis (HA): There is a difference

P-Value: Chance of obtaining observed result or one

Definitions and Notation

: Measure of true population difference must

be estimated. Difference of medical importance

n: Sample size per arm

Type I error: Rejecting H0 when H0 is true

The quantities , , and N are all interrelated.

Note: Typical error rates are = .05 and = .1 or .2

Before We Can Determine Sample Size We

1. What is the main purpose of the study?

5. How much variability is in our population?

Example 1: Does the ingestion of large doses of

Example 2: Does a special diet help to reduce

Some Basic Sample Size Formulas

Some Basic Sample Size Formulas

The Standard Normal

N(0,1) refers to standard normal (mean 0 and variance 1)

prob[N(0,1) > z1-/2 ] = /2

prob[N(0,1) > z1- ] =

Dichotomous Outcome (2 Independent Samples)

p1, p2 = projected true probabilities of success in the

where is the probability from a standard normal

where is the probability from a standard normal

Example 1: Does ingestion of large doses of vitamin A

Assume 2-sided test with =0.05 and 80% power

p1 = 150 per 100,000 = .0015

n per group = 234,882

Example 2: Does a special diet help to reduce

These two basic formulas address common settings

Sample Size Summary

You might also like