0% found this document useful (0 votes)

53 views

Generating Random Numbers W. Implementation in C

The document discusses random number generation and pseudo-random number generators used in statistical computing. It describes the properties and defects of common random number generators, such as linear congruential generators. It provides examples of generating uniform deviates and rescaling the outputs. It recommends using generators known to produce "good quality" random numbers, such as algorithms by Lehmer and implementations in Numerical Recipes in C. It discusses practical considerations for integer overflow and portable implementations.

Uploaded by

janeThomas

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Generating Random Numbers W. Implementation in C

Uploaded by

janeThomas

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Random Number

Generation

Biostatistics 615/815
Lecture 14
Homework 5, Question 1:
Quick Sort Optimization …
12 200

Thousands
10 167

Comparisons
Time (ms)

8 133

6 100
0 10 20 30 40 50 60
M
Homework 5, Question 1:
Merge-Sort Optimization
14 200

Thousands
175
12

Comparisons
Time (ms)

150

10
125

8 100
0 10 20 30 40 50 60
M
Homework 5, Question 2:
z Comparison of Hashing Strategies
• Linear hashing
• Double hashing
z Interesting aspects:
• Memory dramatically impacts performance
• In double-hashing, it is important to choose the
second hash function carefully:
• Specifically, it is key to avoid that it might return the
values 0, 1 and any multiple of the table size M
Today
z Random Number Generators
• Key ingredient of statistical computing

z Discuss properties and defects of

alternative generators
Some Uses of Random Numbers
z Simulating data
• Evaluate statistical procedures
• Evaluate study designs
• Evaluate program implementations

z Controlling stochastic processes

• Markov-Chain Monte-Carlo methods
z Selecting questions for exams
Random Numbers and Computers
z Most modern computers do not generate
truly random sequences

z Instead, they can be programmed to

produce pseudo-random sequences
• These will behave the same as random
sequences for a wide-variety of applications
Uniform Deviates

z Fall within specific interval (usually 0..1)

z Potential outcomes have equal probability

z Usually, one or more of these deviates

are used to generate other types of
random numbers
C Library Implementation

// RAND_MAX is the largest value returned by rand

// RAND_MAX is 32767 on MS VC++ and on Sun Workstations
// RAND_MAX is 2147483647 on my Linux server
#define RAND_MAX XXXXX

// This function generates a new pseudo-random number

int rand();

// This function resets the sequence of

// pseudo-random numbers to be generated by rand
void srand(unsigned int seed);
Example Usage
#include <stdlib.h>
#include <stdio.h>

int main()
{
int i;

printf(“10 random numbers between 0 and %d\n”, RAND_MAX);

/* Seed the random-number generator with

* current time so that numbers will be
* different for every run.
*/
srand( (unsigned) time(NULL) );

/* Display 10 random numbers. */

for( i = 0; i < 10; i++ )
printf( " %6d\n", rand() );
}
Unfortunately …
z Many library implementations of rand()
are botched

z Referring to an early IBM implementation,

a computer consultant said …
• We guarantee each number is random individually,
but we don’t guarantee that more than one of them
is random.
Good Advice
z Always use a random number generator
that is known to produce “good quality”
random numbers

z “Strange looking, apparently unpredictable

sequences are not enough”
• Park and Miller (1988) in Communications of the
ACM provide several examples
Lehmer’s (1951) Algorithm
z Multiplicative linear congruential generator

• Ij+1= aIj mod m

z Where
• Ij is the jth number in the sequence
• m is a large prime integer
• a is an integer 2 .. m - 1
Rescaling

z To produce numbers in the interval 0..1:

• Uj = I j / m

z These will range between 1/m and 1 – 1/m

Example 1
z Ij+1 = 6 Ij mod 13

z Produces the sequence:

• … 1, 6, 10, 8, 9, 2, 12, 7, 3, 5, 4, 11, 1, …

z Which includes all values 1 .. m-1 before

repeating itself
Example 2
z Ij+1 = 7 Ij mod 13

z Produces the sequence:

• … 1, 7, 10, 5, 9, 11, 12, 6, 3, 8, 4, 2, 1 …

z This sequence still has a full period, but

looks a little less “random” …
Example 3
z Ij+1 = 5 Ij mod 13

z Produces one of the sequences:

• … 1, 5, 12, 8, 1, …
• … 2, 10, 11, 3, 2, …
• … 4, 7, 9, 6, 4, …

z In this case, if m = 13, a = 5 is a very poor

choice
Practical Values for a and m
z Do not choose your own (dangerous!)
z Rely on values that are known to work.

z Good sources:
• Numerical Recipes in C
• Park and Miller (1988) Communications of the ACM

z We will use a = 16807 and m = 2147483647

A Random Number Generator
/* This implementation will not work in
* many systems, due to integer overflows
*/

static int seed = 1;

double Random()
{
int a = 16807;
int m = 2147483647; /* 2^31 – 1 */

seed = (a * seed) % m;
return seed / (double) m;
}

/* If this is working properly, starting with seed = 1,

* the 10,000th call produces seed = 1043618065
*/
A Random Number Generator
/* This implementation will not work in newer compilers that
* support 64-bit integer variables of type long long
*/

static long long seed = 1;

double Random()
{
long long a = 16807;
long long m = 2147483647; /* 2^31 – 1 */

seed = (a * seed) % m;
return seed / (double) m;
}

/* If this is working properly, starting with seed = 1,

* the 10,000th call produces seed = 1043618065
*/
Practical Computation

z Many systems will not represent integers

larger than 232

z We need a practical calculation where:

• Results Cover nearly all possible integers
• Intermediate values do not exceed 232
The Solution
z Let m = aq + r

z Where
• q=m/a
• r = m mod a
• r<q

⎧ a ( I j mod q ) − r[ I j / q ] if ≥ 0
z Then aI j mod m = ⎨
⎩a ( I j mod q ) − r[ I j / q ] + m
Random Number Generator:
A Portable Implementation
#define RAND_A 16807
#define RAND_M 2147483647
#define RAND_Q 127773
#define RAND_R 2836
#define RAND_SCALE (1.0 / RAND_M)

static int seed = 1;

double Random()
{
int k = seed / RAND_Q;

seed = RAND_A * (seed – k * RAND_Q) – k * RAND_R;

if (seed < 0) seed += RAND_M;

return seed * (double) RAND_SCALE;

}
Reliable Generator
z Fast

z Some slight improvements possible:

• Use a = 48271 (q = 44488 and r = 3399)
• Use a = 69621 (q = 30845 and r = 23902)

z Still has some subtle weaknesses …

• E.g. whenever a value < 10-6 occurs, it will be followed
by a value < 0.017, which is 10-6 * RAND_A
Further Improvements
z Shuffle Output.
• Generate two sequences, and use one to
permute the output of the other.

z Sum Two Sequences.

• Generate two sequences, and return the sum
of the two (modulus the period for either).
Example: Shuffling (Part I)
// Define RAND_A, RAND_M, RAND_Q, RAND_R as before
#define RAND_TBL 32
#define RAND_DIV (1 + (RAND_M – 1) / RAND_TBL)

static int random_next = 0;

static int random_tbl[RAND_TBL];

void SetupRandomNumbers(int seed)

{
int j;

if (seed == 0) seed = 1;

for (j = RAND_TBL – 1; j >= 0; j--)

{
int k = seed / RAND_Q;
seed = RAND_A * (seed – k * RAND_Q) – k * RAND_R;
if (seed < 0) seed += RAND_M;
random_tbl[j] = seed;
}

random_next = random_tbl[0];
}
Example: Shuffling (Part II)
double Random()
{
// Generate the next number in the sequence
int k = seed / RAND_Q, index;
seed = RAND_A * (seed – k * RAND_Q) – k * RAND_R;
if (seed < 0) seed += RAND_M;

// Swap it for a previously generated number

index = random_next / RAND_DIV;
random_next = random_tbl[index];
random_tbl[index] = seed;

// And return the shuffled result …

return random_next * (double) RAND_SCALE;
}
Shuffling …
z Shuffling improves things, however …

z Requires additional storage …

z If an extremely small value occurs (e.g.

< 10-6) it will be slightly correlated with
other nearby extreme values.
Summing Two Sequences (I)
#define RAND_A1 40014
#define RAND_M1 2147483563
#define RAND_Q1 53668
#define RAND_R1 12211

#define RAND_A2 40692

#define RAND_M2 2147483399
#define RAND_Q2 52744
#define RAND_R2 3791

#define RAND_SCALE1 (1.0 / RAND_M1)

Summing Two Sequences (II)
static int seed1 = 1, seed2 = 1;

double Random()
{
int k, result;

k = seed1 / RAND_Q1;
seed1 = RAND_A1 * (seed1 – k * RAND_Q1) – k * RAND_R1;
if (seed1 < 0) seed1 += RAND_M1;

k = seed2 / RAND_Q2;
seed2 = RAND_A2 * (seed2 – k * RAND_Q2) – k * RAND_R2;
if (seed2 < 0) seed2 += RAND_M2;

result = seed1 – seed2;

if (result < 1) result += RAND_M1 – 1;

return result * (double) RAND_SCALE1;

}
Summing Two Sequences
z If the sequences are uncorrelated, we can do
no harm:
• If the original sequence is “random”, summing a
second sequence will preserve the original
randomness

z In the ideal case, the period of the combined

sequence will be the least common multiple of
the individual periods
Summing More Sequences
z It is possible to sum more sequences to increase randomness

z One example is the Wichman Hill random number generator,

where:
• A1 = 171, M1 = 30269
• A2 = 172, M2 = 30307
• A3 = 170, M3 = 30323

z Values for each sequence are:

• Scaled to the interval (0,1)
• Summed
• Integer part of sum is discarded
So far …
z Uniformly distributed random numbers
• Using Lehmer’s algorithm
• Work well for carefully selected parameters
z “Randomness” can be improved:
• Through shuffling
• Summing two sequences
• Or both (see Numerical Recipes for an
example)
Random Numbers in R
z In R, multiple generators are supported

z To select a specific sequence use:

•RNGkind() -- select algorithm
•RNGversion() -- mimics older R versions
•set.seed() -- selects specific sequence

z Use help(RNGkind) for details

Random Numbers in R
z Many custom functions:
•runif(n, min = 0, max = 1)
•rnorm(n, mean = 0, sd = 1)
•rt(n, df)
•rchisq(n, df, ncp = 0)
•rf(n, df1, df2)
•rexp(n, rate = 1)
•rgamma(n, shape, rate = 1)
Sampling from Arbitrary
Distributions
z The general approach for sampling from an
arbitrary distribution is to:

z Define
• Cumulative density function F(x)
• Inverse cumulative density function F-1(x)

z Sample x ~ U(0,1)
z Evaluate F-1(x)
Example: Exponential Distribution
z Consider:
• f (x) = e-x
• F (x) = 1 – e-x
• F-1(y) = -ln(1 – y)

double RandomExp()
{
return –log(Random());
}
Example: Categorical Data
z To sample from a discrete set of outcomes, use:

int SampleCategorical(int outcomes, double * probs)

{
double prob = Random();
int outcome = 0;

while (outcome + 1 < outcomes && prob > probs[outcome])

{
prob -= probs[outcome];
outcome++;
}

return outcome;
}
More Useful Examples
z Numerical Recipes in C has additional
examples, including algorithms for
sampling from normal and gamma
distributions
The Mersenne Twister
z Current gold standard random generator

z Web: www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/emt.html
• Or Google for “Mersenne Twister”

z Has a very long period (219937 – 1)

z Equi-distributed in up to 623 dimensions
Recommended Reading
z Numerical Recipes in C
• Chapters 7.1 – 7.3

z Park and Miller (1998)

“Random Number Generators:
Good Ones Are Hard To Find”
Communications of the ACM
Implementation Without Division
z Let a = 16807 and m = 2147483647

z It is actually possible to implement Park-

Miller generator without any divisions
• Division is 20-40x slower than other operations

z Solution proposed by D. Carta (1990)

A Random Number Generator
/* This implementation is very fast, because there is no division */

static unsigned int seed = 1;

int RandomInt()
{
// After calculation below, (hi << 16) + lo = seed * 16807
unsigned int lo = 16807 * (seed & 0xFFFF); // Multiply lower 16 bits by 16807
unsigned int hi = 16807 * (seed >> 16); // Multiply higher 16 bits by 16807

// After these lines, lo has the bottom 31 bits of result, hi has bits 32 and up
lo += (hi & 0x7FFF) << 16; // Combine lower 15 bits of hi with lo’s upper bits
hi >>= 15; // Discard the lower 15 bits of hi

// value % (231 - 1)) = ((231) * hi + lo) % (231 – 1)

// = ((231 - 1) * hi + hi + lo) % (231-1)
// = (hi + lo) % (231 – 1)
lo += hi;

// No division required, since hi + lo is always < 232 - 2

if (lo > 2147483647) lo -= 2147483647;

return (seed = lo);

}

Laboratory Probability and Statistics 20 21 Errata Corrected
No ratings yet
Laboratory Probability and Statistics 20 21 Errata Corrected
10 pages
CSCI 1120: Introduction To Computing Using C++ Tutorial 6: Predefined Function
No ratings yet
CSCI 1120: Introduction To Computing Using C++ Tutorial 6: Predefined Function
28 pages
Cp01 Random
No ratings yet
Cp01 Random
18 pages
Random Number
No ratings yet
Random Number
14 pages
IAESTE USA Assessment Test
No ratings yet
IAESTE USA Assessment Test
4 pages
Random PDF
No ratings yet
Random PDF
15 pages
Sas15 Bes043
No ratings yet
Sas15 Bes043
6 pages
Computational Physics (PH-401) Lecture-20
No ratings yet
Computational Physics (PH-401) Lecture-20
76 pages
Tutorial 5
No ratings yet
Tutorial 5
12 pages
Random Number Generation PDF
No ratings yet
Random Number Generation PDF
31 pages
lect15
No ratings yet
lect15
19 pages
Unit09 1 PRNG
No ratings yet
Unit09 1 PRNG
43 pages
Randomnumbers Chapter6
No ratings yet
Randomnumbers Chapter6
59 pages
Parallel Random Number Generation: Ahmet Duran CISC 879
No ratings yet
Parallel Random Number Generation: Ahmet Duran CISC 879
37 pages
Lecture 8-Generation of Random Variable1-NEW
No ratings yet
Lecture 8-Generation of Random Variable1-NEW
10 pages
Generating Random Numbers
No ratings yet
Generating Random Numbers
32 pages
Screenshot 2023-01-16 at 6.30.45 PM
No ratings yet
Screenshot 2023-01-16 at 6.30.45 PM
14 pages
3.1 Basics of Pseudo-Random Numbers Generators
No ratings yet
3.1 Basics of Pseudo-Random Numbers Generators
10 pages
3.1 Basics of Pseudo-Random Numbers Generators
No ratings yet
3.1 Basics of Pseudo-Random Numbers Generators
10 pages
Random Number Generators: Professor Karl Sigman Columbia University Department of IEOR New York City USA
No ratings yet
Random Number Generators: Professor Karl Sigman Columbia University Department of IEOR New York City USA
17 pages
Generation of Random Numbers and Random Observations
No ratings yet
Generation of Random Numbers and Random Observations
29 pages
Random Number Generation
No ratings yet
Random Number Generation
42 pages
Simulation and Modeling- Lecture 2
No ratings yet
Simulation and Modeling- Lecture 2
13 pages
Emon 1
No ratings yet
Emon 1
11 pages
Pretty Derping
No ratings yet
Pretty Derping
46 pages
Random Numbers
No ratings yet
Random Numbers
7 pages
System Simulation and Modeling Lab
No ratings yet
System Simulation and Modeling Lab
23 pages
Chapter 7
No ratings yet
Chapter 7
29 pages
06
No ratings yet
06
57 pages
ITP_Exp8
No ratings yet
ITP_Exp8
3 pages
C Lib
No ratings yet
C Lib
107 pages
lecture12gg
No ratings yet
lecture12gg
43 pages
Lecture No.32 - Random Number Generation
No ratings yet
Lecture No.32 - Random Number Generation
16 pages
2WB05 Simulation Lecture 5: Random-Number Generators: Marko Boon
No ratings yet
2WB05 Simulation Lecture 5: Random-Number Generators: Marko Boon
32 pages
Random Numbers: 3.5 For Discussion and References) - One Sometimes
No ratings yet
Random Numbers: 3.5 For Discussion and References) - One Sometimes
2 pages
Random Numbers
No ratings yet
Random Numbers
99 pages
Ssm-Unit 5 6
No ratings yet
Ssm-Unit 5 6
30 pages
Randomnumbers
No ratings yet
Randomnumbers
26 pages
Normal CPP
No ratings yet
Normal CPP
33 pages
Random Project 222
No ratings yet
Random Project 222
5 pages
Generation of Random Numbers by Computer: Project PHYSNET Physics Bldg. Michigan State University East Lansing, MI
No ratings yet
Generation of Random Numbers by Computer: Project PHYSNET Physics Bldg. Michigan State University East Lansing, MI
10 pages
B3 SM Exp1
No ratings yet
B3 SM Exp1
7 pages
Numbers
No ratings yet
Numbers
6 pages
Random Number Variate Generation
No ratings yet
Random Number Variate Generation
64 pages
Random Gen
No ratings yet
Random Gen
31 pages
Lecture 17 Functions Inline Random Builtin Cmar
No ratings yet
Lecture 17 Functions Inline Random Builtin Cmar
18 pages
RN GMC Final
No ratings yet
RN GMC Final
21 pages
Random - Number Generators
No ratings yet
Random - Number Generators
37 pages
Module 3 - SMS
No ratings yet
Module 3 - SMS
38 pages
7.5 Random Sequences Based On Data Encryption: 1 at A Rapid "Chip Rate," So As To Spread Its Spectrum Uniformly
No ratings yet
7.5 Random Sequences Based On Data Encryption: 1 at A Rapid "Chip Rate," So As To Spread Its Spectrum Uniformly
5 pages
Chapter 6: STL Associative Containers and Iterators
No ratings yet
Chapter 6: STL Associative Containers and Iterators
43 pages
Uniform Random Numbers
No ratings yet
Uniform Random Numbers
26 pages
Generating Random Numbers: Lecturer: Dmitri A. Moltchanov E-Mail: Moltchan@cs - Tut.fi
No ratings yet
Generating Random Numbers: Lecturer: Dmitri A. Moltchanov E-Mail: Moltchan@cs - Tut.fi
60 pages
Generacion de Pseudo Numeros
No ratings yet
Generacion de Pseudo Numeros
19 pages
Basics of Modelling and Simulation
No ratings yet
Basics of Modelling and Simulation
17 pages
Cambridge Books Online
No ratings yet
Cambridge Books Online
14 pages
Random Number Generation
100% (1)
Random Number Generation
13 pages
CH3
No ratings yet
CH3
32 pages
Chapter 05 Generating Random Numbers
No ratings yet
Chapter 05 Generating Random Numbers
45 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Coding Interview Questions
6% (17)
Coding Interview Questions
8 pages
HANA Traces PerformanceTrace 2.00.040+
No ratings yet
HANA Traces PerformanceTrace 2.00.040+
3 pages
08 Hashing
No ratings yet
08 Hashing
42 pages
An Examination of The Bloom Filter and Its Application in Preventing Weak Password Choices
No ratings yet
An Examination of The Bloom Filter and Its Application in Preventing Weak Password Choices
4 pages
Hackathon Report
No ratings yet
Hackathon Report
11 pages
Nieuw Tekstdocument
No ratings yet
Nieuw Tekstdocument
13 pages
DBMSMCQ
No ratings yet
DBMSMCQ
21 pages
Analysis of Various Hash Function
No ratings yet
Analysis of Various Hash Function
4 pages
Hashing - How Hash Map Works in Java or How Get Method Works Internally - Java Hungry
No ratings yet
Hashing - How Hash Map Works in Java or How Get Method Works Internally - Java Hungry
9 pages
Newssdf
No ratings yet
Newssdf
267 pages
Estruturas de Dados II LISCH EISCH
No ratings yet
Estruturas de Dados II LISCH EISCH
38 pages
PHP Interview Questions and Answers: × CCCC CCCCCCCCCCCCCCC
No ratings yet
PHP Interview Questions and Answers: × CCCC CCCCCCCCCCCCCCC
5 pages
Answer All Questions, Each Carries 3 Marks: Reg No.: - Name
No ratings yet
Answer All Questions, Each Carries 3 Marks: Reg No.: - Name
2 pages
DSA Lab Manual
100% (1)
DSA Lab Manual
65 pages
Questions and Answers Automation and Configuration Management
No ratings yet
Questions and Answers Automation and Configuration Management
16 pages
CO3 Notes Hashing
No ratings yet
CO3 Notes Hashing
10 pages
PHP String Functions
No ratings yet
PHP String Functions
8 pages
Ds Answer
No ratings yet
Ds Answer
26 pages
CHANGES
No ratings yet
CHANGES
6 pages
Computer-Assisted Audit Tools and Techniques: IT Auditing, Hall, 4e
100% (1)
Computer-Assisted Audit Tools and Techniques: IT Auditing, Hall, 4e
26 pages
Balaji Institute of Sciences: Narsampet, Warangal-506 331 2010-11
No ratings yet
Balaji Institute of Sciences: Narsampet, Warangal-506 331 2010-11
36 pages
21.1.6 Lab - Hashing Things Outs
No ratings yet
21.1.6 Lab - Hashing Things Outs
7 pages
Lab Workbook: 17Cs3554 - Competitive Coding Lab
100% (1)
Lab Workbook: 17Cs3554 - Competitive Coding Lab
104 pages
Lecture-6 Part 2 Linux-Digital-Evidence-To-Create-Image-File
No ratings yet
Lecture-6 Part 2 Linux-Digital-Evidence-To-Create-Image-File
2 pages
S Y B Tech 2023-Pattern-NEP-Sem-I
No ratings yet
S Y B Tech 2023-Pattern-NEP-Sem-I
29 pages
Deep Neural Network Based Malware Detection Using Two Dimensional Binary Program Features
No ratings yet
Deep Neural Network Based Malware Detection Using Two Dimensional Binary Program Features
10 pages
2MARKS (Data Structures Using Python)
No ratings yet
2MARKS (Data Structures Using Python)
22 pages
Values, Hash Codes, Hash Sums, Checksums or Simply Hashes.: From Wikipedia, The Free Encyclopedia
100% (1)
Values, Hash Codes, Hash Sums, Checksums or Simply Hashes.: From Wikipedia, The Free Encyclopedia
11 pages
CS301 Final Term MAGA File.. All Paperz Are in 1 File.
No ratings yet
CS301 Final Term MAGA File.. All Paperz Are in 1 File.
28 pages
Ruby Cheatbook
100% (13)
Ruby Cheatbook
14 pages