Chapter 3 Probability
Chapter 3 Probability
Chapter 3
Probability
Objectives
After completing this chapter, you will be able to:
Introduction
The world is an uncertain place. Making predictions about something as
seemingly mundane as tomorrow’s weather, for example, is actually quite a difficult
task. Even with the most advanced computers and models of the modern era, weather
forecasters still cannot say with absolute certainty whether it will rain tomorrow. The
best they can do is to report their best estimate of the chance that it will rain tomorrow.
For example, if the forecasters are fairly confident that it will rain tomorrow, they might
say that there is a 90% chance of rain. You have probably heard statements like this
your entire life, but have you ever asked yourself what exactly it means to say that there
is a 90% chance of rain?
Let us consider an even more basic example: tossing a coin. If the coin is fair,
then it is just as likely to come up heads as it is to come up tails. In other words, if we
were to repeatedly toss the coin many times, we would expect about about half of the
tosses to be heads and and half to be tails. In this case, we say that the probability of
getting a head is 1/2 or 0.5.
Note that when we say the probability of a head is 1/2, we are not claiming that
any sequence of coin tosses will consist of exactly 50% heads. If we toss a fair coin ten
times, it would not be surprising to observe 6 heads and 4 tails, or even 3 heads and 7
tails. But as we continue to toss the coin over and over again, we expect the long-run
frequency of heads to get ever closer to 50%. In general, it is important in statistics to
understand the distinction between theoretical and empirical quantities. Here, the true
(theoretical) probability of a head was 1/2, but any realized (empirical) sequence of coin
tosses may have more or less than exactly 50% heads.
Now suppose instead that we were to toss an unusual coin with heads on both of
its faces. Then every time we flip this coin we will observe a head — we say that the
probability of a head is 1. The probability of a tail, on the other hand, is 0. Note that
ENGINEERING DATA ANALYSIS | MODULE 2
there is no way we can further modify the coin to make flipping a head even more likely.
Thus, a probability is always a number between 0 and 1 inclusive.
First Concepts
Terminology
When we later discuss examples that are more complicated than flipping a coin,
it will be useful to have an established vocabulary for working with probabilities. A
probabilistic experiment (such as tossing a coin or rolling a die) has several
components. The sample space is the set of all possible outcomes in the experiment.
We usually denote the sample space by Ω, the Greek capital letter “Omega.” So in a
coin toss experiment, the sample space is
since there are only two possible outcomes: heads (H) or tails (T). Different experiments
have different sample spaces. So if we instead consider an experiment in which we roll
a standard six-sided die, the sample space is
Collections of outcomes in the sample space Ω are called events, and we often
use capital Roman letters to denote these collections. We might be interested in the
event that we roll an even number, for example. If we call this event E, then
for any event A ⊆ Ω. This should make sense given that we’ve already said that a
probability of 0 is assigned to an impossible event, basic probability 7 and there is no
way for something to be less likely than something that is impossible!
The next axiom is that the sum of the probabilities of all the outcomes in Ω must be 1.
We can restate this requirement by the equation
so a = 1/6. In this example, we were able to use the symmetry of the experiment along
with one of the probability axioms to determine the probability of rolling any number.
Once we know the probabilties of the outcomes in an experiment, we can compute the
probability of any event. This is because the probability of an event is the sum of the
probabilities of the outcomes it comprises. In other words, for an event A ⊆ Ω, the
probability of A is
To illustrate this equation, let us find the probability of rolling an even number, an
event which we will denote by E. Since E = {2, 4, 6}, we simply add the probabilities of
these three outcomes to obtain
What is the probability that we get at least one H? Solution. One way to solve this
problem is to add up the probabilities of all outcomes that have at least one H. We
would get
Another way to do this is to find the probability that we don’t flip at least one H,
and subtract that probability from 1. This would give us the probability that we do flip at
least one H. The only outcome in which we don’t flip at least one H is if we flip T both
times. We would then compute
Then to get the complement of this event, i.e. the event where we do flip at least
one H, we subtract the above probability from 1. This gives us
ENGINEERING DATA ANALYSIS | MODULE 4
Wowee! Both methods for solving this problem gave the same answer. Notice that in
the second calculation, we had to sum up fewer probabilities to get the answer. It can
often be the case that computing the probability of the complement of an event and
subtracting that from 1 to find the probability of the original event requires less work.
Tips
• The probability of an event can only be between 0 and 1 and can also be written as a
percentage.
• The probability of event AAA is often written as P(A)P(A)P, left parenthesis, A, right
parenthesis.
• If P(A) > P(B)P(A)>P(B)P, left parenthesis, A, right parenthesis, is greater than, P, left
parenthesis, B, right parenthesis, then event AAA has a higher chance of occurring
than event BBB.
• If P(A) = P(B)P(A)=P(B)P, left parenthesis, A, right parenthesis, equals, P, left
parenthesis, B, right parenthesis, then events AAA and BBB are equally likely to
occur.
Probability of events
Probability = The number of wanted outcomes Thenumber of possible outcomes
Example
What is the probability to get a 6 when you roll a die?
A die has 6 sides, 1 side contain the number 6 that give us 1 wanted outcome in 6
possible outcomes.
Independent events: Two events are independent when the outcome of the first
event does not influence the outcome of the second event.
When we determine the probability of two independent events we multiply the
probability of the first event by the probability of the second event.
P(XandY)=P(X)⋅P(Y)P(XandY)=P(X)⋅P(Y)
To find the probability of an independent event we are using this rule:
Example
If one has three dice what is the probability of getting three 4s?
The probability of getting a 4 on one die is 1/6
The probability of getting 3 4s is:
P(4and4and4)=16⋅16⋅16=1216P(4and4and4)=16⋅16⋅16=1216
ENGINEERING DATA ANALYSIS | MODULE 7
When the outcome affects the second outcome, which is what we called
dependent events.
Dependent events: Two events are dependent when the outcome of the first
event influences the outcome of the second event. The probability of two dependent
events is the product of the probability of X and the probability of Y AFTER X occurs.
P(XandY)=P(X)⋅P(Yafterx)P(XandY)=P(X)⋅P(Yafterx)
Example
What is the probability for you to choose two red cards in a deck of cards?
A deck of cards has 26 black and 26 red cards. The probability of choosing a red card
randomly is:
P(red)=2652=12P(red)=2652=12
The probability of choosing a second red card from the deck is now:
P(red)=2551P(red)=2551
The probability:
P(2red)=12⋅2551=25102P(2red)=12⋅2551=25102
Two events are mutually exclusive when two events cannot happen at the same
time. The probability that one of the mutually exclusive events occur is the sum of their
individual probabilities.
P(XorY)=P(X)+P(Y)P(XorY)=P(X)+P(Y)
An example of two mutually exclusive events is a wheel of fortune. Let's say you win a
bar of chocolate if you end up in a red or a pink field.
Probability
Let denote the conditional probability of given that has already occurred,
then
(9)
(10)
(11)
(12)
(13)
(14)
The relationship
(15)
holds if and are independent events. A very important result states that
(16)
which can be generalized to
(17)
Lecture Notes
Below are listed the terms that usually used in Statistics and Analysis and
definition.
Definitions
Average - The average is a number that is one way to find the typical value of a set of
numbers. You find the average by adding up all the numbers and then dividing the total
by the number of numbers in the set.
Example:
To find the average of the data set (1, 3, 3, 4, 4, 5, 8)
Add all the values together 1+3+3+4+4+5+8 = 28
Then divide by the total number of values 28 ÷ 7 = 4
The average value is 4.
Dependent event - Events are dependent if the occurrence of either event affects the
probability of the occurrence of the other event. In other words, one event depends on
the other.
Extrapolate - Extrapolation is a way to estimate values beyond the known data. You
ENGINEERING DATA ANALYSIS | MODULE 10
can use patterns and graphs to determine other possible data points that were not
actually measured.
Frequency - The frequency is how often an event occurs during a specific amount of
time.
Interpolate - Interpolation is a way to estimate data. When you interpolate you estimate
the data between two known points on a graph. This can be done by drawing a curve or
line between the two points.
Interval - The set of numbers between two other numbers in a data set. It often refers to
a period of time between two events.
Percent - A percent is a special type of fraction where the denominator is 100. It can be
written using the % sign.
Probability - The probability is the chance that an event will or will not occur.
Random - If something is random, then all possible events have an equal chance of
occurring.
Range - The range is the difference between the largest number and the smallest
number in a data set.
Example: The range of the data set (2, 2, 7, 8, 12, 7, 2, 14) is 14 -2 = 12.
Ratio - A ratio is a comparison of two numbers. It can be written a few different ways.
Example: The following are all a way to write the same ratio: 1/2 , 1:2, 1 of 2
Slope - A number that indicates the incline or steepness of a line on a graph. Slope
equals the "rise" over the "run" on a graph. This can also be written as the change in y
over the change in x.
Example: If two points on a line are (x1, y1) and (x2, y2), then the slope = (y2 - y1) ÷
(x2-x1).
ENGINEERING DATA ANALYSIS | MODULE 11
Video Links:
Probability
• https://www.youtube.com/watch?v=yUaI0JriZtY&vl=
en
• https://www.youtube.com/watch?v=KzfWUEJjG18
• https://www.youtube.com/watch?v=uzkc-qNVoOk
References
• https://seeing-theory.brown.edu/doc/basic-
probability.pdf
• https://mathworld.wolfram.com/Probability.html
• https://www.ducksters.com/kidsmath/probability_stat
istics_glossary.php#:~:text=Probability%20%2D%20
The%20probability%20is%20the,number%20in%20
a%20data%20set.
• https://www.khanacademy.org/math/statistics-
probability/probability-library/basic-theoretical-
probability/a/probability-the-
basics#:~:text=Probability%20is%20simply%20how
%20likely,by%20probability%20is%20called%20stat
istics.