0% found this document useful (0 votes)

18 views45 pages

תרגול - Bayesian Learning

The document outlines the process for appealing grades and provides details on implementing a k-Nearest Neighbors (kNN) algorithm as part of an assignment. It discusses feature scaling, efficient distance checks, and the use of probability in classification, including concepts like Bayes' rule and maximum likelihood estimation. Additionally, it covers the importance of cross-validation and the classification process based on posterior probabilities.

Uploaded by

benmizar2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views45 pages

תרגול - Bayesian Learning

Uploaded by

benmizar2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

© Ben Galili IDC

¡ After grades are published, you have one

week to send a mail with your appeal (for HW
1 the one week starting now)
¡ You will get a response mail with the appeal’s
decision
¡ The grade will be updated only in the excel
file (uploaded to the first section in Moodle)

© Ben Galili IDC

¡ Family of learning algorithm that:
§ Doesn’t build a model to the data (like tree in Decision
Tree)
§ Instead – compares new instance with instances seen
in training
¡ Time complexity:
§ Fast learning (No learning…)
§ Potentially slow classification/prediction (O(n))
¡ Space complexity:
§ Store all in instances (O(n))
¡ Used in both Classification and Regression

© Ben Galili IDC

¡ How to find nearest? √
§ We know the possible methods & we use X-Fold
Cross Validation to chose best one?
¡ Slow query & Large space √
§ We now able to reduce space (irrelevant points) &
accelerate query time (K-D tree, reducing
calculation time)
¡ How to choose k? √
§ We use X-Fold Cross Validation to chose best one
© Ben Galili IDC
¡ This assignment has 3 phases:
§ First
▪ Implement a feature scaler.
§ Second
▪ Implement kNN algorithm
▪ use cross validation in order to find best hyper parameters (K,
p for the distance method, weighted / uniform majority)
§ Third
▪ Examine the influence of the number of folds on the running
time of each fold and the total running time.
▪ Implement an efficient distance kNN and see how it
effects the running time
© Ben Galili IDC
¡ Feature Scaling
§ 1 class : FeatureScaler
§ Should receive an instances object and return a
scaled instances object.
§ We'll use standardization on scaling in this
assignment:

© Ben Galili IDC

¡ Implement kNN
§ 2 classes: MainHW3 & kNN
§ The kNN class is the algorithm object
▪ You need to think which properties the class needs
(hint: think which parameters kNN algorithm needs)
§ The MainHW3 should find the best combination of
k, p (the distance method) and the voted method
– It should go over all combination and select the
one with the smallest error using cross validation

© Ben Galili IDC

¡ Efficient kNN: Efficient Distance Check
§ After you found the kNN parameters, implement the
efficient kNN
§ You need to implement an efficient distance check :

,
' .
# %
d " # ," % = ( ") − ")
)*+

§ Remember – the goal is to stop iterating once we are

above a desired threshold

© Ben Galili IDC

¡ Our previous model didn’t use probability
calculation (except maybe in the goodness of
split)
¡ The most intuitive algorithm is to return the
majority class, or in other word – return the
most probable class according to the training
set
¡ Today agenda – probability algorithm –
algorithm that uses probability techniques in
order to predict new instance
© Ben Galili IDC
¡ Sample space
¡ A sample space is a set of events which lists
all possible outcomes:
§ For a coin toss this is the sample space: S = {H,T}
§ For rolling a dice this is the sample space:
S = {1,2,3,4,5,6}
§ For rolling two dice this is the sample space:
S = {(1,1), (1,2), (1,3), (1,4), …, (6,5), (6, 6)}

© Ben Galili IDC

¡ Events
¡ Any subset of the sample space is called an
event
§ For rolling two die
E={(1,6) , (2,5), (3,4), (4,3), (5,2), (6,1)}

© Ben Galili IDC

¡ Events
¡ Some basic operation on events:
§ Union
§ Intersection
§ Complement

© Ben Galili IDC

¡ Random variable
¡ Some function of the outcome event
¡ For example, the sum of two die (not the two
numbers that come up):
§ Let X be a random variable denoting the sum of
two dice rolls:
▪ ! "=1 =
? 0
(
▪ ! "=2 =
? ! 1,1 =
)*
)
▪ ! "=4 =
? ! 1,3 , 3,1 , {2,2} = )*

© Ben Galili IDC

¡ Random variable
¡ We now can define the expected value of a random
variable:
§ For discrete variable:

! " = $ &'(&)
%
* Where p is the probability mass function (pmf)
§ For continuous variable:
,
! " = * &- & .&
+,
* Where f is the probability density function (pdf)
© Ben Galili IDC
¡ Random variable
¡ The variance:
! " = $%& ' = ([ * − , " ]
¡ The standard deviation (=square root of the
variance):
! = $%& ' = ([ * − , " ]

© Ben Galili IDC

¡ ! " ∪ $ =?
! " +! $ −! "∩$
¡ ! " ∪ $ ∪ * =?
! "∪$ ∪* =
! "∪$ +! * −! "∪$ ∩* =
! " +! $ −! "∩$ +! * −! "∩* ∪ $∩* =
! " +! $ −! "∩$ +! * −! "∩* −! $∩* +! "∩$∩* =
! " +! $ +! * −! "∩$ −! "∩* −! $∩* +! "∩$∩*

© Ben Galili IDC

¡ As we said before the simplest way is to ask
which class has higher probability in the training
set
¡ What is the probability that you’ll pass the
exam?
§ We have training data of the previous year result
§ There are 2 classes: ‘Pass’ or ‘Fail’
§ ‘Pass’ probability is 90%, and ‘Fail’ is 10%
§ So every one here has 90% to pass the exam
§ This uses the prior probability, and in our context is
the class distribution in the training set

© Ben Galili IDC

¡ Conditional probability:
& '∩)
§ ! "|$ =
&())
§ ! "|$, =?
& '∩). /.,
▪ = =1
&(). ) /.,

§ ! "|$2 =?
& '∩)3 /.,2
▪ = =0.75
&()3 ) /.,4
§ ! "|$5 =?
& '∩)6 /
▪ = =0
&()6 ) /.,

© Ben Galili IDC

¡ Example:
§ ! !"## = 90%
§ ! (")* = 10%
¡ We also know that:
§ ! ,-"./ 01. 2ℎ- 2-#2|!"## = 90%
§ ! 5)6/7 2 *-"./|!"## = 10%
§ ! ,-"./ 01. 2ℎ- 2-#2|(")* = 5%
§ ! 5)6/7 2 *-"./|(")* = 95%
¡ What is the probability that you pass the test if
you learn?
© Ben Galili IDC
§ ! !"## ∩ %&"'( )*' +ℎ& +&#+ = ! !"##
×! %&"'( )*' +ℎ& +&#+ !"## = 90%×90% = 81%
§ ! !"## ∩ 456(7 + 8&"'( = ! !"## ×!(456(7 + 8&"'(|!"##)
= 90%×10% = 9%
§ ! <"58 ∩ %&"'( )*' +ℎ& +&#+ = ! <"58
×! %&"'( )*' +ℎ& +&#+ <"58 = 10%×5% = 0.5%
§ ! <"58 ∩ 456(7 + 8&"'( = ! <"58 ×!(456(7 + 8&"'(|<"58)
= 10%×95% = 9.5%
§ ! %&"'( )*' +ℎ& +&#+ = ! !"## ∩ %&"'( )*' +ℎ& +&#+
+ ! <"58 ∩ %&"'( )*' +ℎ& +&#+ = 81% + 0.5% = 81.5%
§ ! 456(7 + 8&"'( = ! !"## ∩ 456(7 + 8&"'(
+ ! <"58 ∩ 456(7 + 8&"'( = 9% + 9.5% = 18.5%

© Ben Galili IDC

- -.//∩12.34 563 782 72/7
§ ! !"## $%"&' ()& *ℎ% *%#* =
-(12.34 563 782 72/7)
;<%
= = 99%
;<.?%
- D.EF∩12.34 563 782 72/7
§ ! A"BC $%"&' ()& *ℎ% *%#* =
-(12.34 563 782 72/7)
G.?%
= = 1%
;<.?%
- -.//∩LEM4N 7 F2.34 O%
§ ! !"## IBJ'K * C%"&' = = = 49%
-(LEM4N 7 F2.34) <;.?%
- D.EF∩LEM4N 7 F2.34 O.?%
§ ! A"BC IBJ'K * C%"&' = = = 51%
-(LEM4N 7 F2.34) <;.?%

© Ben Galili IDC

¡ Independent events
§ If ! " ∩ $ = ! " ! $ then A & B are independent
§ From conditional probability we get:
! "∩$
! "|$ =
!($)

! " ∩ $ = ! "|$ !($)

§ If A & B are independent:
! " ! $ = ! " ∩ $ = ! "|$ !($)
! " = ! "|$
* And also ! $ = ! $|"
© Ben Galili IDC
¡ The likelihood is the class conditional
information – the probability of an instance,
given the class
§ for an instance x, and 2 possible classes A, B:
P(x|B)
P(x|A)
If x=12, we’ll predict B,
because P(x|B)>P(x|A)

© Ben Galili IDC

¡ If we return to the previous example (fail \
pass) it is like asking:
§ What is the probability that someone learn to the
test if he pass the exam
¡ But, we wanted to know what is the
probability to pass \ fail the exam if you’ll
learn
¡ So we need a way to go from likelihood to
posterior probability
© Ben Galili IDC
¡ Bayes rule:
!(#|")!(")
! "# =
!(#)

¡ With this rule we can convert the likelihood to the

posterior probability, if we have also the prior
probability
¡ A classifier that classify A if P(A|x)>P(B|x), is a classifier
that maximize the posterior probability – MAP
¡ The classification with MAP depends on both the
likelihood and the prior probabilities

© Ben Galili IDC

¡ So if we want to classify according to MAP:
§ We will classify A if
!(#|")!(") !(#|))!())
! "# = > =! )#
!(#) !(#)
! # " ! " > !(#|))!())
§ Note that P(x) is removed from both side’s
denominator simply because it is the same

© Ben Galili IDC

¡ This classification rule is minimizing the error:
§ If we classify B, then the ! "##$# % = ! ' %
§ If we classify A, then the ! "##$# % = ! ( %
¡ But, we classify B only if ! ( % > ! ' % ,
and therefore the probability of the error is
minimal
! "##$# % = min[! ' % , ! ( % ]

© Ben Galili IDC

¡ We can define a loss measure for wrong
decision:
§ 0-1 loss (the simplest one):
1, ./ . ≠ 1
λ"# = λ %ℎ''() *" *# =+
0, ./ . = 1

© Ben Galili IDC

¡ After we defined the loss we can define the risk,
which is the expected loss (for k classes):
/

! "ℎ$$%& '( ) = + λ(, 1(', |)) = + 1(', |))

,-. ,5(
= 1 − 1('( |))
¡ Classifier that wants to minimize the risk will
choose '( such that:
1 '( ) > 1 ', ) ∀: ≠ <

¡ We can use Bayes rule even for multi-class
problem:
%(#|&" )%(&" )
!" # = % &" # = .
∑+,- %(#|&+ ) %(&+ )
¡ The denominator is the same for all !" # , so
it can be dropped:
!" # = %(#|&" )%(&" )

¡ In order to make the classification process
more efficient we can use ln():
!" # = ln ' # (" ' ("
= ln ' # (" + ln ' ("
¡ It helps reduce multiplication of small number
(0-1) and to deal better with normal
distribution * +(-)
¡ We can do it because ln() is monotonically
increasing
© Ben Galili IDC
¡ If we’re going back to the regression task, we can
define the hypothesis to be any function ℎ " : $ → &
that belongs to the hypothesis space ℎ ∈ (
¡ We want to find the most probable hypothesis
¡ This is a conditional probability problem – find the
hypothesis that maximize
) ℎ* =) *ℎ ) ℎ
* posterior probability
¡ We will assume all ℎ ∈ ( have the same prior
probability, and we’ll get that the most probable ℎ will
be found according to maximum likelihood:
ℎ,- = argmax 5 * ℎ
3∈4
© Ben Galili IDC
¡ Assuming the instances are independent:
! " ℎ = % ! '& ℎ
&
¡ If the error has normal distribution
(& ~*(,, .), then we can say that the
probability that ℎ 0& = '& is the same as the
probability (& = 0 according to the normal
distribution of (&
© Ben Galili IDC
¡ And we get:
ℎ"# = argmax - . ℎ = argmax / 1 20 ℎ
*∈, *∈,
0
1 : ;< 9= ?
9
= argmax / 8 7 >
*∈,
0
256 7
?
1 : * @< 9A<
9
= argmax / 8 7 >
*∈,
0
256 7
© Ben Galili IDC
;
1 6 * 78 598
5
ℎ"# = argmax - 4 3 :
*∈,
.
212 3
;
1 6 * 78 598
5
ℎ"# = argmax ln - 4 3 :
*∈,
.
212 3
3
1 1 ℎ @. − A.
ℎ"# = argmax > ln −
*∈, 212 3 2 2
.
3
1 ℎ @. − A.
= argmax > −
*∈, 2 2
.

= argmax > − ℎ @. − A. 3
*∈,
.

= argmin > ℎ @. − A. 3
*∈,
.

¡ Prior classifier: ! " > !(%)
¡ ML classifier: ! '|" > !('|%) – assuming
! " = !(%)
¡ MAP classifier:
! "|' = ! '|" ! " > ! '|% ! % = ! %|'
* Drooping ! ' from the denominator

¡ Parametric models
§ If we know \ can guess the distribution type we
can estimate the parameters of the distribution
¡ Non parametric models
§ Histogram (=count…)
§ Naïve Bayes

¡ For each class we will estimate the distribution
parameter according to the train dataset
¡ If we’re talking about normal distribution
parameters, we need to estimate the mean and
the variance:
)
1
! = % *&
$
&'(
)
1
+, = % (*& − !),
$
&'(
© Ben Galili IDC
¡ Now, we can estimate the parameter for each likelihood
probability, for each class:
1
!" = ',
|&" |
(∈*+
.
1
-" = ' (, − !" ).
|&" |
(∈*+
¡ And then classify according to the largest probability given
by the normal distribution formula:
:
1 7 (68 +
6. 9
2 ,|&" = 5 +

24-".
© Ben Galili IDC
¡ But, this was good only for 1 attribute
¡ What if we have more than 1?
¡ In this case each likelihood probability will be
estimated according to multivariate normal
distribution
¡ For this we will need mean vector (each
dimension will be the mean for some
attribute) and the covariance matrix

Variance

és 11 s 12 ! s 1d ù é s 12 s 12 ! s 1d ù
ês ú ê ú
ê 21 s 22 ! " ú ês 21 s 22 ! " ú
S= =
ê " " # " ú ê " " # " ú
ê ú ê 2 ú
ës d 1 s d 2 ! s dd û êës d 1 s d 2 ! s d úû

S - is the determinant of the covariance matrix

-1
S - is the inverse matrix of the covariance matrix
© Ben Galili IDC
¡ For each attribute we will find the mean and
the variance as before and we will create the
mean vector and the covariance matrix
¡ We will classify according to the multivariate
normal distribution:
1 2
̅ 5 6 7 89 314
1 314 ̅ 5
! #|%
̅ & = 0 /
(2+)- |.|/

¡ If we don’t know the type of distribution?
¡ We need another way to estimate the
probabilities ! "|$% and ! $%
¡ The prior probability ! $% can be estimated
from the classes frequency in the training set
¡ But what with the likelihood?

¡ In order to estimate the likelihood for a given
instance we need a huge dataset
¡ If we have d attributes the number of possible
terms in the likelihood ! "# , "% , … , "' |)* is
+ , |-# | , |-% | ,,, |-' |
¡ We need a way \ assumption to overcome this
problem

¡ If we assume that all attributes are independent given the class,
we will get:
'

! "# , "% , … , "' |)* = , ! "- |)*

-.#
¡ And now we can find the MAP:
'

/01 = argmax !()* ) , ! "- |)*

*
-.#
¡ In this assumption we lower the necessary size of the dataset to
'

9 : /-
-.#

Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
91 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
Probability and Statistics Overview
No ratings yet
Probability and Statistics Overview
26 pages
Mathematics for AI: Lecture Notes
No ratings yet
Mathematics for AI: Lecture Notes
114 pages
AIML-Unit 3 Notes-Assignment 3
No ratings yet
AIML-Unit 3 Notes-Assignment 3
37 pages
Bayesian Learning for Classification Models
No ratings yet
Bayesian Learning for Classification Models
37 pages
Neural Networks in Statistical Classification
No ratings yet
Neural Networks in Statistical Classification
85 pages
Introduction To Probability Theory and Statistics
No ratings yet
Introduction To Probability Theory and Statistics
127 pages
Introduction To Probability Theory and S
No ratings yet
Introduction To Probability Theory and S
127 pages
ECE 368 Course Review: Probabilistic Reasoning 2023
No ratings yet
ECE 368 Course Review: Probabilistic Reasoning 2023
138 pages
Bayesian Learning for Graphics
No ratings yet
Bayesian Learning for Graphics
141 pages
Pattern Recognition Overview
No ratings yet
Pattern Recognition Overview
48 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
Machine Learning: A Probabilistic Perspective: Solutions Manual (Please Do Not Make Publicly Available)
No ratings yet
Machine Learning: A Probabilistic Perspective: Solutions Manual (Please Do Not Make Publicly Available)
127 pages
Log-Linear Models and CRFs Tutorial
No ratings yet
Log-Linear Models and CRFs Tutorial
27 pages
Naive Bays
No ratings yet
Naive Bays
25 pages
Unit IV CI PDF
No ratings yet
Unit IV CI PDF
24 pages
Lecture Notes MAI
No ratings yet
Lecture Notes MAI
111 pages
Lecturenotes
No ratings yet
Lecturenotes
56 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
23 pages
UNIT I-Part 2
No ratings yet
UNIT I-Part 2
35 pages
Machine Learning Models Overview
No ratings yet
Machine Learning Models Overview
38 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
207 pages
Unit 1
No ratings yet
Unit 1
92 pages
Lecture 02
No ratings yet
Lecture 02
4 pages
Naïve Bayes and Probability Theory in ML
No ratings yet
Naïve Bayes and Probability Theory in ML
30 pages
Understanding Bayes Theorem and Classifiers
No ratings yet
Understanding Bayes Theorem and Classifiers
26 pages
Understanding Bayesian Networks in AI
No ratings yet
Understanding Bayesian Networks in AI
25 pages
Leon-Garcia-IPPR - Chapters 1-6
No ratings yet
Leon-Garcia-IPPR - Chapters 1-6
180 pages
Statistical Foundations of Machine Learning
No ratings yet
Statistical Foundations of Machine Learning
377 pages
ML Unit 1
No ratings yet
ML Unit 1
13 pages
Understanding Probabilistic Models
No ratings yet
Understanding Probabilistic Models
29 pages
Understanding Probability and Inference
No ratings yet
Understanding Probability and Inference
34 pages
L09 Learning I Bayesian Learning
No ratings yet
L09 Learning I Bayesian Learning
66 pages
This Is A Traditional AI Topic, But We Need To Cover It in at Least A Little Detail Here There Are Many Different Approaches To Handling Uncertainty
No ratings yet
This Is A Traditional AI Topic, But We Need To Cover It in at Least A Little Detail Here There Are Many Different Approaches To Handling Uncertainty
32 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
Bayesian
No ratings yet
Bayesian
14 pages
Bayes Decision Rule in Pattern Recognition
No ratings yet
Bayes Decision Rule in Pattern Recognition
10 pages
Bayesian Networks and Inference Methods
No ratings yet
Bayesian Networks and Inference Methods
23 pages
Bayesian Learning and Inference Techniques
No ratings yet
Bayesian Learning and Inference Techniques
33 pages
AI Probability and Bayesian Networks
No ratings yet
AI Probability and Bayesian Networks
57 pages
Unsupervised Learning Foundations
No ratings yet
Unsupervised Learning Foundations
9 pages
Statistical Foundations of Machine Learning
No ratings yet
Statistical Foundations of Machine Learning
364 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
39 pages
Introduction To Machine Learning CS - 229
No ratings yet
Introduction To Machine Learning CS - 229
109 pages
Machine Learning PPT Part III
No ratings yet
Machine Learning PPT Part III
26 pages
Naive Bayes Classifier and Other Topics
No ratings yet
Naive Bayes Classifier and Other Topics
52 pages
FML Unit3
No ratings yet
FML Unit3
18 pages
Bayesian Inference Fundamentals
No ratings yet
Bayesian Inference Fundamentals
195 pages
Understanding Gradients and Decision Trees
No ratings yet
Understanding Gradients and Decision Trees
43 pages
Bayesian Theory Daniel Restrepo
No ratings yet
Bayesian Theory Daniel Restrepo
8 pages
COMP3411 Week 9 - Uncertainty
No ratings yet
COMP3411 Week 9 - Uncertainty
70 pages
Opssb - Admit Card
No ratings yet
Opssb - Admit Card
2 pages
Comprehensive Project Scope Overview
No ratings yet
Comprehensive Project Scope Overview
1 page
Naukri Abdheshkumar (9y 0m)
No ratings yet
Naukri Abdheshkumar (9y 0m)
4 pages
Accenture Interview Questions1
No ratings yet
Accenture Interview Questions1
14 pages
DX Diag
No ratings yet
DX Diag
36 pages
Computersecuritystudent Com
No ratings yet
Computersecuritystudent Com
48 pages
Apple Supplier List 2013
No ratings yet
Apple Supplier List 2013
33 pages
Setting Up 2 Factor Authentication For Office365
No ratings yet
Setting Up 2 Factor Authentication For Office365
16 pages
ESET PROTECT Advanced Brochure 2023
No ratings yet
ESET PROTECT Advanced Brochure 2023
4 pages
D370e4 Kidde
No ratings yet
D370e4 Kidde
121 pages
GST On E-Commerce Operators
No ratings yet
GST On E-Commerce Operators
45 pages
Overview of Cloud Computing Architecture
No ratings yet
Overview of Cloud Computing Architecture
3 pages
LTE494 - CMAS (Commercial Mobile Alert System Feature)
No ratings yet
LTE494 - CMAS (Commercial Mobile Alert System Feature)
9 pages
Schneider Electric EcoStruxure Control Expert Unity Pro CEXSPUCZXTPMZZ
No ratings yet
Schneider Electric EcoStruxure Control Expert Unity Pro CEXSPUCZXTPMZZ
2 pages
CLASS No. 8 English Punctuation
No ratings yet
CLASS No. 8 English Punctuation
5 pages
Proj
No ratings yet
Proj
734 pages
Network Security Assignment
No ratings yet
Network Security Assignment
14 pages
Doctec PDF
No ratings yet
Doctec PDF
1 page
AWP UPS Cat
No ratings yet
AWP UPS Cat
27 pages
Final Project Document
No ratings yet
Final Project Document
82 pages
Implementing and Administering Cisco Solutions: (CCNA)
No ratings yet
Implementing and Administering Cisco Solutions: (CCNA)
4 pages
Belden Category 6 UTP Cable Specs
No ratings yet
Belden Category 6 UTP Cable Specs
4 pages
JWEI Duplo PFi Blade B3 Spare Parts 30032020
No ratings yet
JWEI Duplo PFi Blade B3 Spare Parts 30032020
10 pages
Multimedia for Web Design Students
No ratings yet
Multimedia for Web Design Students
24 pages
10 Best FREE Screenwriting Software Options For Writers in 2021
No ratings yet
10 Best FREE Screenwriting Software Options For Writers in 2021
1 page
Camera Bullet
No ratings yet
Camera Bullet
2 pages
AIPM Part 2
No ratings yet
AIPM Part 2
17 pages
MMIC Getting Started
No ratings yet
MMIC Getting Started
108 pages
IIAE Training Notes
No ratings yet
IIAE Training Notes
19 pages
Understanding I2C Protocol Basics
No ratings yet
Understanding I2C Protocol Basics
22 pages

תרגול - Bayesian Learning

Uploaded by

תרגול - Bayesian Learning

Uploaded by

© Ben Galili IDC

¡ After grades are published, you have one

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

§ Remember – the goal is to stop iterating once we are

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

! " ∩ $ = ! "|$ !($)

© Ben Galili IDC

¡ With this rule we can convert the likelihood to the

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

! "ℎ$$%& '( ) = + λ(, 1(', |)) = + 1(', |))

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

S - is the determinant of the covariance matrix

© Ben Galili IDC

© Ben Galili IDC

© Ben Galili IDC

! "# , "% , … , "' |)* = , ! "- |)*

/01 = argmax !()* ) , ! "- |)*

© Ben Galili IDC

You might also like