14.3 - Mathematical Objective Function of PCA - mp4

The document discusses the mathematical conversion of a data projection problem, focusing on finding a direction (u1) that maximizes the variance of projected points. It emphasizes the importance of unit vectors and the formulation of an optimization problem to achieve this goal, while noting that the actual solution will be addressed later in the course. The document also highlights the significance of understanding variance and the constraints involved in the optimization process.

Uploaded by

NAKKA PUNEETH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views3 pages

14.3 - Mathematical Objective Function of PCA - mp4

Uploaded by

NAKKA PUNEETH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

So what we wrote in English in the previous video, we'll try to convert it mathematically.

So
again, this is my f one axis, my f two axis. This is my f one dash axis, on which my variance,
my variance of my projected points. Because if I project all my points, if I project each of
these points onto f one dash, the variance will be very high. Right. I'd, I want to find f one
dash. For simplicity, let me call my f one dash as u one, because this is what you'll typically
find in most lecture nodes or in most textbooks. So I'll use the same terminology here. Let
me call it u one dash. Now that's the first thing, okay? The second thing is, I really don't care
about finding this whole line. I only care about finding this direction, because once I know
the direction, I can project any point on this direction, right? So what I care about is finding
a direction. And we know that we represent directions using unit vectors. So what is a unit
vector? I'll make my u one a unit vector. And what is a unit vector? Unit vector basically
means the length of this vector equals to one, because I really don't care. I only want to find
this direction. That's all I care about. Okay, so this is what my u one actually is. My u one is
the direction. My u one is. The direction is the direction on which, if I project each of my
points, the variance will be maximal. Right? Let's try to build the math around it. So let me
just draw a much more simplified system here. Let's assume this is my f one. This is my f
two. Let's assume this is my direction u one. Okay? Of course this direction, I can draw a
line. Okay, this is my direction u one. So let's assume I have a point here. Let's call it xi. A
point can be represented as a vector. If I want to project this xi. Let's assume this point, this
projected point here is x I dash. So what am I saying here? X I dash is nothing but projection
of x I onto u one. Okay? Let's just define, let's just call it the projection of x I on u one. Let's
call it x I dash. Right? So now my data set, actually given data set, is xi. Let's just ignore yis,
right? Now, I equals to one to n. I'm creating a new data set from this called dash of x I dash,
such that my x I dash is nothing but projection of x I on u one, right? So now let's go and see.
So my x I dash is projection of x I on u one. And we learned in the linear algebra, in the
linear algebra series of videos, we learned that this is nothing but u one dot xi divided by
length of u one square. We just learned it, right? We learned this in the projection when we
understood what is a projection, what is a unit vector? And ideas like that. Now we know
that u one is a unit vector, which means this value is one, which means my x I dash is
nothing but u one transpose x I can. This is the, this is how I can represent my dot product,
right? This is one of the interpretations of dot product. Right? We saw this again in linear
algebra chapter. So each of my x I dash now is nothing but u one transpose x I. So given any
point x I can convert into x I dash using u one dash. Okay, now one thing that we should
remember is if I'm given x bar, let x bar be the mean vector, the mean vector of x. I can
multiply that with u one transpose and get x I, sorry, x dash bar. This is the mean of x I dash
I equals to one to n. So I can get the mean vector for x I dash by just taking the mean vector
of x I's and multiplying and doing a dot product with u one. Okay, so this is important. We'll
see why to use this. How to use this? Right. So what is our whole task? Let's write it
mathematically. Our task is we are saying that u one is the direction we want to find. Find u
one, find u one. Let me write it much more clearly here. Instead of this, find u one, such that
the variance of points xi projected onto u one. Again, I, going from one to n is maximal or
maximum. Okay, this is a task, right? This is the problem that we want to solve. So first, let's
solve this problem. What is the variance of this? This is nothing, but let me write it carefully.
This is nothing but variance of u one transpose x I equals to one to n by, from simple
definition, right? What am I doing here? I am just replacing this with this. From definition,
nothing very fancy. Okay, now let's go back. Let's go back and understand this. What does
this mean? Let's write, write it mathematically. What is the formula for variance? It is the
average. It's the average. This part is nothing, but it's the average dispersion of each of the
points from the mean, right? That's what variance is squared. Of course. Of course. Squared.
Okay, so let me just write it for you first, it is u one transpose x I minus u one transpose x
bar square. Now, you might ask, why did you write this? So what is this? This is x I dash, and
this is mean of x I dash. This is a formula for variance, isn't it? This is the simple formula for
variance. Now remember your projection. When you do ui transpose x I, you should
remember that u one, u one transpose multiplied by x I x I is a column vector, right? Which
means it has n rows and one column. What about this? This has one row and n columns. By
multiplying it, you're going to get a scalar. You're going to get one scalar, right? So these are
actually numbers, even though they're represented as vectors. This product results in one
scalar value, not a vector. Okay, so the mean everything works out well, now we know. So
when we started off, we said x, our original data set has been column standardized. So what
does column standardization mean? It means my mean vector is at zero is at origin, because
for each column, because I've already standardized the data, right? Which means my mean
vector will be at zero, which means if my mean vector is at zero, this is where things will get
interesting. This will become zero, right? My mean vector literally becomes zero. Right? So
now let's look at the formula. Now, what is variance of my x I dash? I equals to one to n is
nothing, but it's the average I equals to one to n, right? Un transpose x I square. Now let's
write the actual mathematical objective that we want to find. Okay, so I'll write something
called an optimization function. We learn more about optimization little later and how to
solve optimization problems. Okay, for now, let's write it. Let me explain what it is right
now. So what do we want to find? We want to maximize the variance. What is variance
here? We want to maximize this term u one transpose x I square. This is nothing, but this is
a variance, right? Okay, we want to find u one because x I's already given to us. Our excise
are part of a data matrix. This is already given to us. What is it that we have to find? We have
to find u one. Okay, we have to find u one such that this is maximized. So this is the problem
that we have to solve. Again, sorry, again, I'm not able to draw it properly. Okay, let me just
put a bounding box around it. But there is one constraint here such that my u one is a unit
vector, which means u one transpose u equals to one which is nothing but norm of u one
square. Right? So let me reread this. Let me try to explain this step by step. It's always
important to understand optimization problems. We want to maximize the variance of x I
dash, right? This is nothing but, this is nothing but variance of x I dash, right? Since x I's
already given to us as part of a data matrix, we already have all the x I's, right? We want to
maximize the variance of x I dash. And we want to find u one. You want to find that
direction. We want to find u one or the direction that maximizes the variance. And here we
are putting a constraint. So this is called the objective, this is called the objective of an
optimization problem. Of an optimization problem. And this is called constraint. We won't
let u to be any value. We won't let u one be any value. We want u one to be a unit vector,
okay? This says that u one is a unit vector because we want it to represent a direction, right?
See, if I just leave u one to be anything, right? It can just make my u one vector infinity, isn't
it? If I make my u one vector infinity, comma infinity, or very large values, this will always
be maximal, right? Multiplying even a small value with a very large value will make it large,
isn't it? So, I don't want any u one. I want my u one to be a unit vector. This is called an
optimization problem. This is called an optimization problem. Okay, we will learn about
solving optimization problems later when we learn about, after we learn a couple of
machine learning techniques. And I promise you, I will teach you how to solve this problem.
We will revisit this optimization problem and solve it when we learn optimization. For now,
since I don't want to divert and go and teach you whole of optimization, just in the interest
of not diverting, I will solve this problem much more. I'll tell you the solution to this
problem without telling you how I got there. But you can trust me there. I promise you, I
will solve this problem for you later. When we learn about optimization, when we learn
some basic calculus, remember, I'll teach you the basics of what differentiation means. I
promise you we'll revisit this and solve this problem. But for a while, for the first few
techniques, what I'll do is I'll just explain you the mathematical optimization problem that
we are solving, right here. We are solving a mathematical optimization problem, how to
solve it, and all. I'll teach you later. Right now, I'll explain you what the problem is how to
solve it. I promise you, I'll surely revisit it. I'm sorry for having to repeat it so many times,
but I think if I don't tell you how to solve it, I'll not be doing justice to you. I don't want to
water down the math. That's what I told you multiple times. I really, really don't want to
water down the math. Okay? But right now, this is where we will stop and will directly go to
the solution to this, to this optimization problem. How we arrive at that solution, we will
learn in the optimization section.

MScFE 650 MLF - Video - Transcripts - M2
No ratings yet
MScFE 650 MLF - Video - Transcripts - M2
23 pages
NV XRJIBOREc
No ratings yet
NV XRJIBOREc
13 pages
Lec 16
No ratings yet
Lec 16
12 pages
14.5 - Eigen Values and Eigen Vectors (PCA) Dimensionality Reduction - mp4
No ratings yet
14.5 - Eigen Values and Eigen Vectors (PCA) Dimensionality Reduction - mp4
5 pages
Lec 4
No ratings yet
Lec 4
6 pages
Lec 22
No ratings yet
Lec 22
15 pages
Linear Equations for Engineers
No ratings yet
Linear Equations for Engineers
29 pages
Year 1 MT Vacation Work: Vectors & Matrices
No ratings yet
Year 1 MT Vacation Work: Vectors & Matrices
5 pages
Lecture Notes
No ratings yet
Lecture Notes
38 pages
Understanding Polynomial Kernels
No ratings yet
Understanding Polynomial Kernels
3 pages
MITOCW - Ocw-18-085-F08-Rec01 - 300k: Professor Strang
No ratings yet
MITOCW - Ocw-18-085-F08-Rec01 - 300k: Professor Strang
12 pages
Add Maths Formulae List: Form 4: y F y FX
No ratings yet
Add Maths Formulae List: Form 4: y F y FX
16 pages
Projection Matrices in Linear Algebra
No ratings yet
Projection Matrices in Linear Algebra
13 pages
Lec 15
No ratings yet
Lec 15
16 pages
14.2 - Geometric Intuition of PCA - mp4
No ratings yet
14.2 - Geometric Intuition of PCA - mp4
3 pages
CH 2
No ratings yet
CH 2
121 pages
Mathematics Csec Summary 2022
100% (1)
Mathematics Csec Summary 2022
22 pages
Lec 43
No ratings yet
Lec 43
10 pages
Form 4 Add Maths Formulae Guide
No ratings yet
Form 4 Add Maths Formulae Guide
8 pages
NM Course
No ratings yet
NM Course
37 pages
MITOCW - 13. Randomized Matrix Multiplication: Gilbert Strang
No ratings yet
MITOCW - 13. Randomized Matrix Multiplication: Gilbert Strang
12 pages
Trigonometric Fourier Series: ECE 350 - Linear Systems I MATLAB Tutorial #6
No ratings yet
Trigonometric Fourier Series: ECE 350 - Linear Systems I MATLAB Tutorial #6
7 pages
Math Essentials for ML Enthusiasts
No ratings yet
Math Essentials for ML Enthusiasts
25 pages
Linear Alg
No ratings yet
Linear Alg
3 pages
Y Ac6KiQ1t0
No ratings yet
Y Ac6KiQ1t0
22 pages
Maths For ML
No ratings yet
Maths For ML
1 page
Notes Unit 1+2+3+4
No ratings yet
Notes Unit 1+2+3+4
110 pages
A1 Pointers Done Now
No ratings yet
A1 Pointers Done Now
6 pages
Lec 18
No ratings yet
Lec 18
5 pages
MLF Combined
No ratings yet
MLF Combined
84 pages
MLF Notes - Rishab Dec 24
No ratings yet
MLF Notes - Rishab Dec 24
6 pages
Elementary Numerical Analysis Prof. Rekha P. Kulkarni Department of Mathematics Indian Institute of Technology, Bombay Lecture No. # 40 Q R Method
No ratings yet
Elementary Numerical Analysis Prof. Rekha P. Kulkarni Department of Mathematics Indian Institute of Technology, Bombay Lecture No. # 40 Q R Method
35 pages
Classification Notation Explained
No ratings yet
Classification Notation Explained
3 pages
0 Qws 8 Bu K3 RQ
No ratings yet
0 Qws 8 Bu K3 RQ
7 pages
Subtitle
No ratings yet
Subtitle
1 page
Calculus of Variation and Image Processing: Scalar Product
No ratings yet
Calculus of Variation and Image Processing: Scalar Product
9 pages
Clnote Sept28
No ratings yet
Clnote Sept28
30 pages
s4 Mathematics Textbook by Wendi Kasiim Kawanda Secondary School Revision & Past Papers
No ratings yet
s4 Mathematics Textbook by Wendi Kasiim Kawanda Secondary School Revision & Past Papers
442 pages
28.3 - Why We Take Values +1 and and - 1 For Support Vector Planes - mp4
No ratings yet
28.3 - Why We Take Values +1 and and - 1 For Support Vector Planes - mp4
2 pages
Understanding Regression With Geometry - by Ravi Charan - Medium
No ratings yet
Understanding Regression With Geometry - by Ravi Charan - Medium
20 pages
Chapter 8: Performance Surfaces and Optimum Points: Brandon Morgan 1/14/2021
No ratings yet
Chapter 8: Performance Surfaces and Optimum Points: Brandon Morgan 1/14/2021
10 pages
Preliminaries and Systems of Linear Equations
No ratings yet
Preliminaries and Systems of Linear Equations
30 pages
PrincipalComponentAnalysis LectureNotesPublic
No ratings yet
PrincipalComponentAnalysis LectureNotesPublic
24 pages
Numerical Methods Lecture Notes
No ratings yet
Numerical Methods Lecture Notes
30 pages
MITOCW - Ocw-18.06-F99-Lec24b - 300k
No ratings yet
MITOCW - Ocw-18.06-F99-Lec24b - 300k
22 pages
L ZR IPRno GQQ
No ratings yet
L ZR IPRno GQQ
13 pages
Final
No ratings yet
Final
11 pages
ML L02 Lin - Alg Review
No ratings yet
ML L02 Lin - Alg Review
58 pages
Numerical Linear Algebra Lecture Notes
No ratings yet
Numerical Linear Algebra Lecture Notes
71 pages
Lecture Notes
No ratings yet
Lecture Notes
83 pages
AIMLB PGP 2025 Session 4
No ratings yet
AIMLB PGP 2025 Session 4
38 pages
SPM Additional Mathematics Formula Sheet
100% (2)
SPM Additional Mathematics Formula Sheet
9 pages
MScFE 650 MLF - Video - Transcripts - M3
No ratings yet
MScFE 650 MLF - Video - Transcripts - M3
19 pages
Mscfe XXX (Course Name) - Module X: Collaborative Review Task
No ratings yet
Mscfe XXX (Course Name) - Module X: Collaborative Review Task
19 pages
Mathematics Book 4
No ratings yet
Mathematics Book 4
69 pages
Mathematical Background Overview
No ratings yet
Mathematical Background Overview
31 pages
20.1 - Introduction - mp4
No ratings yet
20.1 - Introduction - mp4
2 pages
15.1 - What Is t-SNE - mp4
No ratings yet
15.1 - What Is t-SNE - mp4
2 pages
20.4 - K-NN, Given A Distance or Similarity Matrix - mp4
No ratings yet
20.4 - K-NN, Given A Distance or Similarity Matrix - mp4
2 pages
20.2 - Imbalanced Vs Balanced Dataset - mp4
No ratings yet
20.2 - Imbalanced Vs Balanced Dataset - mp4
6 pages
20.3 - Multi-Class Classification - mp4
No ratings yet
20.3 - Multi-Class Classification - mp4
3 pages
15.5 - How To Apply t-SNE and Interpret Its Output - mp4
No ratings yet
15.5 - How To Apply t-SNE and Interpret Its Output - mp4
9 pages
5.2 - Numerical Operations On Numpy - mp4
No ratings yet
5.2 - Numerical Operations On Numpy - mp4
10 pages
15.3 - Geometric Intuition of t-SNE - mp4
No ratings yet
15.3 - Geometric Intuition of t-SNE - mp4
2 pages
15.4 - Crowding Problem - mp4
No ratings yet
15.4 - Crowding Problem - mp4
2 pages
14.1 - Why Learn PCA - mp4
No ratings yet
14.1 - Why Learn PCA - mp4
1 page
15.2 - Neighborhood of A Point, Embedding - mp4
No ratings yet
15.2 - Neighborhood of A Point, Embedding - mp4
2 pages
7.1 - Getting Started With Pandas - mp4
No ratings yet
7.1 - Getting Started With Pandas - mp4
2 pages
6.1 - Getting Started With Matplotlib - mp4
No ratings yet
6.1 - Getting Started With Matplotlib - mp4
5 pages
38.1 - Problem Formulation Movie Reviews - mp4
No ratings yet
38.1 - Problem Formulation Movie Reviews - mp4
5 pages
5.1 - Numpy Introduction - mp4
No ratings yet
5.1 - Numpy Introduction - mp4
10 pages
18.15 - Visualizing Train, Validation and Test Datasets - mp4
No ratings yet
18.15 - Visualizing Train, Validation and Test Datasets - mp4
3 pages
4.1 - Introduction - mp4
No ratings yet
4.1 - Introduction - mp4
4 pages
28.13 - Cases - mp4
No ratings yet
28.13 - Cases - mp4
3 pages
SQL ORDER BY Keyword Explained
No ratings yet
SQL ORDER BY Keyword Explained
2 pages
57.7 - USE, DESCRIBE, SHOW TABLES - mp4
No ratings yet
57.7 - USE, DESCRIBE, SHOW TABLES - mp4
4 pages
Understanding PageRank Algorithm
No ratings yet
Understanding PageRank Algorithm
3 pages
2.7 - Operators - mp4
No ratings yet
2.7 - Operators - mp4
3 pages
Understanding Data Formats and Techniques
No ratings yet
Understanding Data Formats and Techniques
3 pages
Python Comments and Indentation Explained
No ratings yet
Python Comments and Indentation Explained
2 pages
Why Choose Python for Programming
No ratings yet
Why Choose Python for Programming
1 page
Vector Algebra PDF
No ratings yet
Vector Algebra PDF
4 pages
CIVE 260 Statics Course Syllabus
No ratings yet
CIVE 260 Statics Course Syllabus
12 pages
Vector Report
No ratings yet
Vector Report
16 pages
Emf - Ii - Eee - Syllabus - Class1
No ratings yet
Emf - Ii - Eee - Syllabus - Class1
52 pages
Chapter 2 Work Sheet
No ratings yet
Chapter 2 Work Sheet
4 pages
Understanding Vector Definitions
No ratings yet
Understanding Vector Definitions
14 pages
Ncert Solutions Class 12 Maths Chapter 10
No ratings yet
Ncert Solutions Class 12 Maths Chapter 10
47 pages
Geometry Teaching Enhancement Plan
No ratings yet
Geometry Teaching Enhancement Plan
14 pages
Vectors and The Geometry of Space Definition
No ratings yet
Vectors and The Geometry of Space Definition
6 pages
Tensor Algebra and Tensor Analysis For Engineers by Mikhail Itskov
No ratings yet
Tensor Algebra and Tensor Analysis For Engineers by Mikhail Itskov
6 pages
Unit-2 Vector Calculus Notes
No ratings yet
Unit-2 Vector Calculus Notes
28 pages
Mathematical Prerequisites: QUANTUM MECHANICS - A Modern Development © World Scientific Publishing Co. Pte. LTD
No ratings yet
Mathematical Prerequisites: QUANTUM MECHANICS - A Modern Development © World Scientific Publishing Co. Pte. LTD
5 pages
Computational Geometry-02-Point and Vector
No ratings yet
Computational Geometry-02-Point and Vector
35 pages
Mathematical Methods (Second Year) MT 2009 Problem Set 1: Linear Algebra I
No ratings yet
Mathematical Methods (Second Year) MT 2009 Problem Set 1: Linear Algebra I
2 pages
Electromagnetism - Principles and Applications (Lorrain Corson) 0716700646
No ratings yet
Electromagnetism - Principles and Applications (Lorrain Corson) 0716700646
522 pages
Distance Between Points on a Plane
No ratings yet
Distance Between Points on a Plane
44 pages
FIT1045 53 Workshop 12 Solutions FIT1045 53 Workshop 12 Solutions
No ratings yet
FIT1045 53 Workshop 12 Solutions FIT1045 53 Workshop 12 Solutions
10 pages
Inner Product Spaces
No ratings yet
Inner Product Spaces
6 pages
Vector Magnitude and Dot Product
No ratings yet
Vector Magnitude and Dot Product
10 pages
CH 06
No ratings yet
CH 06
5 pages
Manual For Instructors: TO Linear Algebra Fifth Edition
No ratings yet
Manual For Instructors: TO Linear Algebra Fifth Edition
10 pages
1.0 - L - SP105 - Physical Quantities and Measurement
No ratings yet
1.0 - L - SP105 - Physical Quantities and Measurement
35 pages
Understanding Newton's Laws and Space-Time
No ratings yet
Understanding Newton's Laws and Space-Time
39 pages
Introduction To Electrodynamics International 4th Edition David J. Griffiths New Release 2025
No ratings yet
Introduction To Electrodynamics International 4th Edition David J. Griffiths New Release 2025
106 pages
MTH 282 Comprehensive Study Notes On Mathematical Methods II
No ratings yet
MTH 282 Comprehensive Study Notes On Mathematical Methods II
62 pages
Density Operator in Quantum Optics
No ratings yet
Density Operator in Quantum Optics
60 pages
2021 KEM EG 231 Tutorial Sheet 1
No ratings yet
2021 KEM EG 231 Tutorial Sheet 1
19 pages
Vector - Class 12 - Maths Full Chapter
50% (2)
Vector - Class 12 - Maths Full Chapter
130 pages
Vector Spaces Over Z/2Z Guide
No ratings yet
Vector Spaces Over Z/2Z Guide
7 pages
Chapter 2 Slides
No ratings yet
Chapter 2 Slides
41 pages

14.3 - Mathematical Objective Function of PCA - mp4

Uploaded by

14.3 - Mathematical Objective Function of PCA - mp4

Uploaded by

So what we wrote in English in the previous video, we'll try to convert it mathematically.

You might also like