0% found this document useful (0 votes)

224 views2 pages

Machine Learning 2: Exercise Sheet 1

The document is an exercise sheet for a machine learning course. It contains 4 exercises related to dimensionality reduction techniques: 1. Analyzing symmetries in Locally Linear Embedding (LLE). 2. Deriving the closed-form solution for optimal weights in LLE. 3. Deriving gradients for optimizing the Kullback-Leibler divergence, which is minimized in t-Distributed Stochastic Neighbor Embedding (t-SNE). 4. Completing a programming exercise to implement aspects of the techniques.

Uploaded by

Surya Iyer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

224 views2 pages

Machine Learning 2: Exercise Sheet 1

Uploaded by

Surya Iyer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Exercises for the course Abteilung Maschinelles Lernen

Institut für Softwaretechnik und theoretische Informatik

Machine Learning 2 Fakultät IV, Technische Universität Berlin
Summer semester 2021 Prof. Dr. Klaus-Robert Müller
Email: [email protected]

Exercise Sheet 1
Exercise 1: Symmetries in LLE (25 P)
The Locally Linear Embedding (LLE) method takes as input a collection of data points ~x1 , . . . , ~xN ∈ Rd
and embeds them in some low-dimensional space. LLE operates in two steps, with the first step consisting
of minimizing the objective
XN X 2
E(w) = ~xi − wij ~xj

i=1 j
P P
where w is a collection of reconstruction weights subject to the constraint ∀i : j wij = 1, and where j
sums over the K nearest neighbors of the data point ~xi . The solution that minimizes the LLE objective can
be shown to be invariant to various transformations of the data.
Show that invariance holds in particular for the following transformations:

(a) Replacement of all ~xi with α~xi , for an α ∈ R+ \ {0},

(b) Replacement of all ~xi with ~xi + ~v , for a vector ~v ∈ Rd ,

(c) Replacement of all ~xi with U~xi , where U is an orthogonal d × d matrix.

Exercise 2: Closed form for LLE (25 P)

In the following, we would like to show that the optimal weights w have an explicit analytic solution. For
this, we first observe that the objective function can be decomposed as a sum of as many subobjectives as
there are data points:
N
X X 2
E(w) = Ei (w) with Ei (w) = ~xi − wij ~xj

i=1 j

Furthermore, because each subobjective depends on different parameters, they can be optimized indepen-
dently. We consider one such subobjective and for simplicity of notation, we rewrite it as:
K
X 2
Ei (w) = ~x − wj ~ηj

j=1

where ~x is the current data point (we have dropped the index i), where η = (~η1 , . . . , ~ηK ) is a matrix of size
K × d containing the K nearest neighborsP of ~x, and w is the vector of size K containing the weights to
optimize and subject to the constraint K j=1 wj = 1.

(a) Prove that the optimal weights for ~x are found by solving the following optimization problem:

min w> Cw subject to w> 1 = 1.

where C = (1~x> − η)(1~x> − η)> is the covariance matrix associated to the data point ~x and 1 is a vector
of ones of size K.

(b) Show using the method of Lagrange multipliers that the minimum of the optimization problem found in (a)
is given analytically as:
C −1 1
w = > −1 .
1 C 1
(c) Show that the optimal w can be equivalently found by solving the equation Cw = 1 and then rescaling w
such that w> 1 = 1.
Exercise 3: SNE and Kullback-Leibler Divergence (25 P)
SNE is an embedding algorithm that operates by minimizing the Kullback-Leibler divergence between two
discrete probability distributions p and q representing the input space and the embedding space respectively.
In ‘symmetric SNE’, these discrete distributions assign to each pair of data points (i, j) in the dataset the
probability scores pij and qij respectively, corresponding to how close the two data points are in the input
and embedding spaces. Once the exact probability functions are defined, the embedding algorithm proceeds
by optimizing the function:

C = DKL (p k q)
N X
N p
ij
X
= pij log
qij
i=1 j=1

where p and q are subject to the constraints N

P PN PN PN
i=1 j=1 pij = 1 and i=1 j=1 qij = 1. Specifically, the
algorithm minimizes q which itself is a function of the coordinates in the embedded space. Optimization is
typically performed using gradient descent.
In this exercise, we derive the gradient of the Kullback-Leibler divergence, first with respect to the probability
scores qij , and then with respect to the embedding coordinates of which qij is a function.

(a) Show that

∂C pij
=− . (1)
∂qij qij

(b) The probability matrix q is now reparameterized using a ‘softargmax’ function:

exp(zij )
qij = PN PN
k=1 l=1 exp(zkl )

The new variables zij can be interpreted as unnormalized log-probabilities. Show that

∂C
= −pij + qij . (2)
∂zij

(c) Explain which of the two gradients, (1) or (2), is the most appropriate for practical use in a gradient descent
algorithm. Motivate your choice, first in terms of the stability or boundedness of the gradient, and second
in terms of the ability to maintain a valid probability distribution during training.

(d) The scores zij are now reparameterized as

zij = −k~yi − ~yj k2

where the coordinates ~yi , ~yj ∈ Rh of data points in embedded space now appear explicitly. Show using the
chain rule for derivatives that
N
∂C X
= 4 (pij − qij ) · (~yi − ~yj ).
∂~yi
j=1

Exercise 4: Programming (25 P)

Download the programming files on ISIS and follow the instructions.

Problem Set 2 Spring 2020 CS395T
No ratings yet
Problem Set 2 Spring 2020 CS395T
4 pages
Machine Learning End-Sem Exam 2023
No ratings yet
Machine Learning End-Sem Exam 2023
4 pages
hw5 1
No ratings yet
hw5 1
6 pages
Efficient NumPy Programming for ML
No ratings yet
Efficient NumPy Programming for ML
3 pages
Machine Learning Homework 1 Solutions
No ratings yet
Machine Learning Homework 1 Solutions
11 pages
Data Science Exercise Sheet 2
No ratings yet
Data Science Exercise Sheet 2
5 pages
Department of Electrical Engineering School of Science and Engineering
No ratings yet
Department of Electrical Engineering School of Science and Engineering
10 pages
Paper 4 S2 2324
No ratings yet
Paper 4 S2 2324
5 pages
Machine Learning Exam Problem Sheets
No ratings yet
Machine Learning Exam Problem Sheets
54 pages
SVM Kernel Functions and Estimation Techniques
No ratings yet
SVM Kernel Functions and Estimation Techniques
11 pages
Machine Learning Homework 3: Logistic Regression
No ratings yet
Machine Learning Homework 3: Logistic Regression
7 pages
DGM 2023 Endterm Solution
No ratings yet
DGM 2023 Endterm Solution
12 pages
Quiz3 2024
No ratings yet
Quiz3 2024
2 pages
CS771A Machine Learning Homework 1
No ratings yet
CS771A Machine Learning Homework 1
3 pages
Machine Learning Exam Instructions and Problems
No ratings yet
Machine Learning Exam Instructions and Problems
26 pages
Machine Learning Exam Solutions 2019
No ratings yet
Machine Learning Exam Solutions 2019
4 pages
Statistical Machine Learning Assignment 1
No ratings yet
Statistical Machine Learning Assignment 1
4 pages
AI60201 Module3 4 Problems
No ratings yet
AI60201 Module3 4 Problems
4 pages
AI60201 Module3 Problems
No ratings yet
AI60201 Module3 Problems
3 pages
Out-of-Sample Extensions for LLE and Clustering
No ratings yet
Out-of-Sample Extensions for LLE and Clustering
8 pages
CS 189 Machine Learning Final Exam
No ratings yet
CS 189 Machine Learning Final Exam
13 pages
Exercise Sheet 8
No ratings yet
Exercise Sheet 8
5 pages
hw7 Sol
No ratings yet
hw7 Sol
13 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 3
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 3
8 pages
Numerical Algorithms Assignment 2 Guide
No ratings yet
Numerical Algorithms Assignment 2 Guide
5 pages
SVM Problems1
No ratings yet
SVM Problems1
5 pages
Machine Learning Midsem Exam Solutions
No ratings yet
Machine Learning Midsem Exam Solutions
6 pages
HW 3
No ratings yet
HW 3
7 pages
COMP9417 Homework 2: Gradient Descent
No ratings yet
COMP9417 Homework 2: Gradient Descent
8 pages
Homework 2: Caltech edX CS1156x
No ratings yet
Homework 2: Caltech edX CS1156x
5 pages
Matrix Computation Techniques in EECS
No ratings yet
Matrix Computation Techniques in EECS
21 pages
Worksheet For Quiz
No ratings yet
Worksheet For Quiz
5 pages
Introduction to Probabilistic Graphical Models
No ratings yet
Introduction to Probabilistic Graphical Models
8 pages
CS236 Hw2 Answers
No ratings yet
CS236 Hw2 Answers
14 pages
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
No ratings yet
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
4 pages
Lec1 Mathreview
No ratings yet
Lec1 Mathreview
61 pages
CS273a Machine Learning Final Exam
No ratings yet
CS273a Machine Learning Final Exam
9 pages
EE769 Machine Learning Exam Questions
No ratings yet
EE769 Machine Learning Exam Questions
2 pages
Midterm Exam Practice Problems 2025
No ratings yet
Midterm Exam Practice Problems 2025
5 pages
Advances in Sketching for Linear Algebra
No ratings yet
Advances in Sketching for Linear Algebra
139 pages
CS228 Homework 4: Probabilistic Models and Sampling
No ratings yet
CS228 Homework 4: Probabilistic Models and Sampling
5 pages
Convex Optimization Homework Exercises
100% (1)
Convex Optimization Homework Exercises
6 pages
Linear Algebra Exercises for Machine Learning
No ratings yet
Linear Algebra Exercises for Machine Learning
5 pages
Caltech CS1156x Homework 1 Guide
No ratings yet
Caltech CS1156x Homework 1 Guide
6 pages
Machine Learning Exam Questions
No ratings yet
Machine Learning Exam Questions
16 pages
Deep Neural Networks: Mathematical Concepts
No ratings yet
Deep Neural Networks: Mathematical Concepts
3 pages
DL Exam 2023-2
No ratings yet
DL Exam 2023-2
5 pages
Practice Questions Lec 18 45
No ratings yet
Practice Questions Lec 18 45
4 pages
NTU Machine Learning Homework 4 Details
No ratings yet
NTU Machine Learning Homework 4 Details
6 pages
ML Quiz 1: True/False & Short Answers
No ratings yet
ML Quiz 1: True/False & Short Answers
3 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Linear Algebra Exercises for Machine Learning
No ratings yet
Linear Algebra Exercises for Machine Learning
3 pages
NTU Fall 2024 Machine Learning Homework 3
No ratings yet
NTU Fall 2024 Machine Learning Homework 3
4 pages
Khan - Diffusion Models and Normalizing Flows
No ratings yet
Khan - Diffusion Models and Normalizing Flows
36 pages
Sample Exam Answers
No ratings yet
Sample Exam Answers
6 pages
Ad
No ratings yet
Ad
5 pages
EE364a Homework 6 Solutions
No ratings yet
EE364a Homework 6 Solutions
20 pages
Machine Learning Exercises: HMM & Markov
No ratings yet
Machine Learning Exercises: HMM & Markov
1 page
CCA Implementation for Machine Learning
No ratings yet
CCA Implementation for Machine Learning
3 pages
Machine Learning CCA Exercises 2021
0% (1)
Machine Learning CCA Exercises 2021
1 page
Bayesian Models for AI Experts
No ratings yet
Bayesian Models for AI Experts
130 pages
Comprehensive Guide to Matrix Theory
No ratings yet
Comprehensive Guide to Matrix Theory
117 pages
Limits & Derivatives (P1)
No ratings yet
Limits & Derivatives (P1)
33 pages
C++ 2nd Year
No ratings yet
C++ 2nd Year
6 pages
Technology in Education: Impact and Benefits
No ratings yet
Technology in Education: Impact and Benefits
16 pages
Assembly Programming Guide
No ratings yet
Assembly Programming Guide
80 pages
Modsim Tutorial for Mineral Processing
No ratings yet
Modsim Tutorial for Mineral Processing
94 pages
Cognizant Interview Questions
No ratings yet
Cognizant Interview Questions
4 pages
BCM 6.30.223.227 WLAN Release Note v0.1
No ratings yet
BCM 6.30.223.227 WLAN Release Note v0.1
8 pages
Handling Form Parameters in RESTEasy
No ratings yet
Handling Form Parameters in RESTEasy
250 pages
BTFL Cli Backup 20240115 222054 GEPRCF411 AIO
No ratings yet
BTFL Cli Backup 20240115 222054 GEPRCF411 AIO
3 pages
SSD-5500 Ver.5 PCB Service Training List
No ratings yet
SSD-5500 Ver.5 PCB Service Training List
2 pages
Calcutta High Court LDA Exam Paper 2019
100% (1)
Calcutta High Court LDA Exam Paper 2019
49 pages
Sample Handling - Amendment of IssuedAproved Test Reports
No ratings yet
Sample Handling - Amendment of IssuedAproved Test Reports
10 pages
Online Bookstore System: Programming in Java (Project)
No ratings yet
Online Bookstore System: Programming in Java (Project)
11 pages
Vending 35
No ratings yet
Vending 35
27 pages
ITS OD 103 Devices 1023
No ratings yet
ITS OD 103 Devices 1023
3 pages
C Programming Lab: Branching Statements
No ratings yet
C Programming Lab: Branching Statements
6 pages
AI Eye Disease Detection via Retinal Imaging
No ratings yet
AI Eye Disease Detection via Retinal Imaging
4 pages
Online Book Store MBA Project Report
No ratings yet
Online Book Store MBA Project Report
84 pages
CV - Massa - Petrache (Peter)
No ratings yet
CV - Massa - Petrache (Peter)
4 pages
Remote Deposit Capture Project Part 1: Project Integration Management
No ratings yet
Remote Deposit Capture Project Part 1: Project Integration Management
7 pages
Class VIII Chapter 2 Exercise
No ratings yet
Class VIII Chapter 2 Exercise
11 pages
Fibonacci Sequence Explained
No ratings yet
Fibonacci Sequence Explained
32 pages
Sieve Techniques in Number Theory
No ratings yet
Sieve Techniques in Number Theory
3 pages
Data Center - Thermal Management
No ratings yet
Data Center - Thermal Management
16 pages
Service Functions for HP M404 Reset
No ratings yet
Service Functions for HP M404 Reset
2 pages
.NET Programming Course Overview
No ratings yet
.NET Programming Course Overview
40 pages
2mp Bullet Camera - Sc-Ind21bp-i (Z) (S)
No ratings yet
2mp Bullet Camera - Sc-Ind21bp-i (Z) (S)
4 pages
RLC Project
No ratings yet
RLC Project
13 pages
Analysis and Approaches Higher May 2022 Paper 2 TZ1
No ratings yet
Analysis and Approaches Higher May 2022 Paper 2 TZ1
16 pages
Spe 208944 Pa
No ratings yet
Spe 208944 Pa
11 pages

Machine Learning 2: Exercise Sheet 1

Uploaded by

Machine Learning 2: Exercise Sheet 1

Uploaded by

Exercises for the course Abteilung Maschinelles Lernen

Institut für Softwaretechnik und theoretische Informatik

(a) Replacement of all ~xi with α~xi , for an α ∈ R+ \ {0},

(b) Replacement of all ~xi with ~xi + ~v , for a vector ~v ∈ Rd ,

(c) Replacement of all ~xi with U~xi , where U is an orthogonal d × d matrix.

Exercise 2: Closed form for LLE (25 P)

min w> Cw subject to w> 1 = 1.

where p and q are subject to the constraints N

(a) Show that

(b) The probability matrix q is now reparameterized using a ‘softargmax’ function:

(d) The scores zij are now reparameterized as

zij = −k~yi − ~yj k2

Exercise 4: Programming (25 P)

You might also like