Assignment 3

This document provides instructions for an assignment on support vector machines. Students are asked to: 1) Generate random training data divided into 3 classes and fit an SVM classifier with a linear kernel. 2) Perform 10-fold cross-validation on the training data to select the optimal cost value. 3) Use the optimal model to make predictions on randomly labeled test data and calculate misclassified observations. 4) Repeat the analysis using a radial basis kernel and select the optimal cost and gamma parameters.

Uploaded by

Austin Azenga

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Assignment 3

Uploaded by

Austin Azenga

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

School of Mathematics and Statistics

MAST90083: Computational Statistics and Data Science

Assignment 3
Weight: 15%

Instructions
Use of any function or library other than what is mentioned in this assignment is not rec-
ommended. Use library e1071 that contains the svm function for this assignment. Unless
specified otherwise, set the seed to 50 for all instances i.e. whenever the random number
generator is invoked by any function you should use a seed. You should also note that due
to the way in which the plotting function is implemented in the library e1071 the decision
boundary for linear kernel case might look jagged. You may use ”rep” and ”sample” function
in addition to the functions that have already been mentioned in the assignment.

Question: Support Vector Machines

1. We are going to produce a random data of size 100 × 2 for each of the three classes
(C=3). This can be generated as an aggregated random data x of size N × 2 as
”matrix(rnorm(N *2), ncol=2)”, where N = 300. Each 100 entries in this matrix
belong to a separate class, first 100 to class 1, next 100 to class 2 and last 100 to
class 3, however since all observations were generated from the same distribution it is
not possible to differentiate among them. To make these 300 entries distinctive and
divide the data into 3 different classes, lets define class specific means in variable z as
”matrix(c(0,0,3,0,3,0),C,2)”. Also, generate a response vector y of size N that contains
labels (1 to 3) for the data in x. Using z and y, assign class specific means to data
points of each class and this operation will change the entries of the matrix x and divide
it into three classes. Use ggplot from the library ”ggplot2” to plot x as a data frame
while using y as a factor for colour assignment. (3 marks)

2. Construct the data frame for the training data as ”tdata=data.frame(x = x, y=as.factor(y))”
and fit the support vector classifier using svm function by setting the kernel as linear,
and cost as 10 and store the result in svmfit. Now, plot the results as ”plot(svmfit,
tdata)”. Also generate summary using the object svmfit and answer how many support
vectors were there in each class? (1 marks)

3. Using the training data from the previous question, perform a ten-fold cross-validation
by utilizing the function ”tune” and providing it with a list of cost values as 0.001, 0.01,
0.1, 1, 5, 10, 100. Use summary on the object returned by the tune function to find out
at what value of cost, the minimum cross validation error rate was found. For this best
cost value, did the number of support vectors increase? How many support vectors

1
were there in each class? Also, save the best model returned by the tune function as
”bestmod”. (2 marks)

4. Set the seed to 100 and generate a test data following the exact approach of question
1 and the syntax ”testdata=data.frame(x=xtest , y=as.factor (ytest))”, the only dif-
ference however is that ytest is now labeled randomly with replacement and not in a
sequence of first 100 to class 1 (label 1) and so on. Now, use predict function with input
arguments as ”bestmod” (from previous question) and ”testdata” to predict the class
label of these test observations and store the results in yp . Use the function ”table”
to print the results in form of a table for the vector of predicted labels (yp ) against
the test labels ytest. How many observations are misclassified? Why in one case the
number of correctly classified observations are greater than 100? (2 marks)

5. Initially, for training, cost and gamma are both set to 1 and then for the tuning purpose
their values are set to 0.1, 1, 10, 100, 1000 and 0.5, 1, 2, 3, 4, respectively. Find how many
observations are misclassified using the best model when the kernel is radial (i.e. repeat
question 1 to 4 with radial kernel). Does the result imply that data is linearly separable
and we do not need the radial kernel? What were the optimal (best) cost and gamma
(parameter of the radial basis function) in this case? (2 marks)

Kemu Degree
No ratings yet
Kemu Degree
1 page
JSS 51034-1992
No ratings yet
JSS 51034-1992
80 pages
Support Vector Machine With Multiple Classes
100% (1)
Support Vector Machine With Multiple Classes
5 pages
BATCH 14 Deep Learning Using Convolutional Neural Network For Crack Detection in Railway
100% (1)
BATCH 14 Deep Learning Using Convolutional Neural Network For Crack Detection in Railway
55 pages
SVM Example in R
No ratings yet
SVM Example in R
4 pages
Lab 4 - Support Vector Machines: Part B
No ratings yet
Lab 4 - Support Vector Machines: Part B
5 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
sectionSVM PDF
No ratings yet
sectionSVM PDF
10 pages
SVM
No ratings yet
SVM
2 pages
EIE520 Neural Computation: The Hong Kong Polytechnic University
No ratings yet
EIE520 Neural Computation: The Hong Kong Polytechnic University
14 pages
Week 1 HW
No ratings yet
Week 1 HW
3 pages
Classification Review
No ratings yet
Classification Review
8 pages
Support Vector Classification
No ratings yet
Support Vector Classification
8 pages
UNIT 3 AAM
No ratings yet
UNIT 3 AAM
30 pages
Lab3Block1 2021-1
No ratings yet
Lab3Block1 2021-1
3 pages
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
No ratings yet
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
12 pages
Assignment 2 Specification
No ratings yet
Assignment 2 Specification
3 pages
SVM Exam Paper For ABA Course To Be Returned With Answers Excel Exercise (25 Marks)
No ratings yet
SVM Exam Paper For ABA Course To Be Returned With Answers Excel Exercise (25 Marks)
12 pages
Aim of The Experiment-Software Required - Theory
No ratings yet
Aim of The Experiment-Software Required - Theory
6 pages
Support-Vector-Classifier
No ratings yet
Support-Vector-Classifier
7 pages
ML Practical 3
No ratings yet
ML Practical 3
5 pages
SVM Assignment ABA Course To Be Returned With Your Answers
No ratings yet
SVM Assignment ABA Course To Be Returned With Your Answers
10 pages
SVM Implementation
No ratings yet
SVM Implementation
8 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
Support-Vector-Classifier
No ratings yet
Support-Vector-Classifier
7 pages
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
No ratings yet
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
5 pages
CS6301 Homework2 KR
No ratings yet
CS6301 Homework2 KR
13 pages
Solution 2
0% (1)
Solution 2
6 pages
ISYE6501-Homework-1
No ratings yet
ISYE6501-Homework-1
7 pages
Data Classification Using Support Vector Machine: Durgesh K. Srivastava, Lekha Bhambhu
No ratings yet
Data Classification Using Support Vector Machine: Durgesh K. Srivastava, Lekha Bhambhu
7 pages
SVM Basics Paper
No ratings yet
SVM Basics Paper
7 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
TD1 SVM
No ratings yet
TD1 SVM
1 page
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
SDSC3006_Assignment 2
No ratings yet
SDSC3006_Assignment 2
3 pages
HW2 2
No ratings yet
HW2 2
3 pages
EGIRAFFE Computational Intelligence UE - DZ - H - Hausuebung - 2020SS - Assignment 4
No ratings yet
EGIRAFFE Computational Intelligence UE - DZ - H - Hausuebung - 2020SS - Assignment 4
15 pages
solution_in5520_exercise_svm_2020
No ratings yet
solution_in5520_exercise_svm_2020
8 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
10 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Assignment II Machine Learning
No ratings yet
Assignment II Machine Learning
8 pages
DAAI22 Exercises1
No ratings yet
DAAI22 Exercises1
6 pages
Support Vector Machines: The Interface To Libsvm in Package E1071 by David Meyer FH Technikum Wien, Austria
No ratings yet
Support Vector Machines: The Interface To Libsvm in Package E1071 by David Meyer FH Technikum Wien, Austria
8 pages
ML PG Assignment 3
No ratings yet
ML PG Assignment 3
3 pages
Fundamentals of Machine Learning Support Vector Machines, Practical Session
No ratings yet
Fundamentals of Machine Learning Support Vector Machines, Practical Session
4 pages
Lect3 2
No ratings yet
Lect3 2
43 pages
Midterm Solutions For Machine Learning
No ratings yet
Midterm Solutions For Machine Learning
13 pages
HW 3
No ratings yet
HW 3
3 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
5 pages
Support Vector Machines (SVM) Models in Stata
No ratings yet
Support Vector Machines (SVM) Models in Stata
19 pages
Machine Learning-Lecture 15(Student)
No ratings yet
Machine Learning-Lecture 15(Student)
8 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
assignment
No ratings yet
assignment
7 pages
quiz3
No ratings yet
quiz3
12 pages
A Introduction To SVM PDF
No ratings yet
A Introduction To SVM PDF
48 pages
Svmtrain M
No ratings yet
Svmtrain M
10 pages
Support Vector Machine
0% (1)
Support Vector Machine
7 pages
unit 6 ai
No ratings yet
unit 6 ai
28 pages
hw2 4
No ratings yet
hw2 4
3 pages
2.1 SVM
No ratings yet
2.1 SVM
16 pages
Evolutionary Algorithms
From Everand
Evolutionary Algorithms
Alain Petrowski
No ratings yet
Guided Randomness in Optimization, Volume 1
From Everand
Guided Randomness in Optimization, Volume 1
Maurice Clerc
No ratings yet
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
CE3GOE Assessment 1 2021-22 PDF
No ratings yet
CE3GOE Assessment 1 2021-22 PDF
11 pages
W 03 Handout
No ratings yet
W 03 Handout
1 page
Megadodo Publications XF
No ratings yet
Megadodo Publications XF
3 pages
Passport PDF
No ratings yet
Passport PDF
1 page
Utility
No ratings yet
Utility
1 page
MAT00003C Introduction To Applied Mathemtaics Exam Questions 2020
No ratings yet
MAT00003C Introduction To Applied Mathemtaics Exam Questions 2020
9 pages
Assignment 2 QBUS2820 2021S2
No ratings yet
Assignment 2 QBUS2820 2021S2
3 pages
Class PM Case Study
No ratings yet
Class PM Case Study
6 pages
Maxima2d Fall21
No ratings yet
Maxima2d Fall21
2 pages
PSP Angels Offer 26
No ratings yet
PSP Angels Offer 26
3 pages
Netflix Exercise Assignment
No ratings yet
Netflix Exercise Assignment
11 pages
ACTL5109 Assignment 2021
No ratings yet
ACTL5109 Assignment 2021
6 pages
The Second Midterm Project
No ratings yet
The Second Midterm Project
4 pages
Assignment5 DueOct4
No ratings yet
Assignment5 DueOct4
2 pages
EBME 410 Computer Project MRI Synthesis
No ratings yet
EBME 410 Computer Project MRI Synthesis
2 pages
9273203-Reading 2
No ratings yet
9273203-Reading 2
2 pages
Assignment Two - Curatorial Exercise
No ratings yet
Assignment Two - Curatorial Exercise
3 pages
Video Transcript
No ratings yet
Video Transcript
3 pages
Statement
No ratings yet
Statement
14 pages
BSAD 300 Individual Report A F21
No ratings yet
BSAD 300 Individual Report A F21
2 pages
Media Summative Rubric 4
No ratings yet
Media Summative Rubric 4
2 pages
Tutorial 3
No ratings yet
Tutorial 3
2 pages
Problem Set 1
No ratings yet
Problem Set 1
2 pages
BIM2005.Project HH
No ratings yet
BIM2005.Project HH
1 page
Transcript2022mar18 DBF 54352 2016
No ratings yet
Transcript2022mar18 DBF 54352 2016
1 page
Assignment 3-2
No ratings yet
Assignment 3-2
5 pages
HW 1
No ratings yet
HW 1
5 pages
Topics 1
No ratings yet
Topics 1
2 pages
Statement
0% (1)
Statement
2 pages
Diffusion in Iron, Iron Solid Solutions and Steels
No ratings yet
Diffusion in Iron, Iron Solid Solutions and Steels
38 pages
Energy Conservation Plan
No ratings yet
Energy Conservation Plan
4 pages
The Business Case For Circular Buildings
No ratings yet
The Business Case For Circular Buildings
49 pages
MTS9302A Installation Procedures 14.06.2022 v2
No ratings yet
MTS9302A Installation Procedures 14.06.2022 v2
60 pages
Digital Electronics 1
No ratings yet
Digital Electronics 1
6 pages
VAM Recommended Running Compound Table - Revfebruary2016
No ratings yet
VAM Recommended Running Compound Table - Revfebruary2016
1 page
PR2 Facebook Addiction 1
No ratings yet
PR2 Facebook Addiction 1
18 pages
CAE Speaking Part 2, Set 2
No ratings yet
CAE Speaking Part 2, Set 2
2 pages
Material Requirements Planning: Managing Inventories of Items With Dependent Demand
No ratings yet
Material Requirements Planning: Managing Inventories of Items With Dependent Demand
77 pages
Heat Transfer in Packed Bed
No ratings yet
Heat Transfer in Packed Bed
11 pages
VI O - Final Term Syllabus
No ratings yet
VI O - Final Term Syllabus
1 page
2021 엘앤에프 ESG Report
No ratings yet
2021 엘앤에프 ESG Report
58 pages
23-11-10 Long Quiz - SPP (1) - Answer Keys
No ratings yet
23-11-10 Long Quiz - SPP (1) - Answer Keys
13 pages
HolidayHomework 52015483286
No ratings yet
HolidayHomework 52015483286
4 pages
Midterm Exam Attendance Sheet
No ratings yet
Midterm Exam Attendance Sheet
14 pages
Lift Type Selector Tool
No ratings yet
Lift Type Selector Tool
1 page
Geotec Reviewer Mod. 1
No ratings yet
Geotec Reviewer Mod. 1
3 pages
160-174-37-63 - Article Text
No ratings yet
160-174-37-63 - Article Text
15 pages
Negotiation Quotes: Pre-Reading
No ratings yet
Negotiation Quotes: Pre-Reading
2 pages
Part of What Makes Humans Unique Is Our Freedom To Determine How We'll Act
No ratings yet
Part of What Makes Humans Unique Is Our Freedom To Determine How We'll Act
3 pages
Algeria Proclamation Chart
No ratings yet
Algeria Proclamation Chart
11 pages
2 Reciprocal Lattice
No ratings yet
2 Reciprocal Lattice
48 pages
Thesis Ideas For Electrical Engineering
100% (3)
Thesis Ideas For Electrical Engineering
4 pages
Personality Notes
No ratings yet
Personality Notes
10 pages
Seminar DLR Jul 2020
No ratings yet
Seminar DLR Jul 2020
32 pages
Boeson Trennwax ELV
No ratings yet
Boeson Trennwax ELV
1 page
Sample TGMS Annual Maintenance and Service Plan Hallam-ICS
No ratings yet
Sample TGMS Annual Maintenance and Service Plan Hallam-ICS
2 pages
Intelligent CV Document
No ratings yet
Intelligent CV Document
1 page

Assignment 3

Uploaded by

Assignment 3

Uploaded by

School of Mathematics and Statistics

MAST90083: Computational Statistics and Data Science

Question: Support Vector Machines

You might also like