Tutorial 6 Machine Learning
Tutorial 6 Machine Learning
For the above purpose, scientists have collected a number of measurements on over 4000 abalones.
The dataset is available at https://archive.ics.uci.edu/ml/datasets/abalone.
...
With the above information and with the intention to solve the problem with machine learning
tools, i.e. to predict the age of an abalone from external measurements. Answer the following
questions:
1
c. Define the hypothesis, i.e. the equation.
d. How will you learn the parameters for the hypothesis? What will your training set
comprise of (features, examples), and what will be your labels?
c. If unsupervised (skip b),
a. Propose a learning tool to solve the problem.
b. Define the hypothesis or model, i.e. the equation or network.
c. How will you learn the parameters for the hypothesis? What will your training set
comprise of (features, examples), and how many classes will you set?
Note that the attribute “gender” has categorical values. The error or distance calculation will not be
the same as, for example, using an Euclidean distance. In your learning problem, you will exclude
this feature.
x1 x2 y
6 2 13
3 5 4
4 4 7
6 10 5
4 2 9
3 4 5
You want to learn a linear regression predictor for y. Answer the following questions.
The data set consists of 50 samples from each of three species of Iris (Iris setosa, Iris virginica and Iris
versicolor). Four features were measured from each sample: the length and the width of the sepals
and petals, in centimetres. The dataset contains:
2
7 3.2 4.7 1.4 Iris-versicolor
6.4 3.2 4.5 1.5 Iris-versicolor
6.9 3.1 4.9 1.5 Iris-versicolor
...
...
You would like to implement an ANN to classify the flowers among the three species. Answer the
following questions:
a. Let’s use an ANN with only one hidden layer. Suggest the size (number of nodes) of the
input, hidden and output layers. Keep to minimal sizes.
b. Draw the ANN.
c. Write the vectorized representation for the activation output function for the hidden layer
and output layer. Clearly write out the elements for the vector/matrix.
d. How many parameters are there to learn?