0% found this document useful (0 votes)

3 views

Neural Network from scratch in Python _ by Omar Aflak _ Towards Data Science

The document outlines the process of building a neural network from scratch in Python, focusing on the mathematical foundations and coding implementation. It covers key concepts such as forward propagation, backward propagation, and gradient descent, while detailing the creation of various layers including fully connected and activation layers. The goal is to enable the construction of modular neural networks capable of learning from data through iterative adjustments of parameters.

Uploaded by

hasahmad9982

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Neural Network from scratch in Python _ by Omar Aflak _ Towards Data Science

Uploaded by

hasahmad9982

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Neural Network from scratch in Python

Make your own machine learning library.

Omar Aflak · Follow

Published in Towards Data Science
10 min read · Nov 15, 2018

Listen Share

Photo by Mathew Schwartz on Unsplash

In this post we will go through the mathematics of machine learning and code from
scratch, in Python, a small library to build neural networks with a variety of layers
(Fully Connected, Convolutional, etc.). Eventually, we will be able to create networks
in a modular fashion:

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 1/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

3-layer neural network

I’m assuming you already have some knowledge about neural networks. The purpose
here is not to explain why we make these models, but to show how to make a proper
implementation.

Layer by Layer
We need to keep in mind the big picture here :

1. We feed input data into the neural network.

2. The data flows from layer to layer until we have the output.

3. Once we have the output, we can calculate the error which is a scalar.

4. Finally we can adjust a given parameter (weight or bias) by subtracting the

derivative of the error with respect to the parameter itself.

5. We iterate through that process.

The most important step is the 4th. We want to be able to have as many layers as we
want, and of any type. But if we modify/add/remove one layer from the network, the
output of the network is going to change, which is going to change the error, which
is going to change the derivative of the error with respect to the parameters. We
need to be able to compute the derivatives regardless of the network architecture,
regardless of the activation functions, regardless of the loss we use.

In order to achieve that, we must implement each layer separately.

What every layer should implement

Every layer that we might create (fully connected, convolutional, maxpooling,
dropout, etc.) have at least 2 things in common: input and output data.

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 2/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Forward propagation
We can already emphasize one important point which is: the output of one layer is
the input of the next one.

This is called forward propagation. Essentially, we give the input data to the first
layer, then the output of every layer becomes the input of the next layer until we
reach the end of the network. By comparing the result of the network (Y) with the
desired output (let’s say Y*), we can calculate en error E. The goal is to minimize that
error by changing the parameters in the network. That is backward propagation
(backpropagation).

Gradient Descent
This is a quick reminder, if you need to learn more about gradient descent there are
tons of resources on the internet.

Basically, we want to change some parameter in the network (call it w) so that the
total error E decreases. There is a clever way to do it (not randomly) which is the
following :

Where α is a parameter in the range [0,1] that we set and that is called the learning
rate. Anyway, the important thing here is ∂E/∂w (the derivative of E with respect to
w). We need to be able to find the value of that expression for any parameter of the
network regardless of its architecture.

Backward propagation

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 3/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Suppose that we give a layer the derivative of the error with respect to its output
(∂E/∂Y), then it must be able to provide the derivative of the error with respect to its
input (∂E/∂X).

Remember that E is a scalar (a number) and X and Y are matrices.

Let’s forget about ∂E/∂X for now. The trick here, is that if we have access to ∂E/∂Y we
can very easily calculate ∂E/∂W (if the layer has any trainable parameters) without
knowing anything about the network architecture ! We simply use the chain rule :

The unknown is ∂y_j/∂w which totally depends on how the layer is computing its
output. So if every layer have access to ∂E/∂Y, where Y is its own output, then we can
update our parameters !

But why ∂E/∂X ?

Don’t forget, the output of one layer is the input of the next layer. Which means ∂E/
∂X for one layer is ∂E/∂Y for the previous layer ! That’s it ! It’s just a clever way to
propagate the error ! Again, we can use the chain rule :
https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 4/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

This is very important, it’s the key to understand backpropagation ! After that, we’ll
be able to code a Deep Convolutional Neural Network from scratch in no time !

Diagram to understand backpropagation

This is what I described earlier. Layer 3 is going to update its parameters using ∂E/
∂Y, and is then going to pass ∂E/∂H2 to the previous layer, which is its own “∂E/∂Y”.
Layer 2 is then going to do the same, and so on and so forth.

This may seem abstract here, but it will get very clear when we will apply this to a
specific type of layer. Speaking of abstract, now is a good time to write our first
python class.

Abstract Base Class : Layer

The abstract class Layer, which all other layers will inherit from, handles simple
properties which are an input, an output, and both a forward and backward
methods.

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 5/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

1 # Base class
2 class Layer:
3 def __init__(self):
4 self.input = None
5 self.output = None
6
7 # computes the output Y of a layer for a given input X
8 def forward_propagation(self, input):
9 raise NotImplementedError
10
11 # computes dE/dX for a given dE/dY (and update parameters if any)
12 def backward_propagation(self, output_error, learning_rate):
13 raise NotImplementedError

medium_nn_py_base_class.py hosted with ❤ by GitHub view raw

3-layer neural network

As you can see there is an extra parameter in backward_propagation that I didn’t

mention, it is the learning_rate . This parameter should be something like an update
policy, or an optimizer as they call it in Keras, but for the sake of simplicity we’re
simply going to pass a learning rate and update our parameters using gradient
descent.

Fully Connected Layer

Now let's define and implement the first type of layer: fully connected layer or FC
layer. FC layers are the most basic layers as every input neurons are connected to
every output neurons.

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 6/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Forward Propagation
The value of each output neuron can be calculated as the following :

With matrices, we can compute this formula for every output neuron in one shot
using a dot product :

We’re done with the forward pass. Now let’s do the backward pass of the FC layer.

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 7/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Note that I’m not using any activation function yet, that’s because we will implement it in
a separate layer!

Backward Propagation
As we said, suppose we have a matrix containing the derivative of the error with
respect to that layer’s output (∂E/∂Y). We need :

1. The derivative of the error with respect to the parameters (∂E/∂W, ∂E/∂B)

2. The derivative of the error with respect to the input (∂E/∂X)

Let's calculate ∂E/∂W. This matrix should be the same size as W itself : ixj where i

is the number of input neurons and j the number of output neurons. We need one
gradient for every weight :

Using the chain rule stated earlier, we can write :

Therefore,
https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 8/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

That’s it we have the first formula to update the weights! Now let's calculate ∂E/∂B.

Again ∂E/∂B needs to be of the same size as B itself, one gradient per bias. We can
use the chain rule again :

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 9/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

And conclude that,

Now that we have ∂E/∂W and ∂E/∂B, we are left with ∂E/∂X which is very important
as it will “act” as ∂E/∂Y for the layer before that one.

Again, using the chain rule,

Finally, we can write the whole matrix :

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 10/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

That’s it! We have the three formulas we needed for the FC layer!

Coding the Fully Connected Layer

We can now write some python code to bring this math to life!

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 11/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

1 from layer import Layer

2 import numpy as np
3
4 # inherit from base class Layer
5 class FCLayer(Layer):
6 # input_size = number of input neurons
7 # output_size = number of output neurons
8 def __init__(self, input_size, output_size):
9 self.weights = np.random.rand(input_size, output_size) - 0.5
10 self.bias = np.random.rand(1, output_size) - 0.5
11
12 # returns output for a given input
13 def forward_propagation(self, input_data):
14 self.input = input_data
15 self.output = np.dot(self.input, self.weights) + self.bias
16 return self.output
17
18 # computes dE/dW, dE/dB for a given output_error=dE/dY. Returns input_error=dE/dX.
19 def backward_propagation(self, output_error, learning_rate):
20 input_error = np.dot(output_error, self.weights.T)
21 weights_error = np.dot(self.input.T, output_error)
22 # dBias = output_error
23
24 # update parameters
25 self.weights -= learning_rate * weights_error
26 self.bias -= learning_rate * output_error
27 return input_error

medium_nn_py_fc_layer.py hosted with ❤ by GitHub view raw

Activation Layer
All the calculation we did until now were completely linear. It's hopeless to learn
anything with that kind of model. We need to add non-linearity to the model by
applying non-linear functions to the output of some layers.

Now we need to redo the whole process for this new type of layer!

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 12/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

https://gph.is/21pKLjE

No worries, it’s going to be way faster as there are no learnable parameters. We just
need to calculate ∂E/∂X.

We will call f and f' the activation function and its derivative respectively.

Forward Propagation
As you will see, it is quite straightforward. For a given input X , the output is simply
the activation function applied to every element of X . Which means input and
output have the same dimensions.

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 13/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Backward Propagation
Given ∂E/∂Y, we want to calculate ∂E/∂X.

Be careful, here we are using an element-wise multiplication between the two

matrices (whereas in the formulas above, it was a dot product).

Coding the Activation Layer

The code for the activation layer is as straightforward.

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 14/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

1 from layer import Layer

2
3 # inherit from base class Layer
4 class ActivationLayer(Layer):
5 def __init__(self, activation, activation_prime):
6 self.activation = activation
7 self.activation_prime = activation_prime
8
9 # returns the activated input
10 def forward_propagation(self, input_data):
11 self.input = input_data
12 self.output = self.activation(self.input)
13 return self.output
14
15 # Returns input_error=dE/dX for a given output_error=dE/dY.
16 # learning_rate is not used because there is no "learnable" parameters.
17 def backward_propagation(self, output_error, learning_rate):
18 return self.activation_prime(self.input) * output_error

medium_nn_py_activation_layer.py hosted with ❤ by GitHub view raw

You can also write some activation functions and their derivatives in a separate file.
These will be used later to create an ActivationLayer .

1 import numpy as np
2
3 # activation function and its derivative
4 def tanh(x):
5 return np.tanh(x);
6
7 def tanh_prime(x):
8 return 1-np.tanh(x)**2;

medium_nn_py_activation.py hosted with ❤ by GitHub view raw

Loss Function
Until now, for a given layer, we supposed that ∂E/∂Y was given (by the next layer).
But what happens to the last layer? How does it get ∂E/∂Y? We simply give it
manually, and it depends on how we define the error.

The error of the network, which measures how good or bad the network did for a
given input data, is defined by you. There are many ways to define the error, and
one of the most known is called MSE — Mean Squared Error.

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 15/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Mean Squared Error

Where y* and y denotes desired output and actual output respectively. You can
think of the loss as a last layer which takes all the output neurons and squashes
them into one single neuron. What we need now, as for every other layer, is to
define ∂E/∂Y. Except now, we finally reached E !

These are simply two python functions that you can put in a separate file. They will
be used when creating the network.

1 import numpy as np
2
3 # loss function and its derivative
4 def mse(y_true, y_pred):
5 return np.mean(np.power(y_true-y_pred, 2));
6
7 def mse_prime(y_true, y_pred):
8 return 2*(y_pred-y_true)/y_true.size;

medium_nn_py_loss.py hosted with ❤ by GitHub view raw

Network Class
Almost done ! We are going to make a Network class to create neural networks very
easily akin the first picture !
https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 16/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

I commented almost every part of the code, it shouldn’t be too complicated to

understand if you grasped the previous steps. Nevertheless, leave a comment if you
have any question, I will gladly answer !

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 17/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

1 class Network:
2 def __init__(self):
3 self.layers = []
4 self.loss = None
5 self.loss_prime = None
6
7 # add layer to network
8 def add(self, layer):
9 self.layers.append(layer)
10
11 # set loss to use
12 def use(self, loss, loss_prime):
13 self.loss = loss
14 self.loss_prime = loss_prime
15
16 # predict output for given input
17 def predict(self, input_data):
18 # sample dimension first
19 samples = len(input_data)
20 result = []
21
22 # run network over all samples
23 for i in range(samples):
24 # forward propagation
25 output = input_data[i]
26 for layer in self.layers:
27 output = layer.forward_propagation(output)
28 result.append(output)
29
30 return result
31
32 # train the network
33 def fit(self, x_train, y_train, epochs, learning_rate):
34 # sample dimension first
35 samples = len(x_train)
36
37 # training loop
38 for i in range(epochs):
39 err = 0
40 for j in range(samples):
41 # forward propagation
42 output = x_train[j]
43 for layer in self.layers:
44 output = layer.forward_propagation(output)
45
46 # compute loss (for display purpose only)
47 err += self.loss(y_train[j], output)
48
https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 18/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science
48
49 # backward propagation
50 error = self.loss_prime(y_train[j], output)
51 for layer in reversed(self.layers):
52 error = layer.backward_propagation(error, learning_rate)
53
54 # calculate average error on all samples
55 err /= samples
56 print('epoch %d/%d error=%f' % (i+1, epochs, err))

medium nn py network py hosted with ❤ by GitHub view raw

Building Neural Networks
Finally ! We can use our class to create a neural network with as many layers as we
want ! We are going to build two neural networks : a simple XOR and a MNIST
solver.

Solve XOR
Starting with XOR is always important as it’s a simple way to tell if the network is
learning anything at all.

1 import numpy as np
2
3 from network import Network
4 from fc_layer import FCLayer
5 from activation_layer import ActivationLayer
6 from activations import tanh, tanh_prime
7 from losses import mse, mse_prime
8
9 # training data
10 x_train = np.array([[[0,0]], [[0,1]], [[1,0]], [[1,1]]])
11 y_train = np.array([[[0]], [[1]], [[1]], [[0]]])
12
13 # network
14 net = Network()
15 net.add(FCLayer(2, 3))
16 net.add(ActivationLayer(tanh, tanh_prime))
17 net.add(FCLayer(3, 1))
18 net.add(ActivationLayer(tanh, tanh_prime))
19
20 # train
21 net.use(mse, mse_prime)
22 net.fit(x_train, y_train, epochs=1000, learning_rate=0.1)
23
24 # test
25 out = net.predict(x_train)
26 print(out)

medium_nn_py_example_xor.py hosted with ❤ by GitHub view raw

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 19/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

I don’t think I need to emphasize many things. Just be careful with the training data,
you should always have the sample dimension first. For example here, the input
shape is (4,1,2).
Result

$ python xor.py
epoch 1/1000 error=0.322980
epoch 2/1000 error=0.311174
epoch 3/1000 error=0.307195
...
epoch 998/1000 error=0.000243
epoch 999/1000 error=0.000242
epoch 1000/1000 error=0.000242
[
array([[ 0.00077435]]),
array([[ 0.97760742]]),
array([[ 0.97847793]]),
array([[-0.00131305]])
]

Clearly this is working, great ! We can now solve something more interesting, let’s
solve MNIST !
Solve MNIST
We didn’t implemented the Convolutional Layer but this is not a problem. All we
need to do is to reshape our data so that it can fit into a Fully Connected Layer.

Open
MNISTin app
Dataset Sign
consists of images of digits from 0 to 9, of shape 28x28x1. The up is to
goal Sign in

predict what digit is drawn on a picture.

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 20/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

1 import numpy as np
2
3 from network import Network
4 from fc_layer import FCLayer
5 from activation_layer import ActivationLayer
6 from activations import tanh, tanh_prime
7 from losses import mse, mse_prime
8
9 from keras.datasets import mnist
10 from keras.utils import np_utils
11
12 # load MNIST from server
13 (x_train, y_train), (x_test, y_test) = mnist.load_data()
14
15 # training data : 60000 samples
16 # reshape and normalize input data
17 x_train = x_train.reshape(x_train.shape[0], 1, 28*28)
18 x_train = x_train.astype('float32')
19 x_train /= 255
20 # encode output which is a number in range [0,9] into a vector of size 10
21 # e.g. number 3 will become [0, 0, 0, 1, 0, 0, 0, 0, 0, 0]
22 y_train = np_utils.to_categorical(y_train)
23
24 # same for test data : 10000 samples
25 x_test = x_test.reshape(x_test.shape[0], 1, 28*28)
26 x_test = x_test.astype('float32')
27 x_test /= 255
28 y_test = np_utils.to_categorical(y_test)
29
30 # Network
31 net = Network()
32 net.add(FCLayer(28*28, 100)) # input_shape=(1, 28*28) ; output_shape=(1, 100
33 net.add(ActivationLayer(tanh, tanh_prime))
34 net.add(FCLayer(100, 50)) # input_shape=(1, 100) ; output_shape=(1, 50
35 net.add(ActivationLayer(tanh, tanh_prime))
36 net.add(FCLayer(50, 10)) # input_shape=(1, 50) ; output_shape=(1, 10
37 net.add(ActivationLayer(tanh, tanh_prime))
38
39 # train on 1000 samples
40 # as we didn't implemented mini-batch GD, training will be pretty slow if we update at each ite
41 net.use(mse, mse_prime)
42 net.fit(x_train[0:1000], y_train[0:1000], epochs=35, learning_rate=0.1)
43
44 # test on 3 samples
45 out = net.predict(x_test[0:3])
46 print("\n")
47 print("predicted values : ")
48 print(out end="\n")
https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 21/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science
48 print(out, end \n )
49 print("true values : ")
50 print(y_test[0:3])

medium nn py example mnist fc py hosted with ❤ by GitHub view raw

Result

$ python example_mnist_fc.py
epoch 1/30 error=0.238658
epoch 2/30 error=0.093187
epoch 3/30 error=0.073039
...
epoch 28/30 error=0.011636
epoch 29/30 error=0.011306
epoch 30/30 error=0.010901
predicted values :
[
array([[ 0.119, 0.084 , -0.081, 0.084, -0.068, 0.011, 0.057,
0.976, -0.042, -0.0462]]),
array([[ 0.071, 0.211, 0.501 , 0.058, -0.020, 0.175, 0.057 ,
0.037, 0.020, 0.107]]),
array([[ 1.197e-01, 8.794e-01, -4.410e-04, 4.407e-02, -4.213e-
02, 5.300e-02, 5.581e-02, 8.255e-02, -1.182e-01, 9.888e-02]])
]
true values :
[[0. 0. 0. 0. 0. 0. 0. 1. 0. 0.]
[0. 0. 1. 0. 0. 0. 0. 0. 0. 0.]
[0. 1. 0. 0. 0. 0. 0. 0. 0. 0.]]

This is working perfectly ! Amazing :)

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 22/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

https://gph.is/2jzemp3

GitHub Repository & Google Colab

You can find the whole working code used for this post in the following GitHub
repository, and Google Colab file. It also contains the code for other layers like
Convolutional or Flatten.

OmarAflak/Medium-Python-Neural-Network
Contribute to OmarAflak/Medium-Python-Neural-Network
development by creating an account on GitHub.
github.com

Neural Networks from Scratch in Python

Feel free to contact me
colab.research.google.com

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 23/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

I’ve recently put the content of that article into a beautifully animated video. You
can check it out on YouTube.

Neural Network from Scratch | Mathematics & Python Code

Neural Network from Scratch | Mathematics & Python Code — The Independent Code

Convolutional Neural Network from Scratch | Mathematics & Python Code

Convolutional Neural Network from Scratch | Mathematics & Python Code — The Independent Code

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 24/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

If you liked this post — I’d really appreciate if you hit the clap button 👏 it
would help me a lot. Peace! 😎

Python Neural Networks Mathematics Machine Learning

Artificial Intelligence

Published in Towards Data Science

779K Followers · Last published 15 hours ago

Your home for data science and AI. The world’s leading publication for data science, data analytics, data
engineering, machine learning, and artificial intelligence professionals.

Written by Omar Aflak

599 Followers · 28 Following

@omar_aflak

Responses (40)

What are your thoughts?

Respond

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 25/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Andrea de Luca
over 5 years ago

A great article! Just one note: maybe you could clarify where you are using the matrix product vs the
Hadamard product (elementwise). Thanks! ;)

9 2 replies Reply

Mr.Kumar
about 6 years ago

Please publish a post on ANFIS using Python.

Thank You

38 Reply

Ryan Daly
about 6 years ago

Feed Forward

Maybe I’ve misunderstood, but deep networks are deep networks. Feed forward is just the generative mode
of operation of any network. Prediction based on the model.

As an applied mathemagician I see all networks and stochastic Boltzmann machines as…...

2 1 reply Reply

See all responses

More from Omar Aflak and Towards Data Science

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 26/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Omar Aflak

Bézier Interpolation
Create smooth shapes using Bézier curves.

May 9, 2020 247 7

In Towards Data Science by Lak Lakshmanan

Evaluation-Driven Development for agentic applications using

PydanticAI
An open-source, model-agnostic agentic framework that supports dependency injection

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 27/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

2d ago 221 5

In Towards Data Science by Youness Mansar

An Agentic Approach to Reducing LLM Hallucinations

Simple techniques to alleviate LLM hallucinations using LangGraph

1d ago 188

Omar Aflak

Bézier Curve
Understand the mathematics of Bézier curves
https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 28/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

May 2, 2020 287 2

See all from Omar Aflak

See all from Towards Data Science

Recommended from Medium

Ritesh Gupta

Exploring LSTM and GRU Networks

How LSTM and GRU Transform Sequential Data Processing

Dec 10 1

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 29/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

In Tech Spectrum by Aarafat Islam

Building a Physics-Informed Neural Network (PINN) Using PyTorch From

Scratch
Solving the 1D Heat Equation

Nov 12 68

Lists

Predictive Modeling w/ Python

20 stories · 1730 saves

Practical Guides to Machine Learning

10 stories · 2103 saves

Natural Language Processing

1871 stories · 1496 saves

ChatGPT
21 stories · 919 saves

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 30/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

LM Po

Backpropagation: The Backbone of Neural Network Training

Backpropagation, short for “backward propagation of errors,” is a fundamental algorithm in the
training of deep neural networks. It…

Sep 14 5 1

In Towards Data Science by Leo Anello 💡

Master Machine Learning: 4 Classification Models Made Simple

A Beginner’s Guide to Building Models in 15 Practical Steps

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 31/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

Dec 14 502 2

In The Pythoneers by Abhay Parashar

17 Mindblowing GitHub Repositories You Never Knew Existed

Repositories To Bookmark Right Away

Dec 16 1.7K 13

In Level Up Coding by Philip Mocz

Create Your Own Navier-Stokes Spectral Method Fluid Simulation (With

Python)
https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 32/33
24/12/2024, 14:53 Neural Network from scratch in Python | by Omar Aflak | Towards Data Science

For today’s recreational coding exercise, we solve the Navier-Stokes equations for an
incompressible viscous fluid. To do so, we will…

Aug 4, 2023 877 6

See more recommendations

https://towardsdatascience.com/math-neural-network-from-scratch-in-python-d6da9f29ce65 33/33

OCAIRS AssessmentForms
100% (4)
OCAIRS AssessmentForms
50 pages
How To Build Your Own Neural Network From Scratch in
No ratings yet
How To Build Your Own Neural Network From Scratch in
6 pages
The Sleepwalkers
100% (1)
The Sleepwalkers
2 pages
Keras1-Introduction Two KEras
No ratings yet
Keras1-Introduction Two KEras
6 pages
Multi-Layer Perceptron Tutorial
No ratings yet
Multi-Layer Perceptron Tutorial
87 pages
Beginner's PyTorch Guide
No ratings yet
Beginner's PyTorch Guide
35 pages
Crear Red Neural en Python
No ratings yet
Crear Red Neural en Python
7 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
13 Soft Computing Lab Practicals
No ratings yet
13 Soft Computing Lab Practicals
46 pages
Convolutional Neural Network Architecture - CNN Architecture
No ratings yet
Convolutional Neural Network Architecture - CNN Architecture
13 pages
C# Artificial Intelligence (Ai) Programming - A Basic Object Oriented (Oop) Framework For Neural PDF
100% (1)
C# Artificial Intelligence (Ai) Programming - A Basic Object Oriented (Oop) Framework For Neural PDF
18 pages
Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908
No ratings yet
Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908
5 pages
A Neural Network Model Using Python
No ratings yet
A Neural Network Model Using Python
10 pages
Machine Learning With CAI Lazarus Delphi
100% (1)
Machine Learning With CAI Lazarus Delphi
17 pages
ID6001_Homework_2b57bb1d39ec7c53700fa31dc04520dc
No ratings yet
ID6001_Homework_2b57bb1d39ec7c53700fa31dc04520dc
2 pages
Ann Lab Manual 2
No ratings yet
Ann Lab Manual 2
7 pages
Backpropagation Algorithm
No ratings yet
Backpropagation Algorithm
29 pages
MLT unit 4 and 5 part 2
No ratings yet
MLT unit 4 and 5 part 2
34 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
Deep Learning Algorithms Report PDF
No ratings yet
Deep Learning Algorithms Report PDF
11 pages
ANN Simulink Examples
No ratings yet
ANN Simulink Examples
14 pages
Machine Learning101
No ratings yet
Machine Learning101
20 pages
A Probabilistic Theory of Deep Learning: Unit 2
No ratings yet
A Probabilistic Theory of Deep Learning: Unit 2
17 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
32 pages
AI Manual
No ratings yet
AI Manual
36 pages
Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
Lecture 5 - CS50's Introduction to Artificial Intelligence with Python
No ratings yet
Lecture 5 - CS50's Introduction to Artificial Intelligence with Python
16 pages
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
No ratings yet
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
7 pages
unit 4 part 3 dl_1
No ratings yet
unit 4 part 3 dl_1
5 pages
Creating SimpleOCR Application
No ratings yet
Creating SimpleOCR Application
9 pages
Unit VML
No ratings yet
Unit VML
14 pages
Unit 2 v1.
No ratings yet
Unit 2 v1.
41 pages
APKA Report
No ratings yet
APKA Report
3 pages
Unit 3
No ratings yet
Unit 3
52 pages
unit-4-part-3
No ratings yet
unit-4-part-3
8 pages
Understanding and Coding Neural Networks From Scratch in Python and R
No ratings yet
Understanding and Coding Neural Networks From Scratch in Python and R
12 pages
Institute of Engineering and Technology Davv, Indore: Lab Assingment On
No ratings yet
Institute of Engineering and Technology Davv, Indore: Lab Assingment On
14 pages
ProjectReport Kanwarpal
No ratings yet
ProjectReport Kanwarpal
17 pages
Sony Ai Content[1]
No ratings yet
Sony Ai Content[1]
26 pages
Performance Evaluation of Artificial Neural Networks For Spatial Data Analysis
No ratings yet
Performance Evaluation of Artificial Neural Networks For Spatial Data Analysis
15 pages
Unit 4 notes
No ratings yet
Unit 4 notes
19 pages
API Google Studio Ge
No ratings yet
API Google Studio Ge
4 pages
Understanding and Coding Neural Networks From Scratch in Python and R
100% (1)
Understanding and Coding Neural Networks From Scratch in Python and R
15 pages
Neural Networks and Their Statistical Application
No ratings yet
Neural Networks and Their Statistical Application
41 pages
ML Module 2 New
No ratings yet
ML Module 2 New
36 pages
Deep Learning
No ratings yet
Deep Learning
12 pages
deep-learning-r18-jntuh-lab-manual
No ratings yet
deep-learning-r18-jntuh-lab-manual
20 pages
Object-Oriented Rosenblatt Perceptron Using C++
No ratings yet
Object-Oriented Rosenblatt Perceptron Using C++
30 pages
Backpropagation - Wikipedia, The Free Encyclopedia
No ratings yet
Backpropagation - Wikipedia, The Free Encyclopedia
10 pages
COMPUTER NETWORKS LAB Manual L On 5.7.14 (STUDENT Copy)
No ratings yet
COMPUTER NETWORKS LAB Manual L On 5.7.14 (STUDENT Copy)
45 pages
3 1 Backpropagation - Example
No ratings yet
3 1 Backpropagation - Example
9 pages
Applied Deep Learning - Part 1 - Artificial Neural Networks - by Arden Dertat - Towards Data Science
No ratings yet
Applied Deep Learning - Part 1 - Artificial Neural Networks - by Arden Dertat - Towards Data Science
34 pages
DL Unit 3 Jntuk r20
100% (1)
DL Unit 3 Jntuk r20
47 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
1 Neural Networks
No ratings yet
1 Neural Networks
16 pages
Keras1-2 Classification
No ratings yet
Keras1-2 Classification
13 pages
Data Mining-Backpropagation
100% (1)
Data Mining-Backpropagation
5 pages
Week 6 - Lab
No ratings yet
Week 6 - Lab
5 pages
Lecture02.Backpropagation.annotated
No ratings yet
Lecture02.Backpropagation.annotated
33 pages
Neural Network Questions
No ratings yet
Neural Network Questions
17 pages
Natural Computing with Python: Learn to implement genetic and evolutionary algorithms to solve problems in a pythonic way
From Everand
Natural Computing with Python: Learn to implement genetic and evolutionary algorithms to solve problems in a pythonic way
Giancarlo Zaccone
No ratings yet
Neural Networks with Python
From Everand
Neural Networks with Python
Mei Wong
No ratings yet
Mms
No ratings yet
Mms
18 pages
Cardiology
No ratings yet
Cardiology
613 pages
Mechanical Design Engineer2
No ratings yet
Mechanical Design Engineer2
8 pages
Module 1 - OSC
No ratings yet
Module 1 - OSC
159 pages
MATHS Grade 10 Revision Document 2019
No ratings yet
MATHS Grade 10 Revision Document 2019
60 pages
Lecture 3 - Variables Constants and Data Types
No ratings yet
Lecture 3 - Variables Constants and Data Types
34 pages
GLCM Second Order Image Classification
No ratings yet
GLCM Second Order Image Classification
10 pages
"Ask Me How": Stating Research Question
No ratings yet
"Ask Me How": Stating Research Question
16 pages
Reviewer On Philosophical Anthropology
No ratings yet
Reviewer On Philosophical Anthropology
5 pages
coffee supplies direct Invoice 2nd
No ratings yet
coffee supplies direct Invoice 2nd
1 page
The Color Photo Book
No ratings yet
The Color Photo Book
424 pages
UFP - Pre-Lift Safety Checklist - CFN-1092
No ratings yet
UFP - Pre-Lift Safety Checklist - CFN-1092
2 pages
Cj2a Footman Loop Locations - Rear Tub
No ratings yet
Cj2a Footman Loop Locations - Rear Tub
3 pages
Torq - Ep 3 Ultra Load
No ratings yet
Torq - Ep 3 Ultra Load
2 pages
NRF Proposal Writing Guide
No ratings yet
NRF Proposal Writing Guide
42 pages
K-8206 Spec US-CA Kohler en
No ratings yet
K-8206 Spec US-CA Kohler en
2 pages
Solutions Board Questions
No ratings yet
Solutions Board Questions
14 pages
Akash Kumar IITMandi
No ratings yet
Akash Kumar IITMandi
1 page
Quantum Speed Reading As A Quasi Telepathic Commun
100% (4)
Quantum Speed Reading As A Quasi Telepathic Commun
26 pages
U4 The Functional Approach. Literal-Direct Translation
No ratings yet
U4 The Functional Approach. Literal-Direct Translation
18 pages
HH Facilitator Guide
No ratings yet
HH Facilitator Guide
8 pages
RC 2
No ratings yet
RC 2
13 pages
Numerical Investigation of Near-Muzzle Blast Levels For Perforated Muzzle Brake Using High Performance Computing
No ratings yet
Numerical Investigation of Near-Muzzle Blast Levels For Perforated Muzzle Brake Using High Performance Computing
9 pages
FEARLESSTEAM_ABDULLAH SIRAJUDDIN SYA'BANA_The Innovation of the Traditional Sawut Lassagna Food Made From Suweg as an Effort to Revitalize Traditional Indonesian Food Which is Increasingly Being Forgotten (1)
No ratings yet
FEARLESSTEAM_ABDULLAH SIRAJUDDIN SYA'BANA_The Innovation of the Traditional Sawut Lassagna Food Made From Suweg as an Effort to Revitalize Traditional Indonesian Food Which is Increasingly Being Forgotten (1)
23 pages
CED Assignment 2&3
No ratings yet
CED Assignment 2&3
4 pages
Penawaran lb1 BNPT
No ratings yet
Penawaran lb1 BNPT
2 pages
Electromagnetic Lock: Application Scenario
No ratings yet
Electromagnetic Lock: Application Scenario
2 pages
Material Safety Data Sheet
No ratings yet
Material Safety Data Sheet
4 pages