Lecture Notes 6

The document introduces machine learning concepts including convolutional neural networks (CNNs), pooling, and recurrent neural networks (RNNs). RNNs can process sequential data like time-series and have memory, making them well-suited for natural language, speech recognition, and image captioning. However, RNNs struggle with vanishing gradients, leading to the development of long short-term memory (LSTM) and gated recurrent unit (GRU) networks to improve memory. The document then demonstrates building a sentiment analysis model using Keras and TensorFlow on a text dataset in Databricks.

Uploaded by

fgsfgs

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views

Lecture Notes 6

Uploaded by

fgsfgs

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Chapter 1 Introduction to Machine Learning

Figure 1-31. Pooling

We can repeat the previous steps (convolution and pooling) multiple

times in a network to learn the main features of the image and finally pass
it to the fully connected layer at the end to make the classification.

R
NN
The general feedforward neural networks and CNNs are not good for
time-series kinds of datasets as these networks don’t have any memory of
their own. Recurrent neural networks bring with them the unique ability
to remember important stuff during the training over a period of time.
This makes them well suited for tasks such as natural language translation,
speech recognition, and image captioning. These networks have states
defined over a timeline and use the output of the previous state in the
current input, as shown in Figure 1-32.

39
Chapter 1 Introduction to Machine Learning

Figure 1-32. RNN

Although RNNs have proved to be really effective in time-series kinds

of applications, it does run into some serious limitations in terms of
performance because of its architecture. It struggles with what is known as
a vanishing gradient problem that occurs due to no or very little updates in
the weights of the network as the network tries to use data points that are
at the early stages of timeline. Hence, it has a limited memory to put it in
simple terms. To tackle this problem, there are couple of other variants of
RNNs.

• Long short-term memory (LSTM)

• Gradient recurring unit (GRU)

• Attention networks (encoder-decoder model)

40
Chapter 1 Introduction to Machine Learning

Now we are going to use a small dataset and build a deep learning
model to predict the sentiment given the user review. We are going to
make use of TensorFlow and Keras to build this model. There are couple of
steps that we need to do before we train this model in Databricks. We first
need to go to the cluster and click Libraries. On the Libraries tab, we need
to select the Pypi option and mention Keras to get it installed. Similarly, we
need to mention TensorFlow as well once Keras is installed.
Once we upload the reviews dataset, we can create a pandas dataframe
like we did in the earlier case.

[In]: from tensorflow.keras.models import Sequential

[In]: from tensorflow.keras.layers import LSTM,Embedding
[In]: from tensorflow.keras.layers import Dense
[In]: from tensorflow.keras.preprocessing.text import Tokenizer
[In]: from tensorflow.keras.preprocessing.sequence import pad_
sequences
[In]:sparkDF= spark.read.csv('/FileStore/tables/text_summary.
csv', header="true", inferSchema="true")

[In]: df=sparkDF.toPandas()

[In]: df.columns
[Out]: Index(['Sentiment', 'Summary'], dtype='object')

As we can see, there are just two columns in the dataframe.

[In]: df.head(10)

[Out]:

41
Chapter 1 Introduction to Machine Learning

[In]: df.Sentiment.value_counts()

[Out]:
1 1000
0 1000

We can also confirm the class balance by taking a value counts of the
target column. It seems the data is well balanced. Before we go ahead with
building the model, since we are dealing with text data, we need to clean it
a little bit to ensure no unwanted errors are thrown at the time of training.
Hence, we write a small helper function using regular expressions.

[In]:
import re
def clean_reviews(text):
text=re.sub("[^a-zA-Z]"," ",str(text))
return re.sub("^\d+\s|\s\d+\s|\s\d+$", " ", text)

[In]: df['Summary']=df.Summary.apply(clean_reviews)

[In]: df.head(10)

42
Chapter 1 Introduction to Machine Learning

[Out]:

The next step is to separate input and output data. Since the data is
already small, we are not going to split it into train and test sets; rather, we
will train the model on all the data.

[In]: X=df.Summary
[In]: y=df.Sentiment

We now create the tokenizer object with 10,000 vocab words, and an
out-of-vocabulary (oov) token is mentioned for the unseen words that the
model gets exposed to that are not part of the training.

[In]: tokenizer=Tokenizer(num_words=10000,oov_token='xxxxxxx')

[In]: tokenizer.fit_on_texts(X)

[In]: X_dict=tokenizer.word_index

[In]: len(X_dict)

[Out]: 2018

Medicinal Plant Identification Using Machine Learning".In
No ratings yet
Medicinal Plant Identification Using Machine Learning".In
27 pages
Case Study - Sentiment Analysis With RNNs
No ratings yet
Case Study - Sentiment Analysis With RNNs
8 pages
Text Classification_movie Review_news Wires
No ratings yet
Text Classification_movie Review_news Wires
5 pages
Keras For Beginners: Implementing A Recurrent Neural Network
No ratings yet
Keras For Beginners: Implementing A Recurrent Neural Network
13 pages
Module V
No ratings yet
Module V
19 pages
dl lab1
No ratings yet
dl lab1
15 pages
1 AI_Introduction and ML
No ratings yet
1 AI_Introduction and ML
32 pages
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1-10
No ratings yet
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1-10
13 pages
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
No ratings yet
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
9 pages
yarn own bd'
No ratings yet
yarn own bd'
9 pages
Neural Networks
No ratings yet
Neural Networks
8 pages
dl_22Q71A4206
No ratings yet
dl_22Q71A4206
65 pages
DL-unit-4-part-2
No ratings yet
DL-unit-4-part-2
8 pages
Tensor Flow Guide
No ratings yet
Tensor Flow Guide
25 pages
Recurrent Neural Network Using LSTM Model
No ratings yet
Recurrent Neural Network Using LSTM Model
15 pages
106106213
No ratings yet
106106213
637 pages
08 Natural Language Processing in Tensorflow
No ratings yet
08 Natural Language Processing in Tensorflow
29 pages
mergeddv
No ratings yet
mergeddv
2 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Sentiment Classification With Deep Neural Networks: Yi Zhou
No ratings yet
Sentiment Classification With Deep Neural Networks: Yi Zhou
58 pages
Practical 08 Solutions
No ratings yet
Practical 08 Solutions
6 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
RNN LSTM
No ratings yet
RNN LSTM
37 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
Recurrent Neural Networks: Anahita Zarei, PH.D
No ratings yet
Recurrent Neural Networks: Anahita Zarei, PH.D
37 pages
Day 4
No ratings yet
Day 4
22 pages
Rec Ex 11
No ratings yet
Rec Ex 11
13 pages
Unit 3 Slides - Getting Started With Neural Networks
No ratings yet
Unit 3 Slides - Getting Started With Neural Networks
70 pages
Unit 3 NNDL-1
No ratings yet
Unit 3 NNDL-1
31 pages
Module 4
No ratings yet
Module 4
36 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
No ratings yet
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
15 pages
Recurrent Neural Networks: Prof. Gheith Abandah
No ratings yet
Recurrent Neural Networks: Prof. Gheith Abandah
32 pages
Introduction to Deep Learning 1st Edition Eugene Charniak - The latest ebook is available for instant download now
100% (2)
Introduction to Deep Learning 1st Edition Eugene Charniak - The latest ebook is available for instant download now
53 pages
Unit Iv (CNN)
No ratings yet
Unit Iv (CNN)
8 pages
Team Name - Codesmashers Team Members - Manmeet Singh Tuteja, Raghav Gupta
No ratings yet
Team Name - Codesmashers Team Members - Manmeet Singh Tuteja, Raghav Gupta
4 pages
Introduction to Deep Learning 1st Edition Eugene Charniak - The ebook in PDF format is available for download
100% (1)
Introduction to Deep Learning 1st Edition Eugene Charniak - The ebook in PDF format is available for download
48 pages
Assignment No 2
No ratings yet
Assignment No 2
8 pages
CSE 4237 SoftCom Solutions
No ratings yet
CSE 4237 SoftCom Solutions
115 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
11.RNN and Transformers
No ratings yet
11.RNN and Transformers
100 pages
Top 10 Neural Network Architectures You Need To Know: 1 - Perceptrons
No ratings yet
Top 10 Neural Network Architectures You Need To Know: 1 - Perceptrons
12 pages
Deep Learning Lab Assignments - 6-9
No ratings yet
Deep Learning Lab Assignments - 6-9
14 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
Module11 - NNandDeep Learning
No ratings yet
Module11 - NNandDeep Learning
84 pages
Deep Learning
No ratings yet
Deep Learning
45 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
For Seminar
No ratings yet
For Seminar
17 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
566f0619-9145-4b8f-b12b-cb8a5b0cd30d
No ratings yet
566f0619-9145-4b8f-b12b-cb8a5b0cd30d
17 pages
Bonus 1 - TF2.0 Practical Advanced Cheat Sheet PDF
No ratings yet
Bonus 1 - TF2.0 Practical Advanced Cheat Sheet PDF
17 pages
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
No ratings yet
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
8 pages
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
No ratings yet
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
14 pages
K-Max Pooling Operation
No ratings yet
K-Max Pooling Operation
134 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Learning in Python, Theano, And TensorFlow (Machine Learning in Python) by LazyProgrammer (Z-lib.org)
No ratings yet
Convolutional Neural Networks in Python Master Data Science and Machine Learning With Modern Deep Learning in Python, Theano, And TensorFlow (Machine Learning in Python) by LazyProgrammer (Z-lib.org)
183 pages
Practical Guide To Keras
No ratings yet
Practical Guide To Keras
28 pages
Google Aiml
No ratings yet
Google Aiml
50 pages
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Similarity Learning
No ratings yet
Similarity Learning
4 pages
Ghafari 2012
No ratings yet
Ghafari 2012
9 pages
Jheel Maheshwari: Area of Interest Education
No ratings yet
Jheel Maheshwari: Area of Interest Education
2 pages
Using HTK
No ratings yet
Using HTK
36 pages
Machine Learning-Lecture#7-Fall 2020
No ratings yet
Machine Learning-Lecture#7-Fall 2020
18 pages
Acoustic Scene Classification - A Comprehensive Survey
No ratings yet
Acoustic Scene Classification - A Comprehensive Survey
32 pages
8 Generative AI
No ratings yet
8 Generative AI
36 pages
Medical Imaging Using Machine Learning and Deep Learning Algorithms: A Review
No ratings yet
Medical Imaging Using Machine Learning and Deep Learning Algorithms: A Review
5 pages
Atal - Gan
No ratings yet
Atal - Gan
67 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
6 pages
Amity International School, Noida Sub: Artificial Intelligence Class X Term 1 (Pt-2) Revision Sheet 2024-25
No ratings yet
Amity International School, Noida Sub: Artificial Intelligence Class X Term 1 (Pt-2) Revision Sheet 2024-25
2 pages
Lec 02
No ratings yet
Lec 02
12 pages
On-Board Deep-Learning-Based Unmanned Aerial Vehicle Fault Cause Detection and Identification
No ratings yet
On-Board Deep-Learning-Based Unmanned Aerial Vehicle Fault Cause Detection and Identification
7 pages
Bird Region Detection in Images With Multi-Scale HOG Features and SVM Scoring
No ratings yet
Bird Region Detection in Images With Multi-Scale HOG Features and SVM Scoring
12 pages
csm mini project
0% (1)
csm mini project
6 pages
Year 1_ Python, Math & Foundations of AI
No ratings yet
Year 1_ Python, Math & Foundations of AI
48 pages
Matlab Code For Radial Basis Functions
100% (2)
Matlab Code For Radial Basis Functions
13 pages
Presentation ML
No ratings yet
Presentation ML
9 pages
Department of Computer Science and Engineering
No ratings yet
Department of Computer Science and Engineering
4 pages
Rating System Based On Text Review Using Sentiment Analysis: Final Presentation
No ratings yet
Rating System Based On Text Review Using Sentiment Analysis: Final Presentation
20 pages
Deep Learning
No ratings yet
Deep Learning
4 pages
Deep Learning Basics in Machine Learnning 1
No ratings yet
Deep Learning Basics in Machine Learnning 1
29 pages
Research Paper1
No ratings yet
Research Paper1
11 pages
Nainish_Mane_Resume
No ratings yet
Nainish_Mane_Resume
1 page
MCA-304 ML Syllabus
No ratings yet
MCA-304 ML Syllabus
4 pages
Lecture 6 - Multi-Layer Feedforward Neural Networks Using Matlab Part 2
No ratings yet
Lecture 6 - Multi-Layer Feedforward Neural Networks Using Matlab Part 2
3 pages
DS&ML 1
No ratings yet
DS&ML 1
9 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
10 pages
GET - A Generative EEG Transformer For Continuous Context-Based Neural
No ratings yet
GET - A Generative EEG Transformer For Continuous Context-Based Neural
29 pages