0% found this document useful (0 votes)

31 views4 pages

Sentiment Analysis with NLTK

Uploaded by

SE69Shweta Yenaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views4 pages

Sentiment Analysis with NLTK

Uploaded by

SE69Shweta Yenaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

3/28/24, 3:07 PM Sentiment_analysis

In [17]: import pandas as pd

from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize
from nltk.stem import WordNetLemmatizer
import re

In [18]: nltk.download('punkt')
nltk.download('stopwords')
nltk.download('wordnet')

[nltk_data] Downloading package punkt to

[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package punkt is already up-to-date!
[nltk_data] Downloading package stopwords to
[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package stopwords is already up-to-date!
[nltk_data] Downloading package wordnet to
[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package wordnet is already up-to-date!

Out[18]: True

Method 1
In [19]: df = pd.read_csv('reviews.csv', usecols=['body'])
lemma = WordNetLemmatizer()
stop_words = stopwords.words('english')

In [20]: def text_prep(x):

corp = str(x).lower()
corp = re.sub('[^a-zA-Z]+',' ', corp).strip()
tokens = word_tokenize(corp)
words = [t for t in tokens if t not in stop_words]
lemmatize = [lemma.lemmatize(w) for w in words]

return lemmatize

In [22]: preprocess_tag = [text_prep(i) for i in df['body']]

df["preprocess_txt"] = preprocess_tag
df['total_len'] = df['preprocess_txt'].map(lambda x: len(x))

In [24]: file = open('negative-words.txt', 'r')

neg_words = file.read().split()
file = open('positive-words.txt', 'r')
pos_words = file.read().split()

localhost:8888/notebooks/Sentiment_analysis.ipynb 1/4
3/28/24, 3:07 PM Sentiment_analysis

In [27]: num_pos = df['preprocess_txt'].map(lambda x: len([i for i in x if i in pos_w

df['pos_count'] = num_pos
num_neg = df['preprocess_txt'].map(lambda x: len([i for i in x if i in neg_w
df['neg_count'] = num_neg
df['sentiment'] = round((df['pos_count'] - df['neg_count']) / df['total_len'
df.head()

Out[27]:
body preprocess_txt total_len pos_count neg_count sentiment

I had the Samsung [samsung, awhile,

0 A600 for awhile which absolute, doo, doo, read, 162 18 18 0.00
is abs... re...

Due to a software
[due, software, issue,
1 issue between Nokia 67 8 3 0.07
nokia, sprint, phone, t...
and Spri...

This is a great,
[great, reliable, phone,
2 reliable phone. I also 68 10 4 0.09
also, purchased, phon...
purcha...

I love the phone and

[love, phone, really, need,
3 all, because I really 41 3 0 0.07
one, expect, price...
did...

The phone has been

[phone, great, every,
4 great for every 56 5 3 0.04
purpose, offer, except, ...
purpose it ...

Method 2
In [28]: df['sentiment'] = round(df['pos_count'] / (df['neg_count']+1), 2)
df.head()

Out[28]:
body preprocess_txt total_len pos_count neg_count sentiment

I had the Samsung [samsung, awhile,

0 A600 for awhile which absolute, doo, doo, read, 162 18 18 0.95
is abs... re...

Due to a software
[due, software, issue,
1 issue between Nokia 67 8 3 2.00
nokia, sprint, phone, t...
and Spri...

This is a great,
[great, reliable, phone,
2 reliable phone. I also 68 10 4 2.00
also, purchased, phon...
purcha...

I love the phone and

[love, phone, really, need,
3 all, because I really 41 3 0 3.00
one, expect, price...
did...

The phone has been

[phone, great, every,
4 great for every 56 5 3 1.25
purpose, offer, except, ...
purpose it ...

In [30]: nltk.download('vader_lexicon')

[nltk_data] Downloading package vader_lexicon to

[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...

Out[30]: True

localhost:8888/notebooks/Sentiment_analysis.ipynb 2/4
3/28/24, 3:07 PM Sentiment_analysis

Method 3
In [35]: from nltk.sentiment.vader import SentimentIntensityAnalyzer

sent = SentimentIntensityAnalyzer()
df = pd.read_csv('reviews.csv', usecols=['body'])
df['body'].fillna('', inplace=True)
polarity = [round(sent.polarity_scores(str(i))['compound'], 2) for i in df['
df['sentiment_score'] = polarity
print(df.head())

body sentiment_score
0 I had the Samsung A600 for awhile which is abs... 0.86
1 Due to a software issue between Nokia and Spri... 0.89
2 This is a great, reliable phone. I also purcha... 0.80
3 I love the phone and all, because I really did... 0.96
4 The phone has been great for every purpose it ... 0.77

Exra
In [54]: # Create WordNetLemmatizer object
wnl = WordNetLemmatizer()

# single word lemmatization examples

list1 = ['kites', 'babies', 'dogs', 'flying', 'smiling',
'driving', 'tried', 'feet']
for words in list1:
print(words + " ---> " + wnl.lemmatize(words))

print('better' + " ---> " + wnl.lemmatize('better',pos='a'))

kites ---> kite

babies ---> baby
dogs ---> dog
flying ---> flying
smiling ---> smiling
driving ---> driving
tried ---> tried
feet ---> foot
better ---> good

In [59]: sentence = 'I am good in cricket, but best in Football.'

# Tokenize the sentence
tokens = nltk.word_tokenize(sentence)

# Get English stopwords
english_stopwords = set(stopwords.words('english'))

# Filter out stopwords
filtered_tokens = [word for word in tokens if word.lower() not in english_st

print(filtered_tokens)

['good', 'cricket', ',', 'best', 'Football', '.']

localhost:8888/notebooks/Sentiment_analysis.ipynb 3/4
3/28/24, 3:07 PM Sentiment_analysis

In [60]: import nltk

from nltk.stem import PorterStemmer

# Sentence to stem
sentence = 'I am good in cricket, but best in Football.'

# Tokenize the sentence
tokens = nltk.word_tokenize(sentence)

# Initialize PorterStemmer
stemmer = PorterStemmer()

# Perform stemming on each token
stemmed_tokens = [stemmer.stem(word) for word in tokens]
print(stemmed_tokens)

['I', 'am', 'good', 'in', 'cricket', ',', 'but', 'best', 'in', 'footbal',
'.']

In [ ]:

localhost:8888/notebooks/Sentiment_analysis.ipynb 4/4

R002 KrishAhuja BDA Lab9.Ipynb - Colab
No ratings yet
R002 KrishAhuja BDA Lab9.Ipynb - Colab
3 pages
British Airways Forage Report
No ratings yet
British Airways Forage Report
12 pages
Python NLP Techniques Guide
No ratings yet
Python NLP Techniques Guide
18 pages
Chapter 3
No ratings yet
Chapter 3
28 pages
Viva Questions For Opinion Mining Project by NASIR ABBAS - VUBWN
No ratings yet
Viva Questions For Opinion Mining Project by NASIR ABBAS - VUBWN
8 pages
Basenlp
No ratings yet
Basenlp
5 pages
Q 3
No ratings yet
Q 3
2 pages
NLP Sentimental Analysis 1736351356
No ratings yet
NLP Sentimental Analysis 1736351356
32 pages
Text Preprocessing and Sentiment Analysis
No ratings yet
Text Preprocessing and Sentiment Analysis
13 pages
Sentiment Analysis Basics
No ratings yet
Sentiment Analysis Basics
32 pages
17 Practicals
No ratings yet
17 Practicals
7 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
AIML IA3 Loki & SG
No ratings yet
AIML IA3 Loki & SG
31 pages
Sentiment Analysis Using Vectotizer
No ratings yet
Sentiment Analysis Using Vectotizer
37 pages
Detailed Report
No ratings yet
Detailed Report
6 pages
Chapter 2
No ratings yet
Chapter 2
34 pages
Dataset Description: Amazon Reviews of Unlocked Phone
No ratings yet
Dataset Description: Amazon Reviews of Unlocked Phone
4 pages
Lab Manual
No ratings yet
Lab Manual
10 pages
Ment Analysis Text Classification
No ratings yet
Ment Analysis Text Classification
9 pages
Sentiment Analysis and Keyword Extraction
No ratings yet
Sentiment Analysis and Keyword Extraction
14 pages
Sentimental Analysis
No ratings yet
Sentimental Analysis
3 pages
Sentiment Analysis Blog Series Part
No ratings yet
Sentiment Analysis Blog Series Part
8 pages
Pandas PD Numpy NP NLTK NLTK - Sentiment.vader Re Wordcloud Seaborn Sns Matplotlib - Pyplot PLT
No ratings yet
Pandas PD Numpy NP NLTK NLTK - Sentiment.vader Re Wordcloud Seaborn Sns Matplotlib - Pyplot PLT
6 pages
Sentiment Analysis Using Bert Model
No ratings yet
Sentiment Analysis Using Bert Model
8 pages
Nokia Positive and Negative TM
No ratings yet
Nokia Positive and Negative TM
8 pages
Sentiment Analysis Using Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Machine Learning Algorithms
23 pages
NLPPR7
No ratings yet
NLPPR7
6 pages
Kindle Review Sentiment Analysis - Ipynb - Colab
No ratings yet
Kindle Review Sentiment Analysis - Ipynb - Colab
5 pages
Sentiment Analysis On User-Generated Tweets
No ratings yet
Sentiment Analysis On User-Generated Tweets
15 pages
DSBA+Master+Codebook+ +Text+Mining+&+TSF
No ratings yet
DSBA+Master+Codebook+ +Text+Mining+&+TSF
11 pages
DS - Lab Report.
No ratings yet
DS - Lab Report.
25 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
Amazon Assignment Ex
No ratings yet
Amazon Assignment Ex
11 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
Web and Social Media Analytics Lab
No ratings yet
Web and Social Media Analytics Lab
34 pages
Adithiyaa BR 23MBA0018 SMA DA Text Mining PDF
No ratings yet
Adithiyaa BR 23MBA0018 SMA DA Text Mining PDF
6 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
Sentiment Analysis Part 1
No ratings yet
Sentiment Analysis Part 1
9 pages
Package Sentimentr': R Topics Documented
No ratings yet
Package Sentimentr': R Topics Documented
49 pages
Dav Exp7 56
No ratings yet
Dav Exp7 56
8 pages
Bert Sentiment
No ratings yet
Bert Sentiment
7 pages
Sentiment Analysis of Twitter Data: Radhi D. Desai
No ratings yet
Sentiment Analysis of Twitter Data: Radhi D. Desai
4 pages
Session 7
No ratings yet
Session 7
17 pages
Chapter 10 - Text Analytics
No ratings yet
Chapter 10 - Text Analytics
13 pages
Sentiment Analysis Using Text Mining PDF
100% (1)
Sentiment Analysis Using Text Mining PDF
12 pages
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
48 pages
Raj DV Exp5
No ratings yet
Raj DV Exp5
6 pages
Natural Language Processing-Section
No ratings yet
Natural Language Processing-Section
29 pages
1a NLTK
No ratings yet
1a NLTK
10 pages
Sentiment Analysis For PolishPoznan Studies in Contemporary Linguistics
No ratings yet
Sentiment Analysis For PolishPoznan Studies in Contemporary Linguistics
24 pages
Social Media Sentimental Analysis 1
No ratings yet
Social Media Sentimental Analysis 1
30 pages
Ai Project
No ratings yet
Ai Project
15 pages
Combine PDF
No ratings yet
Combine PDF
124 pages
IDTA For NLP
No ratings yet
IDTA For NLP
16 pages
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
18 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
14 pages
DSBD 7 Ass
No ratings yet
DSBD 7 Ass
9 pages
BERT Driven Sentiment Classification With PyTorch
No ratings yet
BERT Driven Sentiment Classification With PyTorch
54 pages
Sentiment Analysis JW Marriot
No ratings yet
Sentiment Analysis JW Marriot
16 pages
05 Instruction of Panel 8.0
No ratings yet
05 Instruction of Panel 8.0
16 pages
Lab Introduction To STATA
100% (1)
Lab Introduction To STATA
27 pages
Answer MySQL Certificate 1
No ratings yet
Answer MySQL Certificate 1
34 pages
HTML
No ratings yet
HTML
97 pages
Ubiquitous Computing Systems: Reading 1: Weiser, M. 1991. The Computer For The 21St Century
No ratings yet
Ubiquitous Computing Systems: Reading 1: Weiser, M. 1991. The Computer For The 21St Century
10 pages
Yamaha cvp-600 Series cvp-601 cvp-605 cvp-609
0% (1)
Yamaha cvp-600 Series cvp-601 cvp-605 cvp-609
122 pages
Software Engineering (INFS-334) : Question Bank For Final Exam Preparation Fall Term 2014-15 (1435-36)
No ratings yet
Software Engineering (INFS-334) : Question Bank For Final Exam Preparation Fall Term 2014-15 (1435-36)
16 pages
Oci345.06 101 1765
No ratings yet
Oci345.06 101 1765
544 pages
SAP SuccessFactors Whats New Viewer
No ratings yet
SAP SuccessFactors Whats New Viewer
6 pages
Understanding Wi-Fi Protected Setup (WPS)
No ratings yet
Understanding Wi-Fi Protected Setup (WPS)
3 pages
Wang 2022
No ratings yet
Wang 2022
8 pages
Top Books On C++ For Beginners and Advanced
No ratings yet
Top Books On C++ For Beginners and Advanced
29 pages
Module 2 Assignment
No ratings yet
Module 2 Assignment
14 pages
Genetic Algorithm for Antenna Design
No ratings yet
Genetic Algorithm for Antenna Design
43 pages
ML 2 Application Notes Getting Started Usw 3x 4189340867 Uk
No ratings yet
ML 2 Application Notes Getting Started Usw 3x 4189340867 Uk
21 pages
HP StorageWorks MSL Library Extender User
No ratings yet
HP StorageWorks MSL Library Extender User
96 pages
Assignment FEA
No ratings yet
Assignment FEA
1 page
Historical Aspects of Nursing Informatics
No ratings yet
Historical Aspects of Nursing Informatics
5 pages
Overview of MATLAB Applications
No ratings yet
Overview of MATLAB Applications
12 pages
Hands-On Data Science and Blockchain Essentials
50% (4)
Hands-On Data Science and Blockchain Essentials
4 pages
Shivakumar Keskar - Resume
No ratings yet
Shivakumar Keskar - Resume
3 pages
Resume: Kharabela Satrujit
No ratings yet
Resume: Kharabela Satrujit
3 pages
Website Vulnerability Scanner Overview
No ratings yet
Website Vulnerability Scanner Overview
3 pages
SR1200 Server Config
No ratings yet
SR1200 Server Config
8 pages
LTTS - Corporate Brochure - 2020
No ratings yet
LTTS - Corporate Brochure - 2020
15 pages
Python For Problem Solving Notes - All Units (2) - 240221 - 093412
No ratings yet
Python For Problem Solving Notes - All Units (2) - 240221 - 093412
157 pages
ADF - Data Flow, Triggers & CICD
No ratings yet
ADF - Data Flow, Triggers & CICD
20 pages
QD81DL96 - Startup Guide L (NA) - 08147-A (07.09)
No ratings yet
QD81DL96 - Startup Guide L (NA) - 08147-A (07.09)
36 pages
Naukri ManikGrover (15y 0m)
No ratings yet
Naukri ManikGrover (15y 0m)
2 pages
MIRAI DTL632V200 - T3214G-22 Mirai Service Manual
No ratings yet
MIRAI DTL632V200 - T3214G-22 Mirai Service Manual
41 pages

Sentiment Analysis with NLTK

Uploaded by

Sentiment Analysis with NLTK

Uploaded by

3/28/24, 3:07 PM Sentiment_analysis

In [17]: import pandas as pd

[nltk_data] Downloading package punkt to

In [20]: def text_prep(x):

In [22]: preprocess_tag = [text_prep(i) for i in df['body']]

In [24]: file = open('negative-words.txt', 'r')

In [27]: num_pos = df['preprocess_txt'].map(lambda x: len([i for i in x if i in pos_w

I had the Samsung [samsung, awhile,

I love the phone and

The phone has been

I had the Samsung [samsung, awhile,

I love the phone and

The phone has been

[nltk_data] Downloading package vader_lexicon to

# single word lemmatization examples

kites ---> kite

In [59]: sentence = 'I am good in cricket, but best in Football.'

['good', 'cricket', ',', 'best', 'Football', '.']

In [60]: import nltk

You might also like