Sentiment analysis using imdb 50 k data

•Download as PPTX, PDF•

0 likes•518 views

Sarthak Dasgupta

Data & Analytics

Sentiment Analysis
(Deep Learning)
Sarthak Dasgupta

Using IMDB-
50k dataset to
do sentiment
analysis

Outcome:-
Based on public review, we will
perform sentiment Analysis and
will try to generalize the
emotion of Public toward the
movie.

First 5 public review and their sentiments :-

Bar graph representing the count of each class
As we can see, half the
reviews has positive
sentiment and other half
has negative sentiment

Preprocessing the “Sentiment” column
We have 2 main values in
sentiment - ‘positive’ &
‘negative’. So assigning 1s
and 0s to them

Data Cleaning
Clearing data
● process of clearing punctuation marks in data.
● cleaning unnecessary marks in data.
● capitalization to lowercase.
● cleaning extra spaces.
● removal of stopwords in sentences.

Tokenization:-
Word
1. thesis
2. Behind
1. rise
Token
1. 13091
2. 383
3. 2007

Splitting the Dataset for testing and training
We splitted the data set in 80-20 ratio
randomly for training and testing our
models after removing stopword from
lines.
TRAIN size: 40000
TEST size: 10000
from sklearn.model_selection
import train_test_split
x_train, x_test, y_train,
y_test =
train_test_split(data,sentim
ent,test_size = 0.2,
random_state = 42)

Words and their Equivalent Tokens:-
thesis behind rise evil
seems br br hitler bad man
bad man hated jews case miss
going fact every scene film
br br
13091 383 2007 337 85
1 1 2009 16 44 16
44
1631 4271 296 557 70
90 73 47 3 1 1

Model Training & Evaluating
Training the model with necessary informations
history = model.fit(x_train_pad, y_train, validation_split=0.3,
epochs=5, batch_size=1000, shuffle=True, verbose = 1)
Then we evaluate the model on test data
result = model.evaluate(x_test_pad, y_test)

Data visualization : Accuracy
plt.figure()
plt.plot(history.history["accuracy"
], label = "Train")
plt.plot(history.history["val_accur
acy"], label = "Test")
plt.title("Accuracy")
plt.ylabel("Acc")
plt.xlabel("epochs")
plt.legend()
plt.show()

Data visualization : Loss
plt.figure()
plt.plot(history.history["loss"],
label = "Train")
plt.plot(history.history["val_loss"
], label = "Test")
plt.title("Loss")
plt.ylabel("Acc")
plt.xlabel("epochs")
plt.legend()
plt.show()

Creating a second model
We remove
the drop-out
layer and
create a new
model

Fitting the second model and evaluating
accuracy

What's hot

Object Detection Using R-CNN Deep Learning FrameworkNader Karimi

Action Recognition (Thesis presentation)nikhilus85

Image Caption Generation using Convolutional Neural Network and LSTMOmkar Reddy

detect emotion from textSafayet Hossain

Sentiment Analysis on TwitterSmritiAgarwal26

Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Simplilearn

Perceptron & Neural NetworksNAGUR SHAREEF SHAIK

Abstractive Text SummarizationTho Phan

Sentiment analysis of Twitter data using pythonHetu Bhavsar

Feature Extractionskylian

K Nearest NeighborsTilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL

Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Simplilearn

Introduction to Sentiment AnalysisJaganadh Gopinadhan

Sentiment Analysis using Twitter DataHari Prasad

Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...Simplilearn

Computer Vision image classificationWael Badawy

Mask R-CNNJaehyun Jun

Sentiment Analysis Using Machine LearningNihar Suryawanshi

Sentiment analysis using naive bayes classifier Dev Sahu

Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...Simplilearn

What's hot (20)

Object Detection Using R-CNN Deep Learning Framework

Action Recognition (Thesis presentation)

Image Caption Generation using Convolutional Neural Network and LSTM

detect emotion from text

Sentiment Analysis on Twitter

Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...

Perceptron & Neural Networks

Abstractive Text Summarization

Sentiment analysis of Twitter data using python

Feature Extraction

K Nearest Neighbors

Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...

Introduction to Sentiment Analysis

Sentiment Analysis using Twitter Data

Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...

Computer Vision image classification

Mask R-CNN

Sentiment Analysis Using Machine Learning

Sentiment analysis using naive bayes classifier

Backpropagation And Gradient Descent In Neural Networks | Neural Network Tuto...

Similar to Sentiment analysis using imdb 50 k data

Sentiment Analysis on TwitterSubarno Pal

Machine learningVikas Sinha

Expression invariant face recognitionSumit Agarwal

Continuous Sentiment Intensity Prediction based on Deep LearningYunchao He

Unit3_1.pptxssuseree099d2

Human Emotion RecognitionChaitanya Maddala

Predicting Yelp Review Star Ratings with LanguageSebastian W. Cheah

Facial Emotion Recognition: A Deep Learning approachAshwinRachha

Caricature Recognition and GenerationSaurav Jha

Recommender Systems from A to Z – The Right DatasetCrossing Minds

Movie recommendation Engine using Artificial IntelligenceHarivamshi D

Use of artificial neural networks to identify fake profilesVenkat Projects

PythonML.pptxHussain395748

House Price Estimation as a Function Fitting Problem with using ANN ApproachYusuf Uzun

UNIT 1 Machine Learning [KCS-055] (1).pptxRohanPathak30

Multimodal emotion recognition at utterance level with spatio-temporal featur...Carlos Toxtli

Quality Estimation for Machine Translation Using the Joint Method of Evaluati...Lifeng (Aaron) Han

A Knowledge-Based Recommendation System That Includes Sentiment Analysis and ...Venkat Projects

Sentiment Analysis: A comparative study of Deep Learning and Machine LearningIRJET Journal

The ABC of Implementing Supervised Machine Learning with Python.pptxRuby Shrestha

Similar to Sentiment analysis using imdb 50 k data (20)

Sentiment Analysis on Twitter

Machine learning

Expression invariant face recognition

Continuous Sentiment Intensity Prediction based on Deep Learning

Unit3_1.pptx

Human Emotion Recognition

Predicting Yelp Review Star Ratings with Language

Facial Emotion Recognition: A Deep Learning approach

Caricature Recognition and Generation

Recommender Systems from A to Z – The Right Dataset

Movie recommendation Engine using Artificial Intelligence

Use of artificial neural networks to identify fake profiles

PythonML.pptx

House Price Estimation as a Function Fitting Problem with using ANN Approach

UNIT 1 Machine Learning [KCS-055] (1).pptx

Multimodal emotion recognition at utterance level with spatio-temporal featur...

Quality Estimation for Machine Translation Using the Joint Method of Evaluati...

A Knowledge-Based Recommendation System That Includes Sentiment Analysis and ...

Sentiment Analysis: A comparative study of Deep Learning and Machine Learning

The ABC of Implementing Supervised Machine Learning with Python.pptx

Recently uploaded

Data Science Jobs and Salaries Analysis.pptxFurkanTasci3

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

RadioAdProWritingCinderellabyButleri.pdfgstagge

B2 Creative Industry Response Evaluation.docxStephen266013

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna

Ukraine War presentation: KNOW THE BASICSAishani27

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh

Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Servicejennyeacort

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor

VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor

Spark3's new memory model/managementakshesh doshi

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

Recently uploaded (20)

Data Science Jobs and Salaries Analysis.pptx

Brighton SEO | April 2024 | Data Storytelling

RadioAdProWritingCinderellabyButleri.pdf

B2 Creative Industry Response Evaluation.docx

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...

Ukraine War presentation: KNOW THE BASICS

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

Data Science Project: Advancements in Fetal Health Classification

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝

Call Girls In Noida City Center Metro 24/7✡️9711147426✡️ Escorts Service

RA-11058_IRR-COMPRESS Do 198 series of 1998

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai

VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...

Spark3's new memory model/management

Call Girls In Mahipalpur O9654467111 Escorts Service

Sentiment analysis using imdb 50 k data

1. Sentiment Analysis (Deep Learning) Sarthak Dasgupta

2. Using IMDB- 50k dataset to do sentiment analysis

3. Outcome:- Based on public review, we will perform sentiment Analysis and will try to generalize the emotion of Public toward the movie.

4. First 5 public review and their sentiments :-

5. Bar graph representing the count of each class As we can see, half the reviews has positive sentiment and other half has negative sentiment

6. Preprocessing the “Sentiment” column We have 2 main values in sentiment - ‘positive’ & ‘negative’. So assigning 1s and 0s to them

7. Data Cleaning Clearing data ● process of clearing punctuation marks in data. ● cleaning unnecessary marks in data. ● capitalization to lowercase. ● cleaning extra spaces. ● removal of stopwords in sentences.

8. Tokenization:- Word 1. thesis 2. Behind 1. rise Token 1. 13091 2. 383 3. 2007

9. Splitting the Dataset for testing and training We splitted the data set in 80-20 ratio randomly for training and testing our models after removing stopword from lines. TRAIN size: 40000 TEST size: 10000 from sklearn.model_selection import train_test_split x_train, x_test, y_train, y_test = train_test_split(data,sentim ent,test_size = 0.2, random_state = 42)

10. Words and their Equivalent Tokens:- thesis behind rise evil seems br br hitler bad man bad man hated jews case miss going fact every scene film br br 13091 383 2007 337 85 1 1 2009 16 44 16 44 1631 4271 296 557 70 90 73 47 3 1 1

11. Sequential Model Summary

12. Model Training & Evaluating Training the model with necessary informations history = model.fit(x_train_pad, y_train, validation_split=0.3, epochs=5, batch_size=1000, shuffle=True, verbose = 1) Then we evaluate the model on test data result = model.evaluate(x_test_pad, y_test)

13. Data visualization : Accuracy plt.figure() plt.plot(history.history["accuracy" ], label = "Train") plt.plot(history.history["val_accur acy"], label = "Test") plt.title("Accuracy") plt.ylabel("Acc") plt.xlabel("epochs") plt.legend() plt.show()

14. Data visualization : Loss plt.figure() plt.plot(history.history["loss"], label = "Train") plt.plot(history.history["val_loss" ], label = "Test") plt.title("Loss") plt.ylabel("Acc") plt.xlabel("epochs") plt.legend() plt.show()

15. Creating a second model We remove the drop-out layer and create a new model

16. Fitting the second model and evaluating accuracy

17. Thank You

Sentiment analysis using imdb 50 k data

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Sentiment analysis using imdb 50 k data

Similar to Sentiment analysis using imdb 50 k data (20)

Recently uploaded

Recently uploaded (20)

Sentiment analysis using imdb 50 k data