SlideShare a Scribd company logo
1 of 16
Title of the Project:-
Detailed Classification of Customer Reviews
Using SentimentAnalysis
G.H Raisoni College of Engineering, Nagpur
Department of Information Technology and
Engineering
Project Phase –I
:
ABSTRACT
 With the rapid of Growth of E-Commerce websites,
there is no doubt that people are drawn to online
shopping more nowadays. As people are drawn to it, so
are the sellers. But sellers can differentiate in quality as
well as quantity.
 To make it easier for consumers to decide which product
suits for their demands and needs, customer review on
E-Commerce Websites is a proficient way for consumers
to get what they are looking for
INTRODUCTION
 Sentiment analysis (or opinion mining) is a natural language
processing (NLP) technique used to determine whether data is
positive, negative, or neutral.
 Sentiment analysis is often performed on text data to help
businesses monitor brand and product sentiment in customer
feedback and understand customer needs.
• The goal is to automatically recognize and classify opinions expressed in text to determine overall sentiment.
Sentiment analysis is the process of analyzing online writing to determine whether it is positive, negative, or neutral. Simply put,
sentiment analysis helps find the author's attitude towards a topic.
INTRODUCTION
Objective
We will be using Concepts likeVader and RoBERTa Model and comparing the results
between these two models.
• Vader Model:
Vader is model that is based on lexicon and rule based matching , sentiment analysis tool
that is specifically aware to sentiments expressed in social platform.
The VADER sentimental analysis uses a dictionary that converts lexical data into sentiment
scores, which measure the intensity of an emotion. By adding the intensity of each word in a
text, one can determine the sentiment score of that text.
• RoBERTa Model:
RoBERTa stands for Robustly Optimized BERT Pre-training Approach.
The goal of this model is to optimize the training of BERT architecture in order to take lesser
time during pre-training.
Methodology
Methodology
VADER Model:
This model for text sentiment is sensitive to both the polarity (positive/negative)
and intensity (strong) of emotion.TheVADER sentimental analysis uses a
dictionary that converts lexical data into sentiment scores, which measure the
intensity of an emotion. By adding the intensity of each word in a text, one can
determine the sentiment score of that text.
POSTaggingTable:
• Part-of-speech (POS) is the practice of
categorizing words in a text (corpus) in
accordance with a certain part of
speech, depending on the word's
definition and context, is known as
tagging in natural language processing
Working of
RoBERTa
Model
RoBERTa Model
 RoBERTa Model is a model trained of a large corpus of data
 Transformer model that accounts for the words but also the
context related to other words.
 teaches the computer to anticipate purposefully hidden text inside
examples of unannotated language
 RoBERTa outperforms BERT in terms of the masked language
modelling aim and performs better on subsequent tasks.
TESTING
Postive reviews and their Polarity Scores
Review – “I am so happy I orderd it”
Here the Comment is I am so happy I orderd it , as we can see the review is about
a satisfied customer who is
happy with this product, hence we can say this is a positive review.
Now if we see the polarity scores that are
Negative (neg) – 0.0
Here there is no negative words or emotion hence the neg score is 0
Positive(pos) – 0.517
The positive score here is good as the words suggest that the review is overall a
positive review.
Compound Score – 0.646
TESTING
Review – “This is bad I hated it”
Here the Comment isThis is bad I hated it, as we can see the review is about
unsatisfied customer who is
unhappy with this product, hence we can say this is a negative review.
Now if we see the polarity scores that are
Negative (neg) – 0.658
32
Here there is a customer who is unhappy with the product so the emotion here is
negative, Hence the negative
value will be higher here.
Positive(pos) – 0.0
Here there is no positive words or emotion hence the neg score is 0.
Compound Score – 0.8271
CONCLUSION
 As we can see in our model the ROBERta has scores that are much
confident than theVADER model as it used deep learning
approach. Due to this approach a difference can be seen between
the two models.
 One cannot dismiss the value that sentiment analysis offers to the
industry despite all the obstacles and potential issues that it faces.
Sentiment analysis is destined to become one of the key
determinants of many business decisions in the future because it
bases its findings on elements that are fundamentally
compassionate.
 Sentiment analysis’s results are helpful. It cannot be used to
forecast a company’s success or other measures. Sentiment
analysis may occasionally be unnecessary and only serve as a
reporting measure after the harm has already been done.
References
 1] A. Srithirath and P. Seresangtakul, "A hybrid approach to Lao word segmentation using longest syllable level
 matching with named entities recognition," 2013 10th International Conference on Electrical
 Engineering/Electronics, Computer, Telecommunications and Information Technology, 2013, pp. 1-5, doi:
 10.1109/ECTICon.2013.6559585.
 2] R. Jiamthapthaksin, P. Setthawong and N. Ratanasawetwad, "A system for popular Thai slang extraction
 from social media content with n-gram based tokenization," 2016 8th International Conference on Knowledge
 and SmartTechnology (KST), 2016, pp. 130-135, doi: 10.1109/KST.2016.7440478.
 3] I. Olenych, M. Prytula, O. Sinkevych and O. Khamar, "System of Automatic Determination of Ukrainian
 Text Tone," 2021 IEEE 12th International Conference on Electronics and Information Technologies (ELIT),
 2021, pp. 80-83, doi: 10.1109/ELIT53502.2021.9501124.
 4] S. J. Putra, M. N. Gunawan and A. Suryatno, "Tokenization and N-Gram for Indexing Indonesian
 Translation of the Quran," 2018 6th International Conference on Information and Communication Technology
 (ICoICT), 2018, pp. 158-161, doi:10.1109/ICoICT.2018.8528762.
 5] P. Prakrankamanant and E. Chuangsuwanich, "Tokenization-based data augmentation for text classification," 2022
 19th International Joint Conference on Computer Science and Software Engineering (JCSSE), 2022, pp. 1-6, doi:
 10.1109/JCSSE54890.2022.9836268.
 6] Abdul-Mageed, M., M.T. Diab, and M. Korayem. Subjectivity and sentiment analysis of modern standard Arabic.
 In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics:shortpapers, 2011.
 7] Akkaya, C., J. Wiebe, and R. Mihalcea. Subjectivity word sense disambiguation. In Proceedings of
 the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP-2009), 2009.
 8] Alm, C.O. Subjective natural language problems: motivations, applications, characterizations, and
 implications. In Proceedings of the 49th Annual Meeting of the Association for Computational
 Linguistics:shortpapers (ACL-2011), 2011.
ThankYou

More Related Content

Similar to Customer review using sentiment analysis.pptx

Dictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewDictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A Review
INFOGAIN PUBLICATION
 

Similar to Customer review using sentiment analysis.pptx (20)

Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...
Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...
Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...
 
Hybrid Deep Learning Model for Multilingual Sentiment Analysis
Hybrid Deep Learning Model for Multilingual Sentiment AnalysisHybrid Deep Learning Model for Multilingual Sentiment Analysis
Hybrid Deep Learning Model for Multilingual Sentiment Analysis
 
Ieee format 5th nccci_a study on factors influencing as a best practice for...
Ieee format 5th nccci_a study on factors influencing as  a  best practice for...Ieee format 5th nccci_a study on factors influencing as  a  best practice for...
Ieee format 5th nccci_a study on factors influencing as a best practice for...
 
NLP Ecosystem
NLP EcosystemNLP Ecosystem
NLP Ecosystem
 
SPEECH EMOTION RECOGNITION
SPEECH EMOTION RECOGNITIONSPEECH EMOTION RECOGNITION
SPEECH EMOTION RECOGNITION
 
L017358286
L017358286L017358286
L017358286
 
Sentiment Features based Analysis of Online Reviews
Sentiment Features based Analysis of Online ReviewsSentiment Features based Analysis of Online Reviews
Sentiment Features based Analysis of Online Reviews
 
IRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression Identification
 
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
 
ANALYSING SPEECH EMOTION USING NEURAL NETWORK ALGORITHM
ANALYSING SPEECH EMOTION USING NEURAL NETWORK ALGORITHMANALYSING SPEECH EMOTION USING NEURAL NETWORK ALGORITHM
ANALYSING SPEECH EMOTION USING NEURAL NETWORK ALGORITHM
 
Sentiment analysis on unstructured review
Sentiment analysis on unstructured reviewSentiment analysis on unstructured review
Sentiment analysis on unstructured review
 
INFORMATION RETRIEVAL FROM TEXT
INFORMATION RETRIEVAL FROM TEXTINFORMATION RETRIEVAL FROM TEXT
INFORMATION RETRIEVAL FROM TEXT
 
Dictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewDictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A Review
 
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
 
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...
IRJET- Sentimental Analysis on Audio and Video using Vader Algorithm -Monali ...
 
IRJET- Opinion Targets and Opinion Words Extraction for Online Reviews wi...
IRJET-  	  Opinion Targets and Opinion Words Extraction for Online Reviews wi...IRJET-  	  Opinion Targets and Opinion Words Extraction for Online Reviews wi...
IRJET- Opinion Targets and Opinion Words Extraction for Online Reviews wi...
 
IRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
IRJET- Interpreting Public Sentiments Variation by using FB-LDA TechniqueIRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
IRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
 
Sentence level sentiment polarity calculation for customer reviews by conside...
Sentence level sentiment polarity calculation for customer reviews by conside...Sentence level sentiment polarity calculation for customer reviews by conside...
Sentence level sentiment polarity calculation for customer reviews by conside...
 
NLP Techniques for Sentiment Anaysis.docx
NLP Techniques for Sentiment Anaysis.docxNLP Techniques for Sentiment Anaysis.docx
NLP Techniques for Sentiment Anaysis.docx
 
N01741100102
N01741100102N01741100102
N01741100102
 

Recently uploaded

Recently uploaded (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Customer review using sentiment analysis.pptx

  • 1. Title of the Project:- Detailed Classification of Customer Reviews Using SentimentAnalysis G.H Raisoni College of Engineering, Nagpur Department of Information Technology and Engineering Project Phase –I :
  • 2. ABSTRACT  With the rapid of Growth of E-Commerce websites, there is no doubt that people are drawn to online shopping more nowadays. As people are drawn to it, so are the sellers. But sellers can differentiate in quality as well as quantity.  To make it easier for consumers to decide which product suits for their demands and needs, customer review on E-Commerce Websites is a proficient way for consumers to get what they are looking for
  • 3.
  • 4. INTRODUCTION  Sentiment analysis (or opinion mining) is a natural language processing (NLP) technique used to determine whether data is positive, negative, or neutral.  Sentiment analysis is often performed on text data to help businesses monitor brand and product sentiment in customer feedback and understand customer needs.
  • 5. • The goal is to automatically recognize and classify opinions expressed in text to determine overall sentiment. Sentiment analysis is the process of analyzing online writing to determine whether it is positive, negative, or neutral. Simply put, sentiment analysis helps find the author's attitude towards a topic. INTRODUCTION
  • 6. Objective We will be using Concepts likeVader and RoBERTa Model and comparing the results between these two models. • Vader Model: Vader is model that is based on lexicon and rule based matching , sentiment analysis tool that is specifically aware to sentiments expressed in social platform. The VADER sentimental analysis uses a dictionary that converts lexical data into sentiment scores, which measure the intensity of an emotion. By adding the intensity of each word in a text, one can determine the sentiment score of that text. • RoBERTa Model: RoBERTa stands for Robustly Optimized BERT Pre-training Approach. The goal of this model is to optimize the training of BERT architecture in order to take lesser time during pre-training.
  • 8. Methodology VADER Model: This model for text sentiment is sensitive to both the polarity (positive/negative) and intensity (strong) of emotion.TheVADER sentimental analysis uses a dictionary that converts lexical data into sentiment scores, which measure the intensity of an emotion. By adding the intensity of each word in a text, one can determine the sentiment score of that text.
  • 9. POSTaggingTable: • Part-of-speech (POS) is the practice of categorizing words in a text (corpus) in accordance with a certain part of speech, depending on the word's definition and context, is known as tagging in natural language processing
  • 11. RoBERTa Model  RoBERTa Model is a model trained of a large corpus of data  Transformer model that accounts for the words but also the context related to other words.  teaches the computer to anticipate purposefully hidden text inside examples of unannotated language  RoBERTa outperforms BERT in terms of the masked language modelling aim and performs better on subsequent tasks.
  • 12. TESTING Postive reviews and their Polarity Scores Review – “I am so happy I orderd it” Here the Comment is I am so happy I orderd it , as we can see the review is about a satisfied customer who is happy with this product, hence we can say this is a positive review. Now if we see the polarity scores that are Negative (neg) – 0.0 Here there is no negative words or emotion hence the neg score is 0 Positive(pos) – 0.517 The positive score here is good as the words suggest that the review is overall a positive review. Compound Score – 0.646
  • 13. TESTING Review – “This is bad I hated it” Here the Comment isThis is bad I hated it, as we can see the review is about unsatisfied customer who is unhappy with this product, hence we can say this is a negative review. Now if we see the polarity scores that are Negative (neg) – 0.658 32 Here there is a customer who is unhappy with the product so the emotion here is negative, Hence the negative value will be higher here. Positive(pos) – 0.0 Here there is no positive words or emotion hence the neg score is 0. Compound Score – 0.8271
  • 14. CONCLUSION  As we can see in our model the ROBERta has scores that are much confident than theVADER model as it used deep learning approach. Due to this approach a difference can be seen between the two models.  One cannot dismiss the value that sentiment analysis offers to the industry despite all the obstacles and potential issues that it faces. Sentiment analysis is destined to become one of the key determinants of many business decisions in the future because it bases its findings on elements that are fundamentally compassionate.  Sentiment analysis’s results are helpful. It cannot be used to forecast a company’s success or other measures. Sentiment analysis may occasionally be unnecessary and only serve as a reporting measure after the harm has already been done.
  • 15. References  1] A. Srithirath and P. Seresangtakul, "A hybrid approach to Lao word segmentation using longest syllable level  matching with named entities recognition," 2013 10th International Conference on Electrical  Engineering/Electronics, Computer, Telecommunications and Information Technology, 2013, pp. 1-5, doi:  10.1109/ECTICon.2013.6559585.  2] R. Jiamthapthaksin, P. Setthawong and N. Ratanasawetwad, "A system for popular Thai slang extraction  from social media content with n-gram based tokenization," 2016 8th International Conference on Knowledge  and SmartTechnology (KST), 2016, pp. 130-135, doi: 10.1109/KST.2016.7440478.  3] I. Olenych, M. Prytula, O. Sinkevych and O. Khamar, "System of Automatic Determination of Ukrainian  Text Tone," 2021 IEEE 12th International Conference on Electronics and Information Technologies (ELIT),  2021, pp. 80-83, doi: 10.1109/ELIT53502.2021.9501124.  4] S. J. Putra, M. N. Gunawan and A. Suryatno, "Tokenization and N-Gram for Indexing Indonesian  Translation of the Quran," 2018 6th International Conference on Information and Communication Technology  (ICoICT), 2018, pp. 158-161, doi:10.1109/ICoICT.2018.8528762.  5] P. Prakrankamanant and E. Chuangsuwanich, "Tokenization-based data augmentation for text classification," 2022  19th International Joint Conference on Computer Science and Software Engineering (JCSSE), 2022, pp. 1-6, doi:  10.1109/JCSSE54890.2022.9836268.  6] Abdul-Mageed, M., M.T. Diab, and M. Korayem. Subjectivity and sentiment analysis of modern standard Arabic.  In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics:shortpapers, 2011.  7] Akkaya, C., J. Wiebe, and R. Mihalcea. Subjectivity word sense disambiguation. In Proceedings of  the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP-2009), 2009.  8] Alm, C.O. Subjective natural language problems: motivations, applications, characterizations, and  implications. In Proceedings of the 49th Annual Meeting of the Association for Computational  Linguistics:shortpapers (ACL-2011), 2011.