SlideShare a Scribd company logo
1 of 24
Sentiment Analysis
Of Nepali Text
Using Naïve Bayes
MAHESH ACHARYA
ABHISHEK SAPKOTA
ASHOK CHHETRI
RABIN BHANDARI
UNDER SUPERVISION OF
MR. BIKASH BALAMI
INTRODUCTION
 Contextual mining of text in Negative, Positive or Neutral
 Analysis the opinion of customer on products and services
 Study the political views of people of current affairs
1
PROBLEM DEFINITION
 Sentiment analysis is in research phase in Nepali Language
 What if business owners make strategic plan based on customers
reviews?
 E-commerce sites not utilizing the reviews of the customers
2
OBJECTIVES
 To analysis the opinion of customer on products and services
 To study the political views of people of current affairs
3
SCOPE
SCOPE
 Holds benefit for ecommerce sites
4
USE CASE DIAGRAM
5
METHODOLOGY
Data Collection
Nepali contextual data were collected manually
6
METHODOLOGY
Data Collection
Nepali contextual data were collected manually
7
DATA PREPROCESSING
आह! कती राम्रो समान रहेछ यो ।
:)
अहँ! यो भनाइ ठीक छैन । :(
8
आह कती राम्रो समान रहेछ यो
अहँ यो भनाइ ठीक छैन
अहँ यो भनाइ ठीक छैन
आह कती राम्रो समान रहेछ यो
अहँ यो भनाइ ठीक छैन
TOKENIZATION
आह कती राम्रो समान रहेछ यो
REMOVE STOP WORDS
आह कती राम्रो समान रहेछ यो
आह कती राम्रो समान रहेछ
अहँ भनाइ ठीक छैन
अहँ यो भनाइ ठीक छैन
VECTORIZE - TF(TERM FREQUENCY
)
आह कती राम्रो समान रहेछ यो
0.20 0.20 0.20 0.20 0.20
0.25 0.25 0.25 0.25
अहँ यो भनाइ ठीक छैन
IDF(INVERSE DOCUMENT FREQUENCY)
0.30 0.30 0.30 0.30 0.30
0.30 0.30 0.30 0.30
अहँ यो भनाइ ठीक छैन
आह कती राम्रो समान रहेछ यो
TFIDF(TERM FREQUENCY INVERSE
DOCUMENT FREQUENCY)
0.60 0.60 0.60 0.60 0.60
0.075 0.075 0.075 0.075
अहँ यो भनाइ ठीक छैन
आह कती राम्रो समान रहेछ यो
TRAIN PHASE
Naïve Bayes
Train Model
{ आह:{pos:2,neg:1,neu:1} ,कती :{pos:2,neg:1,neu:1} राम्रो :{pos:2,neg:1,neu:1}
समान :{pos:2,neg:1,neu:1} रहेछ :{pos:2,neg:1,neu:1} यो :{pos:2,neg:1,neu:1}
0.075 0.075 0.075 0.075 0.075 1(pos)
0.60 0.60 0.60 0.60 0(neg)
TEST PHASE
Naïve Bayes
Testing Model
0.0775 0.0775 0.0775 0.0775 0.0775
0.680 0.680 0.880 0.180
PREDICTION
User Input
यो ठाउँ घुम्न को लागि राम्रो छ । 
Remove Punctuation
यो ठाउँ घुम्न को लागि राम्रो छ
(Note : User input also vectorize like above method.)
RESULT
Positive
ACCURACY MEASURE
SN. Train Data (%) Precision (%) Recall (%) F-Score (%)
1. 60 56.27 53.68 54.68
2. 70 57.55 54.91 56.20
3. 80 59.03 57.05 58.02
4. 90 61.45 59.33 60.37
LIMITATION
 The accuracy of the reviews is low when there is the mixture of
Nepali as well as English language in comments.
 Ambiguity can occurs and semantic analysis is difficult to address.
19
TOOLS USED
Front-End Back-End
Demo
22
23

More Related Content

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 

Featured

Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 

Sentiment Analysis on Nepali sentences

  • 1. Sentiment Analysis Of Nepali Text Using Naïve Bayes MAHESH ACHARYA ABHISHEK SAPKOTA ASHOK CHHETRI RABIN BHANDARI UNDER SUPERVISION OF MR. BIKASH BALAMI
  • 2. INTRODUCTION  Contextual mining of text in Negative, Positive or Neutral  Analysis the opinion of customer on products and services  Study the political views of people of current affairs 1
  • 3. PROBLEM DEFINITION  Sentiment analysis is in research phase in Nepali Language  What if business owners make strategic plan based on customers reviews?  E-commerce sites not utilizing the reviews of the customers 2
  • 4. OBJECTIVES  To analysis the opinion of customer on products and services  To study the political views of people of current affairs 3
  • 5. SCOPE SCOPE  Holds benefit for ecommerce sites 4
  • 7. METHODOLOGY Data Collection Nepali contextual data were collected manually 6
  • 8. METHODOLOGY Data Collection Nepali contextual data were collected manually 7
  • 9. DATA PREPROCESSING आह! कती राम्रो समान रहेछ यो । :) अहँ! यो भनाइ ठीक छैन । :( 8 आह कती राम्रो समान रहेछ यो अहँ यो भनाइ ठीक छैन
  • 10. अहँ यो भनाइ ठीक छैन आह कती राम्रो समान रहेछ यो अहँ यो भनाइ ठीक छैन TOKENIZATION आह कती राम्रो समान रहेछ यो
  • 11. REMOVE STOP WORDS आह कती राम्रो समान रहेछ यो आह कती राम्रो समान रहेछ अहँ भनाइ ठीक छैन अहँ यो भनाइ ठीक छैन
  • 12. VECTORIZE - TF(TERM FREQUENCY ) आह कती राम्रो समान रहेछ यो 0.20 0.20 0.20 0.20 0.20 0.25 0.25 0.25 0.25 अहँ यो भनाइ ठीक छैन
  • 13. IDF(INVERSE DOCUMENT FREQUENCY) 0.30 0.30 0.30 0.30 0.30 0.30 0.30 0.30 0.30 अहँ यो भनाइ ठीक छैन आह कती राम्रो समान रहेछ यो
  • 14. TFIDF(TERM FREQUENCY INVERSE DOCUMENT FREQUENCY) 0.60 0.60 0.60 0.60 0.60 0.075 0.075 0.075 0.075 अहँ यो भनाइ ठीक छैन आह कती राम्रो समान रहेछ यो
  • 15. TRAIN PHASE Naïve Bayes Train Model { आह:{pos:2,neg:1,neu:1} ,कती :{pos:2,neg:1,neu:1} राम्रो :{pos:2,neg:1,neu:1} समान :{pos:2,neg:1,neu:1} रहेछ :{pos:2,neg:1,neu:1} यो :{pos:2,neg:1,neu:1} 0.075 0.075 0.075 0.075 0.075 1(pos) 0.60 0.60 0.60 0.60 0(neg)
  • 16. TEST PHASE Naïve Bayes Testing Model 0.0775 0.0775 0.0775 0.0775 0.0775 0.680 0.680 0.880 0.180
  • 17. PREDICTION User Input यो ठाउँ घुम्न को लागि राम्रो छ ।  Remove Punctuation यो ठाउँ घुम्न को लागि राम्रो छ (Note : User input also vectorize like above method.)
  • 19. ACCURACY MEASURE SN. Train Data (%) Precision (%) Recall (%) F-Score (%) 1. 60 56.27 53.68 54.68 2. 70 57.55 54.91 56.20 3. 80 59.03 57.05 58.02 4. 90 61.45 59.33 60.37
  • 20. LIMITATION  The accuracy of the reviews is low when there is the mixture of Nepali as well as English language in comments.  Ambiguity can occurs and semantic analysis is difficult to address. 19
  • 22. Demo
  • 23. 22
  • 24. 23

Editor's Notes

  1. Punctuation Removal