SlideShare a Scribd company logo
Sentiment Analysis Project
Report
By:
Rationale behind the project
• Using computers to perceive human emotions
• Understand how words can reveal sentiments
• Discover interesting books from reviews
• Gain deeper understanding of GRU
Ethical Considerations made during the
implementation
• Eliminating user identifiable attributes
• Minimizing bias
• Dealing with profane words
• Labels of the sentiments
Accuracy of the Output
• Training set (0.9944) versus validation set accuracy (0.6275)
• Model able to generally classify review’s sentiments
• Few exceptions of wrong classification
• Constructive criticism labelled negative
• Sarcastic remarks labelled positive
Implementation Details
• Used Google Colab
• Dataset loaded using wget
• XML file parsed using beautiful soup library & regular expressions
• Negative and positive book reviews selected to train model
Implementation Details
• Truncating sentences
• Lemmatization
• Removing stop words
• Lowercasing
• Converting reviews to integer sequences
Implementation
• Training validation split
• Using GRUs
• 5 epochs
• Testing the model by inputting own reviews

More Related Content

Similar to Sentiment Analysis Project Report.pptx

Talk at Axiom Summit 2022: Exploratory Testing Driven by Mind Maps
Talk at Axiom Summit 2022: Exploratory Testing Driven by Mind MapsTalk at Axiom Summit 2022: Exploratory Testing Driven by Mind Maps
Talk at Axiom Summit 2022: Exploratory Testing Driven by Mind Maps
Claudia Badell
 
Ch11lect1 ud
Ch11lect1 udCh11lect1 ud
Ch11lect1 ud
Ahmet Balkan
 
Carl shaulis agile_td2014
Carl shaulis agile_td2014Carl shaulis agile_td2014
Carl shaulis agile_td2014
Carl Shaulis
 
Introduction to Test Driven Development
Introduction to Test Driven DevelopmentIntroduction to Test Driven Development
Introduction to Test Driven Development
Sarah Dutkiewicz
 
Testing & Scrum
Testing & ScrumTesting & Scrum
Testing & Scrum
Johan Hoberg
 
Agile Testing: The Role Of The Agile Tester
Agile Testing: The Role Of The Agile TesterAgile Testing: The Role Of The Agile Tester
Agile Testing: The Role Of The Agile Tester
Declan Whelan
 
Remote research uxpa talk
Remote research uxpa talkRemote research uxpa talk
Remote research uxpa talk
Gavin Fung
 
Implementing Agile : Do's and Don'ts
Implementing Agile : Do's and Don'tsImplementing Agile : Do's and Don'ts
Implementing Agile : Do's and Don'ts
Anay Kamat
 
Test planning and software's engineering
Test planning and software's engineeringTest planning and software's engineering
Test planning and software's engineering
MansiganeshJawale
 
Get the most out of getting out of the building
Get the most out of getting out of the buildingGet the most out of getting out of the building
Get the most out of getting out of the building
Todd Warren
 
Driving application development through behavior driven development
Driving application development through behavior driven developmentDriving application development through behavior driven development
Driving application development through behavior driven development
Einar Ingebrigtsen
 
Bosnia Agile Tuzla Meetup
Bosnia Agile Tuzla MeetupBosnia Agile Tuzla Meetup
Bosnia Agile Tuzla Meetup
Bosnia Agile
 
Agile Testing - What, why and how.
Agile Testing - What, why and how.Agile Testing - What, why and how.
Agile Testing - What, why and how.
Asim Kazmi
 
TMPA-2015: Towards a Usable Defect Prediction Tool: Crossbreeding Machine Lea...
TMPA-2015: Towards a Usable Defect Prediction Tool: Crossbreeding Machine Lea...TMPA-2015: Towards a Usable Defect Prediction Tool: Crossbreeding Machine Lea...
TMPA-2015: Towards a Usable Defect Prediction Tool: Crossbreeding Machine Lea...
Iosif Itkin
 
productionising-recommenders
productionising-recommendersproductionising-recommenders
productionising-recommenders
Ludovik Coba
 
Agile Software Development and Test Driven Development: Agil8's Dave Putman 3...
Agile Software Development and Test Driven Development: Agil8's Dave Putman 3...Agile Software Development and Test Driven Development: Agil8's Dave Putman 3...
Agile Software Development and Test Driven Development: Agil8's Dave Putman 3...
agil8 Ltd
 
Agile Methodology
Agile MethodologyAgile Methodology
Agile Methodology
Md. Shafiuzzaman Hira
 
Usability testing
Usability testingUsability testing
Usability testing
pfasolo
 
The Importance of Culture: Building and Sustaining Effective Engineering Org...
The Importance of Culture:  Building and Sustaining Effective Engineering Org...The Importance of Culture:  Building and Sustaining Effective Engineering Org...
The Importance of Culture: Building and Sustaining Effective Engineering Org...
Randy Shoup
 
My Experiments In Agile Testing in Yahoo.pptx
My Experiments In Agile Testing in Yahoo.pptxMy Experiments In Agile Testing in Yahoo.pptx
My Experiments In Agile Testing in Yahoo.pptx
Baiju Joseph
 

Similar to Sentiment Analysis Project Report.pptx (20)

Talk at Axiom Summit 2022: Exploratory Testing Driven by Mind Maps
Talk at Axiom Summit 2022: Exploratory Testing Driven by Mind MapsTalk at Axiom Summit 2022: Exploratory Testing Driven by Mind Maps
Talk at Axiom Summit 2022: Exploratory Testing Driven by Mind Maps
 
Ch11lect1 ud
Ch11lect1 udCh11lect1 ud
Ch11lect1 ud
 
Carl shaulis agile_td2014
Carl shaulis agile_td2014Carl shaulis agile_td2014
Carl shaulis agile_td2014
 
Introduction to Test Driven Development
Introduction to Test Driven DevelopmentIntroduction to Test Driven Development
Introduction to Test Driven Development
 
Testing & Scrum
Testing & ScrumTesting & Scrum
Testing & Scrum
 
Agile Testing: The Role Of The Agile Tester
Agile Testing: The Role Of The Agile TesterAgile Testing: The Role Of The Agile Tester
Agile Testing: The Role Of The Agile Tester
 
Remote research uxpa talk
Remote research uxpa talkRemote research uxpa talk
Remote research uxpa talk
 
Implementing Agile : Do's and Don'ts
Implementing Agile : Do's and Don'tsImplementing Agile : Do's and Don'ts
Implementing Agile : Do's and Don'ts
 
Test planning and software's engineering
Test planning and software's engineeringTest planning and software's engineering
Test planning and software's engineering
 
Get the most out of getting out of the building
Get the most out of getting out of the buildingGet the most out of getting out of the building
Get the most out of getting out of the building
 
Driving application development through behavior driven development
Driving application development through behavior driven developmentDriving application development through behavior driven development
Driving application development through behavior driven development
 
Bosnia Agile Tuzla Meetup
Bosnia Agile Tuzla MeetupBosnia Agile Tuzla Meetup
Bosnia Agile Tuzla Meetup
 
Agile Testing - What, why and how.
Agile Testing - What, why and how.Agile Testing - What, why and how.
Agile Testing - What, why and how.
 
TMPA-2015: Towards a Usable Defect Prediction Tool: Crossbreeding Machine Lea...
TMPA-2015: Towards a Usable Defect Prediction Tool: Crossbreeding Machine Lea...TMPA-2015: Towards a Usable Defect Prediction Tool: Crossbreeding Machine Lea...
TMPA-2015: Towards a Usable Defect Prediction Tool: Crossbreeding Machine Lea...
 
productionising-recommenders
productionising-recommendersproductionising-recommenders
productionising-recommenders
 
Agile Software Development and Test Driven Development: Agil8's Dave Putman 3...
Agile Software Development and Test Driven Development: Agil8's Dave Putman 3...Agile Software Development and Test Driven Development: Agil8's Dave Putman 3...
Agile Software Development and Test Driven Development: Agil8's Dave Putman 3...
 
Agile Methodology
Agile MethodologyAgile Methodology
Agile Methodology
 
Usability testing
Usability testingUsability testing
Usability testing
 
The Importance of Culture: Building and Sustaining Effective Engineering Org...
The Importance of Culture:  Building and Sustaining Effective Engineering Org...The Importance of Culture:  Building and Sustaining Effective Engineering Org...
The Importance of Culture: Building and Sustaining Effective Engineering Org...
 
My Experiments In Agile Testing in Yahoo.pptx
My Experiments In Agile Testing in Yahoo.pptxMy Experiments In Agile Testing in Yahoo.pptx
My Experiments In Agile Testing in Yahoo.pptx
 

Recently uploaded

The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
Bisnar Chase Personal Injury Attorneys
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
IreneSebastianRueco1
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 

Recently uploaded (20)

The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 

Sentiment Analysis Project Report.pptx

  • 2. Rationale behind the project • Using computers to perceive human emotions • Understand how words can reveal sentiments • Discover interesting books from reviews • Gain deeper understanding of GRU
  • 3. Ethical Considerations made during the implementation • Eliminating user identifiable attributes • Minimizing bias • Dealing with profane words • Labels of the sentiments
  • 4. Accuracy of the Output • Training set (0.9944) versus validation set accuracy (0.6275) • Model able to generally classify review’s sentiments • Few exceptions of wrong classification • Constructive criticism labelled negative • Sarcastic remarks labelled positive
  • 5. Implementation Details • Used Google Colab • Dataset loaded using wget • XML file parsed using beautiful soup library & regular expressions • Negative and positive book reviews selected to train model
  • 6. Implementation Details • Truncating sentences • Lemmatization • Removing stop words • Lowercasing • Converting reviews to integer sequences
  • 7. Implementation • Training validation split • Using GRUs • 5 epochs • Testing the model by inputting own reviews

Editor's Notes

  1. I preferred NLP project that involved training a computer to predict the sentiments of people. How computers can perceive human emotions has always been a subject of fascination to me. I also have deep interests in linguistics and wanted to explore how words reveal the emotions of people. I was also hoping to find some interesting books to read by going over the reviews of different customers. Most importantly, I wanted to understand how I could Deep Neural Networks such as Gated Recurrent Units work to process complex information and train models that can make accurate and reliable predictions.
  2. The dataset contained a lot of user identifiable information such as their name, location and email address. I had to eliminate all those attributes and remain with the review texts only and their corresponding sentimental labels. I also eliminated the other attributes to minimize bias by avoiding stereotyping users according to reviews provided by the dataset. Bias could take place, for example making the model learn that women might have more negative sentiments compared to men. Another issue was dealing with profane words that might have been used in the reviews to the data. Additionally, the label of the sentiments seemed extreme to me. I though happy or sad could have been better than negative or positive.
  3. The model had a very high accuracy score of 99.44% on the training set and lower accuracy of 62.75% of the validation set. This implied that there was a problem of overfitting. However, the model was able to generally classify the emotions present in the reviews as negative or positive accurately. However, there were some few exceptions of wrong classifications. For instance, constructive criticism towards a book was labelled negative while sarcastic remarks were labelled positive. In my opinion, some of the classifications of the review
  4. The implementation was performed using Google Colab since training the model required high computation resources such as GPUs and TPUs. The dataset was loaded to Google Colab using the wget bash command. Since the original data was an XML file, it was parsed using beautiful soup library and regular expresso. The negative and positive book reviews were selected to be used to train the model/
  5. The data preprocessing stage involved truncating the reviews of customers to remain with the first 150 characters. Truncating could make training the model to take less time and be more effective. It was presumed that a customer’s sentiments towards a product can be detected while using the last few sentences. Lemmatization involved converting the words into their base form to derive meaning. Removing stop words such as and, the, I, and we was necessary to reduce the number of vocabularies that did not carry much meaning. Lowercasing enabled the text to remain uniform. Finally, the vocabularies of the reviews were transformed into integer indexes enabling conversion of the reviews to integer sequences. This step was necessary to make it easier for the machine learning model to learn from the data.
  6. Before training the model, the data was split into training and validation sets in the ratio of 80:20. This was necessary to track the performance of the model during training. Gated Recurrent Units were selected because they are one of the most advanced forms of Recurrent Neural Networks which determine which information is to be forgotten and which is to carried on to the next stage of processing. The model was trained using 5 epochs. The small number of epochs was selected to prevent the model from overfitting. The model was tested using user input where users could write their own reviews and the model would predict whether it was positive or negative.