SlideShare a Scribd company logo
1 of 13
Bug Report, Feature Request, or Simply
Praise? On Automatically Classifying App
Reviews
Authors: Walid Maalej and Hader Nabil
IEEE RE - 2015
Presented by:
Oğuzhan Çalıkkasap
App Market
• Apple App Store
• Google Play Store
https://www.statista.com/statistics/276623/number-of-apps-available-in-leading-app-stores/
2
Motivation
• Better reviews, higher sales
• Rich source
Information about bugs
Ideas for new features
• Benefitted by
App vendors
Developers
3
Purpose
• Classify reviews into four categories
Bug reports
Feature requests
User experiences
Ratings
4
Paper Outline
• Introduce probabilistic techniques for review classification
• Compare accuracy of the review classification techniques
• Derive insights into review analytics tool design
5
Review Classification Techniques
1. String matching
2. Bag of Words
3. Text Preprocessing
4. Review Metadata
5. Sentiment Analysis
6. Binary and Multiclass Classifiers
6
Research Method and Data
• Collect reviews from app store, extract their metadata
• Manually label reviews and create a truth set
• Implement each classifier at different parts of truth set
• Evaluate classifiers’ accuracies
• Data: Review text, title, app name, category, store, submission
date, star rating
7
Evaluation Data 8
Evaluation Metrics
• Precision: Fraction of reviews that are classified correctly for type i
• Recall: Fraction of reviews of type i which are classified correctly
• F1: Harmonic mean
9
𝑇𝑃𝑖
𝑇𝑃𝑖 + 𝐹𝑃𝑖
𝑇𝑃𝑖
𝑇𝑃𝑖 + 𝐹𝑁𝑖
Research Results - Features 10
Research Results - Classifiers 11
Study Outcomes
• Probabilistic approaches always outperformed the basic classifiers
• Combination of text classifiers, metadata, NLP and sentiments usually
resulted in high precision – above about 70%
• Does not mean combination of techniques always rank best
• No clear trend like ‘more NLP leads to a better result’
• Binary classifiers more accurate than multiclass classifiers
• No single classifier works best for all review types and data sources
• Naive Bayes seems to be a more appropriate classifier for review
classification
12
Thank you for listening 13
https://mast.informatik.uni-hamburg.de/wp-content/uploads/2015/06/review_classification_preprint.pdf
Original paper:

More Related Content

What's hot (7)

Genevieve De La Cruz .Net Portfolio
Genevieve De La Cruz .Net PortfolioGenevieve De La Cruz .Net Portfolio
Genevieve De La Cruz .Net Portfolio
 
Final Presentation
Final PresentationFinal Presentation
Final Presentation
 
_Search? en toute simplicité : Elastic + App Search
_Search? en toute simplicité : Elastic + App Search_Search? en toute simplicité : Elastic + App Search
_Search? en toute simplicité : Elastic + App Search
 
Birds-Other
Birds-OtherBirds-Other
Birds-Other
 
Mobipedia presentation
Mobipedia presentationMobipedia presentation
Mobipedia presentation
 
Psm digital-olympus-slides-09.19.18
Psm digital-olympus-slides-09.19.18Psm digital-olympus-slides-09.19.18
Psm digital-olympus-slides-09.19.18
 
Webinar: Ecommerce, Rules, and Relevance
Webinar: Ecommerce, Rules, and RelevanceWebinar: Ecommerce, Rules, and Relevance
Webinar: Ecommerce, Rules, and Relevance
 

Similar to Oguzhan nlp presentation

Using microsoft application insights to implement a build, measure, learn loop
Using microsoft application insights to implement a build, measure, learn loopUsing microsoft application insights to implement a build, measure, learn loop
Using microsoft application insights to implement a build, measure, learn loop
Marcel de Vries
 
Mobile App Analytics
Mobile App AnalyticsMobile App Analytics
Mobile App Analytics
pragativbora
 
Option 2015- Getting Started with Optimizely for Mobile
Option 2015- Getting Started with Optimizely for MobileOption 2015- Getting Started with Optimizely for Mobile
Option 2015- Getting Started with Optimizely for Mobile
Optimizely
 
Taking the Pain out of Data Science - RecSys Machine Learning Framework Over ...
Taking the Pain out of Data Science - RecSys Machine Learning Framework Over ...Taking the Pain out of Data Science - RecSys Machine Learning Framework Over ...
Taking the Pain out of Data Science - RecSys Machine Learning Framework Over ...
Sonya Liberman
 

Similar to Oguzhan nlp presentation (20)

Webinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better SearchWebinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better Search
 
App testing and publishing
App testing and publishingApp testing and publishing
App testing and publishing
 
How To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaHow To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With Kibana
 
Using microsoft application insights to implement a build, measure, learn loop
Using microsoft application insights to implement a build, measure, learn loopUsing microsoft application insights to implement a build, measure, learn loop
Using microsoft application insights to implement a build, measure, learn loop
 
UCL M.Sc. Technology Entrepreneurship 2015 - Launching Digital Products
UCL M.Sc. Technology Entrepreneurship 2015 -  Launching Digital ProductsUCL M.Sc. Technology Entrepreneurship 2015 -  Launching Digital Products
UCL M.Sc. Technology Entrepreneurship 2015 - Launching Digital Products
 
Optimizing Dev Portals with Analytics and Feedback
Optimizing Dev Portals with Analytics and FeedbackOptimizing Dev Portals with Analytics and Feedback
Optimizing Dev Portals with Analytics and Feedback
 
FineAI Recommendation Engine
FineAI Recommendation EngineFineAI Recommendation Engine
FineAI Recommendation Engine
 
ppt.pptx
ppt.pptxppt.pptx
ppt.pptx
 
Mobile App Analytics
Mobile App AnalyticsMobile App Analytics
Mobile App Analytics
 
Measuring the Right App Metrics - Guide for Beginners
Measuring the Right App Metrics - Guide for BeginnersMeasuring the Right App Metrics - Guide for Beginners
Measuring the Right App Metrics - Guide for Beginners
 
Smartone v1.0
Smartone v1.0Smartone v1.0
Smartone v1.0
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Maruti gollapudi cv
Maruti gollapudi cvMaruti gollapudi cv
Maruti gollapudi cv
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Option 2015- Getting Started with Optimizely for Mobile
Option 2015- Getting Started with Optimizely for MobileOption 2015- Getting Started with Optimizely for Mobile
Option 2015- Getting Started with Optimizely for Mobile
 
Cultural change of testing
Cultural change of testingCultural change of testing
Cultural change of testing
 
Google Tag Manager and Google Analytics
Google Tag Manager and Google AnalyticsGoogle Tag Manager and Google Analytics
Google Tag Manager and Google Analytics
 
Taking the Pain out of Data Science - RecSys Machine Learning Framework Over ...
Taking the Pain out of Data Science - RecSys Machine Learning Framework Over ...Taking the Pain out of Data Science - RecSys Machine Learning Framework Over ...
Taking the Pain out of Data Science - RecSys Machine Learning Framework Over ...
 
Mobile Healthcare Apps: 7 things to remember to get your app noticed
Mobile Healthcare Apps: 7 things to remember to get your app noticedMobile Healthcare Apps: 7 things to remember to get your app noticed
Mobile Healthcare Apps: 7 things to remember to get your app noticed
 
User-Driven Development: The Art of Creating a Software Product - Velvetech
User-Driven Development: The Art of Creating a Software Product - VelvetechUser-Driven Development: The Art of Creating a Software Product - Velvetech
User-Driven Development: The Art of Creating a Software Product - Velvetech
 

Recently uploaded

Orientation Canvas Course Presentation.pdf
Orientation Canvas Course Presentation.pdfOrientation Canvas Course Presentation.pdf
Orientation Canvas Course Presentation.pdf
Elizabeth Walsh
 
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
EADTU
 
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdf
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdfContoh Aksi Nyata Refleksi Diri ( NUR ).pdf
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdf
cupulin
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
AnaAcapella
 

Recently uploaded (20)

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17
 
OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
 
How to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxHow to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptx
 
What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptx
 
Orientation Canvas Course Presentation.pdf
Orientation Canvas Course Presentation.pdfOrientation Canvas Course Presentation.pdf
Orientation Canvas Course Presentation.pdf
 
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdf
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdfContoh Aksi Nyata Refleksi Diri ( NUR ).pdf
Contoh Aksi Nyata Refleksi Diri ( NUR ).pdf
 
How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 

Oguzhan nlp presentation

  • 1. Bug Report, Feature Request, or Simply Praise? On Automatically Classifying App Reviews Authors: Walid Maalej and Hader Nabil IEEE RE - 2015 Presented by: Oğuzhan Çalıkkasap
  • 2. App Market • Apple App Store • Google Play Store https://www.statista.com/statistics/276623/number-of-apps-available-in-leading-app-stores/ 2
  • 3. Motivation • Better reviews, higher sales • Rich source Information about bugs Ideas for new features • Benefitted by App vendors Developers 3
  • 4. Purpose • Classify reviews into four categories Bug reports Feature requests User experiences Ratings 4
  • 5. Paper Outline • Introduce probabilistic techniques for review classification • Compare accuracy of the review classification techniques • Derive insights into review analytics tool design 5
  • 6. Review Classification Techniques 1. String matching 2. Bag of Words 3. Text Preprocessing 4. Review Metadata 5. Sentiment Analysis 6. Binary and Multiclass Classifiers 6
  • 7. Research Method and Data • Collect reviews from app store, extract their metadata • Manually label reviews and create a truth set • Implement each classifier at different parts of truth set • Evaluate classifiers’ accuracies • Data: Review text, title, app name, category, store, submission date, star rating 7
  • 9. Evaluation Metrics • Precision: Fraction of reviews that are classified correctly for type i • Recall: Fraction of reviews of type i which are classified correctly • F1: Harmonic mean 9 𝑇𝑃𝑖 𝑇𝑃𝑖 + 𝐹𝑃𝑖 𝑇𝑃𝑖 𝑇𝑃𝑖 + 𝐹𝑁𝑖
  • 10. Research Results - Features 10
  • 11. Research Results - Classifiers 11
  • 12. Study Outcomes • Probabilistic approaches always outperformed the basic classifiers • Combination of text classifiers, metadata, NLP and sentiments usually resulted in high precision – above about 70% • Does not mean combination of techniques always rank best • No clear trend like ‘more NLP leads to a better result’ • Binary classifiers more accurate than multiclass classifiers • No single classifier works best for all review types and data sources • Naive Bayes seems to be a more appropriate classifier for review classification 12
  • 13. Thank you for listening 13 https://mast.informatik.uni-hamburg.de/wp-content/uploads/2015/06/review_classification_preprint.pdf Original paper: