SlideShare a Scribd company logo
“Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics”  Authors: AnindyaGhose, Panagiotis G. Ipeirotis, Member, IEEE Course: Topics in Data mining Presenter: NobalNiraula December 8, 2010 @ UOM 1
Introduction Gathering variables (Attributes) Explanatory study using Econometric Regression Hypothesis for sales Hypothesis for perceived usefulness Prediction Helpfulness Impact on sales Conclusion Outline 2
Product related word-of-mouth conversations in online markets Reviewers contribute time and energy Volume of review could be high Benefits Customers: Usefulness / Helpfulness Average Star Rating Bimodel Peer Review  Biased Helpfulness = helpful votes / total votes “Spotlight Review” in Amazon.com Manufacturers: Influence on Sales Helpful reviews are not necessarily the ones that lead to increases in sales ! Reviews that affect most should be presented first to manufacturers Introduction (1) 3
The paper is unique in looking at how subjectivity level, readabilityandspelling errors in the text of reviews affect product sales and the perceived helpfulness of these reviews. 4 Introduction (2)
Two Level Study Explanatory Econometric Analysis Identify aspects of  a review  a reviewer Prediction Model using “Random Forests” How peer consumers are going to rate a review How sales will be affected by the posted review Predicting Helpfulness and Importance 5
Product Reviews 6
Sample Review 7
Reviewer’s Profile 8
Product Rank 9
Variables Collection Products Audio and video players (144 products), Digital cameras (109 products), and DVDs (158 products). Product and Sales Data :Retail Price, Sales Rank, Average Rating, Number of Reviews, Elapsed Date  Reviewer History: Number of Past Reviews,  Reviewer History Micro, Reviewer History Micro,  Past Helpful Votes, Past Total Votes Reviewer Characteristics: Reviewer Rank, Top-10 Reviewer, Top-50 Reviewer, Top-100 Reviewer, Top-500 Reviewer, Real Name, Nick Name, Hobbies, Birthday, Location, Web page, Interests, Snippet, Any Discloser Individual Review:  Moderate Review, Helpful Votes, Total Votes, Helpfulness Review Readability : Length(Chars), Length (Words), Length(Sentence), Spelling Error, ARI, Gunning Index, Coleman–Liau index, Flesch Reading Ease, Flesch–Kincaid Grade Level, SMOG Review Subjectivity: AvgProb, DevProb 10
Readability Analysis Automated Readability Index Coleman-Liau Index Flesch-Kincaid Grade Level Gunning fog index SMOG Subjectivity Analysis Stylistic Choices : “Subjective” vs “Objective” Each document gets a “Subjectivity Score” AvgProb (r) : High value Many Subjective sentences DevProb (r) : High Value Mixed (Subj+Obj) sentences 11 Text of a Review Matters !
Hypothesis 1a:  All else equal, a change in the subjectivity level and mixture of objective and subjective statements in reviews will be associated with a change in sales. Hypothesis 1b:  All else equal, a change in the readability score of reviews will be associated with a change in sales. Hypothesis 1c:  All else equal, a decrease in the proportion of spelling errors in reviews will be positively related to sales. Hypothesis for Sales 12
ln(D) = a + b * ln(S) D is the unobserved product demand S is its observed sales rank Pareto Distribution High sales rank  low demand  Key Observation: “Sales rank” in Amazon.com can be taken as PROXY of Demand ! 13 Effect on Product Sales
14 Descriptive Statistics for Econometric Analysis
15 Model to test Hypothesis1 ,[object Object]
Control Variables: Retail Price, Avg. Numeric Rating, Elapsed Date, Number of Reviews,[object Object]
Hypothesis 1a: High subjective sentences increase sales Mixture of subjective and objective sentences are negatively associated with product sales compared to highly subjective and objective sentences. Hypothesis 1b: Higher readability scores are associated with higher sales Hypothesis 1c An increase in proportion of spelling mistakesdecreases product sales for some “experience products” like DVDs however the proportion of spelling errors doesn’t have significant impact on sales for “search products” Reviews with that rate products negatively can be associated with increased product sales when the review text is informative and detailed !!! 17 Conclusion (1)
Hypothesis 2a:  All else equal, a change in the subjectivity level and mixture of objective and subjective statements in a review will be associated with a change in the perceived helpfulness of that review  Hypothesis 2b:  All else equal, a change in the readability of a review will be associated with a change the perceived helpfulness of that review. Hypothesis 2c:  All else equal, a decrease in the proportion of spelling errors in a review will be positively related to perceived helpfulness of that review. Hypothesis 2d:  All else equal, an increase in the average helpfulness of a reviewer’s historical reviews will be positively related to perceived helpfulness of a review posted by that reviewer. Hypothesis for Helpfulness  18
19 Effect on Helpfulness ,[object Object],[object Object]
Hypothesis 2a: In general, mixture of subjective and objective elements more informative (helpful) by the users. For feature-based goods users prefer reviews having more objective information and less subjective sentences  For experience goods, e.g. DVD, users expect few objective sentences but more subjective sentences Hypothesis 2b – 2d :  Increase in the readability of reviews has a positive and statistically impact on review helpfulness An increase in proportion of spelling errors has a negative and statistically significant impact review helpfulness for audio-video products and DVDs. Past historical information about reviewers has a statistically significant effect on the perceived helpfulness of reviews 21 Conclusion (2)
Main goal Is the review informative or not ? Does the review impact on sales or not ? Question: given a helpfulness value of a review, decide whether it is useful or not Helpfulness = (Helpful votes/ Total votes) Continuous to binary conversion Threshold found is 60 % Classification Regression Model can be used Binary Classification  22 Predictive Modeling
Classifiers SVM VS Random Forest SVM consistently performed worse unlike reported in reports  Training time for SVM was significantly higher than that of Random Forest 23 Predicting Helpfulness (1)
24 Predicting Helpfulness (2)
Examining whether the difference SalesRankt(r)+T − SalesRankt(r) where t(r) is the time the review is posted, is positive or negative. 25 Predicting Impact on Sales
Random Forest based prediction For experience goods such as DVDs classifier has lower performance Observed high correlation of “classification error” with “distribution of review ratings” Reviews that have received widely fluctuating ratings also have reviews with widely fluctuating helpfulness votes. Highly detailed and readable reviews can have low helpfulness votes  “reviewer-related”, “review subjectivity” and “review readability” features sets are interchangeable! 26 Conclusion (3)
Subjectivity level, readabilityandspelling errors in the text of reviews affect product sales and the perceived helpfulness 27 Overall Conclusion
AnindyaGhose, Panagiotis G. Ipeirotis, "Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics," IEEE Transactions on Knowledge and Data Engineering, vol. 99, no. PrePrints, , 2010 28 References
Thank You ! 29
Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics

More Related Content

Viewers also liked

LET Practice test in Professional Education-principles and strategies in teac...
LET Practice test in Professional Education-principles and strategies in teac...LET Practice test in Professional Education-principles and strategies in teac...
LET Practice test in Professional Education-principles and strategies in teac...
University of Santo Tomas
 
Let reviewer prof ed
Let reviewer prof edLet reviewer prof ed
Let reviewer prof ed
Jenny Rose Basa
 
The most incredible shopping malls
The most incredible shopping mallsThe most incredible shopping malls
The most incredible shopping malls
Fernando Perdomo
 
Kms2013 ise1
Kms2013 ise1Kms2013 ise1
Kms2013 ise1
Sivaraman KM
 
Noor ka khilona urdu (نور کا کھلونا)
Noor ka khilona urdu (نور کا کھلونا)Noor ka khilona urdu (نور کا کھلونا)
Noor ka khilona urdu (نور کا کھلونا)
dawateislami
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
Splunk
 
Ignition toyota igt
Ignition toyota igtIgnition toyota igt
Ignition toyota igt
Toni Gim
 
XPath
XPathXPath
XPath
Raji Ghawi
 
Android dynamic module
Android dynamic moduleAndroid dynamic module
Android dynamic modulegdgvietnam
 
INGLES A1
INGLES A1INGLES A1
Rescue1.asd
Rescue1.asdRescue1.asd
Rescue1.asd
mahmoud shetewy
 
Published Sox9 Paper!
Published Sox9 Paper!Published Sox9 Paper!
Published Sox9 Paper!
Tim Rutkowski
 
英语魔方秀English mofunshow
英语魔方秀English mofunshow英语魔方秀English mofunshow
英语魔方秀English mofunshow果成 李
 
Kessan 1708682945115077
Kessan 1708682945115077Kessan 1708682945115077
Kessan 1708682945115077
yoshikawa0521
 
Final oral test
Final oral testFinal oral test
Final oral test
Tricia Terzaghi
 
Slide Workshop Bakordik - Penilaian
Slide Workshop Bakordik - PenilaianSlide Workshop Bakordik - Penilaian
Slide Workshop Bakordik - Penilaian
fkkumjkepaniteraan
 
Hap7 17
Hap7 17Hap7 17
Hap7 17
TheSlaps
 
Little italy meat menu dec12
Little italy meat menu dec12Little italy meat menu dec12
Little italy meat menu dec12
weiss2001
 
Happy birthday to Prof. Yunus - you change foundation
Happy birthday to Prof.  Yunus - you change foundationHappy birthday to Prof.  Yunus - you change foundation
Happy birthday to Prof. Yunus - you change foundation
The Grameen Creative Lab
 
Dapan
DapanDapan

Viewers also liked (20)

LET Practice test in Professional Education-principles and strategies in teac...
LET Practice test in Professional Education-principles and strategies in teac...LET Practice test in Professional Education-principles and strategies in teac...
LET Practice test in Professional Education-principles and strategies in teac...
 
Let reviewer prof ed
Let reviewer prof edLet reviewer prof ed
Let reviewer prof ed
 
The most incredible shopping malls
The most incredible shopping mallsThe most incredible shopping malls
The most incredible shopping malls
 
Kms2013 ise1
Kms2013 ise1Kms2013 ise1
Kms2013 ise1
 
Noor ka khilona urdu (نور کا کھلونا)
Noor ka khilona urdu (نور کا کھلونا)Noor ka khilona urdu (نور کا کھلونا)
Noor ka khilona urdu (نور کا کھلونا)
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Ignition toyota igt
Ignition toyota igtIgnition toyota igt
Ignition toyota igt
 
XPath
XPathXPath
XPath
 
Android dynamic module
Android dynamic moduleAndroid dynamic module
Android dynamic module
 
INGLES A1
INGLES A1INGLES A1
INGLES A1
 
Rescue1.asd
Rescue1.asdRescue1.asd
Rescue1.asd
 
Published Sox9 Paper!
Published Sox9 Paper!Published Sox9 Paper!
Published Sox9 Paper!
 
英语魔方秀English mofunshow
英语魔方秀English mofunshow英语魔方秀English mofunshow
英语魔方秀English mofunshow
 
Kessan 1708682945115077
Kessan 1708682945115077Kessan 1708682945115077
Kessan 1708682945115077
 
Final oral test
Final oral testFinal oral test
Final oral test
 
Slide Workshop Bakordik - Penilaian
Slide Workshop Bakordik - PenilaianSlide Workshop Bakordik - Penilaian
Slide Workshop Bakordik - Penilaian
 
Hap7 17
Hap7 17Hap7 17
Hap7 17
 
Little italy meat menu dec12
Little italy meat menu dec12Little italy meat menu dec12
Little italy meat menu dec12
 
Happy birthday to Prof. Yunus - you change foundation
Happy birthday to Prof.  Yunus - you change foundationHappy birthday to Prof.  Yunus - you change foundation
Happy birthday to Prof. Yunus - you change foundation
 
Dapan
DapanDapan
Dapan
 

Similar to Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics

JUS 455 Final Project Milestone Two Guidelines and Rubric
 JUS 455 Final Project Milestone Two Guidelines and Rubric   JUS 455 Final Project Milestone Two Guidelines and Rubric
JUS 455 Final Project Milestone Two Guidelines and Rubric
MoseStaton39
 
Market analysis tools in npd (final)
Market analysis tools in npd (final)Market analysis tools in npd (final)
Market analysis tools in npd (final)
Omid Aminzadeh Gohari
 
E_WOM THESIS PRESENTATION
E_WOM THESIS PRESENTATIONE_WOM THESIS PRESENTATION
E_WOM THESIS PRESENTATION
Evangelia Makri
 
APA style 500 – 600 words This week we dove into a bit mor.docx
APA style 500 – 600 words This week we dove into a bit mor.docxAPA style 500 – 600 words This week we dove into a bit mor.docx
APA style 500 – 600 words This week we dove into a bit mor.docx
armitageclaire49
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Online feedback correlation using clustering
Online feedback correlation using clusteringOnline feedback correlation using clustering
Online feedback correlation using clustering
awesomesos
 
iaetsd Co extracting opinion targets and opinion words from online reviews ba...
iaetsd Co extracting opinion targets and opinion words from online reviews ba...iaetsd Co extracting opinion targets and opinion words from online reviews ba...
iaetsd Co extracting opinion targets and opinion words from online reviews ba...
Iaetsd Iaetsd
 
Chapter 9
Chapter 9Chapter 9
Chapter 9
Cheryl Lawson
 
Summarizing and Enriched Extracting technique using Review Data by Users to t...
Summarizing and Enriched Extracting technique using Review Data by Users to t...Summarizing and Enriched Extracting technique using Review Data by Users to t...
Summarizing and Enriched Extracting technique using Review Data by Users to t...
IRJET Journal
 
IRJET- Implementation of Review Selection using Deep Learning
IRJET-  	  Implementation of Review Selection using Deep LearningIRJET-  	  Implementation of Review Selection using Deep Learning
IRJET- Implementation of Review Selection using Deep Learning
IRJET Journal
 
IRJET- Predicting Review Ratings for Product Marketing
IRJET- Predicting Review Ratings for Product MarketingIRJET- Predicting Review Ratings for Product Marketing
IRJET- Predicting Review Ratings for Product Marketing
IRJET Journal
 
On the validity and impact of metacritic scores adams greenwood-ericksen
On the validity and impact of metacritic scores  adams greenwood-ericksenOn the validity and impact of metacritic scores  adams greenwood-ericksen
On the validity and impact of metacritic scores adams greenwood-ericksen
Mary Chan
 
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET Journal
 
Collective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and MetadataCollective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and Metadata
Shebuti Rayana
 
Customer_Analysis.docx
Customer_Analysis.docxCustomer_Analysis.docx
Customer_Analysis.docx
KevalKabariya
 
Electronic word-of-mouth, management response and behavioural intentions
Electronic word-of-mouth, management response and behavioural intentionsElectronic word-of-mouth, management response and behavioural intentions
Electronic word-of-mouth, management response and behavioural intentions
obinnachristianojiak
 
1. Click here to retrieve the Risk Management Template. Working wi.docx
1. Click here to retrieve the Risk Management Template. Working wi.docx1. Click here to retrieve the Risk Management Template. Working wi.docx
1. Click here to retrieve the Risk Management Template. Working wi.docx
jackiewalcutt
 
Innovation Intro to Conjoint
Innovation Intro to ConjointInnovation Intro to Conjoint
Innovation Intro to Conjoint
WahyuRafdinal1
 
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
Crossref
 

Similar to Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics (20)

JUS 455 Final Project Milestone Two Guidelines and Rubric
 JUS 455 Final Project Milestone Two Guidelines and Rubric   JUS 455 Final Project Milestone Two Guidelines and Rubric
JUS 455 Final Project Milestone Two Guidelines and Rubric
 
Market analysis tools in npd (final)
Market analysis tools in npd (final)Market analysis tools in npd (final)
Market analysis tools in npd (final)
 
E_WOM THESIS PRESENTATION
E_WOM THESIS PRESENTATIONE_WOM THESIS PRESENTATION
E_WOM THESIS PRESENTATION
 
APA style 500 – 600 words This week we dove into a bit mor.docx
APA style 500 – 600 words This week we dove into a bit mor.docxAPA style 500 – 600 words This week we dove into a bit mor.docx
APA style 500 – 600 words This week we dove into a bit mor.docx
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Online feedback correlation using clustering
Online feedback correlation using clusteringOnline feedback correlation using clustering
Online feedback correlation using clustering
 
iaetsd Co extracting opinion targets and opinion words from online reviews ba...
iaetsd Co extracting opinion targets and opinion words from online reviews ba...iaetsd Co extracting opinion targets and opinion words from online reviews ba...
iaetsd Co extracting opinion targets and opinion words from online reviews ba...
 
Chapter 9
Chapter 9Chapter 9
Chapter 9
 
Summarizing and Enriched Extracting technique using Review Data by Users to t...
Summarizing and Enriched Extracting technique using Review Data by Users to t...Summarizing and Enriched Extracting technique using Review Data by Users to t...
Summarizing and Enriched Extracting technique using Review Data by Users to t...
 
IRJET- Implementation of Review Selection using Deep Learning
IRJET-  	  Implementation of Review Selection using Deep LearningIRJET-  	  Implementation of Review Selection using Deep Learning
IRJET- Implementation of Review Selection using Deep Learning
 
IRJET- Predicting Review Ratings for Product Marketing
IRJET- Predicting Review Ratings for Product MarketingIRJET- Predicting Review Ratings for Product Marketing
IRJET- Predicting Review Ratings for Product Marketing
 
On the validity and impact of metacritic scores adams greenwood-ericksen
On the validity and impact of metacritic scores  adams greenwood-ericksenOn the validity and impact of metacritic scores  adams greenwood-ericksen
On the validity and impact of metacritic scores adams greenwood-ericksen
 
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
 
Collective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and MetadataCollective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and Metadata
 
Customer_Analysis.docx
Customer_Analysis.docxCustomer_Analysis.docx
Customer_Analysis.docx
 
Electronic word-of-mouth, management response and behavioural intentions
Electronic word-of-mouth, management response and behavioural intentionsElectronic word-of-mouth, management response and behavioural intentions
Electronic word-of-mouth, management response and behavioural intentions
 
1. Click here to retrieve the Risk Management Template. Working wi.docx
1. Click here to retrieve the Risk Management Template. Working wi.docx1. Click here to retrieve the Risk Management Template. Working wi.docx
1. Click here to retrieve the Risk Management Template. Working wi.docx
 
Innovation Intro to Conjoint
Innovation Intro to ConjointInnovation Intro to Conjoint
Innovation Intro to Conjoint
 
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
 

Recently uploaded

Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 

Recently uploaded (20)

Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 

Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics

  • 1. “Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics” Authors: AnindyaGhose, Panagiotis G. Ipeirotis, Member, IEEE Course: Topics in Data mining Presenter: NobalNiraula December 8, 2010 @ UOM 1
  • 2. Introduction Gathering variables (Attributes) Explanatory study using Econometric Regression Hypothesis for sales Hypothesis for perceived usefulness Prediction Helpfulness Impact on sales Conclusion Outline 2
  • 3. Product related word-of-mouth conversations in online markets Reviewers contribute time and energy Volume of review could be high Benefits Customers: Usefulness / Helpfulness Average Star Rating Bimodel Peer Review  Biased Helpfulness = helpful votes / total votes “Spotlight Review” in Amazon.com Manufacturers: Influence on Sales Helpful reviews are not necessarily the ones that lead to increases in sales ! Reviews that affect most should be presented first to manufacturers Introduction (1) 3
  • 4. The paper is unique in looking at how subjectivity level, readabilityandspelling errors in the text of reviews affect product sales and the perceived helpfulness of these reviews. 4 Introduction (2)
  • 5. Two Level Study Explanatory Econometric Analysis Identify aspects of a review a reviewer Prediction Model using “Random Forests” How peer consumers are going to rate a review How sales will be affected by the posted review Predicting Helpfulness and Importance 5
  • 10. Variables Collection Products Audio and video players (144 products), Digital cameras (109 products), and DVDs (158 products). Product and Sales Data :Retail Price, Sales Rank, Average Rating, Number of Reviews, Elapsed Date Reviewer History: Number of Past Reviews, Reviewer History Micro, Reviewer History Micro, Past Helpful Votes, Past Total Votes Reviewer Characteristics: Reviewer Rank, Top-10 Reviewer, Top-50 Reviewer, Top-100 Reviewer, Top-500 Reviewer, Real Name, Nick Name, Hobbies, Birthday, Location, Web page, Interests, Snippet, Any Discloser Individual Review: Moderate Review, Helpful Votes, Total Votes, Helpfulness Review Readability : Length(Chars), Length (Words), Length(Sentence), Spelling Error, ARI, Gunning Index, Coleman–Liau index, Flesch Reading Ease, Flesch–Kincaid Grade Level, SMOG Review Subjectivity: AvgProb, DevProb 10
  • 11. Readability Analysis Automated Readability Index Coleman-Liau Index Flesch-Kincaid Grade Level Gunning fog index SMOG Subjectivity Analysis Stylistic Choices : “Subjective” vs “Objective” Each document gets a “Subjectivity Score” AvgProb (r) : High value Many Subjective sentences DevProb (r) : High Value Mixed (Subj+Obj) sentences 11 Text of a Review Matters !
  • 12. Hypothesis 1a: All else equal, a change in the subjectivity level and mixture of objective and subjective statements in reviews will be associated with a change in sales. Hypothesis 1b: All else equal, a change in the readability score of reviews will be associated with a change in sales. Hypothesis 1c: All else equal, a decrease in the proportion of spelling errors in reviews will be positively related to sales. Hypothesis for Sales 12
  • 13. ln(D) = a + b * ln(S) D is the unobserved product demand S is its observed sales rank Pareto Distribution High sales rank  low demand Key Observation: “Sales rank” in Amazon.com can be taken as PROXY of Demand ! 13 Effect on Product Sales
  • 14. 14 Descriptive Statistics for Econometric Analysis
  • 15.
  • 16.
  • 17. Hypothesis 1a: High subjective sentences increase sales Mixture of subjective and objective sentences are negatively associated with product sales compared to highly subjective and objective sentences. Hypothesis 1b: Higher readability scores are associated with higher sales Hypothesis 1c An increase in proportion of spelling mistakesdecreases product sales for some “experience products” like DVDs however the proportion of spelling errors doesn’t have significant impact on sales for “search products” Reviews with that rate products negatively can be associated with increased product sales when the review text is informative and detailed !!! 17 Conclusion (1)
  • 18. Hypothesis 2a: All else equal, a change in the subjectivity level and mixture of objective and subjective statements in a review will be associated with a change in the perceived helpfulness of that review Hypothesis 2b: All else equal, a change in the readability of a review will be associated with a change the perceived helpfulness of that review. Hypothesis 2c: All else equal, a decrease in the proportion of spelling errors in a review will be positively related to perceived helpfulness of that review. Hypothesis 2d: All else equal, an increase in the average helpfulness of a reviewer’s historical reviews will be positively related to perceived helpfulness of a review posted by that reviewer. Hypothesis for Helpfulness 18
  • 19.
  • 20. Hypothesis 2a: In general, mixture of subjective and objective elements more informative (helpful) by the users. For feature-based goods users prefer reviews having more objective information and less subjective sentences For experience goods, e.g. DVD, users expect few objective sentences but more subjective sentences Hypothesis 2b – 2d : Increase in the readability of reviews has a positive and statistically impact on review helpfulness An increase in proportion of spelling errors has a negative and statistically significant impact review helpfulness for audio-video products and DVDs. Past historical information about reviewers has a statistically significant effect on the perceived helpfulness of reviews 21 Conclusion (2)
  • 21. Main goal Is the review informative or not ? Does the review impact on sales or not ? Question: given a helpfulness value of a review, decide whether it is useful or not Helpfulness = (Helpful votes/ Total votes) Continuous to binary conversion Threshold found is 60 % Classification Regression Model can be used Binary Classification 22 Predictive Modeling
  • 22. Classifiers SVM VS Random Forest SVM consistently performed worse unlike reported in reports Training time for SVM was significantly higher than that of Random Forest 23 Predicting Helpfulness (1)
  • 24. Examining whether the difference SalesRankt(r)+T − SalesRankt(r) where t(r) is the time the review is posted, is positive or negative. 25 Predicting Impact on Sales
  • 25. Random Forest based prediction For experience goods such as DVDs classifier has lower performance Observed high correlation of “classification error” with “distribution of review ratings” Reviews that have received widely fluctuating ratings also have reviews with widely fluctuating helpfulness votes. Highly detailed and readable reviews can have low helpfulness votes “reviewer-related”, “review subjectivity” and “review readability” features sets are interchangeable! 26 Conclusion (3)
  • 26. Subjectivity level, readabilityandspelling errors in the text of reviews affect product sales and the perceived helpfulness 27 Overall Conclusion
  • 27. AnindyaGhose, Panagiotis G. Ipeirotis, "Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics," IEEE Transactions on Knowledge and Data Engineering, vol. 99, no. PrePrints, , 2010 28 References