SlideShare a Scribd company logo
Predic'ng	
  Helpfulness	
  of	
  	
  
Amazon’s	
  User-­‐Generated	
  Product	
  Reviews	
  
Ankita	
  Kaul	
  &	
  Nicholas	
  Baladis	
  
MIT	
  Sloan	
  –	
  Spring	
  2015	
  
Project	
  
Mo'va'on	
  
Amazon	
  prioriAzes	
  product	
  reviews	
  that	
  customers	
  
deem	
  ‘helpful’,	
  only	
  a@er	
  customers	
  have	
  
voluntarily	
  voted	
  so.	
  
Customers	
  can	
  
voluntarily	
  vote	
  
here	
  
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  
…Amazon	
  could	
  predict	
  which	
  reviews	
  are	
  helpful,	
  
	
  the	
  moment	
  they	
  are	
  posted?	
  
Product	
  Ra5ng	
  
Helpfulness	
  score	
  
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  
Data	
  Galore*	
  
Our	
  data	
  consisted	
  of	
  
Amazon	
  user-­‐generated	
  
product	
  reviews,	
  spanning	
  all	
  
product	
  categories,	
  and	
  
spanning	
  a	
  Ame	
  of	
  18	
  years.	
  
Each	
  ‘observaAon’	
  is	
  a	
  
customer’s	
  review.	
  	
  
•  Reviewer	
  ID	
   •  Helpfulness	
  RaAng	
  
•  Product	
  ID	
   •  Product	
  Price	
  
•  Timestamp	
  of	
  
review	
  
•  Review	
  Prose	
  	
  
•  Score	
  
Data	
  Structure:	
  
~35M	
  Reviews,	
  
All	
  Categories	
  
~1.2M,	
  Electronics	
  
Categories	
  	
  
~18K,	
  
Only	
  Reviews	
  with	
  
	
  >10	
  votes	
  
Downsize	
  Downsize	
  
We	
  had	
  to	
  downsize:	
  
*Data	
  procured	
  from	
  Stanford	
  University	
  
J.	
  McAuley	
  and	
  J.	
  Leskovec.	
  Hidden	
  factors	
  and	
  
hidden	
  topics:	
  understanding	
  ra5ng	
  dimensions	
  
with	
  review	
  text.	
  RecSys,	
  2013.	
  
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  
Analysis	
  Approach	
  
The	
  Setup	
  
The	
  Methodology	
  
Dependent	
  
variable	
  
Is	
  a	
  review	
  helpful	
  or	
  
not?	
  	
  
• ‘Yes’	
  if	
  >75%	
  voters	
  
agree	
  
• Binary	
  variable	
  	
  
Independent	
  
variables	
  
Pre-­‐Exis'ng	
  from	
  data	
  set:	
  
• Product	
  Price	
  
• Overall	
  product	
  raAng	
  
	
  
Newly	
  calculated:	
  
• Word	
  count	
  of	
  review	
  prose	
  
• Readability	
  grade-­‐level	
  score	
  	
  
On	
  unclustered	
  
data	
  set	
  
• Linear	
  Regression	
  
• LogisAc	
  Regression	
  
• CART	
  
• Cross-­‐Validated	
  CART	
  
• Random	
  Forest	
  
• Bag	
  of	
  Words	
  
On	
  clustered	
  data	
  
set	
  
• LogisAc	
  Regression	
  
• CART	
  
• Cross-­‐Validated	
  CART	
  
• Random	
  Forest	
  
• Bag	
  of	
  Words	
  
Flesch-­‐Kincaid	
  method:	
  
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  
PredicAons	
  on	
  Unclustered	
  Data	
  Set	
  
	
  Methodology	
   Accuracy	
  
	
  Baseline	
   74.95%	
  
	
  Linear	
  Regression	
  	
   R2	
  =	
  0.273	
  
	
  LogisAc	
  Regression	
   81.44%	
  
	
  CART	
   80.88%	
  
	
  Cross-­‐V	
  CART	
   81.84%	
  
	
  Random	
  Forest	
   81.94%	
  
	
  BoW	
  &	
  LogisAc	
  Reg	
   81.08%	
  
	
  BoW	
  &	
  CART	
   79.80%	
  
	
  BoW	
  &	
  Cross-­‐V	
  CART	
   78.16%	
  
	
  BoW	
  &	
  Random	
  Forest	
   82.08%	
  
score >= 2.5
price < 210
work >= 0.5
score >= 1.5
price < 30
FALSE
FALSE
FALSE
FALSE TRUE
TRUE
yes no
BoW	
  &	
  CART	
  Tree	
  
Our	
  predic've	
  models	
  look	
  promising:	
  
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  
Clustering	
  The	
  Data	
  Set	
  
Cluster	
  1	
  -­‐	
  Eloquent	
  &	
  wordy	
  
•  Highest	
  word	
  count	
  
•  Highest	
  grade	
  score	
  
Cluster	
  2	
  –	
  Cheap	
  products	
  &	
  less	
  
wordy	
  
•  Lowest	
  price	
  
•  Low	
  word	
  count	
  
Cluster	
  3	
  –Worse	
  products	
  &	
  shortest	
  
reviews	
  
•  Lowest	
  word	
  count	
  
•  Lowest	
  product	
  score	
  
Cluster	
  4	
  –	
  The	
  ‘average’	
  group	
  
•  Average	
  in	
  all	
  variables	
  
Cluster	
  5	
  –	
  Expensive	
  products	
  &	
  
least	
  arAculate	
  reviews	
  
•  Highest	
  price	
  
•  Low	
  grade	
  score	
  
15%	
  
35%	
  
31%	
  
14%	
  
5%	
  
05000001000000
Cluster Dendrogram
Height
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  
 Cluster	
  
Baseline	
  
Accuracy	
  
Best	
  Performing	
  
Accuracy	
  
Best	
  Performing	
  
Methodology	
  
	
  Cluster	
  1	
   90.52%	
   90.52%	
   Baseline	
  
	
  Cluster	
  2	
   85.24%	
   86.08%	
   Random	
  Forest	
  
	
  Cluster	
  3	
   65.31%	
   76.74%	
  
Bag	
  of	
  Words	
  &	
  Random	
  
Forest	
  
	
  Cluster	
  4	
   68.63%	
   82.24%	
  
Bag	
  of	
  Words	
  &	
  Cross-­‐
Validated	
  CART	
  
	
  Cluster	
  5	
   70.31%	
   84.34%	
   LogisAc	
  Regression	
  
Clustered	
  Data	
  Set	
  Results	
  
No	
  improvement	
  
through	
  modeling	
  
+14%	
  improvement	
  
Cluster-­‐then-­‐predict	
  total	
  accuracy	
  =	
  76.81%	
  
Clustering	
  provided	
  us	
  mixed	
  results	
  on	
  our	
  models:	
  
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  
Bag	
  of	
  Words	
  Text	
  AnalyAcs	
  +	
  CART	
  
	
  Examples	
  on	
  Clustered	
  Set	
  
score >= 3.5
wordcoun >= 58
grade_sc >= 5.4
wordcoun >= 96
epson >= 2.5
might >= 0.5
keep >= 0.5
pretti >= 0.5
wordcoun < 102
FALSE
FALSE TRUE FALSE
FALSE
FALSE
FALSE
FALSE TRUE
TRUE
yes no
score >= 3.5
wordcoun >= 50 wordcoun >= 124
score >= 2.5
speaker < 1.5
fine >= 0.5
chang < 0.5
window >= 0.5
issu >= 0.5
real >= 0.5
FALSE TRUE
FALSE
FALSE
FALSE
FALSE
FALSE TRUE
TRUE
TRUE
TRUE
yes no
Cluster	
  4	
   Cluster	
  5	
  
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  
Conclusions	
  
Our	
  best	
  performer	
  was	
  Bag	
  of	
  Words	
  +	
  Random	
  Forests	
  on	
  the	
  complete	
  data	
  set	
  
	
  
	
  
	
  
	
  
The	
  cluster-­‐then-­‐predict	
  methodology	
  did	
  not	
  beat	
  modeling	
  the	
  enAre	
  set	
  
	
  
	
  
	
  
	
  
However,	
  clustering	
  gave	
  us	
  other	
  interesAng	
  results:	
  
•  Clusters	
  1,2,4,5	
  beat	
  even	
  our	
  best	
  models	
  we	
  developed	
  on	
  the	
  enAre	
  data	
  set	
  
•  Cluster	
  1	
  had	
  such	
  a	
  high	
  baseline	
  (90.52%),	
  no	
  model	
  is	
  needed	
  
•  Cluster	
  5	
  had	
  a	
  +14%	
  improvement,	
  higher	
  than	
  any	
  other	
  model	
  
	
  
74.95%	
  
(Baseline)	
  
82.08%	
  
	
  (BoW	
  +	
  RF)	
  
74.95%	
  
(Baseline)	
  
76.81%	
  
	
  (Cluster-­‐then-­‐Predict)	
  
Amazon	
  can	
  predict	
  the	
  helpfulness	
  of	
  reviews	
  at	
  the	
  moment	
  they	
  are	
  posted	
  with	
  
reasonable	
  accuracy	
  with	
  a	
  2-­‐step	
  model	
  (1)	
  cluster,	
  2)	
  predict	
  by	
  cluster).	
  By	
  applying	
  
such	
  analy'cs,	
  they	
  can	
  poten'ally	
  flag	
  unhelpful	
  reviews	
  at	
  'me	
  of	
  pos'ng	
  and	
  help	
  
develop	
  a	
  be_er	
  decision	
  making	
  experience	
  for	
  customers.	
  	
  
Conclusions:	
  
Ankita	
  Kaul	
  &	
  Nick	
  Baladis	
  |	
  MIT	
  Sloan	
  

More Related Content

Similar to Predicting Helpfulness of User-Generated Product Reviews Through Analytical Models

Statistical methods for questionnaire development: Questionnaire reliability ...
Statistical methods for questionnaire development: Questionnaire reliability ...Statistical methods for questionnaire development: Questionnaire reliability ...
Statistical methods for questionnaire development: Questionnaire reliability ...
Ahmed Negida
 
Clinical Data Classification of alzheimer's disease
Clinical Data Classification of alzheimer's diseaseClinical Data Classification of alzheimer's disease
Clinical Data Classification of alzheimer's disease
George Kalangi
 
Classification techniques in data mining
Classification techniques in data miningClassification techniques in data mining
Classification techniques in data mining
Kamal Acharya
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10
Roger Barga
 
Predictive Analysis
Predictive AnalysisPredictive Analysis
Predictive Analysis
Michael Bystry
 
Barga Data Science lecture 9
Barga Data Science lecture 9Barga Data Science lecture 9
Barga Data Science lecture 9
Roger Barga
 
Anton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQBAnton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQB
Ievgenii Katsan
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
Kai Koenig
 
Mastering the 80% of Analytics: What Data Scientists Really Do
Mastering the 80% of Analytics: What Data Scientists Really DoMastering the 80% of Analytics: What Data Scientists Really Do
Mastering the 80% of Analytics: What Data Scientists Really Do
Avrio Analytics
 
Introduction to Data Analysis With R and R Studio
Introduction to Data Analysis With R and R StudioIntroduction to Data Analysis With R and R Studio
Introduction to Data Analysis With R and R Studio
Azmi Mohd Tamil
 
Tips and tricks to win kaggle data science competitions
Tips and tricks to win kaggle data science competitionsTips and tricks to win kaggle data science competitions
Tips and tricks to win kaggle data science competitions
Darius Barušauskas
 
Traditional versus adaptive techniques
Traditional versus adaptive techniquesTraditional versus adaptive techniques
Traditional versus adaptive techniques
Aravind Ganesh
 
Data analytics, a (short) tour
Data analytics, a (short) tourData analytics, a (short) tour
Data analytics, a (short) tour
Venkatesh Prasad Ranganath
 
Multi-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender SystemsMulti-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender Systems
Aravind Sesagiri Raamkumar
 
Data mining on yelp dataset
Data mining on yelp datasetData mining on yelp dataset
Data mining on yelp dataset
Parineetha Tirumali
 
Creating AnswerBot with Keras and TensorFlow (TensorBeat)
Creating AnswerBot with Keras and TensorFlow (TensorBeat)Creating AnswerBot with Keras and TensorFlow (TensorBeat)
Creating AnswerBot with Keras and TensorFlow (TensorBeat)
Avkash Chauhan
 
Meetup_Consumer_Credit_Default_Vers_2_All
Meetup_Consumer_Credit_Default_Vers_2_AllMeetup_Consumer_Credit_Default_Vers_2_All
Meetup_Consumer_Credit_Default_Vers_2_All
Bernard Ong
 
Top 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner Pitfalls
Sri Ambati
 
Kaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyKaggle Gold Medal Case Study
Kaggle Gold Medal Case Study
Alon Bochman, CFA
 
Mixed Effects Models - Random Intercepts
Mixed Effects Models - Random InterceptsMixed Effects Models - Random Intercepts
Mixed Effects Models - Random Intercepts
Scott Fraundorf
 

Similar to Predicting Helpfulness of User-Generated Product Reviews Through Analytical Models (20)

Statistical methods for questionnaire development: Questionnaire reliability ...
Statistical methods for questionnaire development: Questionnaire reliability ...Statistical methods for questionnaire development: Questionnaire reliability ...
Statistical methods for questionnaire development: Questionnaire reliability ...
 
Clinical Data Classification of alzheimer's disease
Clinical Data Classification of alzheimer's diseaseClinical Data Classification of alzheimer's disease
Clinical Data Classification of alzheimer's disease
 
Classification techniques in data mining
Classification techniques in data miningClassification techniques in data mining
Classification techniques in data mining
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10
 
Predictive Analysis
Predictive AnalysisPredictive Analysis
Predictive Analysis
 
Barga Data Science lecture 9
Barga Data Science lecture 9Barga Data Science lecture 9
Barga Data Science lecture 9
 
Anton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQBAnton Muzhailo - Practical Test Process Improvement using ISTQB
Anton Muzhailo - Practical Test Process Improvement using ISTQB
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Mastering the 80% of Analytics: What Data Scientists Really Do
Mastering the 80% of Analytics: What Data Scientists Really DoMastering the 80% of Analytics: What Data Scientists Really Do
Mastering the 80% of Analytics: What Data Scientists Really Do
 
Introduction to Data Analysis With R and R Studio
Introduction to Data Analysis With R and R StudioIntroduction to Data Analysis With R and R Studio
Introduction to Data Analysis With R and R Studio
 
Tips and tricks to win kaggle data science competitions
Tips and tricks to win kaggle data science competitionsTips and tricks to win kaggle data science competitions
Tips and tricks to win kaggle data science competitions
 
Traditional versus adaptive techniques
Traditional versus adaptive techniquesTraditional versus adaptive techniques
Traditional versus adaptive techniques
 
Data analytics, a (short) tour
Data analytics, a (short) tourData analytics, a (short) tour
Data analytics, a (short) tour
 
Multi-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender SystemsMulti-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender Systems
 
Data mining on yelp dataset
Data mining on yelp datasetData mining on yelp dataset
Data mining on yelp dataset
 
Creating AnswerBot with Keras and TensorFlow (TensorBeat)
Creating AnswerBot with Keras and TensorFlow (TensorBeat)Creating AnswerBot with Keras and TensorFlow (TensorBeat)
Creating AnswerBot with Keras and TensorFlow (TensorBeat)
 
Meetup_Consumer_Credit_Default_Vers_2_All
Meetup_Consumer_Credit_Default_Vers_2_AllMeetup_Consumer_Credit_Default_Vers_2_All
Meetup_Consumer_Credit_Default_Vers_2_All
 
Top 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner Pitfalls
 
Kaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyKaggle Gold Medal Case Study
Kaggle Gold Medal Case Study
 
Mixed Effects Models - Random Intercepts
Mixed Effects Models - Random InterceptsMixed Effects Models - Random Intercepts
Mixed Effects Models - Random Intercepts
 

Recently uploaded

STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 

Recently uploaded (20)

STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 

Predicting Helpfulness of User-Generated Product Reviews Through Analytical Models

  • 1. Predic'ng  Helpfulness  of     Amazon’s  User-­‐Generated  Product  Reviews   Ankita  Kaul  &  Nicholas  Baladis   MIT  Sloan  –  Spring  2015  
  • 2. Project   Mo'va'on   Amazon  prioriAzes  product  reviews  that  customers   deem  ‘helpful’,  only  a@er  customers  have   voluntarily  voted  so.   Customers  can   voluntarily  vote   here   Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan  
  • 3.
  • 4. …Amazon  could  predict  which  reviews  are  helpful,    the  moment  they  are  posted?   Product  Ra5ng   Helpfulness  score   Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan  
  • 5. Data  Galore*   Our  data  consisted  of   Amazon  user-­‐generated   product  reviews,  spanning  all   product  categories,  and   spanning  a  Ame  of  18  years.   Each  ‘observaAon’  is  a   customer’s  review.     •  Reviewer  ID   •  Helpfulness  RaAng   •  Product  ID   •  Product  Price   •  Timestamp  of   review   •  Review  Prose     •  Score   Data  Structure:   ~35M  Reviews,   All  Categories   ~1.2M,  Electronics   Categories     ~18K,   Only  Reviews  with    >10  votes   Downsize  Downsize   We  had  to  downsize:   *Data  procured  from  Stanford  University   J.  McAuley  and  J.  Leskovec.  Hidden  factors  and   hidden  topics:  understanding  ra5ng  dimensions   with  review  text.  RecSys,  2013.   Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan  
  • 6. Analysis  Approach   The  Setup   The  Methodology   Dependent   variable   Is  a  review  helpful  or   not?     • ‘Yes’  if  >75%  voters   agree   • Binary  variable     Independent   variables   Pre-­‐Exis'ng  from  data  set:   • Product  Price   • Overall  product  raAng     Newly  calculated:   • Word  count  of  review  prose   • Readability  grade-­‐level  score     On  unclustered   data  set   • Linear  Regression   • LogisAc  Regression   • CART   • Cross-­‐Validated  CART   • Random  Forest   • Bag  of  Words   On  clustered  data   set   • LogisAc  Regression   • CART   • Cross-­‐Validated  CART   • Random  Forest   • Bag  of  Words   Flesch-­‐Kincaid  method:   Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan  
  • 7. PredicAons  on  Unclustered  Data  Set    Methodology   Accuracy    Baseline   74.95%    Linear  Regression     R2  =  0.273    LogisAc  Regression   81.44%    CART   80.88%    Cross-­‐V  CART   81.84%    Random  Forest   81.94%    BoW  &  LogisAc  Reg   81.08%    BoW  &  CART   79.80%    BoW  &  Cross-­‐V  CART   78.16%    BoW  &  Random  Forest   82.08%   score >= 2.5 price < 210 work >= 0.5 score >= 1.5 price < 30 FALSE FALSE FALSE FALSE TRUE TRUE yes no BoW  &  CART  Tree   Our  predic've  models  look  promising:   Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan  
  • 8. Clustering  The  Data  Set   Cluster  1  -­‐  Eloquent  &  wordy   •  Highest  word  count   •  Highest  grade  score   Cluster  2  –  Cheap  products  &  less   wordy   •  Lowest  price   •  Low  word  count   Cluster  3  –Worse  products  &  shortest   reviews   •  Lowest  word  count   •  Lowest  product  score   Cluster  4  –  The  ‘average’  group   •  Average  in  all  variables   Cluster  5  –  Expensive  products  &   least  arAculate  reviews   •  Highest  price   •  Low  grade  score   15%   35%   31%   14%   5%   05000001000000 Cluster Dendrogram Height Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan  
  • 9.  Cluster   Baseline   Accuracy   Best  Performing   Accuracy   Best  Performing   Methodology    Cluster  1   90.52%   90.52%   Baseline    Cluster  2   85.24%   86.08%   Random  Forest    Cluster  3   65.31%   76.74%   Bag  of  Words  &  Random   Forest    Cluster  4   68.63%   82.24%   Bag  of  Words  &  Cross-­‐ Validated  CART    Cluster  5   70.31%   84.34%   LogisAc  Regression   Clustered  Data  Set  Results   No  improvement   through  modeling   +14%  improvement   Cluster-­‐then-­‐predict  total  accuracy  =  76.81%   Clustering  provided  us  mixed  results  on  our  models:   Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan  
  • 10. Bag  of  Words  Text  AnalyAcs  +  CART    Examples  on  Clustered  Set   score >= 3.5 wordcoun >= 58 grade_sc >= 5.4 wordcoun >= 96 epson >= 2.5 might >= 0.5 keep >= 0.5 pretti >= 0.5 wordcoun < 102 FALSE FALSE TRUE FALSE FALSE FALSE FALSE FALSE TRUE TRUE yes no score >= 3.5 wordcoun >= 50 wordcoun >= 124 score >= 2.5 speaker < 1.5 fine >= 0.5 chang < 0.5 window >= 0.5 issu >= 0.5 real >= 0.5 FALSE TRUE FALSE FALSE FALSE FALSE FALSE TRUE TRUE TRUE TRUE yes no Cluster  4   Cluster  5   Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan  
  • 12. Our  best  performer  was  Bag  of  Words  +  Random  Forests  on  the  complete  data  set           The  cluster-­‐then-­‐predict  methodology  did  not  beat  modeling  the  enAre  set           However,  clustering  gave  us  other  interesAng  results:   •  Clusters  1,2,4,5  beat  even  our  best  models  we  developed  on  the  enAre  data  set   •  Cluster  1  had  such  a  high  baseline  (90.52%),  no  model  is  needed   •  Cluster  5  had  a  +14%  improvement,  higher  than  any  other  model     74.95%   (Baseline)   82.08%    (BoW  +  RF)   74.95%   (Baseline)   76.81%    (Cluster-­‐then-­‐Predict)   Amazon  can  predict  the  helpfulness  of  reviews  at  the  moment  they  are  posted  with   reasonable  accuracy  with  a  2-­‐step  model  (1)  cluster,  2)  predict  by  cluster).  By  applying   such  analy'cs,  they  can  poten'ally  flag  unhelpful  reviews  at  'me  of  pos'ng  and  help   develop  a  be_er  decision  making  experience  for  customers.     Conclusions:   Ankita  Kaul  &  Nick  Baladis  |  MIT  Sloan