SlideShare a Scribd company logo
Turning the Tide: Curbing Deceptive
Yelp Behaviors
Mahmudur Rahman
Bogdan Carbunar, Jaime Ballesteros, George Burri, Duen Horng Chau
SIAM SDM 2014
Problem Statement
Detect deceptive behaviors in review centered
online social networks
2
Post review, check-in
Review centered social networks
3
Challenges
• Up to 25% of Yelp reviews are claimed to be fraudulent [3]
• SEO companies offer review campaigns for business owners to
manipulate venues’ ratings
• An extra half-star rating on Yelp causes a restaurant to sell out
19% more often [1], and a one-star increase leads to a 5–9%
increase in revenue [2]
[1] Michael Anderson and Jeremy Magruder. Learning from the crowd: Regression discontinuity estimates
of the effects of an online review database. Economic Journal, 122(563):957–989, 2012.
[2] Michael Luca. Reviews, Reputation, and Revenue: The Case of Yelp.com. Available at
hbswk.hbs.edu/item/6833.html.
[3] Yelp admits a quarter of submitted reviews could be fake. BBC, www.bbc.co.uk/news/technology-
24299742
4
Background
• Yelp’s Review System
• Fraudulent Reviews
• Deceptive Venues
– Consumer Alert
Marco: Identify Deceptive Behaviors
• Automatic detection of fraudulent reviews, deceptive
venues and impactful review campaigns
• Leveraged the wealth of spatial, temporal and social
information provided by Yelp
• Increased the cost and complexity of attacks by
imposing a tradeoff on fraudsters
6
Adversary Model
• Model as a corrupt SEO
– Needs to adjust the rating of V according to a finite budget
– Controls a set of unique (IP address, Yelp Sybil account) pairs
– Has access to a market of review writers, with valid yelp
accounts
Malory
7
Collected Yelp Dataset
• Our data consists of:
• (i) 90 deceptive and 100 legitimate venues
• (ii) 200 fraudulent and 202 genuine reviews, and
• (iii) 7,435 venues and their 270,121 reviews from 195,417
reviewers from San Francisco, New York City and Miami
• Collection process took 3 weeks
YCrawl
• Fetches raw HTML pages of Yelp venue and user
accounts
• Uses a pool of servers, IP proxies & DeathByCaptcha
• Uses breadth-first crawling strategy and stratified
sampling
9
The Data
• Ground truth deceptive venues
– “Consumer Alert” feature
• Gold standard legitimate venues
– Well known consistent quality
• Gold standard fraudulent reviews
– Spelp sites, forums used
• Gold standard genuine reviews
– Remove plagiarized text and reviewer photos
– Discard short, generic reviews
– Give preference to active users
Marco System Overview
11
1. Friend & review count
2. Venue “expertise”
3. Venue activities
4. …
FRI Module
RSD Module
Venue timeline
ARD Module
Review ratings
Venue Classifier
Deceptive &
legitimate venues Features
7,435 venues
195,417 users
270,121 reviews
Train
Train Label
Fraudulent & genuine reviews
Review Classifier
Users Venues
Friend
relations Spatial
vicinity
Reviews/Time
Review Spike Detection (RSD) Module
• Identify venues that receive higher number of positive
(negative) reviews than normal
• Uses the measures of dispersion of Box-and-Whisker
plots to detect outliers
• Outputs two features -
• the number of spikes detected for a venue
• the normalized amplitude of the highest spike
12
Timelines of positive reviews of 3 deceptive venues
Review Spike Detection (RSD) Module (Cont.)
• Assumption: A review campaign is successful if it
increases the rating of the target venue by at least half
a star
• Claim #1: The minimum number of reviews the
adversary needs to post in order to (fraudulently)
increase the rating of a venue by half a star is n/7
14
Marco System Overview
15
1. Friend & review count
2. Venue “expertise”
3. Venue activities
4. …
FRI Module
RSD Module
Venue timeline
ARD Module
Review ratings
Venue Classifier
Deceptive &
legitimate venues Features
7,435 venues
195,417 users
270,121 reviews
Train
Train Label
Fraudulent & genuine reviews
Review Classifier
Users Venues
Friend
relations Spatial
vicinity
Reviews/Time
Aggregate Rating Disparity (ARD) Module
• Measures the divergence of reviews involved in a
campaign from the average rating of the venue
• Calculates the aggregated divergence of that venue
• Outputs one feature - the ARD score
16
Evolution in time of the average rating of the venue “Azure Nail & Waxing
Studio” of Chicago, IL, compared against the ratings assigned by its reviews.
Marco System Overview
18
1. Friend & review count
2. Venue “expertise”
3. Venue activities
4. …
FRI Module
RSD Module
Venue timeline Review ratings
Venue Classifier
Deceptive &
legitimate venues Features
7,435 venues
195,417 users
270,121 reviews
Train
Train Label
Fraudulent & genuine reviews
Review Classifier
Users Venues
Friend
relations Spatial
vicinity
Reviews/Time
ARD Module
Fraudulent Review Impact (FRI) Module
• Challenges:
• Venues that receive few genuine reviews are particularly
vulnerable to review campaigns
• Long term review campaigns can re-define the “normal”
review posting behavior, flatten spikes and escape detection
by the RSD module
• Our Solution: detect such behaviors through fraudulent
reviews that significantly impact the aggregate rating of venues
• Uses features extracted from each review, its reviewer
and the relation (user expertise) between the reviewer
and the target venue
19
Fraudulent Review Impact (FRI) Module
• Uses following features:
• The number of friends
• The number of reviews written
• The expertise of U around V
• The number of check-ins of U at V
• The number of photos of U at V
• The feedback count of R
• Age of U’s account when R was posted
• Contributes two features
• The FRI score
• The percentage of reviews classified as fraudulent
20
Marco System Overview
21
1. Friend & review count
2. Venue “expertise”
3. Venue activities
4. …
FRI Module
RSD Module
Venue timeline Review ratings
Venue Classifier
Deceptive &
legitimate venues Features
7,435 venues
195,417 users
270,121 reviews
Train
Train Label
Fraudulent & genuine reviews
Review Classifier
Users Venues
Friend
relations Spatial
vicinity
Reviews/Time
ARD Module
Venue Classification Features
• Uses following features:
• The number of review spikes for V
• The amplitude of the highest spike
• Aggregate rating disparity
• The fraudulent review impact of V
• Count of reviews classified fraudulent
• The rating of V
• The number of reviews of V
• The number of reviews with check-ins
• The number of reviews with photos
• The age of V
22
Review Classification
• The overall accuracy of RF, Bagging and DT is 94%, 93.5% and 93%
respectively.
• Top 2 most important features are: 1) number of reviews written by
U and 2) The expertise of U around V
23
Venue Classification
RF and DT are tied for best accuracy, of 95.8%.
24
Comparison with state-of-the-art
• Compared Marco with the three deceptive venue
detection strategies of Feng et al. [1], avg∆, distΦ
and peak↑
Strategy Accuracy (%)
Marco/RF 95.8
avg∆ 66.3
distΦ 72.1
peak↑ 58.9
Marco’s Overhead
Per-module overhead Zoom-in of FRI module overhead
Experimental Results on Live Data
City Car Shop Mover Spa
Miami, FL 1000 (6) 348 (8) 1000 (21)
San Frans., CA 612 (59) 475 (45) 1000 (42)
NYC, NY 1000 (8) 1000 (27) 1000 (28)
Detected deceptive venues by Marco out of collected venues in Yelp
27
Marco flags almost 10% of its car repair and moving companies
as suspicious
Our Contributions
28
• Introduce a lower bound on the number of reviews
required to launch a review campaign
• Presented Marco to flag suspicious reviews and
venues
• Contributed a novel dataset of reviews and venues
• Demonstrated that Marco is effective and fast
Future works
29
• Deceptive opinion spam detection (feature-based
sentiment analysis)
• Identify plagiarized reviews and outlier reviews
Questions and Comments
30

More Related Content

Similar to Curbing Deceptive Yelp Behaviors

ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social MediaReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
mbeneteau
 
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social MediaReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
mbeneteau
 
Social media and reviews are changing the landscape of revenue management
Social media and reviews are changing the landscape of revenue managementSocial media and reviews are changing the landscape of revenue management
Social media and reviews are changing the landscape of revenue management
Kevin May
 
Marriott User Research Findings
Marriott User Research FindingsMarriott User Research Findings
Marriott User Research Findings
Jonathan Coen
 
Business impact of online institutional recommendation - DigiBiz'09
Business impact of online institutional recommendation - DigiBiz'09Business impact of online institutional recommendation - DigiBiz'09
Business impact of online institutional recommendation - DigiBiz'09
Digibiz'09 Conference
 
How to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against CompetitionHow to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against Competition
UserZoom
 
Praveen cv performancetesting
Praveen cv performancetestingPraveen cv performancetesting
Praveen cv performancetesting
praveen manchukonda
 
The Influence Of Reputation Analytics On Hotel Revenue and Financial Performance
The Influence Of Reputation Analytics On Hotel Revenue and Financial PerformanceThe Influence Of Reputation Analytics On Hotel Revenue and Financial Performance
The Influence Of Reputation Analytics On Hotel Revenue and Financial Performance
eCornell
 
Yelp Product Challenge
Yelp Product ChallengeYelp Product Challenge
Yelp Product Challenge
Hisham Radwan
 
Mike rich
Mike richMike rich
Mike rich
MediaPost
 
Call center assessment[1]
Call center assessment[1]Call center assessment[1]
Call center assessment[1]
Chris Simiriotis
 
OPINION MINING BASED FAKE PRODUCT REVIEW MONITORING AND REMOVAL SYSTEM
OPINION MINING BASED FAKE PRODUCT REVIEW MONITORING AND REMOVAL SYSTEMOPINION MINING BASED FAKE PRODUCT REVIEW MONITORING AND REMOVAL SYSTEM
OPINION MINING BASED FAKE PRODUCT REVIEW MONITORING AND REMOVAL SYSTEM
shivayogihiremath201
 
Cam specialist-unit-web-analytics-and-social-media
Cam specialist-unit-web-analytics-and-social-mediaCam specialist-unit-web-analytics-and-social-media
Cam specialist-unit-web-analytics-and-social-media
SHARP TRADERS & COMMODITIES GROUP
 
Review Pro - Reputation Management for Secret Retreats
Review Pro - Reputation Management for Secret Retreats Review Pro - Reputation Management for Secret Retreats
Review Pro - Reputation Management for Secret Retreats
www.bookgreener.com
 
How to Conduct UX Benchmarking
How to Conduct UX BenchmarkingHow to Conduct UX Benchmarking
How to Conduct UX Benchmarking
UserZoom
 
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
Wright State University, Dayton, OH, USA
 
SEO optimizacijos gidas pradedantiesiems
SEO optimizacijos gidas pradedantiesiemsSEO optimizacijos gidas pradedantiesiems
SEO optimizacijos gidas pradedantiesiems
Tomas Morkūnas
 
Estudio34 Presents- Dara Fitzgerald Brighton SEO-Next Gen Measurement With Go...
Estudio34 Presents- Dara Fitzgerald Brighton SEO-Next Gen Measurement With Go...Estudio34 Presents- Dara Fitzgerald Brighton SEO-Next Gen Measurement With Go...
Estudio34 Presents- Dara Fitzgerald Brighton SEO-Next Gen Measurement With Go...
William Renedo
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Maya Hristakeva
 
Praveen cv
Praveen cvPraveen cv

Similar to Curbing Deceptive Yelp Behaviors (20)

ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social MediaReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
 
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social MediaReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
ReviewPro Webinar - Managing Guest Satisfaction in the Age of Social Media
 
Social media and reviews are changing the landscape of revenue management
Social media and reviews are changing the landscape of revenue managementSocial media and reviews are changing the landscape of revenue management
Social media and reviews are changing the landscape of revenue management
 
Marriott User Research Findings
Marriott User Research FindingsMarriott User Research Findings
Marriott User Research Findings
 
Business impact of online institutional recommendation - DigiBiz'09
Business impact of online institutional recommendation - DigiBiz'09Business impact of online institutional recommendation - DigiBiz'09
Business impact of online institutional recommendation - DigiBiz'09
 
How to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against CompetitionHow to Benchmark Your Online Customer Experience Against Competition
How to Benchmark Your Online Customer Experience Against Competition
 
Praveen cv performancetesting
Praveen cv performancetestingPraveen cv performancetesting
Praveen cv performancetesting
 
The Influence Of Reputation Analytics On Hotel Revenue and Financial Performance
The Influence Of Reputation Analytics On Hotel Revenue and Financial PerformanceThe Influence Of Reputation Analytics On Hotel Revenue and Financial Performance
The Influence Of Reputation Analytics On Hotel Revenue and Financial Performance
 
Yelp Product Challenge
Yelp Product ChallengeYelp Product Challenge
Yelp Product Challenge
 
Mike rich
Mike richMike rich
Mike rich
 
Call center assessment[1]
Call center assessment[1]Call center assessment[1]
Call center assessment[1]
 
OPINION MINING BASED FAKE PRODUCT REVIEW MONITORING AND REMOVAL SYSTEM
OPINION MINING BASED FAKE PRODUCT REVIEW MONITORING AND REMOVAL SYSTEMOPINION MINING BASED FAKE PRODUCT REVIEW MONITORING AND REMOVAL SYSTEM
OPINION MINING BASED FAKE PRODUCT REVIEW MONITORING AND REMOVAL SYSTEM
 
Cam specialist-unit-web-analytics-and-social-media
Cam specialist-unit-web-analytics-and-social-mediaCam specialist-unit-web-analytics-and-social-media
Cam specialist-unit-web-analytics-and-social-media
 
Review Pro - Reputation Management for Secret Retreats
Review Pro - Reputation Management for Secret Retreats Review Pro - Reputation Management for Secret Retreats
Review Pro - Reputation Management for Secret Retreats
 
How to Conduct UX Benchmarking
How to Conduct UX BenchmarkingHow to Conduct UX Benchmarking
How to Conduct UX Benchmarking
 
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
 
SEO optimizacijos gidas pradedantiesiems
SEO optimizacijos gidas pradedantiesiemsSEO optimizacijos gidas pradedantiesiems
SEO optimizacijos gidas pradedantiesiems
 
Estudio34 Presents- Dara Fitzgerald Brighton SEO-Next Gen Measurement With Go...
Estudio34 Presents- Dara Fitzgerald Brighton SEO-Next Gen Measurement With Go...Estudio34 Presents- Dara Fitzgerald Brighton SEO-Next Gen Measurement With Go...
Estudio34 Presents- Dara Fitzgerald Brighton SEO-Next Gen Measurement With Go...
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Praveen cv
Praveen cvPraveen cv
Praveen cv
 

Recently uploaded

Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 

Recently uploaded (20)

Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 

Curbing Deceptive Yelp Behaviors

  • 1. Turning the Tide: Curbing Deceptive Yelp Behaviors Mahmudur Rahman Bogdan Carbunar, Jaime Ballesteros, George Burri, Duen Horng Chau SIAM SDM 2014
  • 2. Problem Statement Detect deceptive behaviors in review centered online social networks 2
  • 3. Post review, check-in Review centered social networks 3
  • 4. Challenges • Up to 25% of Yelp reviews are claimed to be fraudulent [3] • SEO companies offer review campaigns for business owners to manipulate venues’ ratings • An extra half-star rating on Yelp causes a restaurant to sell out 19% more often [1], and a one-star increase leads to a 5–9% increase in revenue [2] [1] Michael Anderson and Jeremy Magruder. Learning from the crowd: Regression discontinuity estimates of the effects of an online review database. Economic Journal, 122(563):957–989, 2012. [2] Michael Luca. Reviews, Reputation, and Revenue: The Case of Yelp.com. Available at hbswk.hbs.edu/item/6833.html. [3] Yelp admits a quarter of submitted reviews could be fake. BBC, www.bbc.co.uk/news/technology- 24299742 4
  • 5. Background • Yelp’s Review System • Fraudulent Reviews • Deceptive Venues – Consumer Alert
  • 6. Marco: Identify Deceptive Behaviors • Automatic detection of fraudulent reviews, deceptive venues and impactful review campaigns • Leveraged the wealth of spatial, temporal and social information provided by Yelp • Increased the cost and complexity of attacks by imposing a tradeoff on fraudsters 6
  • 7. Adversary Model • Model as a corrupt SEO – Needs to adjust the rating of V according to a finite budget – Controls a set of unique (IP address, Yelp Sybil account) pairs – Has access to a market of review writers, with valid yelp accounts Malory 7
  • 8. Collected Yelp Dataset • Our data consists of: • (i) 90 deceptive and 100 legitimate venues • (ii) 200 fraudulent and 202 genuine reviews, and • (iii) 7,435 venues and their 270,121 reviews from 195,417 reviewers from San Francisco, New York City and Miami • Collection process took 3 weeks
  • 9. YCrawl • Fetches raw HTML pages of Yelp venue and user accounts • Uses a pool of servers, IP proxies & DeathByCaptcha • Uses breadth-first crawling strategy and stratified sampling 9
  • 10. The Data • Ground truth deceptive venues – “Consumer Alert” feature • Gold standard legitimate venues – Well known consistent quality • Gold standard fraudulent reviews – Spelp sites, forums used • Gold standard genuine reviews – Remove plagiarized text and reviewer photos – Discard short, generic reviews – Give preference to active users
  • 11. Marco System Overview 11 1. Friend & review count 2. Venue “expertise” 3. Venue activities 4. … FRI Module RSD Module Venue timeline ARD Module Review ratings Venue Classifier Deceptive & legitimate venues Features 7,435 venues 195,417 users 270,121 reviews Train Train Label Fraudulent & genuine reviews Review Classifier Users Venues Friend relations Spatial vicinity Reviews/Time
  • 12. Review Spike Detection (RSD) Module • Identify venues that receive higher number of positive (negative) reviews than normal • Uses the measures of dispersion of Box-and-Whisker plots to detect outliers • Outputs two features - • the number of spikes detected for a venue • the normalized amplitude of the highest spike 12
  • 13. Timelines of positive reviews of 3 deceptive venues
  • 14. Review Spike Detection (RSD) Module (Cont.) • Assumption: A review campaign is successful if it increases the rating of the target venue by at least half a star • Claim #1: The minimum number of reviews the adversary needs to post in order to (fraudulently) increase the rating of a venue by half a star is n/7 14
  • 15. Marco System Overview 15 1. Friend & review count 2. Venue “expertise” 3. Venue activities 4. … FRI Module RSD Module Venue timeline ARD Module Review ratings Venue Classifier Deceptive & legitimate venues Features 7,435 venues 195,417 users 270,121 reviews Train Train Label Fraudulent & genuine reviews Review Classifier Users Venues Friend relations Spatial vicinity Reviews/Time
  • 16. Aggregate Rating Disparity (ARD) Module • Measures the divergence of reviews involved in a campaign from the average rating of the venue • Calculates the aggregated divergence of that venue • Outputs one feature - the ARD score 16
  • 17. Evolution in time of the average rating of the venue “Azure Nail & Waxing Studio” of Chicago, IL, compared against the ratings assigned by its reviews.
  • 18. Marco System Overview 18 1. Friend & review count 2. Venue “expertise” 3. Venue activities 4. … FRI Module RSD Module Venue timeline Review ratings Venue Classifier Deceptive & legitimate venues Features 7,435 venues 195,417 users 270,121 reviews Train Train Label Fraudulent & genuine reviews Review Classifier Users Venues Friend relations Spatial vicinity Reviews/Time ARD Module
  • 19. Fraudulent Review Impact (FRI) Module • Challenges: • Venues that receive few genuine reviews are particularly vulnerable to review campaigns • Long term review campaigns can re-define the “normal” review posting behavior, flatten spikes and escape detection by the RSD module • Our Solution: detect such behaviors through fraudulent reviews that significantly impact the aggregate rating of venues • Uses features extracted from each review, its reviewer and the relation (user expertise) between the reviewer and the target venue 19
  • 20. Fraudulent Review Impact (FRI) Module • Uses following features: • The number of friends • The number of reviews written • The expertise of U around V • The number of check-ins of U at V • The number of photos of U at V • The feedback count of R • Age of U’s account when R was posted • Contributes two features • The FRI score • The percentage of reviews classified as fraudulent 20
  • 21. Marco System Overview 21 1. Friend & review count 2. Venue “expertise” 3. Venue activities 4. … FRI Module RSD Module Venue timeline Review ratings Venue Classifier Deceptive & legitimate venues Features 7,435 venues 195,417 users 270,121 reviews Train Train Label Fraudulent & genuine reviews Review Classifier Users Venues Friend relations Spatial vicinity Reviews/Time ARD Module
  • 22. Venue Classification Features • Uses following features: • The number of review spikes for V • The amplitude of the highest spike • Aggregate rating disparity • The fraudulent review impact of V • Count of reviews classified fraudulent • The rating of V • The number of reviews of V • The number of reviews with check-ins • The number of reviews with photos • The age of V 22
  • 23. Review Classification • The overall accuracy of RF, Bagging and DT is 94%, 93.5% and 93% respectively. • Top 2 most important features are: 1) number of reviews written by U and 2) The expertise of U around V 23
  • 24. Venue Classification RF and DT are tied for best accuracy, of 95.8%. 24
  • 25. Comparison with state-of-the-art • Compared Marco with the three deceptive venue detection strategies of Feng et al. [1], avg∆, distΦ and peak↑ Strategy Accuracy (%) Marco/RF 95.8 avg∆ 66.3 distΦ 72.1 peak↑ 58.9
  • 26. Marco’s Overhead Per-module overhead Zoom-in of FRI module overhead
  • 27. Experimental Results on Live Data City Car Shop Mover Spa Miami, FL 1000 (6) 348 (8) 1000 (21) San Frans., CA 612 (59) 475 (45) 1000 (42) NYC, NY 1000 (8) 1000 (27) 1000 (28) Detected deceptive venues by Marco out of collected venues in Yelp 27 Marco flags almost 10% of its car repair and moving companies as suspicious
  • 28. Our Contributions 28 • Introduce a lower bound on the number of reviews required to launch a review campaign • Presented Marco to flag suspicious reviews and venues • Contributed a novel dataset of reviews and venues • Demonstrated that Marco is effective and fast
  • 29. Future works 29 • Deceptive opinion spam detection (feature-based sentiment analysis) • Identify plagiarized reviews and outlier reviews

Editor's Notes

  1. The goal of this thesis will be to significantly extend the state-of-the-art approaches to (provide)……
  2. In review cen…, users post reviews and checks in at venues, see other user’s reviews/comments, make friendship. They collect all these information along with location updates from users. People rely on online reviews to make decisions on purchases, services and others.
  3. Review centered social networks have been shown to receive significant numbers of fraudulent reviews. For instance, … malicious behaviors may also be professionally organized. For business owners, profit seems to be the main incentive to engage in deceptive activities.
  4. We present Marco, a system for automatic detection of fraudulent reviews, deceptive venues and impactful review campaigns. We introduced a lower bound on the number of reviews required to launch a review campaign that impacts a target venue’s rating.
  5. Before jumping into the system model, let’s discuss the adversary model at first. We assume that … and does not collude with attackers to facilitate false data reports. However … A malicious user can attempt to learn and modify fitness information reported by the trackers of other users or fraudulently claim ownership of videos that they did not create. They can even post fraudulent reviews or be involved in a deceptive review campaign. attempting to perform the above attacks to access more information
  6. DeathByCaptcha to collect CAPTCHA protected reviews filtered by Yelp. We collected 100 venues randomly selected from 10 major US cities (e.g., NY, San Francisco, LA, Chicago, Miami). Second, we used YCrawl to collect basic account information of the 10,031 Yelp users who reviewed those venues. Third, we randomly selected a subset of 16,199 venues from all the venues reviewed by those users.
  7. Consists of 3 modules which we will describe in the following slides. Each module produces several features that being fed into a venue classifier, trained on the datasets. ---------------------------------------------------------------------------------------------------------------------------------------- Not included (RSD: relies on temporal, inter-review relations to identify venues receiving suspiciously high numbers of positive or negative reviews. ARD: uses relations between review ratings and the aggregate rating of their venue, at the time of their posting, to identify venues that exhibit a “bipolar” review FRI: first classifies reviews as fraudulent or genuine based on their social, spatial and temporal features. It then identifies venues whose aggregate rating is significantly impacted by reviews classified as fraudulent.)
  8. Identify venues receiving suspiciously high numbers of positive or negative reviews.
  9. For instance, the “South Bay BMW” venue has a UOF of 9 for positive reviews: any day with more than 9 positive reviews is considered to be a spike.
  10. n: the number of genuine reviews V has at the completion of the campaign
  11. Consists of 3 modules which we will describe in the following slides. Each module produces several features that being fed into a venue classifier, trained on the datasets. ---------------------------------------------------------------------------------------------------------------------------------------- Not included (RSD: relies on temporal, inter-review relations to identify venues receiving suspiciously high numbers of positive or negative reviews. ARD: uses relations between review ratings and the aggregate rating of their venue, at the time of their posting, to identify venues that exhibit a “bipolar” review FRI: first classifies reviews as fraudulent or genuine based on their social, spatial and temporal features. It then identifies venues whose aggregate rating is significantly impacted by reviews classified as fraudulent.)
  12. A venue that is the target of a review campaign is likely to receive reviews that do not agree with its genuine reviews. Also following a successful review campaign, the venue is likely to receive reviews from genuine users that do not agree with the venue’s newly engineered rating.
  13. Consists of 3 modules which we will describe in the following slides. Each module produces several features that being fed into a venue classifier, trained on the datasets. ---------------------------------------------------------------------------------------------------------------------------------------- Not included (RSD: relies on temporal, inter-review relations to identify venues receiving suspiciously high numbers of positive or negative reviews. ARD: uses relations between review ratings and the aggregate rating of their venue, at the time of their posting, to identify venues that exhibit a “bipolar” review FRI: first classifies reviews as fraudulent or genuine based on their social, spatial and temporal features. It then identifies venues whose aggregate rating is significantly impacted by reviews classified as fraudulent.)
  14. First classifies reviews as fraudulent or genuine based on their social, spatial and temporal features. It then identifies venues whose aggregate rating is significantly impacted by reviews classified as fraudulent.
  15. First classifies reviews as fraudulent or genuine based on their social, spatial and temporal features. It then identifies venues whose aggregate rating is significantly impacted by reviews classified as fraudulent.
  16. Consists of 3 modules which we will describe in the following slides. Each module produces several features that being fed into a venue classifier, trained on the datasets. ---------------------------------------------------------------------------------------------------------------------------------------- Not included (RSD: relies on temporal, inter-review relations to identify venues receiving suspiciously high numbers of positive or negative reviews. ARD: uses relations between review ratings and the aggregate rating of their venue, at the time of their posting, to identify venues that exhibit a “bipolar” review FRI: first classifies reviews as fraudulent or genuine based on their social, spatial and temporal features. It then identifies venues whose aggregate rating is significantly impacted by reviews classified as fraudulent.)
  17. First classifies reviews as fraudulent or genuine based on their social, spatial and temporal features. It then identifies venues whose aggregate rating is significantly impacted by reviews classified as fraudulent.
  18. The first one is the ROC plot for review classification. RF performs best, at 94% accuracy. The second one is the ROC curve for venue classification. RF and DT are tied for best accuracy, of 95.8%.
  19. The first one is the ROC plot for review classification. RF performs best, at 94% accuracy. The second one is the ROC curve for venue classification. RF and DT are tied for best accuracy, of 95.8%.
  20. Yelp limits the results for a search to the first 1000 matching venues. Entries with values less than 1000 correspond to cities with fewer than 1000 venues of the corresponding type.
  21. Yelp limits the results for a search to the first 1000 matching venues. Entries with values less than 1000 correspond to cities with fewer than 1000 venues of the corresponding type.
  22. Yelp limits the results for a search to the first 1000 matching venues. Entries with values less than 1000 correspond to cities with fewer than 1000 venues of the corresponding type.