SlideShare a Scribd company logo
Measuring the Quality
of Online Service
Jin Young Kim
Senior Applied Scientist
Microsoft Web Search and AI
About Jin Young Kim
• Data Scientist at Microsoft
• Quantified Self Enthusiast
(10 years of happiness tracking)
• Author of ‘Hello, Data Science’
(#1 Bestseller in Korea)
Data is the ingredients for all these issues
• Data for training and evaluating ML models
• Data for discovering the defect and issues
• Data for monitoring the health of existing service
• Data for measuring the value of new service
Issues in Online Service Development
• Planning
• How to set business objective & plan?
• Implementation
• How to train and improve ML models?
• Evaluation
• How much are users satisfied with the service?
Plan
ExecuteEvaluate
How can we collect data for these purposes?
Case Study: Data Collection for Restaurants
• Customer Behavior
• Facial expression
• Quantity of leftovers
• Pace of dining
Only limited type of data is
available, possibly with lots
of noise
Case Study: Data Collection for Restaurants
• Panel Survey
• Satisfaction for Food
• Satisfaction for Service
• Satisfaction for Environment
Survey can provide
insights into customer
satisfaction, but with
some caveat
Data Collection for Online Service
• User Behavior
• Various ‘signals’ from behavioral data
• Limited type of data is available, with lots of noise
• Needs substantial user base required
• Panel Survey
• Hire a group of panels, or use crowdsourcing
• Collect feedback for all aspects of service quality
• Cost of hiring and maintaining panel
Data Collection for Online Service (2)
• Direct User Feedback
• Request real-time feedback from customers
• Typically low response rate, with potential nuisance
• Widely used for personalized services (i.e., recommendation)
Panel Survey User Behavior
User Feedback
How does major online service companies
collect data for measurement?
Search Engine: Google / Bing
• Early stage: panel-based survey
• Late stage: user behavior-based experiments
• Source: Google
How to evaluate the quality of this SERP?
Social Network: Facebook
• Before: use only user behavior
• Nowadays: user behavior + panel survey + user feedback
• Source: Slate / Quora
We could expose contents users are
actually satisfied instead of click-baits by
using panel survey and user feedback in
addition to signals from user behavior
- Julie Zhuo, Product Design VP at
Facebook
User feedback for Facebook News Feed
Recommendation System: Netflix
• Combine user feedback and behavior for measurement
• Source: Netflix
Movie Recommendations from Netflix
Algorithm
A
Algorithm
B
Can you tell if algorithm A vs. B is better?
Even the users
themselves
can’t!!!
Movie Recommendations from Netflix (2)
Results below are more relevant, but users engage more with the above
So, how should I collect data for my service?
• What signals can we extract out of user behavior?
• Are there incentives for users to provide feedback?
Service
Characteristics
• Do you already have substantial volume of active users?
• Can a panel evaluate user experience as a substitute?
Feasibility of
Collection
• Do you have marketing budget for building a user base,
or for a panel survey?
Cost of
Collection
How to evaluate the quality of this SERP?
Evaluation based on
user behavior
• Which result did users click?
• Is click the only measure of satisfaction?
• How long did a user stay on a result?
• Is longer dwell-time already better?
• Do users perform search repeatedly?
• Does loyalty mean satisfaction?
User behavior is an important clue, but a noisy one.
How can you design a panel survey for SERP
evaluation?
How would you evaluate
the search results for
query ‘crowdsourcing’?
Bad
Good
Excellent
Perfect
Q: Who do you think so?
Alternative: Evaluating a Webpage
How would you evaluate
the search results for
query ‘crowdsourcing’?
Bad
Good
Excellent
Perfect
Q: Who do you think so?
Alternative: Side-by-Side SERP Evaluation
Q: How would you
compare two results?
Left much better
Left slightly better
About the same
Right slightly better
Right much better
Q: Why do you think so?
Conclusions
Summary…
• As a first step in data science, plan on collecting high-quality data
• Combine various data collection methods depending on the
characteristics and lifecycle of service
• It takes a lot of consideration to get the panel survey done right
For more information…
• What you need to know about data even if you’re not a Data Scientist
• SIGIR’2015 Tutorial on Offline Search Evaluation
• Offline Evaluation for Information Retrieval
Foundation and Trend in IR Journal (To Appear)

More Related Content

What's hot

How Machine Learning Can Transform The Customer Experience
How Machine Learning Can Transform The Customer ExperienceHow Machine Learning Can Transform The Customer Experience
How Machine Learning Can Transform The Customer Experience
Product School
 
Mobile research best practices
Mobile research best practicesMobile research best practices
Mobile research best practices
Apptentive
 
Lean User Testing Intro
Lean User Testing IntroLean User Testing Intro
Lean User Testing Intro
Jessica DuVerneay
 
User testing on a diet
User testing on a dietUser testing on a diet
User testing on a diet
Paul Veugen
 
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, NetflixUX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT
 
Maxdiff webinar_10_19_10
 Maxdiff webinar_10_19_10 Maxdiff webinar_10_19_10
Maxdiff webinar_10_19_10
QuestionPro
 
Analytics Academy 2015 Presentation Slides
Analytics Academy 2015 Presentation SlidesAnalytics Academy 2015 Presentation Slides
Analytics Academy 2015 Presentation Slides
HarvardComms
 
User testing methodology
User testing methodologyUser testing methodology
User testing methodology
James Hatfield
 
Max diff scaling for research access(4)
Max diff scaling for research access(4)Max diff scaling for research access(4)
Max diff scaling for research access(4)QuestionPro
 
User research for Product Managers - Product Tank London Jan 17
User research for Product Managers - Product Tank London Jan 17User research for Product Managers - Product Tank London Jan 17
User research for Product Managers - Product Tank London Jan 17
Morag McLaren
 
Basics of AB testing in online products
Basics of AB testing in online productsBasics of AB testing in online products
Basics of AB testing in online products
Ashish Dua
 
Everything You Always Wanted to Know About Bad UX Research But Were Afraid to...
Everything You Always Wanted to Know About Bad UX Research But Were Afraid to...Everything You Always Wanted to Know About Bad UX Research But Were Afraid to...
Everything You Always Wanted to Know About Bad UX Research But Were Afraid to...
Valsplat
 
Understanding Online Audiences Bazley Ma Wonder Web 10 Jun09
Understanding Online Audiences Bazley Ma Wonder Web 10 Jun09Understanding Online Audiences Bazley Ma Wonder Web 10 Jun09
Understanding Online Audiences Bazley Ma Wonder Web 10 Jun09
Martin Bazley
 
Younus poonawala Web Application Testing
Younus poonawala   Web Application TestingYounus poonawala   Web Application Testing
Younus poonawala Web Application Testing
Younus Poonawala
 
Successfully Managing Customer Experience Combining VoC and UX Testing
Successfully Managing Customer Experience Combining VoC and UX TestingSuccessfully Managing Customer Experience Combining VoC and UX Testing
Successfully Managing Customer Experience Combining VoC and UX Testing
UserZoom
 
Intro to User Journey Maps for Building Better Websites - Cornell Drupal Camp...
Intro to User Journey Maps for Building Better Websites - Cornell Drupal Camp...Intro to User Journey Maps for Building Better Websites - Cornell Drupal Camp...
Intro to User Journey Maps for Building Better Websites - Cornell Drupal Camp...
Anthony D. Paul
 
You can't manage what you can't measure - User Vision Breakfast Briefing
You can't manage what you can't measure - User Vision Breakfast BriefingYou can't manage what you can't measure - User Vision Breakfast Briefing
You can't manage what you can't measure - User Vision Breakfast Briefing
User Vision
 
Crash Course on Startup Analytics
Crash Course on Startup AnalyticsCrash Course on Startup Analytics
Crash Course on Startup Analytics
Growth Hacking Asia
 
Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015
Konstantin Savenkov
 
Machine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domainsMachine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domains
Shrutika Oswal
 

What's hot (20)

How Machine Learning Can Transform The Customer Experience
How Machine Learning Can Transform The Customer ExperienceHow Machine Learning Can Transform The Customer Experience
How Machine Learning Can Transform The Customer Experience
 
Mobile research best practices
Mobile research best practicesMobile research best practices
Mobile research best practices
 
Lean User Testing Intro
Lean User Testing IntroLean User Testing Intro
Lean User Testing Intro
 
User testing on a diet
User testing on a dietUser testing on a diet
User testing on a diet
 
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, NetflixUX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
 
Maxdiff webinar_10_19_10
 Maxdiff webinar_10_19_10 Maxdiff webinar_10_19_10
Maxdiff webinar_10_19_10
 
Analytics Academy 2015 Presentation Slides
Analytics Academy 2015 Presentation SlidesAnalytics Academy 2015 Presentation Slides
Analytics Academy 2015 Presentation Slides
 
User testing methodology
User testing methodologyUser testing methodology
User testing methodology
 
Max diff scaling for research access(4)
Max diff scaling for research access(4)Max diff scaling for research access(4)
Max diff scaling for research access(4)
 
User research for Product Managers - Product Tank London Jan 17
User research for Product Managers - Product Tank London Jan 17User research for Product Managers - Product Tank London Jan 17
User research for Product Managers - Product Tank London Jan 17
 
Basics of AB testing in online products
Basics of AB testing in online productsBasics of AB testing in online products
Basics of AB testing in online products
 
Everything You Always Wanted to Know About Bad UX Research But Were Afraid to...
Everything You Always Wanted to Know About Bad UX Research But Were Afraid to...Everything You Always Wanted to Know About Bad UX Research But Were Afraid to...
Everything You Always Wanted to Know About Bad UX Research But Were Afraid to...
 
Understanding Online Audiences Bazley Ma Wonder Web 10 Jun09
Understanding Online Audiences Bazley Ma Wonder Web 10 Jun09Understanding Online Audiences Bazley Ma Wonder Web 10 Jun09
Understanding Online Audiences Bazley Ma Wonder Web 10 Jun09
 
Younus poonawala Web Application Testing
Younus poonawala   Web Application TestingYounus poonawala   Web Application Testing
Younus poonawala Web Application Testing
 
Successfully Managing Customer Experience Combining VoC and UX Testing
Successfully Managing Customer Experience Combining VoC and UX TestingSuccessfully Managing Customer Experience Combining VoC and UX Testing
Successfully Managing Customer Experience Combining VoC and UX Testing
 
Intro to User Journey Maps for Building Better Websites - Cornell Drupal Camp...
Intro to User Journey Maps for Building Better Websites - Cornell Drupal Camp...Intro to User Journey Maps for Building Better Websites - Cornell Drupal Camp...
Intro to User Journey Maps for Building Better Websites - Cornell Drupal Camp...
 
You can't manage what you can't measure - User Vision Breakfast Briefing
You can't manage what you can't measure - User Vision Breakfast BriefingYou can't manage what you can't measure - User Vision Breakfast Briefing
You can't manage what you can't measure - User Vision Breakfast Briefing
 
Crash Course on Startup Analytics
Crash Course on Startup AnalyticsCrash Course on Startup Analytics
Crash Course on Startup Analytics
 
Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015
 
Machine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domainsMachine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domains
 

Viewers also liked

Subtleties in Tracking Happiness -- Seattle QS#10
Subtleties in Tracking Happiness -- Seattle QS#10Subtleties in Tracking Happiness -- Seattle QS#10
Subtleties in Tracking Happiness -- Seattle QS#10
Jin Young Kim
 
Social Entrepreneur meets Technology by 황진솔 대표
Social Entrepreneur meets Technology by 황진솔 대표Social Entrepreneur meets Technology by 황진솔 대표
Social Entrepreneur meets Technology by 황진솔 대표
Jin Young Kim
 
150613 당신이 아마도 몰랐을 빅데이터 이야기 (YEF 공유)
150613 당신이 아마도 몰랐을 빅데이터 이야기 (YEF 공유)150613 당신이 아마도 몰랐을 빅데이터 이야기 (YEF 공유)
150613 당신이 아마도 몰랐을 빅데이터 이야기 (YEF 공유)
Jade Jongdae Lee
 
헬로 데이터 과학: 삶과 업무를 개선하는 데이터 과학 이야기 (스타트업 얼라이언스 강연)
헬로 데이터 과학: 삶과 업무를 개선하는 데이터 과학 이야기 (스타트업 얼라이언스 강연)헬로 데이터 과학: 삶과 업무를 개선하는 데이터 과학 이야기 (스타트업 얼라이언스 강연)
헬로 데이터 과학: 삶과 업무를 개선하는 데이터 과학 이야기 (스타트업 얼라이언스 강연)
Jin Young Kim
 
반상식적이고 주관적인 (CS) 유학 이야기
반상식적이고 주관적인 (CS) 유학 이야기반상식적이고 주관적인 (CS) 유학 이야기
반상식적이고 주관적인 (CS) 유학 이야기
Jin Young Kim
 
빅 데이터 개요 및 활용
빅 데이터 개요 및 활용빅 데이터 개요 및 활용
빅 데이터 개요 및 활용
Jin wook
 
빅데이터의 개념과 이해 그리고 활용사례 (Introduction to big data and use cases)
빅데이터의 개념과 이해 그리고 활용사례 (Introduction to big data and use cases)빅데이터의 개념과 이해 그리고 활용사례 (Introduction to big data and use cases)
빅데이터의 개념과 이해 그리고 활용사례 (Introduction to big data and use cases)
Wonjin Lee
 
빅데이터의 활용
빅데이터의 활용빅데이터의 활용
빅데이터의 활용
수보 김
 
빅데이터의 이해
빅데이터의 이해빅데이터의 이해
빅데이터의 이해
수보 김
 

Viewers also liked (10)

Subtleties in Tracking Happiness -- Seattle QS#10
Subtleties in Tracking Happiness -- Seattle QS#10Subtleties in Tracking Happiness -- Seattle QS#10
Subtleties in Tracking Happiness -- Seattle QS#10
 
Social Entrepreneur meets Technology by 황진솔 대표
Social Entrepreneur meets Technology by 황진솔 대표Social Entrepreneur meets Technology by 황진솔 대표
Social Entrepreneur meets Technology by 황진솔 대표
 
150613 당신이 아마도 몰랐을 빅데이터 이야기 (YEF 공유)
150613 당신이 아마도 몰랐을 빅데이터 이야기 (YEF 공유)150613 당신이 아마도 몰랐을 빅데이터 이야기 (YEF 공유)
150613 당신이 아마도 몰랐을 빅데이터 이야기 (YEF 공유)
 
헬로 데이터 과학: 삶과 업무를 개선하는 데이터 과학 이야기 (스타트업 얼라이언스 강연)
헬로 데이터 과학: 삶과 업무를 개선하는 데이터 과학 이야기 (스타트업 얼라이언스 강연)헬로 데이터 과학: 삶과 업무를 개선하는 데이터 과학 이야기 (스타트업 얼라이언스 강연)
헬로 데이터 과학: 삶과 업무를 개선하는 데이터 과학 이야기 (스타트업 얼라이언스 강연)
 
반상식적이고 주관적인 (CS) 유학 이야기
반상식적이고 주관적인 (CS) 유학 이야기반상식적이고 주관적인 (CS) 유학 이야기
반상식적이고 주관적인 (CS) 유학 이야기
 
빅 데이터 개요 및 활용
빅 데이터 개요 및 활용빅 데이터 개요 및 활용
빅 데이터 개요 및 활용
 
빅데이터의 개념과 이해 그리고 활용사례 (Introduction to big data and use cases)
빅데이터의 개념과 이해 그리고 활용사례 (Introduction to big data and use cases)빅데이터의 개념과 이해 그리고 활용사례 (Introduction to big data and use cases)
빅데이터의 개념과 이해 그리고 활용사례 (Introduction to big data and use cases)
 
빅데이터의 활용
빅데이터의 활용빅데이터의 활용
빅데이터의 활용
 
빅데이터의 이해
빅데이터의 이해빅데이터의 이해
빅데이터의 이해
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Similar to Measuring the Quality of Online Service - Jinyoung kim

Continuous Improvement
Continuous ImprovementContinuous Improvement
Continuous Improvement
Reading Room
 
SPSCT15 - An Independent Evaluation of Third-Party SharePoint Analytics Offer...
SPSCT15 - An Independent Evaluation of Third-Party SharePoint Analytics Offer...SPSCT15 - An Independent Evaluation of Third-Party SharePoint Analytics Offer...
SPSCT15 - An Independent Evaluation of Third-Party SharePoint Analytics Offer...
Jonathan Ralton
 
UX Webinar: Always Be Testing
UX Webinar: Always Be TestingUX Webinar: Always Be Testing
UX Webinar: Always Be Testing
Charity Dynamics
 
SPSNYC15 - An Independent Evaluation of Third-Party SharePoint Analytics Offe...
SPSNYC15 - An Independent Evaluation of Third-Party SharePoint Analytics Offe...SPSNYC15 - An Independent Evaluation of Third-Party SharePoint Analytics Offe...
SPSNYC15 - An Independent Evaluation of Third-Party SharePoint Analytics Offe...
Jonathan Ralton
 
An Introduction to the World of User Research
An Introduction to the World of User ResearchAn Introduction to the World of User Research
An Introduction to the World of User Research
Methods
 
Mktg350 lecture 10212013
Mktg350 lecture 10212013Mktg350 lecture 10212013
Mktg350 lecture 10212013lkirkman
 
UCO16 - An Independent Evaluation of Third-Party SharePoint Analytics Offerings
UCO16 - An Independent Evaluation of Third-Party SharePoint Analytics OfferingsUCO16 - An Independent Evaluation of Third-Party SharePoint Analytics Offerings
UCO16 - An Independent Evaluation of Third-Party SharePoint Analytics Offerings
Jonathan Ralton
 
Share and Tell Stanford 2016
Share and Tell Stanford 2016Share and Tell Stanford 2016
Share and Tell Stanford 2016
Stanford University
 
PQF Overview
PQF OverviewPQF Overview
PQF Overview
Martin Hutchings
 
Using online tools to help us assess our public legal education work
Using online tools to help us assess our public legal education work Using online tools to help us assess our public legal education work
Using online tools to help us assess our public legal education work
Community Legal Education Ontario (CLEO)
 
Why your analytics land with a thud
Why your analytics land with a thudWhy your analytics land with a thud
Why your analytics land with a thud
One North
 
Google Analytics 101
Google Analytics 101Google Analytics 101
Google Analytics 101
Francesca Lorenzoni
 
PAS: The Planning Quality Framework
PAS: The Planning Quality FrameworkPAS: The Planning Quality Framework
PAS: The Planning Quality Framework
PAS_Team
 
Digital Marketing Course Week 4: Digital Analytics
Digital Marketing Course Week 4: Digital AnalyticsDigital Marketing Course Week 4: Digital Analytics
Digital Marketing Course Week 4: Digital Analytics
Ayca Turhan
 
Website Analytics and Measurement
Website Analytics and MeasurementWebsite Analytics and Measurement
Website Analytics and MeasurementAdam Lee
 
Analytics Best Practice for the Travel Industry
Analytics Best Practice for the Travel IndustryAnalytics Best Practice for the Travel Industry
Analytics Best Practice for the Travel IndustryAdam Lee
 
Life in a Not Provided World at SMX West 2012 by Micah Fisher-Kirshner
Life in a Not Provided World at SMX West 2012 by Micah Fisher-KirshnerLife in a Not Provided World at SMX West 2012 by Micah Fisher-Kirshner
Life in a Not Provided World at SMX West 2012 by Micah Fisher-Kirshner
Micah Fisher-Kirshner
 
Design Recommender systems from scratch
Design Recommender systems from scratchDesign Recommender systems from scratch
Design Recommender systems from scratch
Dr. Amit Sachan
 
Data - How to Use it & When by Square and Call Rail Product Leader
Data - How to Use it & When by Square and Call Rail Product LeaderData - How to Use it & When by Square and Call Rail Product Leader
Data - How to Use it & When by Square and Call Rail Product Leader
Product School
 
Marketing in a social age travel updated (feb)
Marketing in a social age   travel updated (feb)Marketing in a social age   travel updated (feb)
Marketing in a social age travel updated (feb)Luke Hay
 

Similar to Measuring the Quality of Online Service - Jinyoung kim (20)

Continuous Improvement
Continuous ImprovementContinuous Improvement
Continuous Improvement
 
SPSCT15 - An Independent Evaluation of Third-Party SharePoint Analytics Offer...
SPSCT15 - An Independent Evaluation of Third-Party SharePoint Analytics Offer...SPSCT15 - An Independent Evaluation of Third-Party SharePoint Analytics Offer...
SPSCT15 - An Independent Evaluation of Third-Party SharePoint Analytics Offer...
 
UX Webinar: Always Be Testing
UX Webinar: Always Be TestingUX Webinar: Always Be Testing
UX Webinar: Always Be Testing
 
SPSNYC15 - An Independent Evaluation of Third-Party SharePoint Analytics Offe...
SPSNYC15 - An Independent Evaluation of Third-Party SharePoint Analytics Offe...SPSNYC15 - An Independent Evaluation of Third-Party SharePoint Analytics Offe...
SPSNYC15 - An Independent Evaluation of Third-Party SharePoint Analytics Offe...
 
An Introduction to the World of User Research
An Introduction to the World of User ResearchAn Introduction to the World of User Research
An Introduction to the World of User Research
 
Mktg350 lecture 10212013
Mktg350 lecture 10212013Mktg350 lecture 10212013
Mktg350 lecture 10212013
 
UCO16 - An Independent Evaluation of Third-Party SharePoint Analytics Offerings
UCO16 - An Independent Evaluation of Third-Party SharePoint Analytics OfferingsUCO16 - An Independent Evaluation of Third-Party SharePoint Analytics Offerings
UCO16 - An Independent Evaluation of Third-Party SharePoint Analytics Offerings
 
Share and Tell Stanford 2016
Share and Tell Stanford 2016Share and Tell Stanford 2016
Share and Tell Stanford 2016
 
PQF Overview
PQF OverviewPQF Overview
PQF Overview
 
Using online tools to help us assess our public legal education work
Using online tools to help us assess our public legal education work Using online tools to help us assess our public legal education work
Using online tools to help us assess our public legal education work
 
Why your analytics land with a thud
Why your analytics land with a thudWhy your analytics land with a thud
Why your analytics land with a thud
 
Google Analytics 101
Google Analytics 101Google Analytics 101
Google Analytics 101
 
PAS: The Planning Quality Framework
PAS: The Planning Quality FrameworkPAS: The Planning Quality Framework
PAS: The Planning Quality Framework
 
Digital Marketing Course Week 4: Digital Analytics
Digital Marketing Course Week 4: Digital AnalyticsDigital Marketing Course Week 4: Digital Analytics
Digital Marketing Course Week 4: Digital Analytics
 
Website Analytics and Measurement
Website Analytics and MeasurementWebsite Analytics and Measurement
Website Analytics and Measurement
 
Analytics Best Practice for the Travel Industry
Analytics Best Practice for the Travel IndustryAnalytics Best Practice for the Travel Industry
Analytics Best Practice for the Travel Industry
 
Life in a Not Provided World at SMX West 2012 by Micah Fisher-Kirshner
Life in a Not Provided World at SMX West 2012 by Micah Fisher-KirshnerLife in a Not Provided World at SMX West 2012 by Micah Fisher-Kirshner
Life in a Not Provided World at SMX West 2012 by Micah Fisher-Kirshner
 
Design Recommender systems from scratch
Design Recommender systems from scratchDesign Recommender systems from scratch
Design Recommender systems from scratch
 
Data - How to Use it & When by Square and Call Rail Product Leader
Data - How to Use it & When by Square and Call Rail Product LeaderData - How to Use it & When by Square and Call Rail Product Leader
Data - How to Use it & When by Square and Call Rail Product Leader
 
Marketing in a social age travel updated (feb)
Marketing in a social age   travel updated (feb)Marketing in a social age   travel updated (feb)
Marketing in a social age travel updated (feb)
 

Recently uploaded

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 

Recently uploaded (20)

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 

Measuring the Quality of Online Service - Jinyoung kim

  • 1. Measuring the Quality of Online Service Jin Young Kim Senior Applied Scientist Microsoft Web Search and AI
  • 2. About Jin Young Kim • Data Scientist at Microsoft • Quantified Self Enthusiast (10 years of happiness tracking) • Author of ‘Hello, Data Science’ (#1 Bestseller in Korea)
  • 3. Data is the ingredients for all these issues • Data for training and evaluating ML models • Data for discovering the defect and issues • Data for monitoring the health of existing service • Data for measuring the value of new service
  • 4. Issues in Online Service Development • Planning • How to set business objective & plan? • Implementation • How to train and improve ML models? • Evaluation • How much are users satisfied with the service? Plan ExecuteEvaluate
  • 5. How can we collect data for these purposes?
  • 6. Case Study: Data Collection for Restaurants • Customer Behavior • Facial expression • Quantity of leftovers • Pace of dining Only limited type of data is available, possibly with lots of noise
  • 7. Case Study: Data Collection for Restaurants • Panel Survey • Satisfaction for Food • Satisfaction for Service • Satisfaction for Environment Survey can provide insights into customer satisfaction, but with some caveat
  • 8. Data Collection for Online Service • User Behavior • Various ‘signals’ from behavioral data • Limited type of data is available, with lots of noise • Needs substantial user base required • Panel Survey • Hire a group of panels, or use crowdsourcing • Collect feedback for all aspects of service quality • Cost of hiring and maintaining panel
  • 9. Data Collection for Online Service (2) • Direct User Feedback • Request real-time feedback from customers • Typically low response rate, with potential nuisance • Widely used for personalized services (i.e., recommendation) Panel Survey User Behavior User Feedback
  • 10.
  • 11. How does major online service companies collect data for measurement?
  • 12. Search Engine: Google / Bing • Early stage: panel-based survey • Late stage: user behavior-based experiments • Source: Google
  • 13. How to evaluate the quality of this SERP?
  • 14. Social Network: Facebook • Before: use only user behavior • Nowadays: user behavior + panel survey + user feedback • Source: Slate / Quora We could expose contents users are actually satisfied instead of click-baits by using panel survey and user feedback in addition to signals from user behavior - Julie Zhuo, Product Design VP at Facebook
  • 15. User feedback for Facebook News Feed
  • 16. Recommendation System: Netflix • Combine user feedback and behavior for measurement • Source: Netflix
  • 17. Movie Recommendations from Netflix Algorithm A Algorithm B Can you tell if algorithm A vs. B is better? Even the users themselves can’t!!!
  • 18. Movie Recommendations from Netflix (2) Results below are more relevant, but users engage more with the above
  • 19. So, how should I collect data for my service? • What signals can we extract out of user behavior? • Are there incentives for users to provide feedback? Service Characteristics • Do you already have substantial volume of active users? • Can a panel evaluate user experience as a substitute? Feasibility of Collection • Do you have marketing budget for building a user base, or for a panel survey? Cost of Collection
  • 20. How to evaluate the quality of this SERP?
  • 21. Evaluation based on user behavior • Which result did users click? • Is click the only measure of satisfaction? • How long did a user stay on a result? • Is longer dwell-time already better? • Do users perform search repeatedly? • Does loyalty mean satisfaction? User behavior is an important clue, but a noisy one.
  • 22. How can you design a panel survey for SERP evaluation? How would you evaluate the search results for query ‘crowdsourcing’? Bad Good Excellent Perfect Q: Who do you think so?
  • 23. Alternative: Evaluating a Webpage How would you evaluate the search results for query ‘crowdsourcing’? Bad Good Excellent Perfect Q: Who do you think so?
  • 24. Alternative: Side-by-Side SERP Evaluation Q: How would you compare two results? Left much better Left slightly better About the same Right slightly better Right much better Q: Why do you think so?
  • 26. Summary… • As a first step in data science, plan on collecting high-quality data • Combine various data collection methods depending on the characteristics and lifecycle of service • It takes a lot of consideration to get the panel survey done right
  • 27. For more information… • What you need to know about data even if you’re not a Data Scientist • SIGIR’2015 Tutorial on Offline Search Evaluation • Offline Evaluation for Information Retrieval Foundation and Trend in IR Journal (To Appear)

Editor's Notes

  1. 행사 제목이 ‘우리가 데이터를 쓰는 법'인데요, 저는 오늘 데이터 수집에 초점을 맞추어 볼까 한다. 데이터로 일을 해보신 분들은 공감하겠지만 제대로 된 데이터가 있으면 이를 가공하는 것도, 사용하는 것도 상대적으로 쉽다.
  2. 이들 대부분은 데이터 문제다. 이처럼 다양한 유형의 데이터가 있지만 핵심은 서비스에 대한 고객의 반응을 측정하는 것이다.
  3. 온라인 서비스의 개발 과정은 크게 ~ 로 나눌 수 있다. 각 단계별로 다양한 이슈가 존재한다.
  4. 이해를 돕기 위해 식당을 예로 들어보자. 고객의 행동에서 얻을 수 있는 데이터는 무엇일까?
  5. 부족한 데이터는 패널 서베이를 통해 얻을 수 있다. 패널 서베이는 고객의 의견을 대표하는 패널을 고용하여 그들의 의견을 청취하는 것이다.
  6. 이런 데이터 수집 방법은 온라인 서비스의 개선에도 그대로 적용할 수 있다.
  7. 지금까지 두가지 방법을 알아보았는데, 이를 결합하면 어떨까? 사용자에게 실시간으로 피드백을 받는 것이다. 하지만, 이를 제대로 하지 않으면 낮은 응답률에 오히려 사용자를 성가시게 할 수도 있다.
  8. 이제 이런 데이터 수집 방법을 주요 온라인 서비스 기업에서 어떻게 활용하는지 알아보자.
  9. 우선 필자의 업무 영역인 검색 서비스 사례를 생각해보자. 검색서비스 개선을 위해서는 다양한 실험 기법이 사용되는데 ~ (뒤에 자세히 다룬다)
  10. 페이스북은 서비스 초기에 사용자 로그만 사용했다고 한다. 하지만 최근에는 ~ 페이스북 피드 랭킹에 사용자 로그와 함께 패널 서베이와 사용자 피드백을 추가로 사용함으로써 클릭만을 유도하는 컨텐츠 대신 사용자가 만족하는 컨텐츠를 더 많이 노출시킬 수 있었다.
  11. 넷플릭스에서는 검색 서비스와 추천 서비스의 평가에 각각 다른 데이터를 사용한다고 한다.
  12. 그 이유중 하나는 개인화된 추천 서비스의 결과를 서베이로 평가하기 어렵다는 것이다. 예를 들어 두가지 추천 알고리즘에서 나온 결과를 비교해보자. 이용자 자신도 우열을 판단하기가 쉽지 않다!
  13. No ground for comparison / What if the judge doesn’t understand the intent?
  14. No ground for comparison / What if the judge doesn’t understand the intent?
  15. Should we use ‘about the same’ vs. ‘the same??