Measuring the Quality of Online Service - Jinyoung kim

Measuring the Quality
of Online Service
Jin Young Kim
Senior Applied Scientist
Microsoft Web Search and AI

About Jin Young Kim
• Data Scientist at Microsoft
• Quantified Self Enthusiast
(10 years of happiness tracking)
• Author of ‘Hello, Data Science’
(#1 Bestseller in Korea)

Data is the ingredients for all these issues
• Data for training and evaluating ML models
• Data for discovering the defect and issues
• Data for monitoring the health of existing service
• Data for measuring the value of new service

Issues in Online Service Development
• Planning
• How to set business objective & plan?
• Implementation
• How to train and improve ML models?
• Evaluation
• How much are users satisfied with the service?
Plan
ExecuteEvaluate

How can we collect data for these purposes?

Case Study: Data Collection for Restaurants
• Customer Behavior
• Facial expression
• Quantity of leftovers
• Pace of dining
Only limited type of data is
available, possibly with lots
of noise

Case Study: Data Collection for Restaurants
• Panel Survey
• Satisfaction for Food
• Satisfaction for Service
• Satisfaction for Environment
Survey can provide
insights into customer
satisfaction, but with
some caveat

Data Collection for Online Service
• User Behavior
• Various ‘signals’ from behavioral data
• Limited type of data is available, with lots of noise
• Needs substantial user base required
• Panel Survey
• Hire a group of panels, or use crowdsourcing
• Collect feedback for all aspects of service quality
• Cost of hiring and maintaining panel

Data Collection for Online Service (2)
• Direct User Feedback
• Request real-time feedback from customers
• Typically low response rate, with potential nuisance
• Widely used for personalized services (i.e., recommendation)
Panel Survey User Behavior
User Feedback

How does major online service companies
collect data for measurement?

Search Engine: Google / Bing
• Early stage: panel-based survey
• Late stage: user behavior-based experiments
• Source: Google

How to evaluate the quality of this SERP?

Social Network: Facebook
• Before: use only user behavior
• Nowadays: user behavior + panel survey + user feedback
• Source: Slate / Quora
We could expose contents users are
actually satisfied instead of click-baits by
using panel survey and user feedback in
addition to signals from user behavior
- Julie Zhuo, Product Design VP at
Facebook

User feedback for Facebook News Feed

Recommendation System: Netflix
• Combine user feedback and behavior for measurement
• Source: Netflix

Movie Recommendations from Netflix
Algorithm
A
Algorithm
B
Can you tell if algorithm A vs. B is better?
Even the users
themselves
can’t!!!

Movie Recommendations from Netflix (2)
Results below are more relevant, but users engage more with the above

So, how should I collect data for my service?
• What signals can we extract out of user behavior?
• Are there incentives for users to provide feedback?
Service
Characteristics
• Do you already have substantial volume of active users?
• Can a panel evaluate user experience as a substitute?
Feasibility of
Collection
• Do you have marketing budget for building a user base,
or for a panel survey?
Cost of
Collection

Evaluation based on
user behavior
• Which result did users click?
• Is click the only measure of satisfaction?
• How long did a user stay on a result?
• Is longer dwell-time already better?
• Do users perform search repeatedly?
• Does loyalty mean satisfaction?
User behavior is an important clue, but a noisy one.

How can you design a panel survey for SERP
evaluation?
How would you evaluate
the search results for
query ‘crowdsourcing’?
Bad
Good
Excellent
Perfect
Q: Who do you think so?

Alternative: Evaluating a Webpage
How would you evaluate
the search results for
query ‘crowdsourcing’?
Bad
Good
Excellent
Perfect
Q: Who do you think so?

Alternative: Side-by-Side SERP Evaluation
Q: How would you
compare two results?
Left much better
Left slightly better
About the same
Right slightly better
Right much better
Q: Why do you think so?

Summary…
• As a first step in data science, plan on collecting high-quality data
• Combine various data collection methods depending on the
characteristics and lifecycle of service
• It takes a lot of consideration to get the panel survey done right

For more information…
• What you need to know about data even if you’re not a Data Scientist
• SIGIR’2015 Tutorial on Offline Search Evaluation
• Offline Evaluation for Information Retrieval
Foundation and Trend in IR Journal (To Appear)

Measuring the Quality of Online Service - Jinyoung kim

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (10)

Similar to Measuring the Quality of Online Service - Jinyoung kim

Similar to Measuring the Quality of Online Service - Jinyoung kim (20)

Recently uploaded

Recently uploaded (20)

Measuring the Quality of Online Service - Jinyoung kim

Editor's Notes