As Andrew Ng says, AI is the new electricity and it will transform many industries, therefore dating is going to be transform by the use of AI.
With the experience of having learned AI during the university several years ago, and having updated it since 2016, plus 10 years working experience in the dating industry, this presentation shows the evolution of AI during these last years, and shows some AI examples already used in Dating services.
Then shows where AI can be applied to dating services, what is needed, which models can be used, shows the building process, and how can be done.
8. 8
SOME DATING SERVICES ALREADY USING AI
BADOO
LOOKILIKE
TINDER
LIKEABLE
MEETIC
CHATBOT
MATCH GROUP
Q3 2017
● Largest tech resource in
the category devoted to
machine learning in
order to improve
discovery.
● New AI-driven data
products that
personalize experience
● Enhanced analytics
9. 9
AI APPLICATIONS FOR DATING
● CONVERSION PREDICTIONS
● CHURN PREDICTIONS
● DYNAMICS PRICING
● CHATBOT ASSISTANT
● USERS' TEXT MODERATION
● USERS' PHOTOS MODERATION
● DETECTING SCAMMERS/TROLLS
● SIMILARITIES RECOMMENDATIONS
● BIDIRECCIONAL RECOMMENDATIONS
10. 10
WHAT YOU NEED TO TRAIN, TEST AND DEPLOY AI
BUSINESS
NEED
DATA
ANALYSIS
DATA
PREPARATION
TRAINING
MODEL
ANALYZE
MODEL
MODEL
SERVING
MONITORING
80% DATA ANALYSIS/PREPARATION - 20% MODELING
11. 11
AI – DATA SPLIT
DATA SIZE
TRAIN SET 10.000TEST
60% 20%
DEV
20%
TRAIN SET 100.000
80% 10%10%
D T
TRAIN SET 1.000.000
98% 1%1%
TRAINING SET → Data set used to train the model
DEVELOPMENT SET → To adjust the model
TEST SET → To verify the model (good accuracy and no overfitting)
12. 12
AI – INTERNATIONAL DATING DATA SPLIT
Randomly shuffle before splitting.
Dev set and Test set should reflect data you expect to get in the
future and consider important to do well on.
Test set should be big enough to give high confidence in the
overall performance of the model.
Data features should be independent.
13. 13
MODELS – FROM SIMPLE TO COMPLEX
Linear Regression
Logistic Regression
Neural Network
Convolutional Neural Network (CNN)
Recurrent Neural Network (RNN)
15. 15
MODELS
For some predictions simpler models can be good enough.
Do not use deep learning for all.
Simple model = White box (You can see what it does)
Complex model = Black box (You don't know how it does it)
16. 16
ERROR - ACCURACY
ERROR -> % of wrong predictions
ACCURACY = 100% - ERROR
Accuracy measure how good the system is.
It cannot reach 100%. Only can get close to Bayes optimal
error.
We can use the best accuracy that an expert or group of
expert can get as a proxy for Bayes error
19. 19
ERROR ANALYSIS
CARRYING OUT ERROR ANALYSIS
Look at dev examples to evaluate ideas
CLEANING UP INCORRECT LABELED DATA
ADDRESS MISMATCHED TRAINING AND DEV/TEST DATA
DECIDE IF YOU SHOULD USE DIFFERENT MODELS FOR SOME
COUNTRIES OR SEGMENTS
21. 21
HOW WE CAN DO IT
USE EXTERNAL SOLUTION WHEN:
● Common problem
● Continuous improvement
● Not too expensive
DEVELOP WHEN:
● External solution too expensive for you. ROI << LTV
● Have thousands/millions of good labeled data
25. 25
DEVELOPMENT OPTIONS
● AUTO/NON PROGRAMABLE OPTION
● INHOUSE DEVELOPERS AFTER MOOC TRAINING
● INHOUSE DEVELOPERS WITH EXTERNAL CONSULTANT
Freelance University Research Research Center Company
● HIRE DATA SCIENTIST/DATA ENGINEER
● EXTERNAL DATA SCIENTIST/DATA ENGINEER
● EXTERNAL BIG DATA/MACHINE LEARNING COMPANY
● UNIVERSITY RESEARCH / RESEARCH CENTER
26. 26
CONCLUSIONS
1. AI WILL TRANSFORM DATING. GET READY!
2. GOOD LABELED DATA IS THE BASE
3. FOR SOME PREDICTIONS SIMPLE MODELS CAN BE GOOD ENOUGH
4. IF A GOOD SOLUTION EXIST, USE IT!
5. CHOOSE THE DEVELOPMENT OPTION THAT FITS YOU BEST