SlideShare a Scribd company logo
1 of 57
Download to read offline
1	
Claudia Perlich
Chief Scientist
@claudia_perlich
Predictability	and	other	Predicament
2	
Certainty Vortex ©	2013	Foster	
Provost	
“Models tend to go where the signal is”
3	
How ‘predictable’ is a predictive modeling task?
(given the data)
Random Deterministic
Sexual
Orientation
Pizza for
Dinner?
4	4	
For now let’s try to predict who is male …
5	5	
Predicting Probability (Male) in Facebook
Data:
Facebook public dataset with 200K
anonymized users, their
demographics and their likes
Methodology:
Logistic regression on sparse
representation
00000d41ed774823fca142945ec915c0,1,,,,,,,,,,,en_GB,,
00000dee02d70cf8c0d79f96b6d1c59d,0,,,,,,,,,,,en_US,,
00000f232abfe25a80156fe069395460,0,1992,20,2,2,,,,,,,,19,-5
00000f4ba0cff946b1c0e3b051287ede,0,1993,19,2,,,,,,,,en_US,310,8
0000130571654e3afaa62f4e9d2e4f63,0,,,2,2,,,,,,,en_US,193,7
00001544469ae9b408869a463a1dd77a,1,1984,28,2,,,,,,,,en_US,,-4
00001544469ae9b408869a463a1dd77a 100198443380917
00001544469ae9b408869a463a1dd77a 100248613695
00001544469ae9b408869a463a1dd77a 10050726267
00001544469ae9b408869a463a1dd77a 101021248409
00001544469ae9b408869a463a1dd77a 101054236602446
00001544469ae9b408869a463a1dd77a 101425333232551
00001544469ae9b408869a463a1dd77a 10148199466
00001544469ae9b408869a463a1dd77a 10150154095435553
00001544469ae9b408869a463a1dd77a 10161539667
00001544469ae9b408869a463a1dd77a 101844593108
00001544469ae9b408869a463a1dd77a 101936079845392
00001544469ae9b408869a463a1dd77a 101987301816
00001544469ae9b408869a463a1dd77a 102038567018
00001544469ae9b408869a463a1dd77a 102040023230884
00001544469ae9b408869a463a1dd77a 10212595263
00001544469ae9b408869a463a1dd77a 102168219824412
00001544469ae9b408869a463a1dd77a 1022548733
40% Men
6	6	
Take 1: Predict Gender Based on age …
7	7	
Gender based on age: very little signal …
Overall Accuracy: 60%
AUC: 58%
p(male|age)
8	8	
Gender based on age: very little signal …
Overall Accuracy: 60%
AUC: 58%
Target the 1% with highest
probability:
Accuracy: 75%
p(male|age)
9	9	
From small to bigger data …
10	10	
Take 2: Gender based all your likes
11	11	
Predict gender base on all likes: a lot of signal …
Overall Accuracy: 83%
Target top 1%
Accuracy: 100%
p(male|likes)
12	12	
Progression: from age to all ‘likes’
75% 86% 100%
100%100%100%
Age
13	
But what happens if your problem is a mixture of both?
Random Deterministic
14	14	
Witness a spike in human predictability ..
2 weeks in 2012
Median5%Lift
2x
Death of
free will?
15	15	
buzzfeed.com	
3/20/16	
amazon.com
4/11/16
nytimes.com
2/15/16
Mlb.com
3/15/16
Etrade.com
4/25/16
Predicting digital events based on browsing histories
azure.microsoft.com
6/10/16
16	
Logistic Regression
Stochastic gradient descent
Hashing
Streaming
L1 & L2 Penalties
17	
Oddly predictive websites?
18	18	
URL’s that are very predictive for more than 10 brands
www.womenshealthbase.com
www.filmannex.com
www.ffog.net
www.drugsnews.org
www.menshealthbase.com
www.dailyfreshies.com
www.hark.com
www.gossipcenter.com
www.articletrunk.com
www.411answers.com
www.dailyrx.com
www.all-allergies.com
www.knowvehicles.com
www.chinaflix.com
www.parentingnewsstories.com
www.wrestlingnewz.com
www.gourmandia.com
19	
Traffic Patterns Are ‘Non - Human’
website	1	 website	2	
50%	
Traffic overlap of cookies from Bid Request
Boston Herald
23
24
25	25	
Witness a spike in human predictability ..
2 weeks in 2012
PredictionPerformance
2x
Death of
free will?
26	26	
Meet ‘Non-Human traffic’
6%
2011	
36%
2015
27	27	
Bots are executing conversion events
• ‘Cookie Stuffing’ increases the value of the ad for retargeting
• Messing up Web analytics …
• Messes up my models because a bot is easier to predict than a human
28	28	
Percent bot traffic on conversion metrics
29	29	
Two populations surfing the we: Bot vs. Human
36%
2015
30	
Random Deterministic
31	31	
Bot activity has more signal – higher predictions
32	32	
Eliminate labels generated by bots ….
32	
3 more weeks in spring 2012
PerformanceIndex
33	33	
What about Clicks?
“Measure of consumers
interest in the product’
34	34
35	35	
Accidental clicks are more predictable than intentional
Model learns to predict the accidental much more easily than the
intentional ones …
I will only target
clicks that are
likely to happen
accidentally but
not randomly …
accidental
intentional
36	36	
Accidents: People fumbling in the dark …
37	37	
Old days of the Click Metric …
Strategy 1:
target women
Strategy 2:
target men
accidental
intentional
<
38	38	
Finding good audiences for luxury cars?
Predict dealership visits?
39	39	
buzzfeed.com	
3/20/16	
amazon.com
4/11/16
nytimes.com
2/15/16
50.240.135.41
166. 216.165.92
207.246.152.60
108.49.133.218
38.104.253.134
208.76.113.13
Mlb.com
3/15/16
Etrade.com
4/25/16
Potterybarn.com
4/10/16
Zip 4
10023-4592
04/15/16
Zip 4
10023-6924
04/20/16
Zip 4
10016-2324
03/10/16
03/14/16
Web App
Phone/Tablet
Desktop
Phone/Tablet
com.mlb.atbat
04/20/16
com.rovio.angry
02/26/16
com.myfitnesspal.android
3/25/16
4/18/16
IDFA
31AB-26FC-
94AE-756B
My actual data: 100 Billion daily events across devices …
?
?
40	40	
Potentially three populations in the location prediction
•  People who are indeed at dealership
and their history
•  People who are somewhere close
•  People who hacked into your WIFI
41	41	
Identify people who will
go to Mercedes dealerships
42	42	
‘In the market’ signal
43	43	
Real Estate
44	44	
Fitness …
45	45	
How much randomness can a model absorb?
We will randomly switch the gender value for increasing percentages
00000d41ed774823fca142945ec915c0,1,,,,,,,,,,,en_GB,,
00000dee02d70cf8c0d79f96b6d1c59d,0,,,,,,,,,,,en_US,,
00000f232abfe25a80156fe069395460,0,1992,20,2,2,,,,,,,,19,-5
00000f4ba0cff946b1c0e3b051287ede,0,1993,19,2,,,,,,,,en_US,310,8
0000130571654e3afaa62f4e9d2e4f63,0,,,2,2,,,,,,,en_US,193,7
00001544469ae9b408869a463a1dd77a,1,1984,28,2,,,,,,,,en_US,,-4
00000d41ed774823fca142945ec915c0,0,,,,,,,,,,,en_GB,,
00000dee02d70cf8c0d79f96b6d1c59d,0,,,,,,,,,,,en_US,,
00000f232abfe25a80156fe069395460,1,1992,20,2,2,,,,,,,,19,-5
00000f4ba0cff946b1c0e3b051287ede,0,1993,19,2,,,,,,,,en_US,310,8
0000130571654e3afaa62f4e9d2e4f63,1,,,2,2,,,,,,,en_US,193,7
00001544469ae9b408869a463a1dd77a,1,1984,28,2,,,,,,,,en_US,,-4
46	46	
Performance is surprisingly stable even under random noise
Percent	original	labels		
60.00%	
65.00%	
70.00%	
75.00%	
80.00%	
85.00%	
90.00%	
95.00%	
100.00%	
100%	 75%	 50%	 25%	
Percent	Men	in	top	1%	
50%	original	labels		
25%	original	labels
47	47	
Identify people who will go to Mercedes dealerships?
My predictions might be better
than my ‘ground truth’
48	48	
Where do we find frequent traveler?
49	49	
What do you think
indicates people going to JFK?
URL	 Logis)c	Parameter	
www.iglesiaelfaroinc.org	 2.38	
www.jumpseatnews.com	 2.25	
www.bluelineprop.com	 2.21	
www.ktxdtv.com	 2.14	
www.southjefffootball.org	 2.1	
www.unitedafa.org	 2.09	
www.parliamenthouse.com	 2.07	
www.yunghova.com	 2.06	
www.interlinetravel.com	 2.03	
www.aclin.org	 2.03	
www.swissport.com	 2.03	
www.gcsanc.com	 2.01	
www.swacu.org	 2.01	
www.airlinepilotcentral.com	 1.97	
www.homotrophy.com	 1.97	
www.beggsfuneralhome.net	 1.94	
www.tvathleFcs.org	 1.92	
www.2shopper.com	 1.91	
www.nextmagazine.com	 1.91	
www.dailyjocks.com	 1.87	
www.pullzone.com	 1.87	
www.diamondoffshore.com	 1.86	
www.myerspolaris.com	 1.86	
www.ryandeyer.com	 1.86	
www.okllo.com	 1.84	
www.ifihadtochoose.com	 1.83	
www.ivoirmixdj.com	 1.83
50	50	
URL	 Logis)c	Parameter	
www.iglesiaelfaroinc.org	 2.38	
www.jumpseatnews.com	 2.25	
www.bluelineprop.com	 2.21	
www.ktxdtv.com	 2.14	
www.southjefffootball.org	 2.1	
www.unitedafa.org	 2.09	
www.parliamenthouse.com	 2.07	
www.yunghova.com	 2.06	
www.interlinetravel.com	 2.03	
www.aclin.org	 2.03	
www.swissport.com	 2.03	
www.gcsanc.com	 2.01	
www.swacu.org	 2.01	
www.airlinepilotcentral.com	 1.97	
www.homotrophy.com	 1.97	
www.beggsfuneralhome.net	 1.94	
www.tvathleFcs.org	 1.92	
www.2shopper.com	 1.91	
www.nextmagazine.com	 1.91	
www.dailyjocks.com	 1.87	
www.pullzone.com	 1.87	
www.diamondoffshore.com	 1.86	
www.myerspolaris.com	 1.86	
www.ryandeyer.com	 1.86	
www.okllo.com	 1.84	
www.ifihadtochoose.com	 1.83	
www.ivoirmixdj.com	 1.83
51	51	
URL	 Logis)c	Parameter	
www.iglesiaelfaroinc.org	 2.38	
www.jumpseatnews.com	 2.25	
www.bluelineprop.com	 2.21	
www.ktxdtv.com	 2.14	
www.southjefffootball.org	 2.1	
www.unitedafa.org	 2.09	
www.parliamenthouse.com	 2.07	
www.yunghova.com	 2.06	
www.interlinetravel.com	 2.03	
www.aclin.org	 2.03	
www.swissport.com	 2.03	
www.gcsanc.com	 2.01	
www.swacu.org	 2.01	
www.airlinepilotcentral.com	 1.97	
www.homotrophy.com	 1.97	
www.beggsfuneralhome.net	 1.94	
www.tvathleFcs.org	 1.92	
www.2shopper.com	 1.91	
www.nextmagazine.com	 1.91	
www.dailyjocks.com	 1.87	
www.pullzone.com	 1.87	
www.diamondoffshore.com	 1.86	
www.myerspolaris.com	 1.86	
www.ryandeyer.com	 1.86	
www.okllo.com	 1.84	
www.ifihadtochoose.com	 1.83	
www.ivoirmixdj.com	 1.83
52	52	
URL	 Logis)c	Parameter	
www.iglesiaelfaroinc.org	 2.38	
www.jumpseatnews.com	 2.25	
www.bluelineprop.com	 2.21	
www.ktxdtv.com	 2.14	
www.southjefffootball.org	 2.1	
www.unitedafa.org 	 2.09	
www.parliamenthouse.com	 2.07	
www.yunghova.com	 2.06	
www.interlinetravel.com	 2.03	
www.aclin.org	 2.03	
www.swissport.com	 2.03	
www.gcsanc.com	 2.01	
www.swacu.org	 2.01	
www.airlinepilotcentral.com	 1.97	
www.homotrophy.com	 1.97	
www.beggsfuneralhome.net	 1.94	
www.tvathleFcs.org	 1.92	
www.2shopper.com	 1.91	
www.nextmagazine.com	 1.91	
www.dailyjocks.com	 1.87	
www.pullzone.com	 1.87	
www.diamondoffshore.com	 1.86	
www.myerspolaris.com	 1.86	
www.ryandeyer.com	 1.86	
www.okllo.com	 1.84	
www.ifihadtochoose.com	 1.83	
www.ivoirmixdj.com	 1.83
53	53	
URL	 Logis)c	Parameter	
www.iglesiaelfaroinc.org	 2.38	
www.jumpseatnews.com	 2.25	
www.bluelineprop.com	 2.21	
www.ktxdtv.com	 2.14	
www.southjefffootball.org	 2.1	
www.unitedafa.org	 2.09	
www.parliamenthouse.com	 2.07	
www.yunghova.com	 2.06	
www.interlinetravel.com	 2.03	
www.aclin.org	 2.03	
www.swissport.com	 2.03	
www.gcsanc.com	 2.01	
www.swacu.org	 2.01	
www.airlinepilotcentral.com	 1.97	
www.homotrophy.com	 1.97	
www.beggsfuneralhome.net	 1.94	
www.tvathleFcs.org	 1.92	
www.2shopper.com	 1.91	
www.nextmagazine.com	 1.91	
www.dailyjocks.com	 1.87	
www.pullzone.com	 1.87	
www.diamondoffshore.com	 1.86	
www.myerspolaris.com	 1.86	
www.ryandeyer.com	 1.86	
www.okllo.com	 1.84	
www.ifihadtochoose.com	 1.83	
www.ivoirmixdj.com	 1.83
54	
Predict who goes to JFK?
Random Deterministic
People who
work there …
55	55	
Big	Picture?
56	
56
Beware of (unintentional) discrimination based on
predictability
57	
Thank You!
Claudia Perlich

More Related Content

Viewers also liked

Ashrith Barthur, Security Scientist, H2o.ai, at MLconf 2017
Ashrith Barthur, Security Scientist, H2o.ai, at MLconf 2017Ashrith Barthur, Security Scientist, H2o.ai, at MLconf 2017
Ashrith Barthur, Security Scientist, H2o.ai, at MLconf 2017MLconf
 
Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017
Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017
Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017MLconf
 
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...MLconf
 
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017MLconf
 
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...MLconf
 
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...MLconf
 
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017 Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017 MLconf
 
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017MLconf
 
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...MLconf
 
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017MLconf
 
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017MLconf
 
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...MLconf
 
Talha Obaid, Email Security, Symantec at MLconf ATL 2017
Talha Obaid, Email Security, Symantec at MLconf ATL 2017Talha Obaid, Email Security, Symantec at MLconf ATL 2017
Talha Obaid, Email Security, Symantec at MLconf ATL 2017MLconf
 
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...MLconf
 
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017MLconf
 
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017MLconf
 
Jonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIJonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIMLconf
 
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017MLconf
 

Viewers also liked (18)

Ashrith Barthur, Security Scientist, H2o.ai, at MLconf 2017
Ashrith Barthur, Security Scientist, H2o.ai, at MLconf 2017Ashrith Barthur, Security Scientist, H2o.ai, at MLconf 2017
Ashrith Barthur, Security Scientist, H2o.ai, at MLconf 2017
 
Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017
Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017
Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017
 
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
 
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
 
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
 
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
 
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017 Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
 
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
 
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
 
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
 
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
 
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...
 
Talha Obaid, Email Security, Symantec at MLconf ATL 2017
Talha Obaid, Email Security, Symantec at MLconf ATL 2017Talha Obaid, Email Security, Symantec at MLconf ATL 2017
Talha Obaid, Email Security, Symantec at MLconf ATL 2017
 
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
 
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
 
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
 
Jonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIJonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAI
 
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
Sara Hooker & Sean McPherson, Delta Analytics, at MLconf Seattle 2017
 

Similar to Predicting Probability (Male) in Facebook Data

1330 keynote perlich_ using her laptop
1330 keynote perlich_ using her laptop1330 keynote perlich_ using her laptop
1330 keynote perlich_ using her laptopRising Media, Inc.
 
Detecting Malicious Websites using Machine Learning
Detecting Malicious Websites using Machine LearningDetecting Malicious Websites using Machine Learning
Detecting Malicious Websites using Machine LearningAndrew Beard
 
Data Visualization: A Marketing Superpower - Clark Boyd
Data Visualization: A Marketing Superpower - Clark BoydData Visualization: A Marketing Superpower - Clark Boyd
Data Visualization: A Marketing Superpower - Clark BoydClark Boyd
 
DMS: OkCupid: Building the Data-Driven Brand
DMS: OkCupid: Building the Data-Driven BrandDMS: OkCupid: Building the Data-Driven Brand
DMS: OkCupid: Building the Data-Driven BrandDigiday
 
Case Study 1.pptx
Case Study 1.pptxCase Study 1.pptx
Case Study 1.pptxLi Zhang
 
Introduction & Hands-on with H2O Driverless AI
Introduction & Hands-on with H2O Driverless AIIntroduction & Hands-on with H2O Driverless AI
Introduction & Hands-on with H2O Driverless AISri Ambati
 
Why Marketers Will Rule The World: Rise of the Marketing Technologist
Why Marketers Will Rule The World: Rise of the Marketing TechnologistWhy Marketers Will Rule The World: Rise of the Marketing Technologist
Why Marketers Will Rule The World: Rise of the Marketing TechnologistSocial Media Group
 
What do wine and AI have in common - Ronan Lundy, Quantcast
What do wine and AI have in common - Ronan Lundy, QuantcastWhat do wine and AI have in common - Ronan Lundy, Quantcast
What do wine and AI have in common - Ronan Lundy, QuantcastSavage Marketing
 
Inside Sales and Social Selling for Extreme Results - By Ken Krogue, Founder ...
Inside Sales and Social Selling for Extreme Results - By Ken Krogue, Founder ...Inside Sales and Social Selling for Extreme Results - By Ken Krogue, Founder ...
Inside Sales and Social Selling for Extreme Results - By Ken Krogue, Founder ...Ken Krogue
 
Marketing Research: Thriving or Dying?
Marketing Research: Thriving or Dying? Marketing Research: Thriving or Dying?
Marketing Research: Thriving or Dying? Sulin Lau
 
Video Killed the Text Star: OSINT Approach
Video Killed the Text Star: OSINT ApproachVideo Killed the Text Star: OSINT Approach
Video Killed the Text Star: OSINT ApproachDevo
 
Baltimore Comic Con 2014: Comics by the Numbers
Baltimore Comic Con 2014: Comics by the NumbersBaltimore Comic Con 2014: Comics by the Numbers
Baltimore Comic Con 2014: Comics by the Numbersbhschenker
 
Creating a Big data Strategy with Tactics for Quick Implementation
Creating a Big data Strategy with Tactics for Quick ImplementationCreating a Big data Strategy with Tactics for Quick Implementation
Creating a Big data Strategy with Tactics for Quick ImplementationLewandog, Inc,
 
Thinking in Mobile
Thinking in MobileThinking in Mobile
Thinking in MobileKeynes Cheng
 
Export Day - José Luis Pulpón (Google)
Export Day -  José Luis Pulpón (Google)Export Day -  José Luis Pulpón (Google)
Export Day - José Luis Pulpón (Google)eventos2014
 
Blockchain and internet of things: A glimpse into the future of fashion, Glos...
Blockchain and internet of things: A glimpse into the future of fashion, Glos...Blockchain and internet of things: A glimpse into the future of fashion, Glos...
Blockchain and internet of things: A glimpse into the future of fashion, Glos...Digiday
 
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015Volodymyr Kazantsev
 
Keynote @ Digital Cream Hong Kong
Keynote @ Digital Cream Hong Kong Keynote @ Digital Cream Hong Kong
Keynote @ Digital Cream Hong Kong Eddie Choi
 
Sports & fitness
Sports & fitnessSports & fitness
Sports & fitnessAsphri457
 

Similar to Predicting Probability (Male) in Facebook Data (20)

1330 keynote perlich_ using her laptop
1330 keynote perlich_ using her laptop1330 keynote perlich_ using her laptop
1330 keynote perlich_ using her laptop
 
Detecting Malicious Websites using Machine Learning
Detecting Malicious Websites using Machine LearningDetecting Malicious Websites using Machine Learning
Detecting Malicious Websites using Machine Learning
 
Data Visualization: A Marketing Superpower - Clark Boyd
Data Visualization: A Marketing Superpower - Clark BoydData Visualization: A Marketing Superpower - Clark Boyd
Data Visualization: A Marketing Superpower - Clark Boyd
 
DMS: OkCupid: Building the Data-Driven Brand
DMS: OkCupid: Building the Data-Driven BrandDMS: OkCupid: Building the Data-Driven Brand
DMS: OkCupid: Building the Data-Driven Brand
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
 
Case Study 1.pptx
Case Study 1.pptxCase Study 1.pptx
Case Study 1.pptx
 
Introduction & Hands-on with H2O Driverless AI
Introduction & Hands-on with H2O Driverless AIIntroduction & Hands-on with H2O Driverless AI
Introduction & Hands-on with H2O Driverless AI
 
Why Marketers Will Rule The World: Rise of the Marketing Technologist
Why Marketers Will Rule The World: Rise of the Marketing TechnologistWhy Marketers Will Rule The World: Rise of the Marketing Technologist
Why Marketers Will Rule The World: Rise of the Marketing Technologist
 
What do wine and AI have in common - Ronan Lundy, Quantcast
What do wine and AI have in common - Ronan Lundy, QuantcastWhat do wine and AI have in common - Ronan Lundy, Quantcast
What do wine and AI have in common - Ronan Lundy, Quantcast
 
Inside Sales and Social Selling for Extreme Results - By Ken Krogue, Founder ...
Inside Sales and Social Selling for Extreme Results - By Ken Krogue, Founder ...Inside Sales and Social Selling for Extreme Results - By Ken Krogue, Founder ...
Inside Sales and Social Selling for Extreme Results - By Ken Krogue, Founder ...
 
Marketing Research: Thriving or Dying?
Marketing Research: Thriving or Dying? Marketing Research: Thriving or Dying?
Marketing Research: Thriving or Dying?
 
Video Killed the Text Star: OSINT Approach
Video Killed the Text Star: OSINT ApproachVideo Killed the Text Star: OSINT Approach
Video Killed the Text Star: OSINT Approach
 
Baltimore Comic Con 2014: Comics by the Numbers
Baltimore Comic Con 2014: Comics by the NumbersBaltimore Comic Con 2014: Comics by the Numbers
Baltimore Comic Con 2014: Comics by the Numbers
 
Creating a Big data Strategy with Tactics for Quick Implementation
Creating a Big data Strategy with Tactics for Quick ImplementationCreating a Big data Strategy with Tactics for Quick Implementation
Creating a Big data Strategy with Tactics for Quick Implementation
 
Thinking in Mobile
Thinking in MobileThinking in Mobile
Thinking in Mobile
 
Export Day - José Luis Pulpón (Google)
Export Day -  José Luis Pulpón (Google)Export Day -  José Luis Pulpón (Google)
Export Day - José Luis Pulpón (Google)
 
Blockchain and internet of things: A glimpse into the future of fashion, Glos...
Blockchain and internet of things: A glimpse into the future of fashion, Glos...Blockchain and internet of things: A glimpse into the future of fashion, Glos...
Blockchain and internet of things: A glimpse into the future of fashion, Glos...
 
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
 
Keynote @ Digital Cream Hong Kong
Keynote @ Digital Cream Hong Kong Keynote @ Digital Cream Hong Kong
Keynote @ Digital Cream Hong Kong
 
Sports & fitness
Sports & fitnessSports & fitness
Sports & fitness
 

More from MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Recently uploaded

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 

Recently uploaded (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 

Predicting Probability (Male) in Facebook Data

  • 2. 2 Certainty Vortex © 2013 Foster Provost “Models tend to go where the signal is”
  • 3. 3 How ‘predictable’ is a predictive modeling task? (given the data) Random Deterministic Sexual Orientation Pizza for Dinner?
  • 4. 4 4 For now let’s try to predict who is male …
  • 5. 5 5 Predicting Probability (Male) in Facebook Data: Facebook public dataset with 200K anonymized users, their demographics and their likes Methodology: Logistic regression on sparse representation 00000d41ed774823fca142945ec915c0,1,,,,,,,,,,,en_GB,, 00000dee02d70cf8c0d79f96b6d1c59d,0,,,,,,,,,,,en_US,, 00000f232abfe25a80156fe069395460,0,1992,20,2,2,,,,,,,,19,-5 00000f4ba0cff946b1c0e3b051287ede,0,1993,19,2,,,,,,,,en_US,310,8 0000130571654e3afaa62f4e9d2e4f63,0,,,2,2,,,,,,,en_US,193,7 00001544469ae9b408869a463a1dd77a,1,1984,28,2,,,,,,,,en_US,,-4 00001544469ae9b408869a463a1dd77a 100198443380917 00001544469ae9b408869a463a1dd77a 100248613695 00001544469ae9b408869a463a1dd77a 10050726267 00001544469ae9b408869a463a1dd77a 101021248409 00001544469ae9b408869a463a1dd77a 101054236602446 00001544469ae9b408869a463a1dd77a 101425333232551 00001544469ae9b408869a463a1dd77a 10148199466 00001544469ae9b408869a463a1dd77a 10150154095435553 00001544469ae9b408869a463a1dd77a 10161539667 00001544469ae9b408869a463a1dd77a 101844593108 00001544469ae9b408869a463a1dd77a 101936079845392 00001544469ae9b408869a463a1dd77a 101987301816 00001544469ae9b408869a463a1dd77a 102038567018 00001544469ae9b408869a463a1dd77a 102040023230884 00001544469ae9b408869a463a1dd77a 10212595263 00001544469ae9b408869a463a1dd77a 102168219824412 00001544469ae9b408869a463a1dd77a 1022548733 40% Men
  • 6. 6 6 Take 1: Predict Gender Based on age …
  • 7. 7 7 Gender based on age: very little signal … Overall Accuracy: 60% AUC: 58% p(male|age)
  • 8. 8 8 Gender based on age: very little signal … Overall Accuracy: 60% AUC: 58% Target the 1% with highest probability: Accuracy: 75% p(male|age)
  • 9. 9 9 From small to bigger data …
  • 10. 10 10 Take 2: Gender based all your likes
  • 11. 11 11 Predict gender base on all likes: a lot of signal … Overall Accuracy: 83% Target top 1% Accuracy: 100% p(male|likes)
  • 12. 12 12 Progression: from age to all ‘likes’ 75% 86% 100% 100%100%100% Age
  • 13. 13 But what happens if your problem is a mixture of both? Random Deterministic
  • 14. 14 14 Witness a spike in human predictability .. 2 weeks in 2012 Median5%Lift 2x Death of free will?
  • 16. 16 Logistic Regression Stochastic gradient descent Hashing Streaming L1 & L2 Penalties
  • 18. 18 18 URL’s that are very predictive for more than 10 brands www.womenshealthbase.com www.filmannex.com www.ffog.net www.drugsnews.org www.menshealthbase.com www.dailyfreshies.com www.hark.com www.gossipcenter.com www.articletrunk.com www.411answers.com www.dailyrx.com www.all-allergies.com www.knowvehicles.com www.chinaflix.com www.parentingnewsstories.com www.wrestlingnewz.com www.gourmandia.com
  • 19. 19 Traffic Patterns Are ‘Non - Human’ website 1 website 2 50% Traffic overlap of cookies from Bid Request
  • 20.
  • 22.
  • 23. 23
  • 24. 24
  • 25. 25 25 Witness a spike in human predictability .. 2 weeks in 2012 PredictionPerformance 2x Death of free will?
  • 27. 27 27 Bots are executing conversion events • ‘Cookie Stuffing’ increases the value of the ad for retargeting • Messing up Web analytics … • Messes up my models because a bot is easier to predict than a human
  • 28. 28 28 Percent bot traffic on conversion metrics
  • 29. 29 29 Two populations surfing the we: Bot vs. Human 36% 2015
  • 31. 31 31 Bot activity has more signal – higher predictions
  • 32. 32 32 Eliminate labels generated by bots …. 32 3 more weeks in spring 2012 PerformanceIndex
  • 33. 33 33 What about Clicks? “Measure of consumers interest in the product’
  • 34. 34 34
  • 35. 35 35 Accidental clicks are more predictable than intentional Model learns to predict the accidental much more easily than the intentional ones … I will only target clicks that are likely to happen accidentally but not randomly … accidental intentional
  • 37. 37 37 Old days of the Click Metric … Strategy 1: target women Strategy 2: target men accidental intentional <
  • 38. 38 38 Finding good audiences for luxury cars? Predict dealership visits?
  • 39. 39 39 buzzfeed.com 3/20/16 amazon.com 4/11/16 nytimes.com 2/15/16 50.240.135.41 166. 216.165.92 207.246.152.60 108.49.133.218 38.104.253.134 208.76.113.13 Mlb.com 3/15/16 Etrade.com 4/25/16 Potterybarn.com 4/10/16 Zip 4 10023-4592 04/15/16 Zip 4 10023-6924 04/20/16 Zip 4 10016-2324 03/10/16 03/14/16 Web App Phone/Tablet Desktop Phone/Tablet com.mlb.atbat 04/20/16 com.rovio.angry 02/26/16 com.myfitnesspal.android 3/25/16 4/18/16 IDFA 31AB-26FC- 94AE-756B My actual data: 100 Billion daily events across devices … ? ?
  • 40. 40 40 Potentially three populations in the location prediction •  People who are indeed at dealership and their history •  People who are somewhere close •  People who hacked into your WIFI
  • 41. 41 41 Identify people who will go to Mercedes dealerships
  • 45. 45 45 How much randomness can a model absorb? We will randomly switch the gender value for increasing percentages 00000d41ed774823fca142945ec915c0,1,,,,,,,,,,,en_GB,, 00000dee02d70cf8c0d79f96b6d1c59d,0,,,,,,,,,,,en_US,, 00000f232abfe25a80156fe069395460,0,1992,20,2,2,,,,,,,,19,-5 00000f4ba0cff946b1c0e3b051287ede,0,1993,19,2,,,,,,,,en_US,310,8 0000130571654e3afaa62f4e9d2e4f63,0,,,2,2,,,,,,,en_US,193,7 00001544469ae9b408869a463a1dd77a,1,1984,28,2,,,,,,,,en_US,,-4 00000d41ed774823fca142945ec915c0,0,,,,,,,,,,,en_GB,, 00000dee02d70cf8c0d79f96b6d1c59d,0,,,,,,,,,,,en_US,, 00000f232abfe25a80156fe069395460,1,1992,20,2,2,,,,,,,,19,-5 00000f4ba0cff946b1c0e3b051287ede,0,1993,19,2,,,,,,,,en_US,310,8 0000130571654e3afaa62f4e9d2e4f63,1,,,2,2,,,,,,,en_US,193,7 00001544469ae9b408869a463a1dd77a,1,1984,28,2,,,,,,,,en_US,,-4
  • 46. 46 46 Performance is surprisingly stable even under random noise Percent original labels 60.00% 65.00% 70.00% 75.00% 80.00% 85.00% 90.00% 95.00% 100.00% 100% 75% 50% 25% Percent Men in top 1% 50% original labels 25% original labels
  • 47. 47 47 Identify people who will go to Mercedes dealerships? My predictions might be better than my ‘ground truth’
  • 48. 48 48 Where do we find frequent traveler?
  • 49. 49 49 What do you think indicates people going to JFK? URL Logis)c Parameter www.iglesiaelfaroinc.org 2.38 www.jumpseatnews.com 2.25 www.bluelineprop.com 2.21 www.ktxdtv.com 2.14 www.southjefffootball.org 2.1 www.unitedafa.org 2.09 www.parliamenthouse.com 2.07 www.yunghova.com 2.06 www.interlinetravel.com 2.03 www.aclin.org 2.03 www.swissport.com 2.03 www.gcsanc.com 2.01 www.swacu.org 2.01 www.airlinepilotcentral.com 1.97 www.homotrophy.com 1.97 www.beggsfuneralhome.net 1.94 www.tvathleFcs.org 1.92 www.2shopper.com 1.91 www.nextmagazine.com 1.91 www.dailyjocks.com 1.87 www.pullzone.com 1.87 www.diamondoffshore.com 1.86 www.myerspolaris.com 1.86 www.ryandeyer.com 1.86 www.okllo.com 1.84 www.ifihadtochoose.com 1.83 www.ivoirmixdj.com 1.83
  • 50. 50 50 URL Logis)c Parameter www.iglesiaelfaroinc.org 2.38 www.jumpseatnews.com 2.25 www.bluelineprop.com 2.21 www.ktxdtv.com 2.14 www.southjefffootball.org 2.1 www.unitedafa.org 2.09 www.parliamenthouse.com 2.07 www.yunghova.com 2.06 www.interlinetravel.com 2.03 www.aclin.org 2.03 www.swissport.com 2.03 www.gcsanc.com 2.01 www.swacu.org 2.01 www.airlinepilotcentral.com 1.97 www.homotrophy.com 1.97 www.beggsfuneralhome.net 1.94 www.tvathleFcs.org 1.92 www.2shopper.com 1.91 www.nextmagazine.com 1.91 www.dailyjocks.com 1.87 www.pullzone.com 1.87 www.diamondoffshore.com 1.86 www.myerspolaris.com 1.86 www.ryandeyer.com 1.86 www.okllo.com 1.84 www.ifihadtochoose.com 1.83 www.ivoirmixdj.com 1.83
  • 51. 51 51 URL Logis)c Parameter www.iglesiaelfaroinc.org 2.38 www.jumpseatnews.com 2.25 www.bluelineprop.com 2.21 www.ktxdtv.com 2.14 www.southjefffootball.org 2.1 www.unitedafa.org 2.09 www.parliamenthouse.com 2.07 www.yunghova.com 2.06 www.interlinetravel.com 2.03 www.aclin.org 2.03 www.swissport.com 2.03 www.gcsanc.com 2.01 www.swacu.org 2.01 www.airlinepilotcentral.com 1.97 www.homotrophy.com 1.97 www.beggsfuneralhome.net 1.94 www.tvathleFcs.org 1.92 www.2shopper.com 1.91 www.nextmagazine.com 1.91 www.dailyjocks.com 1.87 www.pullzone.com 1.87 www.diamondoffshore.com 1.86 www.myerspolaris.com 1.86 www.ryandeyer.com 1.86 www.okllo.com 1.84 www.ifihadtochoose.com 1.83 www.ivoirmixdj.com 1.83
  • 52. 52 52 URL Logis)c Parameter www.iglesiaelfaroinc.org 2.38 www.jumpseatnews.com 2.25 www.bluelineprop.com 2.21 www.ktxdtv.com 2.14 www.southjefffootball.org 2.1 www.unitedafa.org 2.09 www.parliamenthouse.com 2.07 www.yunghova.com 2.06 www.interlinetravel.com 2.03 www.aclin.org 2.03 www.swissport.com 2.03 www.gcsanc.com 2.01 www.swacu.org 2.01 www.airlinepilotcentral.com 1.97 www.homotrophy.com 1.97 www.beggsfuneralhome.net 1.94 www.tvathleFcs.org 1.92 www.2shopper.com 1.91 www.nextmagazine.com 1.91 www.dailyjocks.com 1.87 www.pullzone.com 1.87 www.diamondoffshore.com 1.86 www.myerspolaris.com 1.86 www.ryandeyer.com 1.86 www.okllo.com 1.84 www.ifihadtochoose.com 1.83 www.ivoirmixdj.com 1.83
  • 53. 53 53 URL Logis)c Parameter www.iglesiaelfaroinc.org 2.38 www.jumpseatnews.com 2.25 www.bluelineprop.com 2.21 www.ktxdtv.com 2.14 www.southjefffootball.org 2.1 www.unitedafa.org 2.09 www.parliamenthouse.com 2.07 www.yunghova.com 2.06 www.interlinetravel.com 2.03 www.aclin.org 2.03 www.swissport.com 2.03 www.gcsanc.com 2.01 www.swacu.org 2.01 www.airlinepilotcentral.com 1.97 www.homotrophy.com 1.97 www.beggsfuneralhome.net 1.94 www.tvathleFcs.org 1.92 www.2shopper.com 1.91 www.nextmagazine.com 1.91 www.dailyjocks.com 1.87 www.pullzone.com 1.87 www.diamondoffshore.com 1.86 www.myerspolaris.com 1.86 www.ryandeyer.com 1.86 www.okllo.com 1.84 www.ifihadtochoose.com 1.83 www.ivoirmixdj.com 1.83
  • 54. 54 Predict who goes to JFK? Random Deterministic People who work there …
  • 56. 56 56 Beware of (unintentional) discrimination based on predictability