-
1.
Predictive Modeling & Data-
Driven Product Insights at
LinkedIn
-
2.
This talk: the two sides of data at LinkedIn
Using data to build products that delight our users
Using data to uncover insights
-
3.
Connect the world’s
professionals to make them
more productive and
successful
-
4.
150M+ professional profiles
-
5.
What can we do with all of this data?
Build products.
-
6.
150MM+ professional profiles
-
7.
Data tools and infrastructure
-
8.
Data science
-
9.
What can we do with all of this data?
Derive insights that are actionable and
improve the business or our members’
experience.
Question: What actions on the site are
predictive of future engagement?
-
10.
Panel Data Econometrics
Identifying site activities that predict future engagement
-
11.
What can we do with all of this data?
Derive insights that are just plain cool.
-
12.
What can we do with all of this data?
Insights lead to products.
And what can big data products do?
-
13.
We are good at getting people to make
different decisions…
…but we can do more to help people make
better decisions.
Some stats4BN searches150M members60M USMembers first, monetization comes second
2 members a second…Plus dynamics over time.Just an awesome datasetEven though only been around since 2003, we have data going much further back because our members' careers span that timeGoal: build simple, brilliant products that delight our users. ME: and use data to enhance those products where applicable
Two thingsBuild data productsData insights…going to talk a bit more about that in the second part of the talk
How many of these products are data driven? All of them.
How many of these products are data driven? All of them.
2 members a second…Plus dynamics over time.Just an awesome datasetEven though only been around since 2003, we have data going much further back because our members' careers span that timeGoal: build simple, brilliant products that delight our users. ME: and use data to enhance those products where applicable
How many of these products are data driven? All of them.
Two thingsBuild data productsData insights…going to talk a bit more about that in the second part of the talk
Over 75TB/day processedOver 10BN rows / dayReal time availability for key eventsMost tracking events available after 15 minutes via kafka and hadoop
Two thingsBuild data productsData insights…going to talk a bit more about that in the second part of the talk
Ultimately it’s not about data or tools, it’s about asking the right questions and employing star data scientists who own the end to end. Examples of how we work…
Ultimately it’s not about data or tools, it’s about asking the right questions and employing star data scientists who own the end to end. Examples of how we work…
Two thingsBuild data productsData insights…going to talk a bit more about that in the second part of the talk
Panel data: Following observations over time allows us to control for subject-specific (unobservable) effects Going further away from the gold standard of A/B testing and moving closer to establishing predictive power
Two thingsBuild data productsData insights…going to talk a bit more about that in the second part of the talk
Look at the length of the names – now that’s an interesting story! There’s Chip, Todd and Trey - the quintessential sales guys. CEOs are more diverse – but they still want to be your friend -- so they use nicknames.
Look at the length of the names – now that’s an interesting story! There’s Chip, Todd and Trey - the quintessential sales guys. CEOs are more diverse – but they still want to be your friend -- so they use nicknames.
Which companies are over-represented in founders’ histories?
Which companies are over-represented in founders’ histories?
Two thingsBuild data productsData insights…going to talk a bit more about that in the second part of the talk
Two thingsBuild data productsData insights…going to talk a bit more about that in the second part of the talk